-
Notifications
You must be signed in to change notification settings - Fork 31
Feature/complete editable transcript system #37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
SATVIKsynopsis
wants to merge
17
commits into
Vijayaa21:master
Choose a base branch
from
SATVIKsynopsis:feature/complete-editable-transcript-system
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Feature/complete editable transcript system #37
SATVIKsynopsis
wants to merge
17
commits into
Vijayaa21:master
from
SATVIKsynopsis:feature/complete-editable-transcript-system
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
- Implement EditableTranscript component with segment-based editing - Add real-time speech recognition using Browser Web Speech API - Create AudioRecorder and AudioPost components for audio handling - Add LiveSpeechRecognition modal with microphone testing - Implement TranscriptContext for localStorage persistence - Add useSpeechToText and useTranscript custom hooks - Create TranscriptDemo page with three modes (My Recordings, Live Speech, Tutorial) - Update backend models to support transcript data structure - Add post routes and controllers for transcript handling - Replace hardcoded environment variables with process.env - Fix authentication and server connectivity issues - Remove file upload functionality (Web Speech API limitation) Features: - Click transcript segments to jump to audio timestamps - Edit transcript text inline with confidence indicators - Real-time speech-to-text with interim results - Export transcripts in TXT and SRT formats - Persistent storage across page refreshes - Microphone permission and audio level testing - Support for Chrome/Edge browsers with Web Speech API
- Add speechToText.js with Whisper API and Browser Speech API support - Add comprehensive deployment summary documentation
- Implement EditableTranscript component with segment-based editing - Add real-time speech recognition using Browser Web Speech API - Create AudioRecorder and AudioPost components for audio handling - Add LiveSpeechRecognition modal with microphone testing - Implement TranscriptContext for localStorage persistence - Add useSpeechToText and useTranscript custom hooks - Create TranscriptDemo page with three modes (My Recordings, Live Speech, Tutorial) - Update backend models to support transcript data structure - Add post routes and controllers for transcript handling - Replace hardcoded environment variables with process.env - Fix authentication and server connectivity issues - Remove file upload functionality (Web Speech API limitation) Features: - Click transcript segments to jump to audio timestamps - Edit transcript text inline with confidence indicators - Real-time speech-to-text with interim results - Export transcripts in TXT and SRT formats - Persistent storage across page refreshes - Microphone permission and audio level testing - Support for Chrome/Edge browsers with Web Speech API
…pport - Add transcribeWithWhisper function for OpenAI Whisper API integration - Add transcribeWithBrowserAPI for real-time browser speech recognition - Support for both server-side and client-side speech-to-text conversion - Includes confidence scoring and segment-based transcript format - Error handling for API failures and unsupported browsers
…pport - Add transcribeWithWhisper function for OpenAI Whisper API integration - Add transcribeWithBrowserAPI for real-time browser speech recognition - Support for both server-side and client-side speech-to-text conversion - Includes confidence scoring and segment-based transcript format - Error handling for API failures and unsupported browsers
- Add speechToText.js with Whisper API and Browser Speech API support - Add comprehensive deployment summary documentation
…pport - Add transcribeWithWhisper function for OpenAI Whisper API integration - Add transcribeWithBrowserAPI for real-time browser speech recognition - Support for both server-side and client-side speech-to-text conversion - Includes confidence scoring and segment-based transcript format - Error handling for API failures and unsupported browsers
COMPREHENSIVE TRANSCRIPT SYSTEM: CORE COMPONENTS: - EditableTranscript.jsx - Main transcript editor with audio sync - TranscriptSegment.jsx - Individual segment editing with confidence indicators - AudioRecorder.jsx - Audio recording with real-time waveform - AudioPost.jsx - Audio post display with transcript integration - LiveSpeechRecognition.jsx - Real-time speech recognition modal - TranscriptDemo.jsx - Comprehensive demo with 3 modes CUSTOM HOOKS: - useSpeechToText.jsx - Browser Web Speech API integration - useTranscript.jsx - Transcript management and localStorage persistence CONTEXT & STATE: - TranscriptContext.jsx - Global transcript state management - localStorage persistence across page refreshes BACKEND INTEGRATION: - speechToText.js - Dual API support (Whisper + Browser Speech) - Post model with transcript schema - API routes for transcript CRUD operations - Environment variables for secure deployment KEY FEATURES: - Real-time speech-to-text with Browser Web Speech API - Click transcript segments to jump to audio timestamps - Inline text editing with confidence scoring - Persistent storage across page refreshes - Export transcripts in TXT, SRT, JSON formats - Microphone testing and audio level detection - Secure environment variable configuration - Production-ready deployment setup DEMO MODES: 1. My Recordings - View and edit saved transcripts 2. Live Speech - Real-time speech recognition 3. Tutorial Demo - Interactive feature demonstration BROWSER SUPPORT: - Chrome/Edge for speech recognition - All modern browsers for transcript editing - Fallback UI for unsupported browsers COST OPTIMIZATION: - Free Browser Web Speech API (real-time) - Optional paid OpenAI Whisper API (high accuracy) - No mandatory external API dependencies This represents a complete, production-ready editable transcript system with speech recognition capabilities for the Blabber social media platform.
@SATVIKsynopsis is attempting to deploy a commit to the Vijaya 's projects Team on Vercel. A member of the Team first needs to authorize it. |
Can you provide video of this functionality so that I can view |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
closes #1 COMPREHENSIVE TRANSCRIPT SYSTEM:
CORE COMPONENTS:
CUSTOM HOOKS:
CONTEXT & STATE:
BACKEND INTEGRATION:
KEY FEATURES:
DEMO MODES:
BROWSER SUPPORT:
COST OPTIMIZATION:
This represents a complete, production-ready editable transcript system
with speech recognition capabilities for the Blabber social media platform.