mirror of
https://github.com/openai/whisper.git
synced 2025-11-24 06:26:03 +00:00
- Created whisper/enhancements module for enhanced functionality - Implemented HallucinationDetector with multi-method detection: * Pattern-based detection (YouTube artifacts, repetitive phrases) * Statistical analysis (compression ratios, log probabilities) * Repetition analysis (looping behavior detection) * Temporal analysis (silence-based detection) - Added ConfidenceScorer for comprehensive transcription quality assessment - Enhanced transcribe() function with new parameters: * enhanced_hallucination_detection: Enable advanced detection * hallucination_detection_language: Language-specific patterns * strict_hallucination_filtering: Strict vs permissive filtering * confidence_threshold: Minimum confidence for segments - Maintains full backward compatibility - Added CLI arguments for new functionality Addresses: OpenAI Whisper Discussion #679 - Hallucinations & Repetition Loops
15 lines
534 B
Python
15 lines
534 B
Python
# Whisper Enhancements Module
|
|
"""
|
|
This module contains enhanced functionality for the OpenAI Whisper speech recognition system.
|
|
These enhancements provide additional features while maintaining backward compatibility with the core Whisper API.
|
|
"""
|
|
|
|
from .hallucination_detector import HallucinationDetector, detect_hallucinations
|
|
from .confidence_scorer import ConfidenceScorer, calculate_confidence_score
|
|
|
|
__all__ = [
|
|
'HallucinationDetector',
|
|
'detect_hallucinations',
|
|
'ConfidenceScorer',
|
|
'calculate_confidence_score'
|
|
] |