Microsoft interns developed AVED, a framework enabling synchronized audio and video editing via text prompts without retraining models.
arxiv
3 Articles
3
A new speech emotion recognition system uses deep learning to interpret emotions in speech through audio analysis, enabling broader accessibility.
