Search Results for author: Andrew Brown

Found 13 papers, 4 papers with code

Generation, Distillation and Evaluation of Motivational Interviewing-Style Reflections with a Foundational Language Model

no code implementations • 1 Feb 2024 • Andrew Brown, Jiading Zhu, Mohamed Abdelwahab, Alec Dong, Cindy Wang, Jonathan Rose

Many will be motivated to distill specific capabilities of foundational models into smaller models that can be owned and controlled.

Chatbot Language Modelling

Paper
Add Code

Motion-Conditioned Image Animation for Video Editing

no code implementations • 30 Nov 2023 • Wilson Yan, Andrew Brown, Pieter Abbeel, Rohit Girdhar, Samaneh Azadi

We introduce MoCA, a Motion-Conditioned Image Animation approach for video editing.

Image Animation Video Editing

Paper
Add Code

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

no code implementations • 17 Nov 2023 • Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra

We present Emu Video, a text-to-video generation model that factorizes the generation into two steps: first generating an image conditioned on the text, and then generating a video conditioned on the text and the generated image.

Text-to-Video Generation Video Generation

Paper
Add Code

Causal Video Summarizer for Video Exploration

no code implementations • 4 Jul 2023 • Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Andrew Brown, Marcel Worring

Multi-modal video summarization has a video input and a text-based query input.

Video Summarization

Paper
Add Code

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

1 code implementation • 20 Feb 2023 • Jaesung Huh, Andrew Brown, Jee-weon Jung, Joon Son Chung, Arsha Nagrani, Daniel Garcia-Romero, Andrew Zisserman

This paper summarises the findings from the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22), which was held in conjunction with INTERSPEECH 2022.

Speaker Diarization Speaker Recognition +1

Paper
Code

In search of strong embedding extractors for speaker diarisation

no code implementations • 26 Oct 2022 • Jee-weon Jung, Hee-Soo Heo, Bong-Jin Lee, Jaesung Huh, Andrew Brown, Youngki Kwon, Shinji Watanabe, Joon Son Chung

First, the evaluation is not straightforward because the features required for better performance differ between speaker verification and diarisation.

Data Augmentation Speaker Verification

Paper
Add Code

End-to-End Visual Editing with a Generatively Pre-Trained Artist

no code implementations • 3 May 2022 • Andrew Brown, Cheng-Yang Fu, Omkar Parkhi, Tamara L. Berg, Andrea Vedaldi

We consider the targeted image editing problem: blending a region in a source image with a driver image that specifies the desired change.

Paper
Add Code

Face, Body, Voice: Video Person-Clustering with Multiple Modalities

no code implementations • 20 May 2021 • Andrew Brown, Vicky Kalogeiton, Andrew Zisserman

In this paper we make contributions to address both these deficiencies: first, we introduce a Multi-Modal High-Precision Clustering algorithm for person-clustering in videos using cues from several modalities (face, body, and voice).

Clustering Face Clustering

Paper
Add Code

Automated Video Labelling: Identifying Faces by Corroborative Evidence

no code implementations • 10 Feb 2021 • Andrew Brown, Ernesto Coto, Andrew Zisserman

We present a method for automatically labelling all faces in video archives, such as TV broadcasts, by combining multiple evidence sources and multiple modalities (visual and audio).

Domain Adaptation Image Retrieval

Paper
Add Code

VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge

no code implementations • 12 Dec 2020 • Arsha Nagrani, Joon Son Chung, Jaesung Huh, Andrew Brown, Ernesto Coto, Weidi Xie, Mitchell McLaren, Douglas A Reynolds, Andrew Zisserman

We held the second installment of the VoxCeleb Speaker Recognition Challenge in conjunction with Interspeech 2020.

Speaker Recognition

Paper
Add Code

Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval

2 code implementations • ECCV 2020 • Andrew Brown, Weidi Xie, Vicky Kalogeiton, Andrew Zisserman

Optimising a ranking-based metric, such as Average Precision (AP), is notoriously challenging due to the fact that it is non-differentiable, and hence cannot be optimised directly using gradient-descent methods.

Ranked #4 on Vehicle Re-Identification on VehicleID Medium