no code implementations • 18 Apr 2024 • Christoph Reich, Oliver Hahn, Daniel Cremers, Stefan Roth, Biplob Debnath
The use of standardized codecs, such as JPEG or H. 264, is prevalent and required to ensure interoperability.
no code implementations • 18 Apr 2024 • Md Adnan Arefeen, Biplob Debnath, Md Yusuf Sarwar Uddin, Srimat Chakradhar
Use of RAG for combined understanding of multimodal data such as text, images and videos is appealing but two critical limitations exist: one-time, upfront capture of all content in large multimodal data as text descriptions entails high processing times, and not all information in the rich multimodal data is typically in the text descriptions.
1 code implementation • 13 Sep 2023 • Christoph Reich, Biplob Debnath, Deep Patel, Srimat Chakradhar
the input image, the JPEG quality, the quantization tables, and the color conversion parameters.
no code implementations • 2 Sep 2023 • Md Adnan Arefeen, Biplob Debnath, Srimat Chakradhar
Additionally, if free pretrained LLM-based summarizers are used to reduce context (into human consumable summaries), LeanContext can further modify the reduced context to enhance the accuracy (ROUGE-1 score) by $13. 22\%$ to $24. 61\%$.
no code implementations • 30 Aug 2023 • Christoph Reich, Biplob Debnath, Deep Patel, Tim Prangemeier, Daniel Cremers, Srimat Chakradhar
To overcome the deterioration of vision performance, this paper presents the first end-to-end learnable deep video codec control that considers both bandwidth constraints and downstream deep vision performance, while adhering to existing standardization.
no code implementations • 3 Sep 2021 • Kunal Rao, Giuseppe Coviello, Min Feng, Biplob Debnath, Wang-Pin Hsiung, Murugan Sankaradas, Yi Yang, Oliver Po, Utsav Drolia, Srimat Chakradhar
Identification of people with elevated body temperature can reduce or dramatically slow down the spread of infectious diseases like COVID-19.