no code implementations • GermEval 2021 • Jaqueline Böck, Daria Liakhovets, Mina Schütz, Armin Kirchknopf, Djordje Slijepčević, Matthias Zeppelzauer, Alexander Schindler
Our best model is GottBERT (i. e., a BERT transformer pre-trained on German texts) fine-tuned on the GermEval 2021 data.
no code implementations • 1 Mar 2024 • Tin Nguyen, Lam Pham, Phat Lam, Dat Ngo, Hieu Tang, Alexander Schindler
In this paper, we propose a deep learning based model for Acoustic Anomaly Detection of Machines, the task for detecting abnormal machines by analysing the machine sound.
no code implementations • 29 Jan 2024 • Phat Lam, Lam Pham, Tin Nguyen, Hieu Tang, Seidl Michael, Alexander Schindler
For this reason, the role of sentence embedding is crucial for capturing both the semantic information between words in the sentence and the contextual relationship of sentences within the abstract to provide a comprehensive representation for better classification.
no code implementations • 27 Dec 2023 • Cam Le, Lam Pham, Jasmin Lampert, Matthias Schlögl, Alexander Schindler
Finally, we propose a combined loss function which leverages Focal loss and IoU loss to train the network.
no code implementations • 16 May 2023 • Lam Pham, Dat Ngo, Cam Le, Anahid Jalali, Alexander Schindler
In the second phase, the student network, which presents a low complexity model, is trained with the embeddings extracted from the teacher.
no code implementations • 25 Feb 2023 • Lam Pham, Cam Le, Dat Ngo, Anh Nguyen, Jasmin Lampert, Alexander Schindler, Ian McLoughlin
In this paper, we present a high-performance and light-weight deep learning model for Remote Sensing Image Classification (RSIC), the task of identifying the aerial scene of a remote sensing image.
no code implementations • 16 Oct 2022 • Lam Pham, Dusan Salovic, Anahid Jalali, Alexander Schindler, Khoa Tran, Canh Vu, Phu X. Nguyen
In this paper, we present a comprehensive analysis of Acoustic Scene Classification (ASC), the task of identifying the scene of an audio recording from its acoustic signature.
no code implementations • 20 Jun 2022 • Lam Pham, Khoa Tran, Dat Ngo, Jasmin Lampert, Alexander Schindler
The task of remote sensing image scene classification (RSISC), which aims at classifying remote sensing images into groups of semantic categories based on their contents, has taken the important role in a wide range of applications such as urban planning, natural hazards detection, environment monitoring, vegetation mapping, or geospatial object detection.
no code implementations • 13 Jun 2022 • Lam Pham, Dat Ngo, Anahid Jalali, Alexander Schindler
In this report, we presents low-complexity deep learning frameworks for acoustic scene classification (ASC).
no code implementations • 23 Mar 2022 • Lam Pham, Khoa Dinh, Dat Ngo, Hieu Tang, Alexander Schindler
In this paper, we present a robust and low complexity system for Acoustic Scene Classification (ASC), the task of identifying the scene of an audio recording.
1 code implementation • 16 Dec 2021 • Lam Pham, Dat Ngo, Phu X. Nguyen, Truong Hoang, Alexander Schindler
This paper presents a task of audio-visual scene classification (SC) where input videos are classified into one of five real-life crowded scenes: 'Riot', 'Noise-Street', 'Firework-Event', 'Music-Event', and 'Sport-Atmosphere'.
no code implementations • 9 Jun 2021 • Mina Schütz, Jaqueline Boeck, Daria Liakhovets, Djordje Slijepčević, Armin Kirchknopf, Manuel Hecht, Johannes Bogensperger, Sven Schlarb, Alexander Schindler, Matthias Zeppelzauer
For both tasks our best model is XLM-R with unsupervised pre-training on the EXIST data and additional datasets and fine-tuning on the provided dataset.
no code implementations • 2 Apr 2020 • Alexander Schindler, Andrew Lindley, Anahid Jalali, Martin Boyer, Sergiu Gordea, Ross King
Specifically, Audio Event Detection is applied to index the content according to attack-specific acoustic concepts.
no code implementations • 27 Mar 2020 • Alexander Schindler, Sergiu Gordea, Peter Knees
We present an approach to unsupervised audio representation learning.
no code implementations • 1 Feb 2020 • Alexander Schindler
In a further consequence it can be concluded that this visual information is music related and thus should be beneficial for the corresponding MIR tasks such as music genre classification or mood recognition.
no code implementations • 15 Jan 2020 • Alexander Schindler, Thomas Lidy, Sebastian Böck
Deep Learning has become state of the art in visual computing and continuously emerges into the Music Information Retrieval (MIR) and audio retrieval domain.
no code implementations • 16 Apr 2019 • Anahid Jalali, Clemens Heistracher, Alexander Schindler, Bernhard Haslhofer, Tanja Nemeth, Robert Glawar, Wilfried Sihn, Peter De Boer
Predicting unscheduled breakdowns of plasma etching equipment can reduce maintenance costs and production losses in the semiconductor industry.
no code implementations • 28 Nov 2018 • Alexander Schindler, Martin Boyer, Andrew Lindley, David Schreiber, Thomas Philipp
The heterogeneous results of the analytical modules are fused into a distributed index of visual and acoustic concepts to facilitate rapid start of investigations, following traits and investigating witness reports.
no code implementations • 11 Nov 2018 • Alexander Schindler, Thomas Lidy, Stephan Karner, Matthias Hecker
We present an empirical study of applying deep Convolutional Neural Networks (CNN) to the task of fashion and apparel image classification to improve meta-data enrichment of e-commerce applications.
no code implementations • 11 Nov 2018 • Botond Fazeka, Alexander Schindler, Thomas Lidy, Andreas Rauber
The audio is fed into a Convolutional Neural Network (CNN) using four convolutional layers.
Sound Audio and Speech Processing