Spaces:

Speech-Arena-2025
/

Speech-DF-Arena

Running

App Files Files Community

Speech-Arena-2025 commited on Aug 11

Commit

0644fc3

1 Parent(s): 6638929

Display some submission instructions on the leaderboard itself

Browse files

Files changed (4) hide show

app.py +0 -1
ui/coming_soon.py +0 -39
ui/metrics.py +0 -27
ui/submission.py +45 -16

app.py CHANGED Viewed

@@ -5,7 +5,6 @@ import pandas as pd
 import gradio as gr
 from ui.leaderboard import render_leader_board, render_info_html, render_citation
 from ui.evaluation import render_eval_info
-from ui.coming_soon import render_coming_soon
 from ui.submission import render_submission_page
 import os
 from utils import load_leaderboard, custom_css

 import gradio as gr
 from ui.leaderboard import render_leader_board, render_info_html, render_citation
 from ui.evaluation import render_eval_info
 from ui.submission import render_submission_page
 import os
 from utils import load_leaderboard, custom_css

ui/coming_soon.py DELETED Viewed

@@ -1,39 +0,0 @@
-import gradio as gr
-def render_coming_soon():
-    text = r"""
-    ### **1. More evaluation metrics**
-    - Accuracy
-    - Precision, Recall and F1 Score
-    - minDCF
-    #### **2. More Datasets and Models:**
-    **Datasets:**
-    - MLAAD
-    - Latin-American-Spanish-Deepfake-Dataset
-    - CodecFake-Omni
-    - Hindi audio-video-Deepfake
-    - SpoofCeleb
-    - VoiceWukong
-    - CodecFake Haibin Wu et.al.
-    - LRPD
-    - EmoFake
-    **Models:**
-    - Wav2Vec2-AASIST
-    - RawNet3
-    - AASIST2
-    #### **3. Top performing DF systems live demo**
-    Run inference using your own audio samples on top performing DF systems. Get probability scores for each system
-    """
-    return gr.Markdown(text)

ui/metrics.py DELETED Viewed

@@ -1,27 +0,0 @@
-import gradio as gr
-def render_metrics():
-    text = r"""
-    We use **Equal Error Rate (EER %)**  a standard method used in bimoretric and anti-spoofing systems.
-    ### **What is EER?**
-    Equal Error Rate (EER) is a performance metric used to evaluate biometric systems. It represents the point at which the **False Acceptance Rate (FAR)** and **False Rejection Rate (FRR)** are equal. A lower EER indicates a more accurate system.
-    #### **False Acceptance Rate (FAR)**
-    FAR is the proportion of **unauthorized** users incorrectly accepted by the system.
-    $FAR = \frac{\text{False Acceptances}}{\text{Total Imposter Attempts}}$
-    A high FAR means the system is too lenient, allowing unauthorized access.
-    #### **False Rejection Rate (FRR)**
-    FRR is the proportion of **genuine** users incorrectly rejected by the system.
-    $FRR = \frac{\text{False Rejections}}{\text{Total Genuine Attempts}}$
-    A high FRR means the system is too strict, denying access to legitimate users.
-    ### EER is the point at which FAR and FRR are equal.
-    """
-    return gr.Markdown(text, latex_delimiters=[ {"left": "$", "right": "$", "display": False }])

ui/submission.py CHANGED Viewed

@@ -1,28 +1,57 @@
 import gradio as gr
 def render_submission_page():
-    text =  r"""Want to submit your own system to the leaderboard? We accept submissions from both open source and closed source systems.
-            Instructions and submission form can be found here: [Submission Form](https://drive.google.com/file/d/1YmW3da68hYAWeTmMAJOcEgUlJG3iGXGx/view?usp=sharing). We request submitting teams to fill out this form and
-            and reach out to use at <speech.arena.eval@gmail.com>. Below is the List of currently included datasets in the leaderboard:
-    - [ASVSpoof2019](https://www.asvspoof.org/index2019.html)
-    - [ASVSpoof2021LA](https://www.asvspoof.org/index2021.html)
-    - [ASVSpoof2021DF](https://www.asvspoof.org/index2021.html)
-    - [ASVSpoof2024-Dev](https://www.asvspoof.org/workshop2024)
-    - [ASVSpoof2024-Eval](https://www.asvspoof.org/workshop2024)
     - [FakeOrReal](https://bil.eecs.yorku.ca/datasets/)
     - [Codecfake Yuankun et. al.](https://github.com/xieyuankun/Codecfake)
-    - [ADD2022 Track 1](http://addchallenge.cn/add2022)
-    - [ADD2022 Track 3](http://addchallenge.cn/add2022)
-    - [ADD 2023 R1](http://addchallenge.cn/add2023)
-    - [ADD2023 R2](http://addchallenge.cn/add2023)
     - [DFADD](https://github.com/isjwdu/DFADD)
     - [LibriVoc](https://github.com/csun22/Synthetic-Voice-Detection-Vocoder-Artifacts)
     - [SONAR](https://github.com/Jessegator/SONAR)
-  """
-    return gr.Markdown(text)

 import gradio as gr
 def render_submission_page():
+  text = r"""
+      Want to submit your own system to the leaderboard? We accept submissions from both open source and proprietary systems.
+                Instructions and submission form can be found here: [Submission Form](https://drive.google.com/file/d/1YmW3da68hYAWeTmMAJOcEgUlJG3iGXGx/view?usp=sharing). We request submitting teams to fill out this form and
+                and reach out to use at <speech.arena.eval@gmail.com>
+    ## General Instructions for commercial and open source systems
+    In order to include the scores on this leaderboard
+    and facilitate the verification of the system to be submitted, the submitting team has to
+    provide the following artifacts along with the signed submission form.
+    - Protocol files used to generate the scores for all the evaluation datasets listed on
+    the leaderboard at the time of submission.
+    - Score files generated using the submitted system for all the evaluation datasets
+    listed on the leaderboard at the time of submission.
+    - Number of parameters used in the system to be submitted
+    ## The submitting team must abide by the following terms for the scores to be considered for evaluation:
+    - The submitted system has not been trained, directly or indirectly, on the
+    evaluation (test) or development sets of any dataset with a public license. This includes, but is not limited to, any form of supervised or unsupervised training, finetuning, or hyperparameter optimization involving these sets.
+    - Reported scores correspond to a single single system evaluated consistently across
+    the evaluation sets with the same checkpoint and parameters with no modifications in the hyperparameters.
+    - Commercial systems with a proprietary license agree to grant API access to the DF Arena team if required strictly for verification purposes.
+    - The DF Arena leaderboard will be updated periodically to include new datasets. The submitting team agrees to evaluate and submit scores on these additional datasets upon if requested in order to maintain a valid presence on the leaderboard.
+    Submitting team acknowledges that any violation of the above may result in disqualification of the submission which includes removal of the system from the leaderboard and public disclosure of the disqualification on DF Arena’s official communication channels.
+    Details regarding the list and URLs / sources used to obtain the evaluation datasets
+    can be found as follows:-
+    - [ASVSpoof2019](https://zenodo.org/records/6906306)
+    - [ASVSpoof2021LA](https://zenodo.org/records/4837263)
+    - [ASVSpoof2021DF](https://zenodo.org/records/4837263)
+    - [ASVSpoof2024-Eval](https://zenodo.org/records/14498691)
     - [FakeOrReal](https://bil.eecs.yorku.ca/datasets/)
     - [Codecfake Yuankun et. al.](https://github.com/xieyuankun/Codecfake)
+    - [ADD2022 Track 1](http://addchallenge.cn/databases2023)
+    - [ADD2022 Track 3](http://addchallenge.cn/databases2023)
+    - [ADD 2023 R1](http://addchallenge.cn/databases2023)
+    - [ADD2023 R2](http://addchallenge.cn/databases2023)
     - [DFADD](https://github.com/isjwdu/DFADD)
     - [LibriVoc](https://github.com/csun22/Synthetic-Voice-Detection-Vocoder-Artifacts)
     - [SONAR](https://github.com/Jessegator/SONAR)
+  """
+  return gr.Markdown(text)