Speech-Arena-2025 commited on
Commit
0644fc3
·
1 Parent(s): 6638929

Display some submission instructions on the leaderboard itself

Browse files
Files changed (4) hide show
  1. app.py +0 -1
  2. ui/coming_soon.py +0 -39
  3. ui/metrics.py +0 -27
  4. ui/submission.py +45 -16
app.py CHANGED
@@ -5,7 +5,6 @@ import pandas as pd
5
  import gradio as gr
6
  from ui.leaderboard import render_leader_board, render_info_html, render_citation
7
  from ui.evaluation import render_eval_info
8
- from ui.coming_soon import render_coming_soon
9
  from ui.submission import render_submission_page
10
  import os
11
  from utils import load_leaderboard, custom_css
 
5
  import gradio as gr
6
  from ui.leaderboard import render_leader_board, render_info_html, render_citation
7
  from ui.evaluation import render_eval_info
 
8
  from ui.submission import render_submission_page
9
  import os
10
  from utils import load_leaderboard, custom_css
ui/coming_soon.py DELETED
@@ -1,39 +0,0 @@
1
- import gradio as gr
2
-
3
- def render_coming_soon():
4
- text = r"""
5
-
6
- ### **1. More evaluation metrics**
7
- - Accuracy
8
- - Precision, Recall and F1 Score
9
- - minDCF
10
-
11
- #### **2. More Datasets and Models:**
12
-
13
- **Datasets:**
14
-
15
- - MLAAD
16
- - Latin-American-Spanish-Deepfake-Dataset
17
- - CodecFake-Omni
18
- - Hindi audio-video-Deepfake
19
- - SpoofCeleb
20
- - VoiceWukong
21
- - CodecFake Haibin Wu et.al.
22
- - LRPD
23
- - EmoFake
24
-
25
-
26
- **Models:**
27
- - Wav2Vec2-AASIST
28
- - RawNet3
29
- - AASIST2
30
-
31
- #### **3. Top performing DF systems live demo**
32
-
33
- Run inference using your own audio samples on top performing DF systems. Get probability scores for each system
34
-
35
- """
36
-
37
-
38
-
39
- return gr.Markdown(text)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ui/metrics.py DELETED
@@ -1,27 +0,0 @@
1
- import gradio as gr
2
-
3
- def render_metrics():
4
- text = r"""
5
- We use **Equal Error Rate (EER %)** a standard method used in bimoretric and anti-spoofing systems.
6
-
7
- ### **What is EER?**
8
- Equal Error Rate (EER) is a performance metric used to evaluate biometric systems. It represents the point at which the **False Acceptance Rate (FAR)** and **False Rejection Rate (FRR)** are equal. A lower EER indicates a more accurate system.
9
-
10
- #### **False Acceptance Rate (FAR)**
11
- FAR is the proportion of **unauthorized** users incorrectly accepted by the system.
12
-
13
- $FAR = \frac{\text{False Acceptances}}{\text{Total Imposter Attempts}}$
14
-
15
- A high FAR means the system is too lenient, allowing unauthorized access.
16
-
17
- #### **False Rejection Rate (FRR)**
18
- FRR is the proportion of **genuine** users incorrectly rejected by the system.
19
-
20
- $FRR = \frac{\text{False Rejections}}{\text{Total Genuine Attempts}}$
21
-
22
- A high FRR means the system is too strict, denying access to legitimate users.
23
-
24
- ### EER is the point at which FAR and FRR are equal.
25
- """
26
-
27
- return gr.Markdown(text, latex_delimiters=[ {"left": "$", "right": "$", "display": False }])
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ui/submission.py CHANGED
@@ -1,28 +1,57 @@
1
  import gradio as gr
2
 
 
3
  def render_submission_page():
4
- text = r"""Want to submit your own system to the leaderboard? We accept submissions from both open source and closed source systems.
5
- Instructions and submission form can be found here: [Submission Form](https://drive.google.com/file/d/1YmW3da68hYAWeTmMAJOcEgUlJG3iGXGx/view?usp=sharing). We request submitting teams to fill out this form and
6
- and reach out to use at <speech.arena.eval@gmail.com>. Below is the List of currently included datasets in the leaderboard:
7
-
8
- - [ASVSpoof2019](https://www.asvspoof.org/index2019.html)
9
- - [ASVSpoof2021LA](https://www.asvspoof.org/index2021.html)
10
- - [ASVSpoof2021DF](https://www.asvspoof.org/index2021.html)
11
- - [ASVSpoof2024-Dev](https://www.asvspoof.org/workshop2024)
12
- - [ASVSpoof2024-Eval](https://www.asvspoof.org/workshop2024)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  - [FakeOrReal](https://bil.eecs.yorku.ca/datasets/)
14
  - [Codecfake Yuankun et. al.](https://github.com/xieyuankun/Codecfake)
15
- - [ADD2022 Track 1](http://addchallenge.cn/add2022)
16
- - [ADD2022 Track 3](http://addchallenge.cn/add2022)
17
- - [ADD 2023 R1](http://addchallenge.cn/add2023)
18
- - [ADD2023 R2](http://addchallenge.cn/add2023)
19
  - [DFADD](https://github.com/isjwdu/DFADD)
20
  - [LibriVoc](https://github.com/csun22/Synthetic-Voice-Detection-Vocoder-Artifacts)
21
  - [SONAR](https://github.com/Jessegator/SONAR)
22
- """
23
 
24
-
25
- return gr.Markdown(text)
 
26
 
27
 
28
 
 
1
  import gradio as gr
2
 
3
+
4
  def render_submission_page():
5
+
6
+ text = r"""
7
+ Want to submit your own system to the leaderboard? We accept submissions from both open source and proprietary systems.
8
+ Instructions and submission form can be found here: [Submission Form](https://drive.google.com/file/d/1YmW3da68hYAWeTmMAJOcEgUlJG3iGXGx/view?usp=sharing). We request submitting teams to fill out this form and
9
+ and reach out to use at <speech.arena.eval@gmail.com>
10
+
11
+ ## General Instructions for commercial and open source systems
12
+ In order to include the scores on this leaderboard
13
+ and facilitate the verification of the system to be submitted, the submitting team has to
14
+ provide the following artifacts along with the signed submission form.
15
+
16
+ - Protocol files used to generate the scores for all the evaluation datasets listed on
17
+ the leaderboard at the time of submission.
18
+ - Score files generated using the submitted system for all the evaluation datasets
19
+ listed on the leaderboard at the time of submission.
20
+ - Number of parameters used in the system to be submitted
21
+
22
+ ## The submitting team must abide by the following terms for the scores to be considered for evaluation:
23
+ - The submitted system has not been trained, directly or indirectly, on the
24
+ evaluation (test) or development sets of any dataset with a public license. This includes, but is not limited to, any form of supervised or unsupervised training, finetuning, or hyperparameter optimization involving these sets.
25
+
26
+ - Reported scores correspond to a single single system evaluated consistently across
27
+ the evaluation sets with the same checkpoint and parameters with no modifications in the hyperparameters.
28
+ - Commercial systems with a proprietary license agree to grant API access to the DF Arena team if required strictly for verification purposes.
29
+
30
+ - The DF Arena leaderboard will be updated periodically to include new datasets. The submitting team agrees to evaluate and submit scores on these additional datasets upon if requested in order to maintain a valid presence on the leaderboard.
31
+
32
+ Submitting team acknowledges that any violation of the above may result in disqualification of the submission which includes removal of the system from the leaderboard and public disclosure of the disqualification on DF Arena’s official communication channels.
33
+
34
+
35
+ Details regarding the list and URLs / sources used to obtain the evaluation datasets
36
+ can be found as follows:-
37
+
38
+ - [ASVSpoof2019](https://zenodo.org/records/6906306)
39
+ - [ASVSpoof2021LA](https://zenodo.org/records/4837263)
40
+ - [ASVSpoof2021DF](https://zenodo.org/records/4837263)
41
+ - [ASVSpoof2024-Eval](https://zenodo.org/records/14498691)
42
  - [FakeOrReal](https://bil.eecs.yorku.ca/datasets/)
43
  - [Codecfake Yuankun et. al.](https://github.com/xieyuankun/Codecfake)
44
+ - [ADD2022 Track 1](http://addchallenge.cn/databases2023)
45
+ - [ADD2022 Track 3](http://addchallenge.cn/databases2023)
46
+ - [ADD 2023 R1](http://addchallenge.cn/databases2023)
47
+ - [ADD2023 R2](http://addchallenge.cn/databases2023)
48
  - [DFADD](https://github.com/isjwdu/DFADD)
49
  - [LibriVoc](https://github.com/csun22/Synthetic-Voice-Detection-Vocoder-Artifacts)
50
  - [SONAR](https://github.com/Jessegator/SONAR)
 
51
 
52
+ """
53
+
54
+ return gr.Markdown(text)
55
 
56
 
57