Spaces:
Runtime error
Runtime error
Commit
Β·
07926c5
1
Parent(s):
ca4743e
Add metric description
Browse files
README.md
CHANGED
|
@@ -14,16 +14,11 @@ pinned: false
|
|
| 14 |
|
| 15 |
# Metric Card for action_generation
|
| 16 |
|
| 17 |
-
***Module Card Instructions:*** *Fill out the following subsections. Feel free to take a look at existing metric cards if you'd like examples.*
|
| 18 |
-
|
| 19 |
## Metric Description
|
| 20 |
-
|
|
|
|
| 21 |
|
| 22 |
## How to Use
|
| 23 |
-
*Give general statement of how to use the metric*
|
| 24 |
-
|
| 25 |
-
*Provide simplest possible example for using the metric*
|
| 26 |
-
|
| 27 |
```python
|
| 28 |
import evaluate
|
| 29 |
valid_labels = [
|
|
@@ -64,8 +59,6 @@ print(result)
|
|
| 64 |
|
| 65 |
*State the range of possible values that the metric's output can take, as well as what in that range is considered good. For example: "This metric can take on any value between 0 and 100, inclusive. Higher scores are better."*
|
| 66 |
|
| 67 |
-
#### Values from Popular Papers
|
| 68 |
-
*Give examples, preferrably with links to leaderboards or publications, to papers that have reported this metric, along with the values they have reported.*
|
| 69 |
|
| 70 |
### Examples
|
| 71 |
*Give code examples of the metric being used. Try to include examples that clear up any potential ambiguity left from the metric description above. If possible, provide a range of examples that show both typical and atypical results, as well as examples where a variety of input parameters are passed.*
|
|
|
|
| 14 |
|
| 15 |
# Metric Card for action_generation
|
| 16 |
|
|
|
|
|
|
|
| 17 |
## Metric Description
|
| 18 |
+
Evaluate the result of action generation task.
|
| 19 |
+
Consider the output format `/class/phrase`. Compute the scores for both `/class` and `phrase` separately, and then perform a weighted sum of these scores.
|
| 20 |
|
| 21 |
## How to Use
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
```python
|
| 23 |
import evaluate
|
| 24 |
valid_labels = [
|
|
|
|
| 59 |
|
| 60 |
*State the range of possible values that the metric's output can take, as well as what in that range is considered good. For example: "This metric can take on any value between 0 and 100, inclusive. Higher scores are better."*
|
| 61 |
|
|
|
|
|
|
|
| 62 |
|
| 63 |
### Examples
|
| 64 |
*Give code examples of the metric being used. Try to include examples that clear up any potential ambiguity left from the metric description above. If possible, provide a range of examples that show both typical and atypical results, as well as examples where a variety of input parameters are passed.*
|