Spaces:

madmax3366
/

AutoEnv_demo

Running

App Files Files Community

madmax3366 commited on Nov 11

Commit

8ca7ca7

verified ·

1 Parent(s): 72a8e5c

Update index.html

Browse files

Files changed (1) hide show

index.html +40 -0

index.html CHANGED Viewed

@@ -234,6 +234,46 @@
   </div>
 </section>
 <!-- BibTeX -->
 <section class="section" id="BibTeX">
   <div class="container is-max-desktop content">

   </div>
 </section>
+<!-- Results and Analysis -->
+<section class="section" id="results-analysis">
+  <div class="container is-max-desktop">
+    <div class="columns is-centered">
+      <div class="column is-four-fifths">
+        <h2 class="title is-3">Results and Analysis</h2>
+        <div class="content has-text-justified">
+          <p>
+            We evaluate multiple agent configurations on <strong>Automotive-ENV</strong>, reporting success
+            rates across General tasks (Explicit Control, Implicit Intent) and Safety-Aware tasks
+            (Driving Alignment, Environment Alerts). We also analyze the effect of GPS-aware context
+            on inference token usage and task-wise performance across hotspot categories.
+          </p>
+        </div>
+        <!-- Figure 1: Success rates -->
+        <figure class="system-figure has-text-centered" style="margin-top:12px;">
+          <img src="./static/images/results.jpg" alt="Success rates of different agent configurations across task groups">
+          <figcaption class="subtitle is-6" style="margin-top:8px;">
+            Success rates (SR %) of different agent configurations on Automotive-ENV. Results are
+            reported across General tasks (Explicit Control, Implicit Intent) and Safety-Aware tasks
+            (Driving Alignment, Environment Alerts).
+          </figcaption>
+        </figure>
+        <!-- Figure 2: Tokens & GPS comparison -->
+        <figure class="system-figure has-text-centered" style="margin-top:18px;">
+          <img src="./static/images/task_and_check.jpg" alt="Token length distributions and task-wise performance with vs. without GPS">
+          <figcaption class="subtitle is-6" style="margin-top:8px;">
+            Comparison of inference tokens with and without GPS information. Left: distribution of
+            token lengths. Right: task-wise performance across hotspot categories.
+          </figcaption>
+        </figure>
+      </div>
+    </div>
+  </div>
+</section>
 <!-- BibTeX -->
 <section class="section" id="BibTeX">
   <div class="container is-max-desktop content">