madmax3366 commited on
Commit
7b60214
·
verified ·
1 Parent(s): 8697998

Update index.html

Browse files
Files changed (1) hide show
  1. index.html +161 -28
index.html CHANGED
@@ -1,29 +1,162 @@
1
- <!doctype html>
2
- <html>
3
- <head>
4
- <meta charset="utf-8" />
5
- <meta name="viewport" content="width=device-width,initial-scale=1" />
6
- <title>Demo</title>
7
- <style>
8
- html, body { height: 100%; margin: 0; background: #0b0b0b; }
9
- .wrap { height: 100%; display: grid; place-items: center; }
10
- video { max-width: 100%; max-height: 100vh; outline: none; }
11
- h1 { position: absolute; top: 12px; left: 16px; margin: 0; font: 600 16px/1.2 system-ui, sans-serif; color: #cfcfcf; opacity: .7; }
12
- </style>
13
- </head>
14
- <body>
15
- <h1>Demo</h1>
16
- <div class="wrap">
17
- <video
18
- src="demo.mp4"
19
- autoplay
20
- muted
21
- loop
22
- playsinline
23
- controls
24
- preload="metadata">
25
- Your browser does not support the video tag.
26
- </video>
27
- </div>
28
- </body>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
29
  </html>
 
1
+ <!DOCTYPE html>
2
+ <html lang="en">
3
+ <head>
4
+ <meta charset="utf-8" />
5
+ <meta name="viewport" content="width=device-width, initial-scale=1" />
6
+ <title>AUTOMOTIVE-ENV: Benchmarking Multimodal Agents in Vehicle Interface Systems</title>
7
+ <meta name="description" content="AUTOMOTIVE-ENV: Benchmarking Multimodal Agents in Vehicle Interface Systems" />
8
+ <link rel="preconnect" href="https://fonts.googleapis.com">
9
+ <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
10
+ <link href="https://fonts.googleapis.com/css2?family=Inter:wght@400;500;600;700&display=swap" rel="stylesheet">
11
+ <style>
12
+ :root {
13
+ --bg: #ffffff;
14
+ --fg: #0a0a0a;
15
+ --muted: #555;
16
+ --border: #e6e6e6;
17
+ --accent: #111;
18
+ --maxw: 960px;
19
+ --radius: 14px;
20
+ --shadow: 0 1px 2px rgba(0,0,0,0.05), 0 6px 20px rgba(0,0,0,0.06);
21
+ }
22
+ * { box-sizing: border-box; }
23
+ html, body { margin: 0; padding: 0; background: var(--bg); color: var(--fg); font-family: Inter, system-ui, -apple-system, Segoe UI, Roboto, Helvetica, Arial, "Apple Color Emoji", "Segoe UI Emoji"; }
24
+ a { color: var(--accent); text-decoration: none; border-bottom: 1px solid rgba(0,0,0,0.1); }
25
+ a:hover { border-bottom-color: rgba(0,0,0,0.3); }
26
+ .wrap { max-width: var(--maxw); margin: 0 auto; padding: 32px 20px 80px; }
27
+ header { text-align: center; padding: 40px 0 24px; }
28
+ h1 { font-size: clamp(28px, 4.5vw, 40px); line-height: 1.15; margin: 0 0 16px; letter-spacing: -0.02em; }
29
+ .lead { color: var(--muted); margin: 8px auto 18px; font-size: clamp(16px, 2vw, 18px); max-width: 840px; }
30
+ .authors, .affils { margin: 10px auto 0; color: var(--muted); font-size: 15px; }
31
+ .authors a { border-bottom: 1px dashed rgba(0,0,0,0.2); }
32
+ .badgebar { display: inline-flex; gap: 10px; margin-top: 18px; }
33
+ .badge { display: inline-block; font-size: 14px; padding: 8px 12px; border: 1px solid var(--border); border-radius: 999px; box-shadow: var(--shadow); background: #fff; }
34
+ .section { margin: 30px 0; }
35
+ .card { border: 1px solid var(--border); border-radius: var(--radius); box-shadow: var(--shadow); background: #fff; padding: 20px; }
36
+ .card h2 { margin: 0 0 12px; font-size: 22px; }
37
+ .video { overflow: hidden; }
38
+ .video video, .video iframe { width: 100%; height: auto; display: block; border-radius: 12px; }
39
+ .grid { display: grid; grid-template-columns: 1fr; gap: 16px; }
40
+ @media (min-width: 900px) { .grid.two { grid-template-columns: 1fr 1fr; } }
41
+ footer { margin-top: 40px; padding-top: 20px; border-top: 1px solid var(--border); color: var(--muted); text-align: center; font-size: 14px; }
42
+ .mono { font-family: ui-monospace, SFMono-Regular, Menlo, Monaco, Consolas, "Liberation Mono", "Courier New", monospace; font-size: 13px; white-space: pre-wrap; word-break: break-word; background: #fafafa; border: 1px solid var(--border); border-radius: 8px; padding: 12px; }
43
+ </style>
44
+ </head>
45
+ <body>
46
+ <div class="wrap">
47
+ <!-- header -->
48
+ <header>
49
+ <h1>AUTOMOTIVE-ENV: BENCHMARKING MULTIMODAL AGENTS IN VEHICLE INTERFACE SYSTEMS</h1>
50
+ <div class="authors">
51
+ <strong>Junfeng Yan</strong><sup>*1</sup>, <strong>Biao Wu</strong><sup>*1</sup>, <strong>Meng Fang</strong><sup>2</sup>, <strong>Ling Chen</strong><sup>1</sup>
52
+ </div>
53
+ <div class="affils">
54
+ <sup>1</sup>Australian Artificial Intelligence Institute, Sydney, Australia &nbsp;&nbsp;|&nbsp;&nbsp; <sup>2</sup>University of Liverpool, Liverpool, United Kingdom
55
+ </div>
56
+ <p class="lead">
57
+ Multimodal agents have shown strong general GUI abilities, but in-vehicle systems impose unique constraints: limited driver attention, strict safety, and location-aware interaction. <em>Automotive-ENV</em> is a high-fidelity benchmark and interaction environment for vehicle GUIs with 185 parameterized tasks and reproducible checks. We further propose <em>ASURADA</em>, a geo-aware agent that leverages GPS context for safer decisions.
58
+ </p>
59
+ <div class="badgebar">
60
+ <a class="badge" href="https://arxiv.org/abs/2509.21143" target="_blank" rel="noopener">Paper</a>
61
+ <a class="badge" href="#" target="_blank" rel="noopener">Code: Release soon</a>
62
+ </div>
63
+ </header>
64
+
65
+ <!-- demo video -->
66
+ <section class="section">
67
+ <div class="card video" aria-label="demo video">
68
+ <!-- Place demo.mp4 at the repo root (same folder as this index.html) -->
69
+ <video src="demo.mp4" autoplay muted loop playsinline controls></video>
70
+ </div>
71
+ </section>
72
+
73
+ <!-- abstract + quick highlights -->
74
+ <section class="section grid two">
75
+ <div class="card">
76
+ <h2>Abstract</h2>
77
+ <p>
78
+ In-vehicle GUIs present distinct challenges: drivers’ limited attention, strict safety
79
+ requirements, and complex location-based interaction patterns. We introduce
80
+ <strong>Automotive-ENV</strong>, the first high-fidelity benchmark and interaction
81
+ environment tailored for vehicle GUIs. The platform defines <strong>185 parameterized tasks</strong>
82
+ spanning explicit control, implicit intent, and safety-aware tasks, and provides structured
83
+ multimodal observations with precise programmatic checks for reproducible evaluation.
84
+ </p>
85
+ <p>
86
+ Building on this benchmark, we propose <strong>ASURADA</strong>, a geo-aware multimodal agent that
87
+ integrates GPS-informed context to adapt actions by location, environment, and regional norms.
88
+ Experiments show geo-awareness significantly improves safety-aware task success. We will release
89
+ Automotive-ENV, with tasks and tooling, to advance safe and adaptive in-vehicle agents.
90
+ </p>
91
+ </div>
92
+ <div class="card">
93
+ <h2>Highlights</h2>
94
+ <ul>
95
+ <li>High-fidelity vehicle GUI environment with reproducible checks.</li>
96
+ <li>185 parameterized tasks across control, intent, and safety categories.</li>
97
+ <li>Structured multimodal observations and programmatic success criteria.</li>
98
+ <li>ASURADA: GPS/geo-aware planning boosts safety-aware task success.</li>
99
+ </ul>
100
+ </div>
101
+ </section>
102
+
103
+ <!-- tasks placeholder (you can expand later) -->
104
+ <section class="section">
105
+ <div class="card">
106
+ <h2>Tasks (preview)</h2>
107
+ <p>
108
+ This section is reserved for a compact task overview similar to os-world:
109
+ categories, difficulty tiers, and a few illustrative examples with thumbnails or short clips.
110
+ </p>
111
+ <div class="grid two">
112
+ <div>
113
+ <h3>Explicit Control</h3>
114
+ <ul>
115
+ <li>Climate, media, navigation, connectivity</li>
116
+ <li>Deterministic UI manipulations with constraints</li>
117
+ </ul>
118
+ </div>
119
+ <div>
120
+ <h3>Implicit Intent</h3>
121
+ <ul>
122
+ <li>Goal inference from short user context</li>
123
+ <li>Minimal UI steps with preference awareness</li>
124
+ </ul>
125
+ </div>
126
+ <div>
127
+ <h3>Safety-Aware</h3>
128
+ <ul>
129
+ <li>Sensor + context classification (danger vs. do-nothing)</li>
130
+ <li>Strict action gating and escalation logic</li>
131
+ </ul>
132
+ </div>
133
+ <div>
134
+ <h3>Evaluation</h3>
135
+ <ul>
136
+ <li>Programmatic checks, success/failure traces</li>
137
+ <li>Generalization splits and ablations</li>
138
+ </ul>
139
+ </div>
140
+ </div>
141
+ </div>
142
+ </section>
143
+
144
+ <!-- bibtex -->
145
+ <section class="section">
146
+ <div class="card">
147
+ <h2>Citation</h2>
148
+ <pre class="mono">@article{yan2025automotive_env,
149
+ title = {AUTOMOTIVE-ENV: Benchmarking Multimodal Agents in Vehicle Interface Systems},
150
+ author = {Yan, Junfeng and Wu, Biao and Fang, Meng and Chen, Ling},
151
+ journal = {arXiv preprint arXiv:2509.21143},
152
+ year = {2025}
153
+ }</pre>
154
+ </div>
155
+ </section>
156
+
157
+ <footer>
158
+ © 2025 automotive-env • hosted on GitHub Pages
159
+ </footer>
160
+ </div>
161
+ </body>
162
  </html>