Update README.md
Browse files
README.md
CHANGED
|
@@ -46,11 +46,15 @@ model-index:
|
|
| 46 |
type: code-generation
|
| 47 |
---
|
| 48 |
|
| 49 |
-
#
|
| 50 |
|
| 51 |
-
The
|
| 52 |
|
| 53 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 54 |
|
| 55 |
<style>
|
| 56 |
body { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif; }
|
|
@@ -67,6 +71,14 @@ body { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif; }
|
|
| 67 |
.month-box { z-index: 2; background: white; }
|
| 68 |
</style>
|
| 69 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 70 |
<div id="snake-timeline-container" style="max-width: 1000px; margin: 20px auto; padding: 20px; background: white; border-radius: 12px; box-shadow: 0 2px 10px rgba(0,0,0,0.1);">
|
| 71 |
Loading timeline...
|
| 72 |
</div>
|
|
@@ -240,6 +252,35 @@ function createSnakeTimeline() {
|
|
| 240 |
if (container) {
|
| 241 |
container.innerHTML = html;
|
| 242 |
}
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 243 |
}
|
| 244 |
|
| 245 |
// Run with multiple fallbacks
|
|
@@ -251,32 +292,3 @@ if (document.readyState === 'loading') {
|
|
| 251 |
createSnakeTimeline();
|
| 252 |
}
|
| 253 |
</script>
|
| 254 |
-
|
| 255 |
-
### π οΈ Tooling & Infrastructure
|
| 256 |
-
|
| 257 |
-
The foundation of ML research is tooling and infrastructure and we are working on a range of tools such as [datatrove](www.github.com/huggingface/datatrove), [nanotron](www.github.com/huggingface/nanotron), [TRL](www.github.com/huggingface/trl), [LeRobot](www.github.com/huggingface/lerobot), and [lighteval](www.github.com/huggingface/lighteval).
|
| 258 |
-
|
| 259 |
-
### π Datasets
|
| 260 |
-
|
| 261 |
-
High quality datasets are the powerhouse of LLMs and require special care and skills to build. We focus on building high-quality datasets such as [no-robots](https://huggingface.co/datasets/HuggingFaceH4/no_robots), [FineWeb](https://huggingface.co/datasets/HuggingFaceFW/fineweb), [The Stack](https://huggingface.co/datasets/bigcode/the-stack-v2), and [FineVideo](https://huggingface.co/datasets/HuggingFaceFV/finevideo).
|
| 262 |
-
|
| 263 |
-
### π€ Open Models
|
| 264 |
-
|
| 265 |
-
The datatsets and training recipes of most state-of-the-art models are not released. We build cutting-edge models and release the full training pipeline as well fostering more innovation and reproducibility, such as [Zephyr](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta), [StarCoder2](https://huggingface.co/bigcode/starcoder2-15b), or [SmolLM2](https://huggingface.co/HuggingFaceTB/SmolLM2-1.7B-Instruct).
|
| 266 |
-
|
| 267 |
-
### πΈ Collaborations
|
| 268 |
-
|
| 269 |
-
Research and collaboration go hand in hand. That's why we like to organize and participate in large open collaborations such as [BigScience](https://bigscience.huggingface.co) and [BigCode](https://www.bigcode-project.org), as well as lots of smaller partnerships such as [Leaderboards on the Hub](https://huggingface.co/blog?tag=leaderboard).
|
| 270 |
-
|
| 271 |
-
### βοΈ Infrastructre
|
| 272 |
-
|
| 273 |
-
The research team is organized in small teams with typically <4 people and the science cluster consists of 96 x 8xH100 nodes as well as an auto-scalable CPU cluster for dataset processing. In this setup, even a small research team can build and push out impactful artifacts.
|
| 274 |
-
|
| 275 |
-
### π Educational material
|
| 276 |
-
|
| 277 |
-
Besides writing tech reports of research projects we also like to write more educational content to help newcomers get started to the field or practitioners. We built for example the [alignment handbook](https://github.com/huggingface/alignment-handbook), the [evaluation guidebook](https://github.com/huggingface/evaluation-guidebook), the [pretraining tutorial](https://www.youtube.com/watch?v=2-SPH9hIKT8), or the [FineWeb blog](https://huggingface.co/spaces/HuggingFaceFW/blogpost-fineweb-v1).
|
| 278 |
-
|
| 279 |
-
### π€ Join us!
|
| 280 |
-
|
| 281 |
-
We are actively hiring for both full-time and internships. Check out [hf.co/jobs](https://hf.co/jobs)
|
| 282 |
-
|
|
|
|
| 46 |
type: code-generation
|
| 47 |
---
|
| 48 |
|
| 49 |
+
# Science Timeline
|
| 50 |
|
| 51 |
+
The org card can't display JS HTML embedded in the markdown. So the workaround is to render it in a model card (here) and export the rendered HTML. Steps:
|
| 52 |
|
| 53 |
+
- Edit the `const releases` section in embedded HTML here with the new release data
|
| 54 |
+
- Export the static HTML with the buttons below
|
| 55 |
+
- If you are nice, prettify the resulting HTML
|
| 56 |
+
- Copy-paste the result into the Org [README](https://huggingface.co/spaces/science/README/blob/main/README.md)
|
| 57 |
+
- Tada!
|
| 58 |
|
| 59 |
<style>
|
| 60 |
body { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif; }
|
|
|
|
| 71 |
.month-box { z-index: 2; background: white; }
|
| 72 |
</style>
|
| 73 |
|
| 74 |
+
<!-- ADD THESE BUTTONS WHEREVER YOU WANT THEM -->
|
| 75 |
+
<div style="margin: 20px 0;">
|
| 76 |
+
<button onclick="exportStaticHTML()" style="background: #007bff; color: white; border: none; padding: 10px 20px; border-radius: 6px; cursor: pointer; margin-right: 10px;">π Export Static HTML</button>
|
| 77 |
+
<button onclick="copyToClipboard()" style="background: #28a745; color: white; border: none; padding: 10px 20px; border-radius: 6px; cursor: pointer;">π Copy to Clipboard</button>
|
| 78 |
+
</div>
|
| 79 |
+
|
| 80 |
+
<div id="static-output" style="display: none;"></div>
|
| 81 |
+
|
| 82 |
<div id="snake-timeline-container" style="max-width: 1000px; margin: 20px auto; padding: 20px; background: white; border-radius: 12px; box-shadow: 0 2px 10px rgba(0,0,0,0.1);">
|
| 83 |
Loading timeline...
|
| 84 |
</div>
|
|
|
|
| 252 |
if (container) {
|
| 253 |
container.innerHTML = html;
|
| 254 |
}
|
| 255 |
+
return html;
|
| 256 |
+
}
|
| 257 |
+
|
| 258 |
+
// Export functions - ADD THESE TO YOUR EXISTING SCRIPT
|
| 259 |
+
function exportStaticHTML() {
|
| 260 |
+
const staticHTML = createSnakeTimeline();
|
| 261 |
+
const output = document.getElementById('static-output');
|
| 262 |
+
if (!output) {
|
| 263 |
+
// Create output div if it doesn't exist
|
| 264 |
+
const outputDiv = document.createElement('div');
|
| 265 |
+
outputDiv.id = 'static-output';
|
| 266 |
+
outputDiv.style.cssText = 'background: #f8f9fa; border: 1px solid #ddd; padding: 15px; border-radius: 6px; font-family: monospace; font-size: 12px; max-height: 300px; overflow-y: auto; white-space: pre-wrap; word-break: break-all; margin-top: 15px;';
|
| 267 |
+
document.body.appendChild(outputDiv);
|
| 268 |
+
}
|
| 269 |
+
output.style.display = 'block';
|
| 270 |
+
output.textContent = `<div id="snake-timeline-container" style="max-width: 1000px; margin: 20px auto; padding: 20px; background: white; border-radius: 12px; box-shadow: 0 2px 10px rgba(0,0,0,0.1);">
|
| 271 |
+
${staticHTML}
|
| 272 |
+
</div>`;
|
| 273 |
+
}
|
| 274 |
+
|
| 275 |
+
function copyToClipboard() {
|
| 276 |
+
const output = document.getElementById('static-output');
|
| 277 |
+
if (output && output.textContent) {
|
| 278 |
+
navigator.clipboard.writeText(output.textContent).then(() => {
|
| 279 |
+
alert('β
Static HTML copied to clipboard!');
|
| 280 |
+
});
|
| 281 |
+
} else {
|
| 282 |
+
alert('β οΈ Please export first!');
|
| 283 |
+
}
|
| 284 |
}
|
| 285 |
|
| 286 |
// Run with multiple fallbacks
|
|
|
|
| 292 |
createSnakeTimeline();
|
| 293 |
}
|
| 294 |
</script>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|