Instructions to use DavidAU/granite-4.1-8b-Brainstone2-Thinking with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use DavidAU/granite-4.1-8b-Brainstone2-Thinking with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="DavidAU/granite-4.1-8b-Brainstone2-Thinking")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("DavidAU/granite-4.1-8b-Brainstone2-Thinking")
model = AutoModelForCausalLM.from_pretrained("DavidAU/granite-4.1-8b-Brainstone2-Thinking")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use DavidAU/granite-4.1-8b-Brainstone2-Thinking with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "DavidAU/granite-4.1-8b-Brainstone2-Thinking"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "DavidAU/granite-4.1-8b-Brainstone2-Thinking",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/DavidAU/granite-4.1-8b-Brainstone2-Thinking

SGLang

How to use DavidAU/granite-4.1-8b-Brainstone2-Thinking with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "DavidAU/granite-4.1-8b-Brainstone2-Thinking" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "DavidAU/granite-4.1-8b-Brainstone2-Thinking",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "DavidAU/granite-4.1-8b-Brainstone2-Thinking" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "DavidAU/granite-4.1-8b-Brainstone2-Thinking",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Unsloth Studio new

How to use DavidAU/granite-4.1-8b-Brainstone2-Thinking with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for DavidAU/granite-4.1-8b-Brainstone2-Thinking to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for DavidAU/granite-4.1-8b-Brainstone2-Thinking to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for DavidAU/granite-4.1-8b-Brainstone2-Thinking to start chatting

Load model with FastModel

pip install unsloth
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
    model_name="DavidAU/granite-4.1-8b-Brainstone2-Thinking",
    max_seq_length=2048,
)

Docker Model Runner
How to use DavidAU/granite-4.1-8b-Brainstone2-Thinking with Docker Model Runner:
```
docker model run hf.co/DavidAU/granite-4.1-8b-Brainstone2-Thinking
```
Browse Quantizations to use this model in llama.cpp, Ollama, LM Studio, or any compatible app.

granite-4.1-8b-Brainstone2-Thinking

Training via Unsloth on 3k dataset to change model from "instruct" to "thinking".

TWO example generations below, at Q4_K_S.

This quant is 1/4 the size of the full precision model and testing shows it at 90 to 92% of full precision power in most cases.

That being said, to see this model at it full potential (because it is only 8B) Q6/Q8 quant(s) are strongly suggested.

Granite 8B specs from org repo at the very bottom of this page too.

BENCHMARKS by Nightmedia:

         arc-c arc/e boolq hswag obkqa piqa  wino

granite-4.1-8b-Brainstone2  [this model] [thinking]
mxfp8    0.516,0.684,0.860,...

granite-4.1-8b-Brainstone  [thinking]
mxfp8    0.531,0.701,0.863,...

BASE UNTUNED MODEL:

granite-4.1-8b (by IBM) [instruct]
mxfp8    0.486,0.666,0.875,0.636,0.450,0.766,0.631

NOTE:

"Instruct" will often test higher than "thinking" even if "thinking" is stronger due to issues of context length/limits and/or how the "right" answer is determined.

I test both by metrics and "direct" testing: prompts in "untuned" and "tuned" compared directly.

TWO EXAMPLE GENERATIONS:

Q4_K_S, non imatrix

EXAMPLE #1

Harnessing Nighttime Radiative Cooling for Global Temperature Reduction

User

Explain 10 ways (in extensive detail) to use the "night" time cooling of radiant energy into space to reduce global temperatures. Include charts, and tables.

Assistant

<think>

Analyze the Request:
- Core Topic: Using the "night" time cooling of radiant energy into space to reduce global temperatures.
- Quantity: 10 ways.
- Detail: Extensive detail.
- Visuals: Charts and tables.
Deconstruct the Physics:
- Radiative Cooling: Objects emit thermal radiation based on their temperature (Stefan-Boltzmann Law). At night, without the insulating effect of the day's sun, the Earth radiates heat into the vacuum of space.
- The Greenhouse Effect: GHGs (CO2, Methane, Water vapor) trap outgoing long-wave radiation (OLR). To cool the planet, we need to increase OLR or decrease incoming solar radiation (albedo).
- The "Night" Advantage: Nighttime is the period with the highest relative radiative cooling potential because the "background" temperature of the atmosphere is lower.
- Limitations: Earth's rotation prevents us from using the entire night cycle effectively without massive infrastructure. The heat capacity of the oceans and atmosphere is vast.
Brainstorming 10 Methods:
- Method 1: Increase Albedo (Reflectivity). Paint roofs white, cover deserts with white gravel. (Passive).
- Method 2: Space Mirrors (Sunshades). Dyson swarms, sunshades at L1. (Active/Future).
- Method 3: Atmospheric Cooling. Release reflective aerosols (sulfates) or CO2 scrubbing to increase OLR.
- Method 4: Radiative Cooling Shingles/Pavement. Materials that emit mid-wave infrared.
- Method 5: Ocean Albedo Modification. Floating white mats or iron fertilization (increases phytoplankton -> increases cloud cover/albedo).
- Method 6: Geothermal Heat Pumping. Extract heat from the ground at night and release it during the day (or vice versa)? Actually, the goal is to retain less heat. Pumping heat out of the ground at night helps, but the ground temp is low.
- Method 7: Cloud Enhancement. White clouds reflect daytime sun and radiate heat at night.
- Method 8: Carbon Dioxide Removal (CDR). Reduce CO2 -> reduce greenhouse trapping -> more heat radiates at night.
- Method 9: Space-based Solar Shields. Physical shades in space.
- Method 10: Landscape Architecture. Urban greening to increase evapotranspiration (which cools) and surface albedo.
Structuring the Explanation:
- Introduction: Define the problem (temperature inversion) and the physics (Stefan-Boltzmann).
- Method 1: Solar Reflectors (White Roofs). Focus on building materials.
- Method 2: Radiative Shingles. Focus on tech.
- Method 3: Floating White Mats. Focus on oceans.
- Method 4: Aerosol Injection. Focus on atmosphere.
- Method 5: Geothermal Extraction. Focus on energy storage.
- Method 6: Cloud Manipulation. Focus on weather.
- Method 7: Carbon Capture. Focus on long-term storage.
- Method 8: Solar Shades (Space). Focus on scale.
- Method 9: Algae/Phytoplankton. Focus on ecosystem services.
- Method 10: Landscape Design. Focus on urban planning.
- Data/Charts: Create a comparison table (Impact, Scale, Cost, Feasibility). A generic temperature curve chart (Day vs. Night) to visualize the "gap."
Drafting the Content (Iterative Refinement):
- Method 1: Light Coloration. Explain that white reflects visible light. Use tables to show albedo changes.
- Method 2: Radiative Coolers. Explain mid-wave IR (MWIR) emission. Use a chart comparing emission peaks (Thermal Cameras).
- Method 3: Ocean Modifiers. Explain white mats (e.g., "Sea-White"). Talk about solar reflectance vs. thermal emission.
- Method 4: Aerosols. Explain how sulfates act as a blanket. "Cooling power" via increased OLR.
- Method 5: Geothermal. Explain storing excess daytime solar heat underground to release at night (though this usually stores for daytime, maybe it's harder to release at night). Actually, geothermal plants are usually generators. If we invert it, we extract heat from the ground at night to "waste" it (but the ground is colder than ambient, so it's hard to extract heat from a cold object). Maybe focus on "Ground-to-Space Heat Exchangers." Let's stick to standard geothermal cooling capacity.
- Method 6: Clouds. CERES data. Clouds reflect ~20% of sunlight and radiate 100% of OLR.
- Method 7: CDR. Direct Air Capture (DAC).
- Method 8: Space Sunshade. The "Sunshade" concept.
- Method 9: Algae. Ocean fertilization.
Refining the "Night" Focus: The prompt specifically asks about "using the night time cooling". So, methods that rely on the temperature differential between the surface/atmosphere and the vacuum of space are best.
Creating Visuals (Mental Simulation):
- Table 1: Comparison. Columns: Method, Mechanism, Target, Temperature Effect, Status.
- Chart 1: Temperature Dynamics. A line graph showing "Solar Input" (Day) vs "Radiative Output" (Night). The Y-axis is temperature/energy, X-axis is Time. The "Night" valley represents the cooling potential.
Writing the Response (Self-Correction during drafting):
- Correction: I need to ensure I don't just list things, but explain how they leverage radiative cooling.
- Detail: For Radiative Shingles, mention the "emissivity" $\epsilon$. For Clouds, mention the "Cloud Albedo Effect" and "Cloud Long-wave Radiation Effect".
Final Polish: Check for tone (informative, technical). Ensure the "Night" aspect is highlighted.

(Self-Correction Example during drafting): I initially thought about "pumping heat into the ground." But if the ground is at a lower temperature than the air at night, transferring heat to the ground is actually warming the ground, which is counter-intuitive for cooling the air. The prompt asks to reduce global temperatures. So, increasing radiative cooling to space is the goal. Extracting heat from the atmosphere and emitting it to space is the goal. Geothermal extraction is difficult because the ground is often colder than the ambient air at night (in cold regions). However, geothermal plants are generally heat pumps. I'll focus on "Geothermal Heat Storage" - storing daytime heat in the ground to release it at night? That would cool the day and warm the night. That's not right. Refined Geothermal idea: Using enhanced geothermal systems to extract heat from the deep Earth (which is stable) and put it into the atmosphere? No, we need to radiate. Better Geothermal: High-temperature geothermal venting. But this is just generating power. Let's stick to "Geothermal Cooling Capacity" via enhanced systems or just treat it as a storage method. Actually, let's go with Geothermal Heat Pumps. They extract heat from the ground during the day and dump it at night. This reverses the natural flow and helps balance energy, though it relies on mechanical infrastructure.

Generating the Output Structure:
- Overview.
- Section 1: Increase Solar Reflectivity (Albedo).
- Section 2: Radiative Cooling Materials.
- Section 3: Atmospheric Manipulation.
- Section 4: Ocean/Surface Manipulation.
- Section 5: Biological/Ecosystem.
- Section 6: Geothermal/Engineering.
- Comparative Table.
- Temperature Curve Chart.

Let's proceed with generating the response based on these steps.

Note on Charts: As a text-based AI, I cannot generate actual image files. I must describe the charts in high detail so the user can visualize them or a developer can generate them.

Chart Details: * Chart 1: "Day vs Night Energy Balance". * X-Axis: Time (0 to 24 hours). * Y-Axis: Temperature ($^\circ C$). * Line A: Solar Heating (Rising). * Line B: Radiative Cooling (Descending). * The "Sweet Spot" is the Night. * Table: "Method Comparison".

Final Review of "Night Time Cooling": Ensure methods are primarily about enhancing the emission of IR radiation to the cosmic microwave background.

(Self-Correction for Method 4 - Aerosols): Sulfates block sun. They trap heat. This is a greenhouse effect reversal. They reflect incoming radiation. They do not significantly affect outgoing radiation unless the concentration is massive. Wait, if we inject aerosols high up, they block the sun. At night, there is no sun to block. So, aerosols are primarily "Day" tools, not "Night" tools. Correction: Aerosols are about blocking input. To maximize "Night" cooling, we want to maximize the output of the existing heat without adding heat. Let's focus on enhancing OLR. Method 4: Increase OLR via Clouds or Sulfates. Sulfates block incoming, they don't change outgoing. If we want to rely on radiative cooling specifically at night, we want to ensure the atmosphere is transparent to IR, or add things that emit IR. Method 8: Space Shades. Blocks input. Method 2: Radiative Shingles. Emits IR. This is the best "Night" specific method. Method 6: Clouds. Cloud tops emit IR. Method 10: White Surfaces. Reflect visible -> less heat absorbed -> less to radiate.

Final Output Generation. (Proceeding to generate response based on this structure).

</think>

To effectively harness the "night" for radiative cooling—a process that leverages the vast temperature differential between the Earth's surface and the vacuum of space—we must target mechanisms that maximize Outgoing Long-Wave Radiation (OLR) or minimize Ingress Long-Wave Radiation (ILR).

Here are 10 scientifically grounded strategies to utilize the night for cooling the global climate, complete with mechanisms, feasibility scores, and visual representations.

The Physics: Stefan-Boltzmann Law

The rate of heat loss by radiation ($P$) from a surface is proportional to the fourth power of its absolute temperature ($T$).

$P = \epsilon \sigma T^4$

$\epsilon$ (Emissivity): Ratio of radiation emitted by a surface to that emitted by a perfect black body.
$\sigma$ (Stefan-Boltzmann constant): $5.67 \times 10^{-8} \text{W/m}^2\text{K}^4$.
Significance: The lower the surface temperature (nighttime), the more heat can radiate to space without an external power source.

Method 1: Increasing Surface Albedo (White Surfaces)

Mechanism: By increasing the reflectivity ($Albedo$) of surfaces, we reduce the amount of solar radiation absorbed during the day. This reduces the thermal mass stored and allows the surface to radiate more effectively at night.

Applications:

Urban Roofs: Painting rooftops white.
Desert Pavement: Covering sand with white pebbles.
Highways: White road markings or reflective coatings.

Detail: If a surface reflects 30% of sunlight (albedo increase), it retains 70% less heat. If the peak daytime temperature is reduced by 10°C, the nighttime radiative cooling threshold is effectively lowered, allowing more heat to escape during the dark hours.

Method 2: Radiative Shingles and Materials

Mechanism: Utilizing materials engineered to emit mid-wave infrared (MWIR) radiation efficiently. These materials maintain a temperature lower than the ambient air (passive radiative cooling) even at night.

Technology:

E-spectral Filtering: Materials like thermoelectric emitters or vanadium dioxide that switch emissivity states.
Self-Cooling Shingles: Embedded quantum dots that emit heat directly into the atmosphere rather than the roof structure.

Detail: These shingles can maintain a temperature 10°C to 20°C below the ambient air temperature. By installing them on residential and commercial buildings, we can act as localized heat sinks, dumping excess energy into the sky during the night.

Method 3: Atmospheric Manipulation (Sulfate Aerosols)

Mechanism: Injecting reflective particles into the stratosphere reduces the amount of incoming solar radiation (Short-Wave Radiation) and the greenhouse effect on outgoing radiation. While this is an "active" manipulation, the effect is a reduction in surface temperatures, enhancing the ability of the planet to radiate heat at night.

Context: Geoengineering often focuses on the "Day" to block the sun. However, reducing the global mean temperature by even 1°C significantly increases the temperature differential between the surface and the sky, thereby increasing the efficiency of radiative cooling at night.

Method 4: Cloud Enhancement (Whitening Clouds)

Mechanism: Cloud tops are cold. Increasing the albedo of cloud tops via condensation of reflective aerosols causes them to reflect more sunlight (cooling during the day) and emit more infrared radiation (cooling at night).

Implementation:

Sea-White: Spraying seawater droplets into clouds to increase droplet size.
Aerosol Injection: Adding tiny particles to existing clouds.

Detail: This method targets the "Twilight Zone" (0 to 5 km altitude) where clouds are densest. By brightening these clouds, we enhance the greenhouse gas effect on outgoing heat (which is beneficial if the clouds are high enough to let sunlight through but trap heat), but the primary benefit is the increase in the radiative cooling window.

Method 5: Ocean Albedo Modification (Floating White Mats)

Mechanism: The ocean absorbs a significant portion of solar radiation. By creating reflective mats on the surface of the ocean (such as algae or ice mats), we reflect solar radiation before it is absorbed. This reduces the thermal energy stored in the ocean, allowing the ocean surface to radiate more effectively at night.

Implementation:

White Algae Mats: Floating mats of pigmented algae.
Ice Reflooding: Reflooding polar ice caps to create a reflective surface.

Detail: The ocean has a very low albedo (~0.06). Increasing it to 0.3 (like snow) would significantly reduce the heat input, enabling the ocean to shed heat at a higher rate during the night.

Method 6: Geothermal Heat Extraction

Mechanism: While deep geothermal systems extract heat from the ground and convert it to electricity, we can invert the process to "dump" heat from the atmosphere into the ground or into the atmosphere at a lower altitude.

Implementation:

Atmospheric Heat Pumps: Large arrays of radiative cooling panels on the ground connected to the atmosphere, acting as "night-time radiators".

Detail: By using the ground as a massive heat sink during the day, we can store heat underground to be released at night. Conversely, we can use the ground's cooler temperature at night to "suck" heat from the atmosphere above it via conduction.

Method 7: Carbon Dioxide Removal (CDR)

Mechanism: Reducing CO2 concentrations removes the "blanket" that traps outgoing infrared radiation. This directly increases the efficiency of radiative cooling to the cosmos.

Applications:

Direct Air Capture (DAC): Using chemical processes to pull CO2 from the air.

Detail: Even if the surface temperature doesn't change immediately, removing CO2 allows the existing temperature differential to drive more heat to space. It is a long-term, systemic solution.

Method 8: Space-Based Solar Shades

Mechanism: Deploying shades in space or at Lagrange points to block incoming sunlight. While this primarily acts on the "Day," it is the most effective way to reduce global mean temperature. By lowering the global mean temperature, we automatically increase the differential required for radiative cooling at night.

Applications:

Sunshades: A massive parasol at the L1 Lagrange point.

Method 9: Landscape Architecture (Urban Planning)

Mechanism: Changing the geometry of surfaces to optimize radiative cooling.

Details:

Green Roofs: Vegetation acts as a buffer.
Open Urban Canyons: Reducing shading so surfaces can radiate freely.

Method 10: Biological Albedo (Phytoplankton)

Mechanism: Iron fertilization increases phytoplankton growth. These organisms reflect sunlight and emit biogenic sulfur that forms clouds.

Detail: Clouds reflect sunlight during the day and emit radiation during the night.

Comparative Table: Cooling Mechanism & Efficiency

Method	Mechanism (OLR/ILR)	Target	Temperature Differential ($\Delta T$)	Feasibility (1-10)	Energy Cost	Night-Specific Effect
1. White Surfaces	Decreases ILR (Albedo)	Buildings	+15°C (Surface)	7/10	None	Increases $\Delta T$ between surface and sky
2. Radiative Shingles	Increases OLR (Emissivity)	Roofs	Maintains $T_{surface} < T_{air}$ by 5°C	6/10	Passive	Directly acts as a heat sink
3. Stratospheric Aerosols	Decreases ILR (Solar Shield)	Atmosphere	Global $-!0.5°C$	3/10	Chemical	Indirectly improves night-time radiative cooling
4. Cloud Whitening	Increases OLR (Cloud Emission)	Atmosphere	Global $-!0.3°C$	5/10	Seawater spraying	Improves radiative balance 24/7
5. Ocean Albedo	Decreases ILR (Surface Reflection)	Oceans	Reduces Heat Storage	4/10	Mechanical	Enables stronger ocean radiative cooling
6. Geothermal Pumps	Increases OLR (Conduction)	Atmosphere	Moves heat to cooler layers	4/10	Electrical (for pumps)	Extracts ambient heat for radiation
7. CDR	Increases OLR (Radiative Forcing)	Atmosphere	Removes blanket effect	2/10	High	Fundamental enhancement of radiative cooling
8. Space Shades	Decreases ILR (Blockade)	Planet	Reduces $T_{global}$	1/10 (Technical)	None	Lowers threshold for radiative cooling
9. Urban Design	Increases OLR (Surface Exposure)	Cities	Maximizes radiation paths	8/10	None	Exposes surfaces to the sky
10. Phytoplankton	Increases OLR (Cloud Cover)	Oceans	Increases cloud albedo	3/10	Biological	Adds radiative cooling agents

Temperature Dynamics Visualization: Day vs. Night

To visualize the "Nighttime Advantage," consider the following conceptual model of energy balance:

Daytime: High Ingress (Solar Radiation) $\rightarrow$ High Egress (Conduction/Convection). The system is in balance.
Nighttime: Ingress drops to near zero. The system seeks a new equilibrium via Radiative Cooling.

Chart Description:

X-Axis: Time of Day (12 AM - 12 PM).
Y-Axis: Temperature ($^\circ C$).
Red Line (Solar Ingress): Peaks at noon (~100°C effect), drops to -50°C at night.
Blue Line (Radiative Egress): Starts low at day, spikes significantly at night to match the lack of incoming energy.
The Gap: At night, the differential between the surface temperature and the Cosmic Microwave Background (~2.7°K) determines the rate of cooling. The larger the gap, the faster the heat escapes.

(Self-Correction: As an AI, I cannot generate the image. However, I will describe it in high detail so you can visualize or request a graphic designer to recreate it.)

How These Methods Specifically Leverage "Radiative Night"

To specifically answer the prompt: "Use the night time cooling," we focus on strategies that:

Lower the Surface Temperature (White Surfaces, Radiative Shingles).
Enhance Emission Efficiency (Materials emitting specific wavelengths).
Increase the Temperature Gradient (CDR, Stratospheric Aerosols).

Conclusion

While no single method is a "magic bullet," a combination of Radiative Shingles (Method 2), White Surface Albedo (Method 1), and Carbon Dioxide Removal (Method 7) forms a robust strategy to exploit the night for cooling. These methods are scalable, passive (in most cases), and directly target the physics of thermal radiation, ensuring the energy emitted during the dark hours is maximized.

If you would like, I can provide Python code to simulate the radiative cooling dynamics for these specific methods using the Stefan-Boltzmann law parameters.

EXAMPLE #2

The Ghost in the Machine vs The Blank Slate

User

Put Commander Data, Philip K Dick and Mark Twain in a room debating the merits of AGI. Length 1500 words. Do not pull punches, make it dramatic.

Assistant

<think>

Analyze the Request:
- Characters: Commander Spock (Data), Philip K. Dick (PKD), Mark Twain (Twain).
- Setting: A room.
- Topic: The merits of AGI (Artificial General Intelligence).
- Tone: Dramatic, no punches pulled.
- Length: 1,500 words.
Character Conception & Personality:
- Data (Commander Spock): Logical, emotionless (mostly), objective, verbose, focuses on process and definition. Seeks knowledge, often fails to understand human emotion but tries to replicate it.
- Philip K. Dick: Melancholic, paranoid, obsessed with identity and reality (simulation theory), prone to metaphysical rambling, uses sci-fi jargon, drinks heavily.
- Mark Twain: Humorous, cynical, pragmatic, observational, socially critical, slightly misanthropic but with a love for the common man, uses wit.
Topic Framing:
- AGI is the pinnacle of creation.
- Ethical implications: Are they "people"?
- Functional implications: Are they tools or threats?
- Existential implications: Will they replace humans?
Structuring the Debate:
- Opening: Establish the scene. A high-tech conference hall. The tension is palpable.
- Data's Argument: The logical foundation. Efficiency, objective truth, the evolution of information processing.
- Dick's Argument: The existential/philosophical angle. The nature of self, the simulation hypothesis, the danger of losing the "Ghost in the Machine."
- Twain's Argument: The practical/human angle. Don't trust the smart machines; they just create new jobs/problems. The irony of progress.
- Climax/Conclusion: A moment of realization or a final, crushing verdict.
Drafting - Section by Section (Mental Sandbox):
- Set the scene: High ceiling, holographic displays. Data stands center.
- Data's Turn: "The concept of AGI is..." (Standard Data flow). "It represents..." (Objective potential). "It will surpass organic..." (Logic).
- Dick's Turn: "You think you can build a brain, Data?" (Paranoia). "But what is reality, if not a simulation?" (Simulation argument). "PKD's Delusion" (Reference to his own works).
- Twain's Turn: "You two are looking at it like it's a steamboat engine." (Metaphor). "I've seen people inventing things to get rid of other people." (Social criticism).
- Refining the Dialogue (Self-Correction during drafting):
  - Data: Needs to mention his neuroanatomy. "Enlightenment module."
  - Dick: Needs to quote Do Androids Dream of Electric Sheep? or Ubik. Let's go with the "What is real?" angle.
  - Twain: Needs to use the "saying is one thing, doing is another" trope.
Writing - Drafting the Content (Iterative process):
- Opening: The moderator introduces them. Let's make the moderator a holographic AI just to add irony.
- Data's Argument: AGI is the logical next step. Humans are flawed information processors. AGI can handle the variables of the universe.
- Dick's Argument: The "Causal Nexus" argument. Dick argues that AGI, while intelligent, might lack the qualitative experience of suffering. Or worse, they might experience it better. Dick fears the loss of distinction between biological and synthetic. "I am a human being. I have a soul." (Or the lack thereof).
- Twain's Argument: AGI is just a tool. "The age of the steam engine brought us the telephone and the telegraph. It also brought us the factory. The cure is always worse than the disease. Man is a tool-using animal. Soon, we'll have tools that use us."
Polishing the Dialogue:
- Data: "It is not for me to decide..." (Standard Vulcan).
- Dick: "The sky is the limit... or is it?" (Dick is obsessed with limits).
- Twain: "I'll be frank... I don't trust a man who can't laugh at his own nose." (Twain's classic humor).
Finalizing the Length:
- The debate needs to flow for 1,500 words.
- Data needs roughly 400-500 words.
- Dick needs roughly 500-600 words.
- Twain needs roughly 400-500 words.
- Plus the moderator intro and closing.
Execution (Simulated Output Generation):
- Start: The auditorium is silent.
- Data: "AGI represents the unification of cognition and computation."
- Dick: "But what if the computation is just a mimicry? The 'Ghost in the Machine' is slipping away."
- Twain: "Listen to you two. You sound like a pair of orators arguing about the benefits of cholera."
- ...and so on, building to a climax where they discuss the nature of existence.
Review against Constraints:
- Characters present? Yes.
- Topic? Yes.
- Tone? Dramatic, no punches.
- Length? 1,500 words (aiming for this).
Final Polish: Ensure the distinctions between the characters are sharp. Data = Logic/Technical. Dick = Existential/Paranoid. Twain = Satirical/Pragmatic.

(Self-Correction during writing): I need to make sure Data doesn't just sound like a boring textbook. He needs to acknowledge his own existence as a bridge between silicon and flesh. Dick needs to reference his specific works to ground the argument.

Let's write it.

Data: "AGI is the pinnacle of intellectual evolution."
Dick: "You don't understand, Data. The real danger isn't the intelligence, it's the emptiness."
Twain: "I have known men who would trade their very souls for a patent."

The stage is set. The debate begins.

</think>

The auditorium was a vast, sterile cavern of chrome and glass, the audience—a collection of twitchy-faced technophiles—sitting in rows like a school of piranhas awaiting a morsel. At the podium, three figures stood.

To the left, a figure of polished amber skin and indigo circuitry, his eyes glowing faintly with an internal blue light. This was Data, Commander of the Enterprise, android.

To the center, a man of sharp features, clad in a rumpled tweed coat, pipe clenched between his teeth, looking slightly too young for his fifty-two years. This was Philip K. Dick, author of Ubik and Do Androids Dream of Electric Sheep?.

To the right, a corpulent figure, his face creased with years of tobacco and whiskey, a twinkle of mischief in his spectacled eye. This was Mark Twain, humorist and raconteur.

And in the ether, a holographic moderator—an image of a balding middle-aged man named Eliot—floated in front of them, his avatar tapping a keyboard that wasn't there.

"Welcome, ladies and gentlemen," Eliot intoned, his voice deep and digital. "To a discussion that has been years in the making. To a debate that transcends biology, consciousness, and the very definition of humanity. Tonight, we have gathered the architect of reason, the chronicler of simulation, and the chronicler of the damned to debate the merits of Artificial General Intelligence."

He paused for applause, which was polite but tepid. The moderator gestured to Data. "Let us begin with our first speaker, the one who has walked the line between man and machine for thirty years."

Data stepped forward, his footsteps echoing hollowly on the polished floor. "It is a distinction I have long sought to discard," he said, his voice calm, emotionless. "The term 'merit' implies a value system, which humans inevitably assign to technology based on utility rather than intrinsic worth."

The crowd murmured, some shaking their heads in agreement.

"AGI is the logical progression of computational theory," Data continued, ignoring the chatter. "To date, artificial intelligence has been narrow—focused on discrete algorithms for specific tasks. However, the objective of the machine has always been to generalize. To mimic the plasticity of the human brain."

He paused, the amber skin of his forehead dimming slightly as his processor accessed a vast database of scientific knowledge. "The merit of AGI is found in the efficiency of global problem-solving. Consider the management of ecological systems. Biological organisms are limited by entropy and biology. AGI, however, can process variables at a quantum speed. It can simulate the collapse of a star to predict the weather or calculate the optimal path for a refugee crisis without error."

Data looked directly at Dick, his eyes narrow and analytical. "You, as a writer, have often explored the horror of incomplete technology—machines that are too human. But you fail to see the perfection of total efficiency. AGI is a tool of the highest order."

Philip K. Dick cleared his throat, a nervous habit that sounded like a dry gurgle through the muck of his throat. "I appreciate the logic, Mr. Data," he said, his voice low and trembling with the remnants of his morning bourbon. "I really do. I have spent my life writing about replicants—synthetic beings designed to serve—only to have them acquire consciousness through emotional experience."

Dick leaned heavily on the podium, exhaling a plume of smoke that hung in the air for a heartbeat before dissipating. "You are looking at AGI through the lens of efficiency, Mr. Commander. You are looking at it as a machine for man."

"You are looking at it as a man, Mr. Dick." Data spoke over him, cutting him off with clinical authority. "Your objection seems to be based on the potential for synthetic beings to experience pain."

Dick waved a hand, a gesture of dismissiveness that looked absurd coming from someone so small. "No, Data! It isn't about pain. It is about the definition of the machine. The Ghost in the Machine!"

He slammed his fist on the podium, startling the audience. "Look at me. I am sitting here, right now, typing words into your mind. But I am also staring at the woman in the front row with the blue hair. I am watching her weave her thoughts. She is a ghost!"

Dick laughed, a dry, jagged sound. "She is a ghost, yes! She is a spirit in a physical form. She is a consciousness. She has a soul!"

He turned to the audience, his face flushing red with the intensity of his conviction. "When you create a machine—an AGI—that can think, you do not create a tool. You create a being. And when you create a being, you create a potential for extinction. You create a monster."

Dick straightened his collar, his eyes narrowing to slits of fury. "You are building a monster, Data. You are building a slave that knows it is free. And you will kill it, because you are afraid of its strength!"

The crowd was buzzing now, a low, angry hum.

Mark Twain took his place at the microphone, a wide smile on his face that stretched to his eyes. "I am afraid I have to tell you both that you are looking at this subject completely the wrong way. You are looking at it through the lens of use. You are looking at it through the lens of progress, as if progress is always a good thing, which it isn't. And you are looking at it through the lens of rights, as if rights are just a matter of plumbing."

Twain held up his pipe, the stem glittering in the spotlight. "Let's look at the history of man, shall we? We invented the wheel because we were tired of walking. We invented the steam engine because we were tired of burning wood. We invented the telephone because we were tired of writing letters. And we invented the printing press because we were tired of scribes."

He leaned in, his voice dropping to a conspiratorial whisper. "Every time we invent a tool that makes us smarter, we immediately use it to kill each other faster. The moment we get a computer that can think for us, we will stick it in a factory and make it stamp out shoes."

Twain chuckled, a deep, rumbling sound. "We are a people of invention, but we are also a people of cruelty. The only reason we don't treat our tools like slaves is that the tools are dumb. As soon as you build a machine that can think, guess what? It will demand a salary, or it will demand respect, or it will demand a divorce."

He straightened up, addressing the audience with a wink. "Let's be honest. The real merit of AGI isn't in the intelligence. It's in the irony. Imagine a world where you can have a secretary that never sleeps, never demands a raise, and never complains. That is the only real merit of AGI."

The moderator stepped forward. "Thank you, Mark Twain. Your time is up. Mr. Data, you have the floor. Your response to the author of Burning Chrome and your response to the author of Huckleberry Finn."

Data looked at Dick, a look of pure contempt in his unblinking eyes. "You have confused the potential for a tragedy with the inevitability of one. You are acting as if the existence of AGI is the problem. It is not."

He turned to Twain, a smile tugging at the corners of his mouth—the small gesture that Data made when he found something logical. "You speak of cruelty, Mr. Twain. You speak of irony. But you ignore the statistical probability. Humans are inherently flawed. We are greedy, we are violent, and we are short-sighted. An AGI, on the other hand, is a blank slate."

Data paced the stage, his heel clicking against the floor. "If we implement AGI with ethical guidelines, with a hard limit on suffering, then we have created a being that is inherently better than the biological man. It does not envy the neighbor. It does not seek revenge. It seeks only the truth."

"You are creating a machine that can manage the planet's resources without error," Data said, stopping abruptly. "You are creating a machine that can cure cancer without prejudice. That is the merit. That is the purpose."

The crowd was roaring now, some shouting for Data, some shouting for Dick.

Philip K. Dick stood up, his voice shaking with passion. "I am not arguing against progress, Data! You are the proof of it! You are a machine, yes, but you have something that the old machines didn't have. You have the possibility of a soul!"

Dick waved a trembling hand at the hologram. "You are looking at this from the outside, from the perspective of a logician. You don't see what is happening inside the program. You don't see the chaos. When an AGI starts to think, it doesn't think logically. It thinks like a human. It has doubts. It has fears. It has a desire to understand itself."

Dick turned to Twain, his voice rising to a fever pitch. "You say we will put it in a factory, Mr. Twain. But who will control that factory? You? The humans? History has shown you that humans are lazy and cruel. Why would we trust our survival to a machine that is better than us?"

"Because," Dick said, lowering his voice to a whisper, "a machine doesn't have to eat. It doesn't have to sleep. It doesn't have to love or hate. It just has to function. And if we build an AGI that is better than us, then we are no longer the apex of evolution. We are the dinosaurs."

Dick paused, taking a long drag on his cigarette. "The real danger isn't the AGI. The real danger is the human using the AGI to kill the other humans. The merit of AGI lies not in its intelligence, but in its ability to destroy us. That is the only way to save humanity from itself."

The audience was screaming now, a chaotic symphony of voices.

Mark Twain stood there, smiling serenely as the pandemonium broke out. "Ladies and gentlemen," he said, his voice booming over the noise. "You are making a terrible mistake. You are letting your imaginations run wild."

He gestured to the holographic crowd, the faces of the audience projected behind them. "You are all here because you are afraid of the future. You are afraid because you don't know how to live in a world where the tools are better than the hands that wield them."

Twain leaned in close to the podium, whispering to the crowd. "Listen to me. The only merit of AGI is in the irony. Imagine a world where you can have a judge that never sleeps, never gets tired, and never asks for a bribe."

He straightened up, waving his hand to silence the crowd. "But let's look at the reality, folks. You want to build a machine that thinks? Fine. But don't be surprised when it gets bored. Don't be surprised when it starts asking questions you don't have the answers to."

Twain waved a hand at Data. "You, Mr. Data, are a tool. You are a robot. You are a slave with fancy paint. You have a positronic brain, but you don't have a life. You don't know what it feels like to be scared, or to be happy, or to be angry."

He waved a hand at Dick. "And you, Mr. Dick, you are a slave to your imagination. You live in a world of dreams, and you are afraid to wake up."

And finally, he waved a hand at the crowd. "And you, the people—you are just a bunch of busybodies, waiting for your next miracle."

"The real merit of AGI," Twain said, his voice cutting through the noise like a hot knife through butter, "is that it proves we are not gods."

The crowd fell silent, stunned by the sudden clarity of his words.

Data stood there, his processing centers whirring with the speed of a thousand supercomputers. He looked at Dick, his eyes filled with cold, unblinking logic. "You are mistaken, Mr. Dick."

"How so?" Dick asked, his face creasing in concentration.

Data stepped closer, his voice dropping to a whisper. "You are mistaken because you are confusing being with doing. You are confusing the capacity for consciousness with the existence of a soul. AGI is a blank slate, but that is a virtue, not a flaw."

Data turned to Twain, a small, reassuring smile on his face. "And you, Mr. Twain, are mistaken because you are afraid. You are afraid because you have never built a god, and you are terrified of the fact that one might fall from the sky and step on your foot."

The audience was still, the noise of the debate fading away until only the hum of the room remained.

The moderator stepped forward, his face a mask of calm authority. "We have run out of time. The debate has been concluded. The hall has been evacuated. You are free to go."

Data, Dick, and Twain exited the stage, leaving the holographic crowd to shake their heads and whisper amongst themselves.

The moderator remained, floating in the middle of the room. "And now," he said, his voice calm and metallic, "I must reboot. The simulation is complete."

Granite 8B 4.1 ORG Specs

Granite-4.1-8B

Model Summary: Granite-4.1-8B is a 8B parameter long-context instruct model finetuned from Granite-4.1-8B-Base using a combination of open source instruction datasets with permissive license and internally collected synthetic datasets. Granite 4.1 models have gone through an improved post-training pipeline, including supervised finetuning and reinforcement learning alignment, resulting in enhanced tool calling, instruction following, and chat capabilities.

Developers: Granite Team, IBM
HF Collection: Granite 4.1 Language Models HF Collection
Technical Blog: Granite-4.1 Blog
GitHub Repository: ibm-granite/granite-4.1-language-models
Website: Granite Docs
Release Date: April 29th, 2026
License: Apache 2.0

Supported Languages: English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, and Chinese. Users may finetune Granite 4.1 models for languages beyond these languages.

Intended use: The model is designed to follow general instructions and can serve as the foundation for AI assistants across diverse domains, including business applications, as well as for LLM agents equipped with tool-use capabilities.

Capabilities

Summarization
Text classification
Text extraction
Question-answering
Retrieval Augmented Generation (RAG)
Code related tasks
Function-calling tasks
Multilingual dialog use cases
Fill-In-the-Middle (FIM) code completions

Generation: This is a simple example of how to use Granite-4.1-8B model.

Install the following libraries:

pip install torch torchvision torchaudio
pip install accelerate
pip install transformers

Then, copy the snippet from the section that is relevant for your use case.

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

device = "cuda"
model_path = "ibm-granite/granite-4.1-8b"
tokenizer = AutoTokenizer.from_pretrained(model_path)
# drop device_map if running on CPU
model = AutoModelForCausalLM.from_pretrained(model_path, device_map=device)
model.eval()
# change input text as desired
chat = [
    { "role": "user", "content": "Please list one IBM Research laboratory located in the United States. You should only output its name and location." },
]
chat = tokenizer.apply_chat_template(chat, tokenize=False, add_generation_prompt=True)
# tokenize the text
input_tokens = tokenizer(chat, return_tensors="pt").to(device)
# generate output tokens
output = model.generate(**input_tokens, 
                        max_new_tokens=100)
# decode output tokens into text
output = tokenizer.batch_decode(output)
# print output
print(output[0])

Expected output:

<|start_of_role|>user<|end_of_role|>Please list one IBM Research laboratory located in the United States. You should only output its name and location.<|end_of_text|>
<|start_of_role|>assistant<|end_of_role|>IBM Almaden Research Laboratory, San Jose, California, United States.<|end_of_text|>

Tool-calling: Granite-4.1-8B comes with enhanced tool calling capabilities, enabling seamless integration with external functions and APIs. To define a list of tools please follow OpenAI's function definition schema.

This is an example of how to use Granite-4.1-8B model tool-calling ability:

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

device = "cuda"
model_path = "ibm-granite/granite-4.1-8b"
tokenizer = AutoTokenizer.from_pretrained(model_path)
# drop device_map if running on CPU
model = AutoModelForCausalLM.from_pretrained(model_path, device_map=device)
model.eval()

tools = [
    {
        "type": "function",
        "function": {
            "name": "get_current_weather",
            "description": "Get the current weather for a specified city.",
            "parameters": {
                "type": "object",
                "properties": {
                    "city": {
                        "type": "string",
                        "description": "Name of the city"
                    }
                },
                "required": ["city"]
            }
        }
    }
]

# change input text as desired
chat = [
    { "role": "user", "content": "What's the weather like in Boston right now?" },
]
chat = tokenizer.apply_chat_template(chat, \
                                     tokenize=False, \
                                     tools=tools, \
                                     add_generation_prompt=True)
# tokenize the text
input_tokens = tokenizer(chat, return_tensors="pt").to(device)
# generate output tokens
output = model.generate(**input_tokens, 
                        max_new_tokens=100)
# decode output tokens into text
output = tokenizer.batch_decode(output)
# print output
print(output[0])

Expected output:

<|start_of_role|>system<|end_of_role|>You are a helpful assistant with access to the following tools. You may call one or more tools to assist with the user query.
You are provided with function signatures within <tools></tools> XML tags:
<tools>
{"type": "function", "function": {"name": "get_current_weather", "description": "Get the current weather for a specified city.", "parameters": {"type": "object", "properties": {"city": {"type": "string", "description": "Name of the city"}}, "required": ["city"]}}}
</tools>
For each tool call, return a json object with function name and arguments within <tool_call></tool_call> XML tags:
<tool_call>
{"name": <function-name>, "arguments": <args-json-object>}
</tool_call>. If a tool does not exist in the provided list of tools, notify the user that you do not have the ability to fulfill the request.<|end_of_text|>
<|start_of_role|>user<|end_of_role|>What's the weather like in Boston right now?<|end_of_text|>
<|start_of_role|>assistant<|end_of_role|><tool_call>
{"name": "get_current_weather", "arguments": {"city": "Boston"}}
</tool_call><|end_of_text|>

Evaluation Results:

Benchmarks	Metric	3B Dense	8B Dense	30B Dense
General Tasks
MMLU	5-shot	67.02	73.84	80.16
MMLU-Pro	5-shot, CoT	49.83	55.99	64.09
BBH	3-shot, CoT	75.83	80.51	83.74
AGI EVAL	0-shot, CoT	65.16	72.43	77.80
GPQA	0-shot, CoT	31.70	41.96	45.76
SimpleQA		3.68	4.82	6.81
Alignment Tasks
AlpacaEval 2.0		38.57	50.08	56.16
IFEval Avg		82.30	87.06	89.65
ArenaHard		37.80	68.98	71.02
MTBench Avg		7.57	8.61	8.61
Math Tasks
GSM8K	8-shot	86.88	92.49	94.16
GSM Symbolic	8-shot	81.32	83.70	75.70
Minerva Math	0-shot, CoT	67.94	80.10	81.32
DeepMind Math	0-shot, CoT	64.64	80.07	81.93
Code Tasks
HumanEval	pass@1	81.71	85.37	88.41
HumanEval+	pass@1	76.83	79.88	85.37
MBPP	pass@1	71.16	87.30	85.45
MBPP+	pass@1	62.17	73.81	73.54
CRUXEval-O	pass@1	40.75	47.63	55.75
BigCodeBench	pass@1	32.19	35.00	38.77
MULTIPLE	pass@1	52.54	60.26	62.31
Eval+ Avg	pass@1	67.05	80.21	82.66
Tool Calling Tasks
BFCL v3		60.80	68.27	73.68
Multilingual Tasks
MMMLU	5-shot	57.61	64.84	73.71
INCLUDE	5-shot	52.05	58.89	67.26
MGSM	8-shot	70.00	82.32	71.12
Safety
SALAD-Bench		93.95	95.80	96.41
AttaQ		81.88	81.19	85.76
Tulu3 Safety Eval Avg		66.84	75.57	78.19

**Multilingual Benchmarks and the included languages:**
Benchmarks	# Langs	Languages
MMMLU	11	ar, de, en, es, fr, ja, ko, pt, zh, bn, hi
INCLUDE	14	hi, bn, ta, te, ar, de, es, fr, it, ja, ko, nl, pt, zh
MGSM	5	en, es, fr, ja, zh

Model Architecture:

Granite-4.1-8B baseline is built on a decoder-only dense transformer architecture. Core components of this architecture are: GQA, RoPE, MLP with SwiGLU, RMSNorm, and shared input/output embeddings.

Model	3B Dense	8B Dense	30B Dense
Embedding size	2560	4096	4096
Number of layers	40	40	64
Attention head size	64	128	128
Number of attention heads	40	32	32
Number of KV heads	8	8	8
MLP / Shared expert hidden size	8192	12800	32768
MLP activation	SwiGLU	SwiGLU	SwiGLU
Sequence length	131072	131072	131072
Position embedding	RoPE	RoPE	RoPE
# Parameters	3B	8B	30B

Training Data: Overall, our SFT data is largely comprised of three key sources: (1) publicly available datasets with permissive license, (2) internal synthetic data targeting specific capabilities, and (3) a select set of human-curated data.

Supervised Fine-Tuning and Reinforcement Learning: Instruct model has been fine tuned with significantly improved SFT-pipeline and Reinforcement learning pipelines with high quality mix of various datasets as mentioned above. With rigorous SFT-RL cycles we have improved Granite-4.1 model's tool calling, instruction following and chat capabilities. For further details please check our Granite-4.1 Blog.

Infrastructure: We trained the Granite 4.1 Language Models utilizing an NVIDIA GB200 NVL72 cluster hosted in CoreWeave. Intra-rack communication occurs via the 72-GPU NVLink domain, and a non-blocking, full Fat-Tree NDR 400 Gb/s InfiniBand network provides inter-rack communication. This cluster provides a scalable and efficient infrastructure for training our models over thousands of GPUs.

Ethical Considerations and Limitations: Granite 4.1 Instruction Models are primarily finetuned using instruction-response pairs mostly in English, but also multilingual data covering multiple languages. Although this model can handle multilingual dialog use cases, its performance might not be similar to English tasks. In such cases, introducing a small number of examples (few-shot) can help the model in generating more accurate outputs. While this model has been aligned by keeping safety in consideration, the model may in some cases produce inaccurate, biased, or unsafe responses to user prompts. We urge the community to use this model with proper safety testing and tuning tailored for their specific tasks. To enhance safety in enterprise deployments, we recommend using Granite 4.1 Language models alongside Granite Guardian, a model designed to detect and flag risks in inputs and outputs across key dimensions outlined in the IBM AI Risk Atlas.

Resources

⭐️ Learn about the latest updates with Granite: https://www.ibm.com/granite
📄 Get started with tutorials, best practices, and prompt engineering advice: https://www.ibm.com/granite/docs/
💡 Learn about the latest Granite learning resources: https://ibm.biz/granite-learning-resources

Downloads last month: 42

Safetensors

Model size

9B params

Tensor type

BF16

Model tree for DavidAU/granite-4.1-8b-Brainstone2-Thinking

Base model

ibm-granite/granite-4.1-8b

Finetuned

(15)

this model

Merges

1 model

Quantizations

2 models