SilentCanoe
SilentCanoe
/
🌿
Anne Story Generator

"I'm so glad I live in a world where there are Octobers." β€” Anne Shirley

Free AI Photo Story Generator β€” Works Offline

πŸ”’

Your photos never leave your device. Both models β€” ViT-GPT2 (image understanding) and Llama 3.2 (story generation) β€” run entirely in your browser using WebAssembly and WebGPU.

πŸ“Έ Your Photos

Add up to 12 photos Β· drag to reorder

🌸

Drop photos here

JPG, PNG, WebP, HEIC

✍️ Story Settings

πŸ€– AI Engines

ViT-GPT2 Vision (~246 MB)
Image understanding
Llama 3.2 3B (~2 GB)
Story generation

Models download once and cache locally

🌿

Every photo holds a story

Upload your photos and let Anne Shirley's spirit weave them into a heartwarming Avonlea-style tale

πŸ“Έ Add up to 12 photos
πŸ€– AI understands each photo
🌿 Anne crafts your story
πŸ’Œ Export as PDF postcard or video

Frequently Asked Questions

How does the AI understand my photos?

The tool uses ViT-GPT2, a vision-language model that runs locally in your browser via WebAssembly. It generates a descriptive caption for each photo, which is then passed to Llama 3.2 to write the Anne Shirley-style story.

Do the AI models need to download every time?

No. ViT-GPT2 (~246 MB) and Llama 3.2 3B (~2 GB) are cached in your browser after the first download. On subsequent visits the models load from your local cache instantly, with no internet required.

What export formats are available?

You can export a PDF postcard (with photo + caption, in Avonlea Green, Ivory Parchment, or Moonlit Night styles at 4Γ—6", 5.5Γ—8.5", or 6Γ—6" sizes), a Ken Burns-style video slideshow (WebM), or social media images optimised for Facebook (1200Γ—630), Instagram (1080Γ—1080), or Stories (1080Γ—1920).

Can I edit the generated captions?

Yes β€” click the pencil icon (✏️) on any story chapter to open an editable text area. Your changes are saved automatically and used in all exports. You can also regenerate individual captions or the entire story with one click.

What browsers support the AI features?

Llama 3.2 story generation requires WebGPU (Chrome 113+ or Edge 113+ on desktop). ViT-GPT2 image captioning uses WebAssembly and works on all modern browsers, including Firefox and Safari. Template-based fallback captions are provided if Llama 3.2 cannot load.