Writer’s latest models can generate text from images including charts and graphs

Writer, a San Francisco-based startup, is at the forefront of generative AI for enterprise applications. Their latest innovation, Palmyra-Vision, enables text generation from images, including graphs and charts.

CEO May Habib explains that Writer’s strategic focus lies in multimodal content, emphasizing text output. While they currently analyze images rather than create them, Habib envisions the possibility of generating charts and graphs from data in the future. For now, Palmyra-Vision excels at extracting text from various image types.

The approach involves a multiple model framework, with each model assigned specific tasks: identifying image content and achieving text generation with impressive accuracy. The applications are diverse, from e-commerce websites automatically updating product descriptions based on changing images to interpreting insights from visual data. Additionally, Palmyra-Vision can aid compliance checks, ensuring FDA-regulated ad copy adheres to guidelines outlined in associated documents.

Photoroom, unlike many startups in the AI application space, has taken a unique approach by developing its own custom models from scratch. This strategic decision involves investing in computational power and securing image rights from agencies and creators. The company is actively recruiting technical talent to enhance the efficiency and functionality of these models. Notably, Photoroom’s proprietary architecture accelerates image generation by up to 40% compared to other visual AI platforms.

CEO Matthieu Rouif emphasizes that their foundation model empowers businesses to create stunning product photos without requiring expertise in prompt engineering or photography. This adaptable model excels in product photography and swiftly adapts to user feedback.

In addition to their homegrown models, Photoroom introduces several new features, including Photoroom Instant Diffusion. This tool ensures consistent styling for product images, regardless of where or how they were captured, giving them a professional studio look. Other features include AI-generated backgrounds, scene expansions, and a variety of image editing tools. For bulk image processing, Photoroom’s tools can handle automatic adjustments.

While we await conversations with CEO Rouif and potential investors, it’s worth noting that Balderton Capital recognizes Photoroom’s impressive journey and their commitment to a user-centric vision. Bernard Liautaud, Managing Partner at Balderton, commends Photoroom’s achievements and execution.

Photoroom has raised a total of $64 million to date. The company plans to utilize this funding to expand its team, invest in research and development, and enhance its infrastructure. Despite industry-wide layoffs, Photoroom currently employs around 50 people and aims to double its workforce by year-end. The latest Palmyra release, featuring image-to-text capabilities, is available starting today .

Leave a Reply

Your email address will not be published. Required fields are marked *

Scroll to Top