Content factory: how to build a system of automatic generation of videos, carousels and posts for social networks
Main chat
A chat for vibe coders: news, guides, live cases, marketplace, and finding executors.
The difference between someone who puts 2-3 posts a day by hand and someone who runs the assembly line is measured not only in hours, but also in scale: manual posting is five open tabs, copying text and hoping nothing falls off. A content factory is dozens of pieces of content on all sites simultaneously, without daily human participation.
In this article – a complete analysis of how it works: the architecture of the conveyor, specific tools for each stage, real cases with numbers and – importantly – pitfalls that are not written in advertising posts services.
What is a content factory and what it consists of
A content factory is an automated chain of processes for creating and publishing content, where the human factor is minimized, and the main tasks are performed by software.
Any content factory, regardless of the niche, consists of the same five blocks connected in the conveyor:
Source of ideas → Scenario generation → Visual/video generation → Assembly and branding → Publication on sites
↓ ↓ ↓
Google Trends, ChatGPT/Claude HeyGen, Sora, FFmpeg, Water Blotato, Ayrshare,
Apify parsers, writes text, Veo, Midjourney, signs, subtitles SMMplanner -
RSS, hand hook, Carouselin hashtags for Whisper dozens of venues
carouseling
The link for all of these blocks is an automation tool (orchestrator) that runs the chain on a schedule, transfers data from one service to another, and makes sure nothing is lost in a crash.
Orchestra Choice: N8n, Make or Code
Before choosing specific neural networks, you need to decide on what to build the chain itself.
n8n – the main choice for content factories
n8n is an open-source solution for business process automation, which has become a real micro-trend in the Russian-language no-code/low-code community. Unlike Zapier or Make, n8n does not limit the number of operations and does not tie the user to a third-party cloud – this is especially important for large amounts of content generation and work with sensitive data.
The built-in logic of branches, queues and error resistance allow you to build complex workflow, which act not just as a bot with a prompt, but as a real orchestrator for many AI services at the same time.
You can deploy n8n on your own server (self-hosted, free with VPS) or use the cloud version with a subscription. For a constantly working content factory self-hosted option is more profitable in the long run.
Make.com is easier for beginners
A clear, novice-friendly visual constructor is a good entry point if n8n seems overly complex at first glance. Pay for the number of transactions, which with large amounts of content can be more expensive than self-hosted n8n.
Python/LangChain code for mature processes
One practicing automatizer described the transition from n8n to native code in Python and LangChain as follows: the first system on n8n was something like a “content harvester” – it collected videos from pre-prepared footage and music tracks, the source code was stored in Supabase, and internal logic ensured that the combinations did not repeat. The second option, on pure code, gave more control over logic and made it easier to debug complex conditions.
Starting with n8n is faster and requires no code writing. Switch to custom code only when you run into the real limitations of a visual designer – for most content factories, this moment does not come at all.
Block 1: Source of ideas and trends
A content factory without topical themes is just a template noise generator. The first and most underrated block is where content topics come from.
Google Trends - a node in n8n with a Cron trigger initiates a periodic search for current trends. Suitable for broad, non-niche topics.
Apify parsers is a parsing service that analyzes competitors: the growth of subscribers for the week, the involvement of rollers, which hooks (leads in the first seconds) are working in a particular niche right now. The system finds working ideas before the money is spent on shooting.
Top news parsing (e.g. Hacker News or industry sources) – for hot news + expert commentary content, particularly effective for B2B and tech niches.
Manual input via Google Table or Notion – even in the most automated assembly line, human task setting is normal and even recommended. In one of the cases described in detail, a person spends about 5 minutes on the topic in Notion, and the entire subsequent cycle goes automatically.
Table as a management interface is a practical solution: you don’t go to n8n every time, but just fill the lines with data and set the Ready status for the ones that need to be processed. Typical set of columns: theme, generation mode (video only, carousel only, combination), status (Ready / In work / Ready for publication / Error), reference to the finished material, text for signature.
Block 2: script and text generation
This stage is where the language model writes the basis: script, hook, description, hashtags.
ChatGPT / Claude via API – unique texts, slogans, hashtags and emojis are formed on the basis of the theme. For short videos, the typical script size is 50-100 words with a viral hook in the first lines.
The structure of a good prompt to generate a short video script:
Subject: {topic from the table}
Vertical video format 30-60 seconds for TikTok/Reels/Shorts
Structure:
1. Hook (first 3 seconds) – question or unexpected fact
2. The main part - 2-3 key thoughts with specifics
3. A call to action at the end
Tone: {write in a separate file tone-of-voice - expert / conversational / provocative}
Limitation: no more than 100 words
Additional: 5 relevant hashtags, short description for post signature
An important detail that is often overlooked: the tone of the brand should be fixed separately, as an independent file instruction (for example, tone-of-voice.md), and not re-entered into each prompt – this simplifies the support of the conveyor and does not give the tone to “float” over time.
Block 3: Video generation – core technologies
Here begins the most resource-intensive part of the conveyor. There are two fundamentally different approaches: video with a talking AI-avatar and video without a person (footage, animation, generative scenes).
Approach 1: AI avatar (talking head)
*HeyGen creates a video with an avatar that moves its lips synchronously to text or speech. The principle of operation: record 5 minutes of video once - the avatar stays with you forever, and then generate an unlimited number of videos without re-shooting.
Typical chain: script from LLM → voiceover via ElevenLabs (realistic voice synthesis) → HeyGen combines avatar with voiceover → FFmpeg adds subtitles, music and background elements → finished video.
Where to use AI-avatar: regular training content, news reports, weekly updates, answers to typical questions. Where not to use: live broadcasts with a live audience, discussions of specific projects where a real expert reputation is needed, intimate communication - the audience quickly reads syntheticity in such formats, and this harms trust more than it helps.
A separate legal and ethical point: revealing the fact of using an AI avatar to an audience is not a formality, but a matter of trust over a long distance.
Approach 2: generative video without avatar
Sora 2 Pro (OpenAI), Veo 3 (Google), Seedance – generate video directly over a textual prompt, without the need for a real or avatar face. The finished video sequence is stored in cloud storage (usually Google Drive) and from there is automatically uploaded to the publishing site.
The combined pipeline for a full-fledged commercial looks like this: NanoBanana generates a visual → Seedance adds animation → Suno creates a musical accompaniment → everything is collected in a ready-made video and published on all platforms at once.
Technical components of assembly
FFmpeg is an open tool for final assembly: subtitles, music, background elements, watermarks and image branding (via a bundle with ImageMagick).
Whisper - speech recognition for automatic generation of subtitles, including with frame-by-frame text stylization (word-by-word, when words are illuminated in turn synchronously with speech - a format that now dominates in short videos).
Block 4: Carousel generation
Carousels are the most underrated format for a content factory, and in 2026 they are experiencing a real boom: algorithms show the carousels again if the user did not interact with the post on the first display – the tape will offer it again, from another slide.
Specialized services for carousels
Unlike video, for carousels over the past year, a whole separate direction of neural networks that do not require design skills has appeared:
** Carouselin** is a neural network that collects the structure, draws up slides and produces a ready-made series of pictures from an idea, text or photo for Instagram*, Vkontakte, Telegram and Pinterest. Charging is for loans where 1 credit corresponds to the generation of one slide.
Carousel Maker – turns the topic, ready-made text or link to an article into a ready-made series of slides with the ability to download your own backgrounds and branding (logo, link to the site).
Telegram-bot for instant generation – a card without the participation of the designer is collected in five minutes through a specialized bot, which is especially convenient if the carousel is only one of the formats in a wider conveyor and does not require a separate interface.
Canva with AI functions - Magic Write generates structured text for a given theme, and algorithms automatically build a visual hierarchy and color harmony of the template. Suitable if you need maximum control over the final design while maintaining some automation.
Prompt structure for carousel generation
Subject:
Number of slides: 6-8
Slide 1: A catchy headline-promise (what the reader learns)
Slides 2-6: one key idea per slide, short title + 1-2 sentences
Slide 7: summary / checklist
Slide 8: Call to Action (Subscribe, Save, Write to Direct)
Style of design: {description of the brand palette and fonts}
Integration of carousels into a common conveyor
Carousels are easily included in the same n8n conveyor as video: after the text structure is generated through the LLM, the request is transmitted to the API of the carousel generation service, the finished images are saved to the cloud, and then go to the same publishing module as the video.
Block 5: publication – cross-subscribing to dozens of sites
This is a critical block that has its own serious technical problem: official social media APIs are either closed or severely restricted. Instagram* does not allow you to publish through the API without app approval, which is almost impossible for a small project. TikTok and Threads are similar.
Intermediaries for autoposting
Blotato – circumvents these restrictions by giving a single entry point to publish on multiple social networks at once. The fee starts at $29 per month; the free version without access to the API is useless for automation.
Ayrshare is a similar service with a similar model of operation, often referred to as an alternative to Blotato in more advanced conveyors.
*Upload-Post – used in conjunction with video generators for direct publication with description and tags on YouTube and similar sites.
Postmypost.io and SMMplanner are simpler cross-posting services: download the finished video, cover and description, connect accounts (Reels, TikTok, Facebook* Video, YouTube Shorts, VKontakte and others) and set the publication schedule.
** CIS-focused planners** - SMMplanner, Onlypult, Amplifer are especially strong in working with VKontakte, Odnoklassniki and Telegram, support publication queues, slots and automatic first comment (relevant for sites where links in the main text of the post are limited).
Trigger launch of publication
For a content factory, the best option for a n8n trigger is to run on schedule through the Schedule Trigger node. The setting depends on the tasks: you can publish every hour at a fixed minute, you can - once a day at a specific time. A table with row statuses (ready/in progress/published/error) allows the system to determine which materials are ready for publication right now.
Real Cases: What Does It Do In Practice
Practice figures are more important than any theoretical reasoning about efficiency.
Motorcycle case
The motorcycle company, before the launch of the content factory, published only dry content - just showed the motorcycle and indicated the price, without involvement and interest, because of which the videos did not generate applications. After the introduction of the conveyor: rollers began to steadily collect tens and hundreds of thousands of views, with individual bursts of up to millions. In the first month, one garage was completely sold out - 45 motorcycles through social networks. Content on a constant basis began to bring a flow of orders, and social networks have become a real sales channel.
Case with a sharp acceleration of production
In one of the described conveyors, the research time was reduced from 45 to 5 minutes due to the automation of the idea search phase. The performance of the system is up to 40 videos per week while maintaining two manual control points: task setting (about 5 minutes) and final editing before publication (about 10 minutes through a Telegram bot).
Case of cleaning business
A Toronto-based cleaning company has introduced a content factory that produces daily English-language posts about cleaning furniture and home care – an example of a niche, narrow-themed pipeline designed not for virality, but for a stable local SEO presence on social networks.
Common pattern of successful cases
In all cases, the same structure is repeated: the person spends minutes setting the task and final verification, and all mechanical work – from researching the topic to publishing on several sites – is performed automatically.
Pitfalls that are not talked about in advertising services
This is the most important part of the article – what distinguishes a working content factory from a beautiful presentation that falls apart after a month.
Algorithmic filter against massively generated content
The algorithm “Typhoon” Yandex from 2025 reduces the mass-generated texts. Social networks also cut coverage for frankly the same, template content. The Russian audience already feels the difference between live and template text – we are talking about most users.
A practical rule worth remembering literally: automate the process, but not the meaning. The idea, scenario and pitch angle should be created by a person (or, at least, a carefully calibrated and constantly updated prompt), the neural network is a tool for accelerating and replicating, not replacing the content part.
Degradation when scaling to multiple accounts
When dozens of videos are published on a grid of several dozen accounts from one device and one IP address, sites see the connection between profiles. This is a direct path to massive shadow coverage restriction (shedouban) or complete blocking of the entire network of accounts at the same time. If a content factory works for several brands or accounts at the same time, you need to divide them by IP and devices (anti-detect browsers or proxies for each group of accounts), otherwise the risk of a cascading ban increases in proportion to the number of profiles.
Dependence on external services and data loss
Neural networks and publishing services sometimes block accounts without warning and without the possibility of appeal within a reasonable time. If the entire database of scripts, ready-made content and developments is stored only inside a third-party cloud service, when you block an account, this database is lost entirely. Keep all database files locally or in your own cloud storage, not tied to a specific content generator.
Micro-variations vs. algorithmic duplication recognition
Each video is slightly changed before it is laid out – brightness, contrast, playback speed – imperceptibly to the viewer, but significant to the platform algorithm, which may otherwise recognize too similar content as repetitive or spam. This is not a fraud of the algorithm in the strict sense, but a way not to fall under automatic filtering of duplicates when replicating one idea in several formats.
The illusion of “complete automation without human intervention”
Despite the high level of automation, manually checking content before publication remains mandatory in almost all documented successful cases. A fully autonomous pipeline without a single point of human control is a recipe for publishing factual errors, failed wording, or simply low-quality content on an industrial scale. Minimum control points – task setting and final check before exit – are worth the 15-20 minutes they require.
How to build a minimum working content factory: step by step
If you start from scratch, you don’t need to build a complete system of ten connected services. Reasonable sequence:
Step 1. Deploy n8n (self-hosted on a VPS or cloud version) and connect one source of ideas - even a simple Google Table that you fill in manually.
Step 2. Connect one LLM (ChatGPT or Claude via API) to generate script text or carousel structure based on the theme from the table.
Step 3. Choose one format to start – either a carousel through a dedicated service (this is technically easier) or an AI avatar video via HeyGen. Do not try to run both formats simultaneously in the first version of the conveyor.
Step 4. Connect one cross-site service (Blotato, Ayrshare or simpler SMMplanner) to publish on 2-3 key sites instead of all ten possible sites.
Step 5. Add a manual checkpoint — for example, through a Telegram bot that sends the finished material for approval before publication.
Step 6. Only after this minimum cycle has been running smoothly for 2-3 weeks, add new formats, pads and manual point automation.
The base plant is assembled this way over the weekend, without the need for a paid course or a complex multi-stage architecture from the start.
Summary table of instruments by phase
| Этап конвейера | Инструменты | Примерная стоимость |
|---|---|---|
| Оркестрация | n8n (self-hosted), Make.com | n8n бесплатен на своём сервере, Make — от операций |
| Источник идей | Google Trends, Apify, RSS, Google Sheets | Apify — по использованию, остальное бесплатно/недорого |
| Генерация текста | ChatGPT API, Claude API | По токенам, обычно недорого при умеренном объёме |
| AI-аватар видео | HeyGen + ElevenLabs | От нескольких десятков $ в месяц |
| Генеративное видео | Sora 2 Pro, Veo 3, Seedance | По генерации, варьируется значительно |
| Карусели | Каруселин, Carousel Maker, Canva AI | От бесплатного тарифа до подписки по кредитам |
| Сборка/субтитры | FFmpeg, Whisper | Бесплатны (open-source) |
| Публикация | Blotato, Ayrshare, SMMplanner, Postmypost | От $29/мес и выше |
Outcome
The content factory is not a one-time “and forgot” setting, but a living system that needs to be maintained: update the prompts for current trends, monitor the reaction of site algorithms, divorce accounts technically when scaling, and be sure to maintain at least a minimum point of human control before publication.
Technically, assembling a working assembly line is a weekend task, not a month-long one, especially if you start with a minimal version of one format and multiple sites. But long-term success is determined not by the complexity of a n8n circuit, but by how well-built live feeding angle and quality control are into it – without that, even the most technically advanced assembly line will produce content that algorithms and audiences alike ignore.
*Relevant to June 2026. Tools and terms of use of services change quickly - check the current tariffs and restrictions before integrating into your pipeline. *
* Meta Platforms Inc. (Facebook, Instagram) is recognized as an extremist organization and its activities are prohibited in the Russian Federation.