You found ten great articles. You want to put them in one place. Easy, right?
Wrong.
Here's the thing — if you just copy-paste them, you're in trouble. And I don't mean minor trouble. I mean a cease-and-desist letter that ruins your Tuesday, or worse, a lawsuit that costs more than your laptop.
The phrase "your new material may aggregate or bring together" sounds boring. Bureaucratic, even. Consider this: it’s the clause in a contract that trips up freelancers. But it’s the sentence that gets people sued. It’s the legal gray area that keeps editors up at night.
If you are creating anything online — a blog, a newsletter, a news digest, or a software feature — you need to understand what happens when you take other people’s work and put it in a box. Because that box has rules Which is the point..
What Is Aggregating or Bringing Together Material
Let's strip away the legalese. When we talk about aggregating or bringing together material, we are talking about taking existing content and packaging it into something new. That "something new" could be a curated list, a summary, a database, or a mashup.
Think of it like a smoothie. You didn't grow the strawberries. You didn't milk the cow. But you blended them together into a drink. That’s aggregation Practical, not theoretical..
But there’s a difference between blending and just dumping the fruit in a bowl.
Aggregation vs. Curation
Aggregation is mechanical. It’s pulling data from Source A, Source B, and Source C and displaying it in a list. So it’s often automated. Google News is aggregation Small thing, real impact. Worth knowing..
Curation is editorial. It’s you picking the best articles on a topic and explaining why they matter. It requires judgment. It requires a human voice.
The line between them matters legally. Day to day, if you just repost links, you’re safer. If you republish the text, you’re walking a tightrope.
Derivative Works vs. Compilations
Copyright law splits this into two buckets.
A compilation is when you take independent works and put them in a collection. Think of a poetry anthology. Each poem belongs to someone else, but the order and the selection are yours.
A derivative work is when you modify something. A translation is a derivative work. A movie based on a book is a derivative work. A mashup of two songs is a derivative work Which is the point..
When you "bring together" material, you are usually making a compilation. But if you change the format — say, turning an article into a video script — you might be creating a derivative work. And that changes the stakes Simple, but easy to overlook..
Why It Matters
Why does this matter? Because the internet runs on other people’s stuff.
Every time you write a blog post, you’re standing on the shoulders of giants. You’re referencing studies, quoting tweets, and linking to news stories. But there’s a difference between referencing and republishing.
Here’s why most people care about this topic:
- Money: Creators lose ad revenue when their content is scraped. This is why the "Content Network" is such a hot topic.
- Legal Risk: One bad aggregation decision can result in a DMCA takedown, a copyright strike on your YouTube channel, or a lawsuit. These aren't theoretical. They happen to small blogs every single day.
- Reputation: Getting caught stealing content—even unintentionally—kills trust.
Look at the big players. Google fought the Associated Press for years over aggregation. Which means the lesson? Google News now pays for licenses in many countries. This leads to the result? News Corp wanted Google to pay for linking to their articles. "Free" content often has a hidden price tag.
How It Works (The Legal Stuff)
I know, I know. You didn't come here for a law lecture. But you need to know the basics, or you’ll step on a landmine.
The Copyright Owner's Rights
When someone creates an original work—text, image, video, code—they own it. That said, they have the exclusive right to reproduce it, distribute it, and make derivative works. When you aggregate that material, you are stepping on one or more of those rights No workaround needed..
Unless you have permission. Or unless you fall under an exception.
Fair Use and Transformative Use
This is the big one. Fair use is the shield that lets you use copyrighted material without asking. But it’s not a magic spell.
But it's not a magic spell. It's a legal doctrine with four factors that courts weigh. Get these wrong, and you're not protected.
The Four Factors
Courts consider:
-
Purpose and character of the use. Is it commercial or educational? Are you adding something new, or just copying? This is where "transformative" comes in. If you add commentary, criticism, or new meaning, you're on stronger ground. If you're just reposting with minimal changes, you're vulnerable.
-
Nature of the copyrighted work. Facts and ideas aren't protected—only the expression. Using a news article about a stock price is different from copying a poet's exact words. Fictional works get stronger protection than factual ones.
-
Amount used. There's no magic word count, but taking the "heart" of a work—its most memorable or creative part—hurts your case. A thumbnail of an image can be infringement. A 30-second clip can be infringement. It depends on context.
-
Effect on the market. Does your use hurt the original's value? If people can get your version instead of paying for the original, that's a problem. This is why streaming movies or downloading full articles is so risky—it's a direct substitute.
No single factor decides. Practically speaking, courts balance all four. But the first one—transformative use—is where most modern fair use arguments live or die The details matter here. No workaround needed..
What "Transformative" Actually Means
People throw around "transformative" like it means "I changed three words." It doesn't.
True transformation adds meaning, message, or insight. A parody that mocks the original is transformative. A critical review that quotes passages to analyze them is transformative. A compilation that organizes data in a new way to reveal patterns is transformative Small thing, real impact. Practical, not theoretical..
Reposting an article with a link back? That's why not transformative. Adding a few emojis? Turning a blog post into a video where you read it aloud? Not transformative. That's a gray area—you've changed the format, but you haven't necessarily added new expression.
The key question: Are you creating something new, or are you just repackaging someone else's work to profit from it?
Practical Guidance
Here's how to stay on the right side of the line.
When You're Probably Safe
- Linking and embedding. Generally fine, as long as you're not bypassing paywalls or scraping content.
- Quoting short passages for commentary, criticism, or news reporting. A sentence or two with attribution usually holds up.
- Summarizing someone else's work in your own words. This is aggregation at its safest.
- Using public domain or Creative Commons material. Check the specific license. Some require attribution. Some prohibit commercial use.
When You're Walking a Risk
- Republishing full articles without permission, even with attribution. The more you take, the bigger the risk.
- Scraping content to build your own database. This is what AI companies are fighting about right now.
- Translating works without permission. That's a derivative work, not a compilation.
- Using images you didn't create. Images are the most frequently sued-over content online.
The Attribution Question
Here's something many people get wrong: attribution does not equal permission.
Credit the creator? Not necessarily. If you copy a paragraph and tag the author, you still copied a paragraph. Good manners. Legally protective? Attribution is a courtesy, not a license Worth keeping that in mind..
The AI Complication
We'd be remiss not to mention what's happening now. Which means aI tools can scrape, summarize, and generate content at scale. Lawsuits are flying. That said, the New York Times is suing OpenAI. Now, authors are suing tech companies. No one knows exactly how this plays out.
But the underlying principles haven't changed. In real terms, taking someone else's work to train a model or generate output? That's still using copyrighted material. The legal system is catching up, and it's going to get messy before it gets clear.
If you're building anything with AI, assume the conservative position: don't use content you wouldn't feel comfortable asking permission for.
The Bottom Line
Copyright isn't just a legal issue—it's a trust issue. The internet works because people share. But sharing only works when creators feel respected.
Here's the practical framework:
- When in doubt, ask. Most creators are happy to grant permission if you ask nicely.
- When you can't ask, transform. Don't just copy—add value. Comment, analyze, build on it.
- When it's not yours, give credit. Not because it protects you legally, but because it's the right thing to do.
- When it's valuable, pay for it. If someone's work is worth using, it's worth compensating.
The line between aggregation and infringement isn't always clear. But it's not mysterious either. It comes down to this: are you building something, or are you just taking something?
Build something. That's how you stay safe, stay ethical, and stay in business.