SitePoint Sponsor

User Tag List

Page 1 of 2 12 LastLast
Results 1 to 25 of 28
  1. #1
    SitePoint Zealot
    Join Date
    Jan 2008
    Posts
    143
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)

    Question % of Duplicate Content to Save Your Site?

    Do you believe that there is some % of the text on the page that can be duplicate content without making any troubles for your site?

    Before some might try to answer that dup content will be found in any way and the dup content publisher will sooner or later have problems - let me give an example that can change the focus.

    Imagine that I am running a site about AQUA (yes-yes, there was such a band with a pretty girl Lene somewhere from Scandinavia). And on the page I am writing why they are so great, blah-blah AND...

    ...then use a quote from their song. Let's say I quote 8 lines from a song.

    This is surely dup content, because somewhere on AQUA's site there must be all lyrics of their songs. Or on the sites of their fans.

    But I am sure (cannot prove it, just feel like this) that if the % of this dup content stays beyond some line, this is ok for search engines. Otherwise even quoting the name of this forum is a potential danger

    I understand that you cannot know that exact %, but does it seem logical to you that this % should exist?
    Backlinks inside blog posts + FREE content = Free Traffic System

  2. #2
    He's No Good To Me Dead silver trophybronze trophy stymiee's Avatar
    Join Date
    Feb 2003
    Location
    Slave I
    Posts
    23,424
    Mentioned
    2 Post(s)
    Tagged
    1 Thread(s)
    Having partial duplicate content is ok for the very kind of reason you cited. Duplicate content becomes an issue when the page offers nothing new to a user.

  3. #3
    SitePoint Wizard bronze trophy bigalreturns's Avatar
    Join Date
    Mar 2006
    Posts
    1,295
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    You run a site about AQUA?!?! Sir, if this we were not online I would shake your hand!
    "The proper function of man is to live - not to exist."
    Get a Free TomTom


  4. #4
    SitePoint Zealot
    Join Date
    Jan 2008
    Posts
    143
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Quote Originally Posted by bigalreturns View Post
    You run a site about AQUA?!?! Sir, if this we were not online I would shake your hand!
    A little early for a handshake - just plan to do that - that it why made this thread to double check. But thanks for the kind words.
    Backlinks inside blog posts + FREE content = Free Traffic System

  5. #5
    Error 404: Life not found silver trophybronze trophy
    Join Date
    Dec 2007
    Location
    UK Nr Manchester
    Posts
    3,460
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Off Topic:

    Quote Originally Posted by bigalreturns View Post
    You run a site about AQUA?!?! Sir, if this we were not online I would shake your hand!
    You would? Why?
    It's 530 people, but do you really get it?
    ImgWebDesign - Web design in Buxton, High Peak, Derbyshire UK.

  6. #6
    SitePoint Wizard bronze trophy hooperman's Avatar
    Join Date
    Jan 2006
    Location
    Manchester, UK
    Posts
    4,301
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Off Topic:

    Quote Originally Posted by JJMcClure View Post
    Off Topic:



    You would? Why?
    Big Al thinks that life in plastic is fantastic

  7. #7
    SitePoint Enthusiast Chapichupapa's Avatar
    Join Date
    Feb 2008
    Posts
    63
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    yes that is possible, at least 30% of the content should be different from the original would not be penalize by search engines.

  8. #8
    He's No Good To Me Dead silver trophybronze trophy stymiee's Avatar
    Join Date
    Feb 2003
    Location
    Slave I
    Posts
    23,424
    Mentioned
    2 Post(s)
    Tagged
    1 Thread(s)
    Quote Originally Posted by Chapichupapa View Post
    yes that is possible, at least 30% of the content should be different from the original would not be penalize by search engines.
    Where did you get that percentage from?

  9. #9
    SitePoint Guru Rebirth Studios's Avatar
    Join Date
    Mar 2003
    Posts
    621
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Quote Originally Posted by besttools View Post
    Do you believe that there is some % of the text on the page that can be duplicate content without making any troubles for your site?
    Quote Originally Posted by stymiee View Post
    Having partial duplicate content is ok for the very kind of reason you cited. Duplicate content becomes an issue when the page offers nothing new to a user.
    Reference - The Googles Webmaster Guidelines:
    Duplicate content on a site is not grounds for action on that site unless it appears that the intent of the duplicate content is to be deceptive and manipulate search engine results. If your site suffers from duplicate content issues, and you don't follow the advice listed above, we do a good job of choosing a version of the content to show in our search results.
    You can watch one of Matt Cutts' video sessions in which he talks about this. The question comes up at about 1:10

    You can also watch this segment with Vanessa Fox

  10. #10
    SitePoint Zealot Jim Beam's Avatar
    Join Date
    Jan 2008
    Posts
    171
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    i think that content can be duplicated in a way...The thing is you can't always create and imagine something new. While communicating you learn a lot from people. Sometimes you say words you heard from somebody, sometimes you rewrite what you heard by your own words. That's normal...

  11. #11
    SitePoint Enthusiast
    Join Date
    Dec 2004
    Location
    India
    Posts
    52
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Quote Originally Posted by besttools View Post
    Do you believe that there is some % of the text on the page that can be duplicate content without making any troubles for your site?

    Before some might try to answer that dup content will be found in any way and the dup content publisher will sooner or later have problems - let me give an example that can change the focus.

    Imagine that I am running a site about AQUA (yes-yes, there was such a band with a pretty girl Lene somewhere from Scandinavia). And on the page I am writing why they are so great, blah-blah AND...

    ...then use a quote from their song. Let's say I quote 8 lines from a song.

    This is surely dup content, because somewhere on AQUA's site there must be all lyrics of their songs. Or on the sites of their fans.

    But I am sure (cannot prove it, just feel like this) that if the % of this dup content stays beyond some line, this is ok for search engines. Otherwise even quoting the name of this forum is a potential danger

    I understand that you cannot know that exact %, but does it seem logical to you that this % should exist?
    i think if you have partial duplicate content then search engine will you panelize you.

  12. #12
    SitePoint Wizard bronze trophy bigalreturns's Avatar
    Join Date
    Mar 2006
    Posts
    1,295
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Quote Originally Posted by Indiaseo View Post
    i think if you have partial duplicate content then search engine will you panelize you.
    Bizarrely my site has duplicate content on every single page. It takes the form of a menu, and also a copyright notice in the footer. Will I be penalised?
    "The proper function of man is to live - not to exist."
    Get a Free TomTom


  13. #13
    Galactic Overlord gold trophysilver trophybronze trophy
    HAWK's Avatar
    Join Date
    Aug 2003
    Location
    New Zealand
    Posts
    12,600
    Mentioned
    987 Post(s)
    Tagged
    14 Thread(s)
    Most definitely. No doubt about it. You need to remove that menu at once.

  14. #14
    He's No Good To Me Dead silver trophybronze trophy stymiee's Avatar
    Join Date
    Feb 2003
    Location
    Slave I
    Posts
    23,424
    Mentioned
    2 Post(s)
    Tagged
    1 Thread(s)
    Quote Originally Posted by Indiaseo View Post
    i think if you have partial duplicate content then search engine will you panelize you.
    Fortunately that isn't true.

  15. #15
    Error 404: Life not found silver trophybronze trophy
    Join Date
    Dec 2007
    Location
    UK Nr Manchester
    Posts
    3,460
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Quote Originally Posted by stymiee View Post
    Fortunately that isn't true.
    I've never been panelized, do you get beaten until you're quite flat and then used as siding or do you have to sit in front of several people answering questions, in that case I have been panelized now I think about it....

    Maybe Google has a panelizing team that goes around beating webmasters severely for using dup content.
    It's 530 people, but do you really get it?
    ImgWebDesign - Web design in Buxton, High Peak, Derbyshire UK.

  16. #16
    SitePoint Wizard bronze trophy hooperman's Avatar
    Join Date
    Jan 2006
    Location
    Manchester, UK
    Posts
    4,301
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Quote Originally Posted by JJMcClure View Post
    I've never been panelized, do you get beaten until you're quite flat and then used as siding or do you have to sit in front of several people answering questions, in that case I have been panelized now I think about it....

    Maybe Google has a panelizing team that goes around beating webmasters severely for using dup content.

  17. #17
    SitePoint Evangelist MrCat's Avatar
    Join Date
    Aug 2007
    Posts
    494
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    You guys are rude
    ClickXposure.com - PPC Management

  18. #18
    Galactic Overlord gold trophysilver trophybronze trophy
    HAWK's Avatar
    Join Date
    Aug 2003
    Location
    New Zealand
    Posts
    12,600
    Mentioned
    987 Post(s)
    Tagged
    14 Thread(s)
    I think it's more a case of venting a bit of frustration over someone that makes sweeping generalisations that have no foundation. Understandable...

    Although we need to keep it nice.

  19. #19
    SitePoint Enthusiast
    Join Date
    May 2007
    Posts
    61
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    okey, what about duplicate contents within a site???

  20. #20
    SitePoint Wizard bronze trophy hooperman's Avatar
    Join Date
    Jan 2006
    Location
    Manchester, UK
    Posts
    4,301
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Ideally it will get filtered out of the SERPs. No point in it appearing twice, even if it's on the same domain.

  21. #21
    SitePoint Enthusiast
    Join Date
    Mar 2008
    Posts
    53
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    There's no % threshold, according to Adam Lasnik. My tests also show that two pages with identical content may both be indexed. Even 90% duplicate doesn't always seem to trip a filter.

    Duplicate content algorithms are pretty complicated to say the least. Just read a couple of patents and you'll start to see why Google is way beyond determining dupes using a percentage.

  22. #22
    SitePoint Wizard bronze trophy hooperman's Avatar
    Join Date
    Jan 2006
    Location
    Manchester, UK
    Posts
    4,301
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Quote Originally Posted by ExtraArm View Post
    My tests also show that two pages with identical content may both be indexed.
    That's interesting ExtraArm. Over what period did you run your test?

    I did a similar test maybe 2 years ago, and eventually one of the duplicate pages was removed from the index completely.

  23. #23
    SitePoint Enthusiast
    Join Date
    Mar 2008
    Posts
    53
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    That's interesting ExtraArm. Over what period did you run your test?
    I ran the test back in 2006 for about 10 months or so when people still believed duplicate content and supplemental results were causally related. At one point, the entire subdirectory I was running the dupe content test on was de-indexed, but then the pages came back. Many of them made it into the main index.

    If there's not enough juice flowing into the test pages, they may fall out of the index not necessarily due to dupe content filter but due to lack of PageRank.

  24. #24
    He's No Good To Me Dead silver trophybronze trophy stymiee's Avatar
    Join Date
    Feb 2003
    Location
    Slave I
    Posts
    23,424
    Mentioned
    2 Post(s)
    Tagged
    1 Thread(s)
    That's a long time ago in Internet time. You can bet Google is much better at filtering it out now.

  25. #25
    SitePoint Enthusiast
    Join Date
    Mar 2008
    Posts
    53
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    You can bet Google is much better at filtering it out now.
    Betting and guessing is for newbs. Besides, Google's filtering mechanism improving has absolutely nada to do with using similarity % to filter out duplicate content.


Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •