sasogeek
  • sasogeek
how can you tell what a website is about from its source code?
Computer Science
  • Stacey Warren - Expert brainly.com
Hey! We 've verified this expert answer for you, click below to unlock the details :)
SOLVED
At vero eos et accusamus et iusto odio dignissimos ducimus qui blanditiis praesentium voluptatum deleniti atque corrupti quos dolores et quas molestias excepturi sint occaecati cupiditate non provident, similique sunt in culpa qui officia deserunt mollitia animi, id est laborum et dolorum fuga. Et harum quidem rerum facilis est et expedita distinctio. Nam libero tempore, cum soluta nobis est eligendi optio cumque nihil impedit quo minus id quod maxime placeat facere possimus, omnis voluptas assumenda est, omnis dolor repellendus. Itaque earum rerum hic tenetur a sapiente delectus, ut aut reiciendis voluptatibus maiores alias consequatur aut perferendis doloribus asperiores repellat.
chestercat
  • chestercat
I got my questions answered at brainly.com in under 10 minutes. Go to brainly.com now for free help!
sasogeek
  • sasogeek
ok so i've written a crawler and i'm storing the links in a database but i don't want all links in one database. i want specific links in specific databases. so that links who's pages are about specific topics or subjects or categories of some ideas are placed in specific databases. and to do that i need to know what the page that link is from is about.
sasogeek
  • sasogeek
or even if same database, different tables, i'd still need to find a way to place them in the right places... :/
across
  • across
You want something that can 'read' the links, 'know what they're about', and store them accordingly?

Looking for something else?

Not the answer you are looking for? Search for more explanations.

More answers

sasogeek
  • sasogeek
yes :)
sasogeek
  • sasogeek
for now, i can do the reading part, "know what they're about" is what i'm in search of atm...
across
  • across
This is artificial intelligence, specifically, natural language processing. You want something that will go through a website and store it in a finite number of categories. Either that, or you can find a gigantic table of website-category associations.
across
  • across
It has to be customized, however, since the word 'category' is rather vague and you may have it defined it differently than someone else.
sasogeek
  • sasogeek
a finite number of categories is what i plan to start with, but as time goes on, users are allowed to submit categories of their interest that are not in the default list and that might be added later depending on how many people really would need such a category
across
  • across
This is exactly what's going on at math.stackexchange.com, except that the categorization is performed by users, not parsers. This is due to the immense difficulty of processing natural language. ;)
sasogeek
  • sasogeek
the idea is certainly to make it customizable so that users get data based on personal preferences... but i guess everything starts from somewhere, that's where i'm looking for right now :/
sasogeek
  • sasogeek
are meta tags a good idea?
across
  • across
It depends on the 'context.'
across
  • across
The biggest lesson that I learned as a CS undergrad is that the field has to be insanely well-defined for it to be feasible.
sasogeek
  • sasogeek
what would you suggest i attempt?
across
  • across
Find a Ph.D., or group thereof, carrying out research in the field of natural language processing and assist them in their pursuit of the betterment of e-mankind!
across
  • across
http://cms.dt.uh.edu/faculty/chenp/
across
  • across
Notice the guy's working on a "node-based web crawler."
sasogeek
  • sasogeek
how am i supposed to help? :s
sasogeek
  • sasogeek
assist i suppose is the right word in place of "help" :/ but still ....
across
  • across
You're now in a university. ;) Let's talk to some professors!
sasogeek
  • sasogeek
well i'm guessing it's not possible that i get in contact by any chance with the person in the link you gave me with respect to his research.... :/
across
  • across
He's just one in a million Ph.D.s carrying out research in this field.
sasogeek
  • sasogeek
i see, well it seems difficult than it looks anyway. :/ looks like i have a long way ahead of me lol
anonymous
  • anonymous
yo...you are thinking to much into what you are trying to progress forward on...lol... keep it simple.. and build parts and other fixs... i have a book that you might find interest in..it's called Constructing intelligent Agents with Java.. read the book from start to finish then step back , make a plan then go with it..
Pradius
  • Pradius
ehhh what about metatags? they should tell you what are the links about
anonymous
  • anonymous
Yes, the meta information can be helpful. Also if you try to get the content of the link then do some indexing for example if you know which word repeated more that can help you to know the context of this link (document). I know it's heavy, because you need to process many links so a lot of content, but it's just an idea!
sasogeek
  • sasogeek
Thanks guys :) for now I'll be using the meta tags but in later development, i may change it to something sophisticated, i just want it working okay for now :) thanks for mentioning it again x
anonymous
  • anonymous
Welcome

Looking for something else?

Not the answer you are looking for? Search for more explanations.