Difference between revisions 932891 and 971731 on enwiki

Famous [[Korea|Korean]] people:

'''Note:''' In Korean names, the [[Korean family name|family name]] is typically placed first (for example, the family name of "Kim Daejung" is "Kim").

* Historical
** [[Sejong of Joseon|King Sejong]], inventor of [[Hangeul]]
** [[Yi Sunsin]], admiral

* Politicians
** [[Kim Dae-jung]]
** [[Kim Il-Sung]]
** [[Kim Jong-il]]
** [[Kim Jongpil]]
** [[Lee Hoichang]]
** [[Park Junghee]]
** [[Rhee Syngman]]
** [[Roh Moo-hyun]]
** [[Roh Taewoo]]
::See also: [[President of South Korea]]

* Businessperson
** [[Chung Juyung]]
** [[Kim Woojung]]
** [[Lee Byungchul]]

* Artists
** [[Nam June Paik|Paik Nam-june]]

* Film-makers
** [[Im Kwontaek]]
** [[Lee Changdong]]

* Religious
** [[Sun Myung Moon|Moon Sun-Myung]]
** [[Seok-heon Ham|Ham Seok-heon]]

* Scientists and Engineers
** [[Lee Huiso]] aka [[Benjamin Lee]]

* Sportmen and -women
** [[Cha Bumkeun]] ([[Bundesliga]] Football Player)
** [[Kim Byunghyun]] ([[MLB]] Player)
** [[Kim Mihyun]] ([[LPGA]] Player)
** [[Park Chan Ho]] ([[MLB]] Player)
** [[Pak Seri]] ([[LPGA]] Player)
** [[Park Chanho]] ([[MLB]] Player)
** [[Hee Seop Choi]] ([[MLB]] Player)
** [[Sohn Kee-jung]] ([[Marathon (sport)|Marathon]] Gold medalist in the [[1936_Summer_Olympics|Berlin Olympics]])

See also: [[List of people by nationality'''Screen scraping''' is the act of capturing [[data]] from a system or program by snooping the contents of some display that is not actually intended for data transport or inspection by programs. Around [[1980]] this term referred to tricks like reading the display memory of a smart terminal through its auxiliary port. Nowadays it often refers to [[parsing]] the [[HTML]] in generated [[web pages]] with [[Computer_program|programs]] designed to mine out particular patterns of content. In either guise screen-scraping is an ugly, ad-hoc, last-resort technique that is very likely to break on even minor changes to the format of the data being snooped.

In spite of the inelegance of consuming data from a web page using screen scaping, 
the emergence of [[web services]] has lent itself
to the creation of technologies that turn web page screen scraping into a science 
(though still a very imperfect one). [[Microsoft]], for example,
has built into its implementation of web services the ability to create a web service 
which extracts its data from a web page with the
help of an extention to the [[WSDL]] standard and the use of [[regular expressions]]. 
For more information on this technique see the [[MSDN]] document [http://msdn.microsoft.com/library/default.asp?url=/library/en-us/cpguide/html/cpconcreatingclientsthatparsecontentofotherwebpages.asp Creating XML Web Services That Parse the Contents of a Web Page]

Regular expressions themselves are a traditional and very powerful technology used for screen scraping. Screen scraping
requires intensive text [[parsing]] [[algorithms]]. Computer language, like [[Perl]] which have strong support for 
regular expression, are a popular choice for writing screen scraping programs.

Also in recent years, [[PHP]] has been developing in areas ideally suited to creating screen scraping applications. The release of PHP5 included many new [[XML]] and [[DOM]] additions, including functions to parse badly formed [[HTML]] documents into DOM-trees, 
and work on them as if they were well-formed XML. Having the webpage in an XML format makes it easier for programs 
to parse web pages.

==Books==

* Hemenway, Kevin and Calishain, Tara,  ''Spidering Hacks'', O'Reilly, 2003, ISBN 0596005776


[[Category:Software]]