Skip to content

Commit

Permalink
act12
Browse files Browse the repository at this point in the history
  • Loading branch information
luisagalva committed Jan 27, 2018
1 parent f597772 commit 61d607e
Show file tree
Hide file tree
Showing 27 changed files with 286 additions and 416 deletions.
Binary file modified .DS_Store
Binary file not shown.
Empty file removed Chinook
Empty file.
Binary file modified bin/.DS_Store
Binary file not shown.
Binary file removed bin/db/Chinook.class
Binary file not shown.
Binary file removed bin/db/Customer.class
Binary file not shown.
Binary file removed bin/db/CustomerAdmin.class
Binary file not shown.
Binary file added bin/lib/jsoup-1.11.1.jar
Binary file not shown.
Binary file added bin/lib/log4j-api-2.9.1.jar
Binary file not shown.
Binary file added bin/lib/log4j-core-2.9.1.jar
Binary file not shown.
9 changes: 4 additions & 5 deletions bin/log4j2.xml
Original file line number Diff line number Diff line change
Expand Up @@ -4,17 +4,16 @@
<Console name="Console" target="SYSTEM_OUT">
<PatternLayout pattern="%d{HH:mm:ss.SSS} [%t] %-5level %logger{36} - %msg%n" />
</Console>
<File name="Logfile" fileName="logs/aadmin.log" immediateFlush="true"
<File name="Logfile" fileName="logs/crawler.log" immediateFlush="true"
append="true">
<PatternLayout
pattern="%d{yyy-MM-dd HH:mm:ss.SSS} [%t] %-5level %logger{36} - %msg%n" />
</File>
</Appenders>
<Loggers>
<Logger name="db.CustomerAdmin" level="info">

</Logger>
<Root level="error"><AppenderRef ref="Logfile" /></Root>


<Root level="info"><AppenderRef ref="Logfile" /></Root>
</Loggers>

</Configuration>
Binary file added bin/web/Crawler.class
Binary file not shown.
Binary file added bin/web/MainCrawler.class
Binary file not shown.
Binary file removed database/Chinook.db
Binary file not shown.
39 changes: 39 additions & 0 deletions jsoup.org.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,39 @@
/cookbook/modifying-data/set-html
//jhy.io/
http://www.opensource.org/
/news/release-1.11.2
discussion
/cookbook/input/parse-body-fragment
/cookbook/modifying-data/set-text
bugs
download
/cookbook
/cookbook/input/load-document-from-file
/apidocs/
/cookbook/
/cookbook/input/load-document-from-url
/apidocs/org/jsoup/select/Elements.html#html--
/news/
/download
/cookbook/introduction/
//try.jsoup.org/~LGB7rk_atM2roavV0d-czMt3J_g
/cookbook/modifying-data/set-attributes
http://whatwg.org/html
/colophon
/cookbook/input/parse-document-from-string
https://github.com/jhy/jsoup/blob/master/src/main/java/org/jsoup/examples/Wikipedia.java
/bugs
/cookbook/extracting-data/attributes-text-html
/apidocs/index.html?org/jsoup/select/Elements.html
/
/cookbook/extracting-data/selector-syntax
http://en.wikipedia.org/wiki/Main_Page
/cookbook/extracting-data/dom-navigation
/cookbook/extracting-data/working-with-urls
license
/cookbook/introduction/parsing-a-document
/cookbook/cleaning-html/whitelist-sanitizer
/discussion
//try.jsoup.org/
http://github.com/jhy/jsoup/
/cookbook/extracting-data/example-list-links
14 changes: 14 additions & 0 deletions logs/crawler.log
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
2017-11-20 22:22:49.453 [main] INFO web.Crawler - Starting to crawl.
2017-11-20 22:22:51.521 [main] INFO web.Crawler - Artículos encontrados: 45
2017-11-20 22:22:51.521 [main] INFO web.Crawler - Subcrawling //try.jsoup.org/~LGB7rk_atM2roavV0d-czMt3J_g
2017-11-20 22:22:51.522 [main] WARN web.Crawler - java.lang.IllegalArgumentException: Malformed URL: //try.jsoup.org/~LGB7rk_atM2roavV0d-czMt3J_g
2017-11-20 22:22:51.522 [main] INFO web.Crawler - Subcrawling //try.jsoup.org/
2017-11-20 22:22:51.522 [main] WARN web.Crawler - java.lang.IllegalArgumentException: Malformed URL: //try.jsoup.org/
2017-11-21 13:09:28.489 [main] INFO web.Crawler - Starting to crawl.
2017-11-21 13:09:30.367 [main] INFO web.Crawler - Artículos encontrados: 7
2017-11-24 12:58:58.284 [main] INFO web.Crawler - Starting to crawl.
2017-11-24 12:59:02.903 [main] INFO web.Crawler - Artículos encontrados: 45
2017-11-24 12:59:02.903 [main] INFO web.Crawler - Subcrawling //try.jsoup.org/~LGB7rk_atM2roavV0d-czMt3J_g
2017-11-24 12:59:02.912 [main] WARN web.Crawler - java.lang.IllegalArgumentException: Malformed URL: //try.jsoup.org/~LGB7rk_atM2roavV0d-czMt3J_g
2017-11-24 12:59:02.912 [main] INFO web.Crawler - Subcrawling //try.jsoup.org/
2017-11-24 12:59:02.912 [main] WARN web.Crawler - java.lang.IllegalArgumentException: Malformed URL: //try.jsoup.org/
Binary file modified src/.DS_Store
Binary file not shown.
107 changes: 0 additions & 107 deletions src/db/Chinook.java

This file was deleted.

146 changes: 0 additions & 146 deletions src/db/Customer.java

This file was deleted.

Loading

0 comments on commit 61d607e

Please sign in to comment.