You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
358 lines
40 KiB
358 lines
40 KiB
2026-05-19 10:48:15.013 [main] INFO com.example.datacollect.Main - Starting CLI Crawler application
|
|
2026-05-19 10:48:15.022 [main] INFO c.e.d.strategy.StrategyFactory - StrategyFactory initialized with 3 strategies
|
|
2026-05-19 10:48:15.025 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: help
|
|
2026-05-19 10:48:15.027 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: list
|
|
2026-05-19 10:48:15.027 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: crawl
|
|
2026-05-19 10:48:15.028 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: exit
|
|
2026-05-19 10:48:15.029 [main] INFO c.e.d.controller.CrawlerController - CrawlerController initialized with 4 commands
|
|
2026-05-19 10:48:15.031 [main] INFO c.e.datacollect.view.ConsoleView - Success: Welcome to CLI Crawler (w10_3)! Type help for commands.
|
|
2026-05-19 10:48:15.032 [main] INFO com.example.datacollect.Main - Application ready, waiting for input
|
|
2026-05-19 10:48:15.033 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
2026-05-19 10:49:49.548 [main] WARN c.e.d.controller.CrawlerController - Unknown command: crawl<https://news.hnu.edu.cn>
|
|
2026-05-19 10:49:49.548 [main] ERROR c.e.datacollect.view.ConsoleView - Error: Unknown command: crawl<https://news.hnu.edu.cn>
|
|
2026-05-19 10:49:49.559 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
2026-05-19 11:04:21.863 [main] INFO com.example.datacollect.Main - Starting CLI Crawler application
|
|
2026-05-19 11:04:21.878 [main] INFO c.e.d.strategy.StrategyFactory - StrategyFactory initialized with 6 strategies
|
|
2026-05-19 11:04:21.883 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: help
|
|
2026-05-19 11:04:21.884 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: list
|
|
2026-05-19 11:04:21.885 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: crawl
|
|
2026-05-19 11:04:21.886 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: exit
|
|
2026-05-19 11:04:21.886 [main] INFO c.e.d.controller.CrawlerController - CrawlerController initialized with 4 commands
|
|
2026-05-19 11:04:21.886 [main] INFO c.e.datacollect.view.ConsoleView - Success: Welcome to CLI Crawler (w10_3)! Type help for commands.
|
|
2026-05-19 11:04:21.887 [main] INFO com.example.datacollect.Main - Application ready, waiting for input
|
|
2026-05-19 11:04:21.887 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
2026-05-19 11:05:05.592 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl
|
|
2026-05-19 11:05:05.594 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: <https://s.weibo.com>
|
|
2026-05-19 11:05:05.596 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL <https://s.weibo.com>: false
|
|
2026-05-19 11:05:05.596 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL <https://s.weibo.com>: false
|
|
2026-05-19 11:05:05.597 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL <https://s.weibo.com>: false
|
|
2026-05-19 11:05:05.603 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL <https://s.weibo.com>: true
|
|
2026-05-19 11:05:05.604 [main] DEBUG c.e.d.strategy.StrategyFactory - Found strategy WeiboHotStrategy for URL: <https://s.weibo.com>
|
|
2026-05-19 11:05:05.606 [main] INFO c.e.datacollect.command.CrawlCommand - Starting crawl for URL: <https://s.weibo.com>
|
|
2026-05-19 11:05:05.621 [main] DEBUG c.e.datacollect.view.ConsoleView - Info: Crawling: <https://s.weibo.com>
|
|
2026-05-19 11:05:05.626 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 1 to fetch URL: <https://s.weibo.com>
|
|
2026-05-19 11:05:05.667 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 1 failed for URL <https://s.weibo.com>: The supplied URL, '<https://s.weibo.com>', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls
|
|
2026-05-19 11:05:05.669 [main] INFO c.e.datacollect.command.CrawlCommand - Retrying in 1000ms...
|
|
2026-05-19 11:05:06.672 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 2 to fetch URL: <https://s.weibo.com>
|
|
2026-05-19 11:05:06.675 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 2 failed for URL <https://s.weibo.com>: The supplied URL, '<https://s.weibo.com>', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls
|
|
2026-05-19 11:05:06.677 [main] INFO c.e.datacollect.command.CrawlCommand - Retrying in 1000ms...
|
|
2026-05-19 11:05:07.689 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 3 to fetch URL: <https://s.weibo.com>
|
|
2026-05-19 11:05:07.690 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 3 failed for URL <https://s.weibo.com>: The supplied URL, '<https://s.weibo.com>', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls
|
|
2026-05-19 11:05:07.692 [main] ERROR c.e.datacollect.command.CrawlCommand - Failed to fetch URL after 3 attempts: <https://s.weibo.com>
|
|
2026-05-19 11:05:07.692 [main] ERROR c.e.datacollect.view.ConsoleView - Error: Failed to fetch URL after 3 attempts: <https://s.weibo.com>
|
|
2026-05-19 11:05:07.694 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
2026-05-19 11:06:22.788 [main] WARN c.e.d.controller.CrawlerController - Unknown command: ceawl
|
|
2026-05-19 11:06:22.788 [main] ERROR c.e.datacollect.view.ConsoleView - Error: Unknown command: ceawl
|
|
2026-05-19 11:06:22.791 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
2026-05-19 11:06:50.556 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl
|
|
2026-05-19 11:06:50.557 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: <
|
|
2026-05-19 11:06:50.557 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL <: false
|
|
2026-05-19 11:06:50.558 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL <: false
|
|
2026-05-19 11:06:50.558 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL <: false
|
|
2026-05-19 11:06:50.558 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL <: false
|
|
2026-05-19 11:06:50.562 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - CctvNewsStrategy supports URL <: false
|
|
2026-05-19 11:06:50.563 [main] DEBUG c.e.d.strategy.WeatherStrategy - WeatherStrategy supports URL <: false
|
|
2026-05-19 11:06:50.567 [main] WARN c.e.d.strategy.StrategyFactory - No strategy found for URL: <
|
|
2026-05-19 11:06:50.574 [main] WARN c.e.datacollect.command.CrawlCommand - No strategy found for: <
|
|
2026-05-19 11:06:50.576 [main] ERROR c.e.datacollect.view.ConsoleView - Error: No strategy found for: <
|
|
2026-05-19 11:06:50.580 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
2026-05-19 11:07:24.657 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl
|
|
2026-05-19 11:07:24.659 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: <https://tv.cctv.com>
|
|
2026-05-19 11:07:24.659 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL <https://tv.cctv.com>: false
|
|
2026-05-19 11:07:24.659 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL <https://tv.cctv.com>: false
|
|
2026-05-19 11:07:24.661 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL <https://tv.cctv.com>: false
|
|
2026-05-19 11:07:24.663 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL <https://tv.cctv.com>: false
|
|
2026-05-19 11:07:24.666 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - CctvNewsStrategy supports URL <https://tv.cctv.com>: true
|
|
2026-05-19 11:07:24.667 [main] DEBUG c.e.d.strategy.StrategyFactory - Found strategy CctvNewsStrategy for URL: <https://tv.cctv.com>
|
|
2026-05-19 11:07:24.668 [main] INFO c.e.datacollect.command.CrawlCommand - Starting crawl for URL: <https://tv.cctv.com>
|
|
2026-05-19 11:07:24.669 [main] DEBUG c.e.datacollect.view.ConsoleView - Info: Crawling: <https://tv.cctv.com>
|
|
2026-05-19 11:07:24.671 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 1 to fetch URL: <https://tv.cctv.com>
|
|
2026-05-19 11:07:24.675 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 1 failed for URL <https://tv.cctv.com>: The supplied URL, '<https://tv.cctv.com>', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls
|
|
2026-05-19 11:07:24.676 [main] INFO c.e.datacollect.command.CrawlCommand - Retrying in 1000ms...
|
|
2026-05-19 11:07:25.678 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 2 to fetch URL: <https://tv.cctv.com>
|
|
2026-05-19 11:07:25.681 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 2 failed for URL <https://tv.cctv.com>: The supplied URL, '<https://tv.cctv.com>', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls
|
|
2026-05-19 11:07:25.682 [main] INFO c.e.datacollect.command.CrawlCommand - Retrying in 1000ms...
|
|
2026-05-19 11:07:26.696 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 3 to fetch URL: <https://tv.cctv.com>
|
|
2026-05-19 11:07:26.698 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 3 failed for URL <https://tv.cctv.com>: The supplied URL, '<https://tv.cctv.com>', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls
|
|
2026-05-19 11:07:26.700 [main] ERROR c.e.datacollect.command.CrawlCommand - Failed to fetch URL after 3 attempts: <https://tv.cctv.com>
|
|
2026-05-19 11:07:26.701 [main] ERROR c.e.datacollect.view.ConsoleView - Error: Failed to fetch URL after 3 attempts: <https://tv.cctv.com>
|
|
2026-05-19 11:07:26.702 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
2026-05-19 11:14:42.973 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl
|
|
2026-05-19 11:14:42.975 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: <https://www.tianqi.com/changsha>
|
|
2026-05-19 11:14:42.975 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL <https://www.tianqi.com/changsha>: false
|
|
2026-05-19 11:14:42.988 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL <https://www.tianqi.com/changsha>: false
|
|
2026-05-19 11:14:43.005 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL <https://www.tianqi.com/changsha>: false
|
|
2026-05-19 11:14:43.011 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL <https://www.tianqi.com/changsha>: false
|
|
2026-05-19 11:14:43.015 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - CctvNewsStrategy supports URL <https://www.tianqi.com/changsha>: false
|
|
2026-05-19 11:14:43.016 [main] DEBUG c.e.d.strategy.WeatherStrategy - WeatherStrategy supports URL <https://www.tianqi.com/changsha>: true
|
|
2026-05-19 11:14:43.017 [main] DEBUG c.e.d.strategy.StrategyFactory - Found strategy WeatherStrategy for URL: <https://www.tianqi.com/changsha>
|
|
2026-05-19 11:14:43.019 [main] INFO c.e.datacollect.command.CrawlCommand - Starting crawl for URL: <https://www.tianqi.com/changsha>
|
|
2026-05-19 11:14:43.030 [main] DEBUG c.e.datacollect.view.ConsoleView - Info: Crawling: <https://www.tianqi.com/changsha>
|
|
2026-05-19 11:14:43.038 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 1 to fetch URL: <https://www.tianqi.com/changsha>
|
|
2026-05-19 11:14:43.039 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 1 failed for URL <https://www.tianqi.com/changsha>: The supplied URL, '<https://www.tianqi.com/changsha>', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls
|
|
2026-05-19 11:14:43.048 [main] INFO c.e.datacollect.command.CrawlCommand - Retrying in 1000ms...
|
|
2026-05-19 11:14:44.062 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 2 to fetch URL: <https://www.tianqi.com/changsha>
|
|
2026-05-19 11:14:44.066 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 2 failed for URL <https://www.tianqi.com/changsha>: The supplied URL, '<https://www.tianqi.com/changsha>', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls
|
|
2026-05-19 11:14:44.067 [main] INFO c.e.datacollect.command.CrawlCommand - Retrying in 1000ms...
|
|
2026-05-19 11:14:45.078 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 3 to fetch URL: <https://www.tianqi.com/changsha>
|
|
2026-05-19 11:14:45.079 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 3 failed for URL <https://www.tianqi.com/changsha>: The supplied URL, '<https://www.tianqi.com/changsha>', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls
|
|
2026-05-19 11:14:45.082 [main] ERROR c.e.datacollect.command.CrawlCommand - Failed to fetch URL after 3 attempts: <https://www.tianqi.com/changsha>
|
|
2026-05-19 11:14:45.083 [main] ERROR c.e.datacollect.view.ConsoleView - Error: Failed to fetch URL after 3 attempts: <https://www.tianqi.com/changsha>
|
|
2026-05-19 11:14:45.090 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
2026-05-19 11:17:17.250 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl
|
|
2026-05-19 11:17:17.251 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: https://www.tianqi.com/changsha
|
|
2026-05-19 11:17:17.263 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL https://www.tianqi.com/changsha: false
|
|
2026-05-19 11:17:17.266 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL https://www.tianqi.com/changsha: false
|
|
2026-05-19 11:17:17.267 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL https://www.tianqi.com/changsha: false
|
|
2026-05-19 11:17:17.267 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL https://www.tianqi.com/changsha: false
|
|
2026-05-19 11:17:17.267 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - CctvNewsStrategy supports URL https://www.tianqi.com/changsha: false
|
|
2026-05-19 11:17:17.267 [main] DEBUG c.e.d.strategy.WeatherStrategy - WeatherStrategy supports URL https://www.tianqi.com/changsha: true
|
|
2026-05-19 11:17:17.267 [main] DEBUG c.e.d.strategy.StrategyFactory - Found strategy WeatherStrategy for URL: https://www.tianqi.com/changsha
|
|
2026-05-19 11:17:17.269 [main] INFO c.e.datacollect.command.CrawlCommand - Starting crawl for URL: https://www.tianqi.com/changsha
|
|
2026-05-19 11:17:17.269 [main] DEBUG c.e.datacollect.view.ConsoleView - Info: Crawling: https://www.tianqi.com/changsha
|
|
2026-05-19 11:17:17.269 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 1 to fetch URL: https://www.tianqi.com/changsha
|
|
2026-05-19 11:17:18.361 [main] DEBUG c.e.d.strategy.WeatherStrategy - Parsing weather page: https://www.tianqi.com/changsha
|
|
2026-05-19 11:17:18.388 [main] INFO c.e.d.strategy.WeatherStrategy - Parsed 1 weather items
|
|
2026-05-19 11:17:18.391 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 2026年05月19日 长沙天气
|
|
2026-05-19 11:17:18.395 [main] INFO c.e.datacollect.command.CrawlCommand - Successfully crawled 1 articles from https://www.tianqi.com/changsha
|
|
2026-05-19 11:17:18.395 [main] INFO c.e.datacollect.view.ConsoleView - Success: Crawled 1 articles.
|
|
2026-05-19 11:17:18.395 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
2026-05-19 11:17:25.633 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: list
|
|
2026-05-19 11:17:25.634 [main] DEBUG c.e.datacollect.command.ListCommand - Listing articles
|
|
2026-05-19 11:17:25.635 [main] DEBUG c.e.d.repository.ArticleRepository - Returning 1 articles (unmodifiable)
|
|
2026-05-19 11:17:25.636 [main] DEBUG c.e.datacollect.view.ConsoleView - Displaying 1 articles
|
|
2026-05-19 11:17:25.660 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
2026-05-19 11:18:00.938 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl
|
|
2026-05-19 11:18:00.939 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: https://tv.cctv.com
|
|
2026-05-19 11:18:00.939 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL https://tv.cctv.com: false
|
|
2026-05-19 11:18:00.942 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL https://tv.cctv.com: false
|
|
2026-05-19 11:18:00.942 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL https://tv.cctv.com: false
|
|
2026-05-19 11:18:00.948 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL https://tv.cctv.com: false
|
|
2026-05-19 11:18:00.950 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - CctvNewsStrategy supports URL https://tv.cctv.com: true
|
|
2026-05-19 11:18:00.950 [main] DEBUG c.e.d.strategy.StrategyFactory - Found strategy CctvNewsStrategy for URL: https://tv.cctv.com
|
|
2026-05-19 11:18:00.951 [main] INFO c.e.datacollect.command.CrawlCommand - Starting crawl for URL: https://tv.cctv.com
|
|
2026-05-19 11:18:00.951 [main] DEBUG c.e.datacollect.view.ConsoleView - Info: Crawling: https://tv.cctv.com
|
|
2026-05-19 11:18:00.952 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 1 to fetch URL: https://tv.cctv.com
|
|
2026-05-19 11:18:01.315 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - Parsing CCTV news page: https://tv.cctv.com
|
|
2026-05-19 11:18:01.318 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - Found 0 news items
|
|
2026-05-19 11:18:01.402 [main] INFO c.e.d.strategy.CctvNewsStrategy - Parsed 189 news from CCTV
|
|
2026-05-19 11:18:01.403 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 直 播
|
|
2026-05-19 11:18:01.403 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 节目单
|
|
2026-05-19 11:18:01.404 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 频道大全
|
|
2026-05-19 11:18:01.404 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 栏目大全
|
|
2026-05-19 11:18:01.404 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 主 持 人
|
|
2026-05-19 11:18:01.405 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 听音
|
|
2026-05-19 11:18:01.406 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 消费主张
|
|
2026-05-19 11:18:01.406 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 文化十分
|
|
2026-05-19 11:18:01.407 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 军事科技
|
|
2026-05-19 11:18:01.407 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 今日关注
|
|
2026-05-19 11:18:01.407 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 天下足球
|
|
2026-05-19 11:18:01.408 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 综合
|
|
2026-05-19 11:18:01.408 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻
|
|
2026-05-19 11:18:01.409 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 财经
|
|
2026-05-19 11:18:01.409 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 综艺
|
|
2026-05-19 11:18:01.409 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 亚洲
|
|
2026-05-19 11:18:01.409 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 体育
|
|
2026-05-19 11:18:01.411 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 电影
|
|
2026-05-19 11:18:01.411 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 国防军事
|
|
2026-05-19 11:18:01.411 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 电视剧
|
|
2026-05-19 11:18:01.412 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 纪录
|
|
2026-05-19 11:18:01.412 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 科教
|
|
2026-05-19 11:18:01.412 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 戏曲
|
|
2026-05-19 11:18:01.413 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 社会与法
|
|
2026-05-19 11:18:01.413 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 少儿
|
|
2026-05-19 11:18:01.413 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 音乐
|
|
2026-05-19 11:18:01.414 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 奥林匹克
|
|
2026-05-19 11:18:01.420 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 农业农村
|
|
2026-05-19 11:18:01.437 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 欧洲
|
|
2026-05-19 11:18:01.437 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 美洲
|
|
2026-05-19 11:18:01.439 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 体育赛事
|
|
2026-05-19 11:18:01.439 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 微视频
|
|
2026-05-19 11:18:01.440 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 搜片库
|
|
2026-05-19 11:18:01.440 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 找栏目
|
|
2026-05-19 11:18:01.440 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 中国经济大讲堂
|
|
2026-05-19 11:18:01.441 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 健康中国
|
|
2026-05-19 11:18:01.441 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 听音
|
|
2026-05-19 11:18:01.441 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 科幻地带
|
|
2026-05-19 11:18:01.441 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 如何解读中美关系新定位?
|
|
2026-05-19 11:18:01.442 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 垃圾围村 曝多地违法倾倒乱象
|
|
2026-05-19 11:18:01.442 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 四川“大摆荡”坠亡事故调查
|
|
2026-05-19 11:18:01.443 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 孤寡老人留百万遗产 由谁继承
|
|
2026-05-19 11:18:01.443 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 带你破解肠道健康的隐秘真相
|
|
2026-05-19 11:18:01.444 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 坐着高铁看中国
|
|
2026-05-19 11:18:01.445 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: AI基建新潮涌
|
|
2026-05-19 11:18:01.451 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 一帘光影三代人
|
|
2026-05-19 11:18:01.451 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 如何把阳光“存进”大海里
|
|
2026-05-19 11:18:01.452 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 驾驭“蓝鲸” 潜航深海
|
|
2026-05-19 11:18:01.452 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 广告
|
|
2026-05-19 11:18:01.454 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 广告
|
|
2026-05-19 11:18:01.454 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 广告
|
|
2026-05-19 11:18:01.456 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 唐都生活指南(第二部)
|
|
2026-05-19 11:18:01.456 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 金石探文明
|
|
2026-05-19 11:18:01.457 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 赤壁之战
|
|
2026-05-19 11:18:01.457 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 先秦智慧
|
|
2026-05-19 11:18:01.458 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 国史通鉴·两晋南北朝篇
|
|
2026-05-19 11:18:01.458 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 《叶问》郑嘉颖乱世之中寻求武学真谛
|
|
2026-05-19 11:18:01.459 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 《曾少年》张一山关晓彤爱情事业两不误
|
|
2026-05-19 11:18:01.461 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 《炊事班的故事Ⅱ》密集承包你的笑点
|
|
2026-05-19 11:18:01.461 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 《火蓝刀锋》海军陆战队亮刀锋展军魂
|
|
2026-05-19 11:18:01.461 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 《问天》致敬中国航天数十年的峥嵘岁月
|
|
2026-05-19 11:18:01.461 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 直播
|
|
2026-05-19 11:18:01.461 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 节目单
|
|
2026-05-19 11:18:01.462 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 频道大全
|
|
2026-05-19 11:18:01.462 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 栏目大全
|
|
2026-05-19 11:18:01.462 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 综合
|
|
2026-05-19 11:18:01.463 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 财经
|
|
2026-05-19 11:18:01.463 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 综艺
|
|
2026-05-19 11:18:01.463 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 中文国际
|
|
2026-05-19 11:18:01.463 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 体育
|
|
2026-05-19 11:18:01.464 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 电影
|
|
2026-05-19 11:18:01.464 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 国防军事
|
|
2026-05-19 11:18:01.464 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 电视剧
|
|
2026-05-19 11:18:01.464 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 纪录
|
|
2026-05-19 11:18:01.464 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 科教
|
|
2026-05-19 11:18:01.464 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 戏曲
|
|
2026-05-19 11:18:01.466 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 社会与法
|
|
2026-05-19 11:18:01.466 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻
|
|
2026-05-19 11:18:01.466 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 少儿
|
|
2026-05-19 11:18:01.467 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 音乐
|
|
2026-05-19 11:18:01.467 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 体育赛事
|
|
2026-05-19 11:18:01.467 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 农业农村
|
|
2026-05-19 11:18:01.467 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻联播
|
|
2026-05-19 11:18:01.468 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 挑战不可能
|
|
2026-05-19 11:18:01.468 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 开讲啦
|
|
2026-05-19 11:18:01.470 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 人与自然
|
|
2026-05-19 11:18:01.471 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 生活提示
|
|
2026-05-19 11:18:01.471 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 中国诗词大会
|
|
2026-05-19 11:18:01.472 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 对话
|
|
2026-05-19 11:18:01.472 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 第一时间
|
|
2026-05-19 11:18:01.473 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 一槌定音
|
|
2026-05-19 11:18:01.474 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 回家吃饭
|
|
2026-05-19 11:18:01.474 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 生活家
|
|
2026-05-19 11:18:01.475 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 金牌喜剧班
|
|
2026-05-19 11:18:01.475 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 舞蹈世界
|
|
2026-05-19 11:18:01.475 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 星光大道
|
|
2026-05-19 11:18:01.476 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 向幸福出发
|
|
2026-05-19 11:18:01.476 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 回声嘹亮
|
|
2026-05-19 11:18:01.476 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 健康中国
|
|
2026-05-19 11:18:01.477 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 深度国际
|
|
2026-05-19 11:18:01.478 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 中国文艺
|
|
2026-05-19 11:18:01.478 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 国家记忆
|
|
2026-05-19 11:18:01.478 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 华人故事
|
|
2026-05-19 11:18:01.479 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 运动一起赢
|
|
2026-05-19 11:18:01.479 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 体坛快讯
|
|
2026-05-19 11:18:01.479 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 足球之夜
|
|
2026-05-19 11:18:01.479 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 今日影评
|
|
2026-05-19 11:18:01.479 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 时光军史馆
|
|
2026-05-19 11:18:01.481 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 军迷行天下
|
|
2026-05-19 11:18:01.482 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 军事报道
|
|
2026-05-19 11:18:01.483 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 军事纪实
|
|
2026-05-19 11:18:01.483 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 军事纪录
|
|
2026-05-19 11:18:01.484 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 国防故事
|
|
2026-05-19 11:18:01.484 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 剧说很好看
|
|
2026-05-19 11:18:01.484 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 寰宇视野
|
|
2026-05-19 11:18:01.485 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 特别呈现
|
|
2026-05-19 11:18:01.485 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 9视频
|
|
2026-05-19 11:18:01.485 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 百家讲坛
|
|
2026-05-19 11:18:01.486 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 健康之路
|
|
2026-05-19 11:18:01.486 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 科幻地带
|
|
2026-05-19 11:18:01.486 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 考古公开课
|
|
2026-05-19 11:18:01.486 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 解码科技史
|
|
2026-05-19 11:18:01.487 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 戏曲青年说
|
|
2026-05-19 11:18:01.487 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 中国京剧音配像精粹
|
|
2026-05-19 11:18:01.487 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 过把瘾
|
|
2026-05-19 11:18:01.488 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 宝贝亮相吧
|
|
2026-05-19 11:18:01.488 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 剧懂法
|
|
2026-05-19 11:18:01.489 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 一线
|
|
2026-05-19 11:18:01.489 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 普法剧场
|
|
2026-05-19 11:18:01.489 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 天网
|
|
2026-05-19 11:18:01.489 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 心理访谈
|
|
2026-05-19 11:18:01.490 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 律师来了
|
|
2026-05-19 11:18:01.490 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 焦点访谈
|
|
2026-05-19 11:18:01.490 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 每周质量报告
|
|
2026-05-19 11:18:01.491 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 午夜新闻
|
|
2026-05-19 11:18:01.491 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻调查
|
|
2026-05-19 11:18:01.491 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻周刊
|
|
2026-05-19 11:18:01.491 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 周末动画片
|
|
2026-05-19 11:18:01.491 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 音乐快递
|
|
2026-05-19 11:18:01.493 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻袋袋裤
|
|
2026-05-19 11:18:01.493 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 智慧树
|
|
2026-05-19 11:18:01.494 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 快乐童行
|
|
2026-05-19 11:18:01.494 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 乐享汇
|
|
2026-05-19 11:18:01.494 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: CCTV音乐厅
|
|
2026-05-19 11:18:01.494 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 中国音乐电视
|
|
2026-05-19 11:18:01.494 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 聆听时刻
|
|
2026-05-19 11:18:01.496 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 童声唱
|
|
2026-05-19 11:18:01.496 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 冰球冰球
|
|
2026-05-19 11:18:01.496 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 田野欢歌
|
|
2026-05-19 11:18:01.497 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 三农长短说
|
|
2026-05-19 11:18:01.498 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 三农群英汇
|
|
2026-05-19 11:18:01.498 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 大地讲堂
|
|
2026-05-19 11:18:01.498 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 片库
|
|
2026-05-19 11:18:01.499 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 热榜
|
|
2026-05-19 11:18:01.499 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 看点
|
|
2026-05-19 11:18:01.499 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 微视频
|
|
2026-05-19 11:18:01.499 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: ai美食
|
|
2026-05-19 11:18:01.500 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 主持人
|
|
2026-05-19 11:18:01.500 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 全部
|
|
2026-05-19 11:18:01.500 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 直播
|
|
2026-05-19 11:18:01.500 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 节目单
|
|
2026-05-19 11:18:01.501 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 热榜
|
|
2026-05-19 11:18:01.502 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 主持人
|
|
2026-05-19 11:18:01.502 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 听音
|
|
2026-05-19 11:18:01.502 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻联播
|
|
2026-05-19 11:18:01.502 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 焦点访谈
|
|
2026-05-19 11:18:01.503 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 等着我
|
|
2026-05-19 11:18:01.503 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 今日说法
|
|
2026-05-19 11:18:01.503 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 海峡两岸
|
|
2026-05-19 11:18:01.504 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 今日关注
|
|
2026-05-19 11:18:01.504 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 今日亚洲
|
|
2026-05-19 11:18:01.504 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 更多
|
|
2026-05-19 11:18:01.504 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 电视剧
|
|
2026-05-19 11:18:01.504 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 动画片
|
|
2026-05-19 11:18:01.506 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 纪录片
|
|
2026-05-19 11:18:01.506 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 特别节目
|
|
2026-05-19 11:18:01.506 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 更多
|
|
2026-05-19 11:18:01.506 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 微视频
|
|
2026-05-19 11:18:01.507 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 视频百科
|
|
2026-05-19 11:18:01.508 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 微故事
|
|
2026-05-19 11:18:01.510 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: ai美食
|
|
2026-05-19 11:18:01.510 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 今日热门
|
|
2026-05-19 11:18:01.512 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 直播导视
|
|
2026-05-19 11:18:01.512 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 精品
|
|
2026-05-19 11:18:01.514 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 片库
|
|
2026-05-19 11:18:01.515 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 看点
|
|
2026-05-19 11:18:01.516 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 央视大全
|
|
2026-05-19 11:18:01.518 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 手机访问 扫描下载央 视影音客户端 扫一扫 手机继续看
|
|
2026-05-19 11:18:01.521 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 返回顶部
|
|
2026-05-19 11:18:01.521 [main] INFO c.e.datacollect.command.CrawlCommand - Successfully crawled 189 articles from https://tv.cctv.com
|
|
2026-05-19 11:18:01.521 [main] INFO c.e.datacollect.view.ConsoleView - Success: Crawled 189 articles.
|
|
2026-05-19 11:18:01.522 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
2026-05-19 11:18:12.244 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: list
|
|
2026-05-19 11:18:12.244 [main] DEBUG c.e.datacollect.command.ListCommand - Listing articles
|
|
2026-05-19 11:18:12.245 [main] DEBUG c.e.d.repository.ArticleRepository - Returning 190 articles (unmodifiable)
|
|
2026-05-19 11:18:12.246 [main] DEBUG c.e.datacollect.view.ConsoleView - Displaying 190 articles
|
|
2026-05-19 11:18:12.317 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
2026-05-19 11:18:49.649 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl
|
|
2026-05-19 11:18:49.650 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: https://www.tianqi.com/changsha
|
|
2026-05-19 11:18:49.651 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL https://www.tianqi.com/changsha: false
|
|
2026-05-19 11:18:49.651 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL https://www.tianqi.com/changsha: false
|
|
2026-05-19 11:18:49.651 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL https://www.tianqi.com/changsha: false
|
|
2026-05-19 11:18:49.652 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL https://www.tianqi.com/changsha: false
|
|
2026-05-19 11:18:49.652 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - CctvNewsStrategy supports URL https://www.tianqi.com/changsha: false
|
|
2026-05-19 11:18:49.652 [main] DEBUG c.e.d.strategy.WeatherStrategy - WeatherStrategy supports URL https://www.tianqi.com/changsha: true
|
|
2026-05-19 11:18:49.663 [main] DEBUG c.e.d.strategy.StrategyFactory - Found strategy WeatherStrategy for URL: https://www.tianqi.com/changsha
|
|
2026-05-19 11:18:49.666 [main] INFO c.e.datacollect.command.CrawlCommand - Starting crawl for URL: https://www.tianqi.com/changsha
|
|
2026-05-19 11:18:49.668 [main] DEBUG c.e.datacollect.view.ConsoleView - Info: Crawling: https://www.tianqi.com/changsha
|
|
2026-05-19 11:18:49.669 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 1 to fetch URL: https://www.tianqi.com/changsha
|
|
2026-05-19 11:18:49.912 [main] DEBUG c.e.d.strategy.WeatherStrategy - Parsing weather page: https://www.tianqi.com/changsha
|
|
2026-05-19 11:18:49.921 [main] INFO c.e.d.strategy.WeatherStrategy - Parsed 1 weather items
|
|
2026-05-19 11:18:49.923 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 2026年05月19日 长沙天气
|
|
2026-05-19 11:18:49.941 [main] INFO c.e.datacollect.command.CrawlCommand - Successfully crawled 1 articles from https://www.tianqi.com/changsha
|
|
2026-05-19 11:18:49.945 [main] INFO c.e.datacollect.view.ConsoleView - Success: Crawled 1 articles.
|
|
2026-05-19 11:18:49.948 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
2026-05-19 11:18:54.406 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: list
|
|
2026-05-19 11:18:54.406 [main] DEBUG c.e.datacollect.command.ListCommand - Listing articles
|
|
2026-05-19 11:18:54.407 [main] DEBUG c.e.d.repository.ArticleRepository - Returning 191 articles (unmodifiable)
|
|
2026-05-19 11:18:54.407 [main] DEBUG c.e.datacollect.view.ConsoleView - Displaying 191 articles
|
|
2026-05-19 11:18:54.473 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console
|
|
|