├── data ├── crawler.sqlite └── emails.csv ├── .gitignore ├── README.md ├── LICENSE ├── docs ├── index.rst ├── make.bat ├── Makefile └── conf.py ├── ColorStreamHandler.py └── logs └── pycrawler.log /data/crawler.sqlite: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andripwn/crawler-python/HEAD/data/crawler.sqlite -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- 1 | # Build and Release Folders 2 | bin-debug/ 3 | bin-release/ 4 | [Oo]bj/ 5 | [Bb]in/ 6 | 7 | # Other files and folders 8 | .settings/ 9 | 10 | # Executables 11 | *.swf 12 | *.air 13 | *.ipa 14 | *.apk 15 | 16 | # Project files, i.e. `.project`, `.actionScriptProperties` and `.flexProperties` 17 | # should NOT be excluded as they contain compiler settings and other important 18 | # information for Eclipse / Flash Builder. 19 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | Python Email Crawler 2 | ==================== 3 | 4 | This python script search/google certain keywords, crawls the webpages from the results, and return all emails found. 5 | 6 | Requirements 7 | ------------ 8 | 9 | - sqlalchemy 10 | - urllib2 11 | 12 | If you don't have, simply `sudo pip install sqlalchemy`. 13 | 14 | 15 | Usage 16 | ------- 17 | 18 | Start the search with a keyword. We use "iphone developers" as an example. 19 | 20 | python email_crawler.py "iphone developers" 21 | 22 | The search and crawling process will take quite a while, as it retrieve up to 500 search results (from Google), and crawl up to 2 level deep. It should crawl around 10,000 webpages :) 23 | 24 | After the process finished, run this command to get the list of emails 25 | 26 | python email_crawler.py --emails 27 | 28 | The emails will be saved in ./data/emails.csv 29 | -------------------------------------------------------------------------------- /data/emails.csv: -------------------------------------------------------------------------------- 1 | 2 | ryan@macrumors.com 3 | privacy@wikimedia.org 4 | arn@normalkid.com 5 | 5588e3adb3e545248e893d6f2fc41ba2@wwwb-sentry.us.archive.org 6 | mike@wired.com 7 | jordan@techcrunch.com 8 | mitchel@macrumors.com 9 | jon_phillips@wired.com 10 | admin@docketalarm.com 11 | juli@macrumors.com 12 | partners@venturebeat.com 13 | Nathan_Olivarez-Giles@wired.com 14 | tim@macrumors.com 15 | Print-Digital-Bundles@2x.png 16 | info@mycompany.io 17 | alexandra_chang@wired.com 18 | support@docketalarm.com 19 | joel@gizmodo.com 20 | gadgetnews@wired.com 21 | shieber@techcrunch.com 22 | dan@macrumors.com 23 | inifixme@gmail.com 24 | joe@macrumors.com 25 | nathan_hurst@wired.com 26 | christina_bonnington@wired.com 27 | ericslivka@macrumors.com 28 | marianne@macrumors.com 29 | me@bzamayo.com 30 | chris.j@macrumors.com 31 | mcclellan@apple.com 32 | Roberto_Baldwin@wired.com 33 | kirsten.korosec@techcrunch.com 34 | contact@stevemoser.org 35 | natasha.m@techcrunch.com 36 | tips@macrumors.com 37 | megan.geuss@arstechnica.com 38 | info@archive.org 39 | wired.dylan+joel@gmail.com 40 | sales@docketalarm.com -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- 1 | MIT License 2 | 3 | Copyright (c) 2020 Duck Research 4 | 5 | Permission is hereby granted, free of charge, to any person obtaining a copy 6 | of this software and associated documentation files (the "Software"), to deal 7 | in the Software without restriction, including without limitation the rights 8 | to use, copy, modify, merge, publish, distribute, sublicense, and/or sell 9 | copies of the Software, and to permit persons to whom the Software is 10 | furnished to do so, subject to the following conditions: 11 | 12 | The above copyright notice and this permission notice shall be included in all 13 | copies or substantial portions of the Software. 14 | 15 | THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR 16 | IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, 17 | FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE 18 | AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER 19 | LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, 20 | OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE 21 | SOFTWARE. 22 | -------------------------------------------------------------------------------- /docs/index.rst: -------------------------------------------------------------------------------- 1 | .. Python Email Crawler documentation master file, created by 2 | sphinx-quickstart on Fri Aug 3 12:26:56 2012. 3 | You can adapt this file completely to your liking, but it should at least 4 | contain the root `toctree` directive. 5 | 6 | Python Email Crawler's documentation! 7 | ================================================ 8 | 9 | This python script search certain keywords on Google, crawls the webpages from the results, and return all emails found. 10 | 11 | For each result from Google, the crawler will crawl that page for an email. If it could not find an email, it will crawl the linked pages (up to 2nd level). 12 | 13 | This is useful when the result returns the hompage of a website, and the email is usually in the Contact Us page. 14 | 15 | ------------ 16 | Requirements 17 | ------------ 18 | 19 | * sqlalchemy 20 | * urllib2 21 | 22 | 23 | ------ 24 | Usage 25 | ------ 26 | Start the search with a keyword. We use "iphone developers" as an example. 27 | 28 | .. code-block:: bash 29 | 30 | $ ./email_crawler.py "iphone developers" 31 | 32 | The search and crawling process will take quite a while, as it retrieve up to 500 search results (from Google), and crawl up to 2 level deep. It shold crawl around 10,000 webpages :) 33 | 34 | After the process finished, run this command to get the list of emails 35 | 36 | .. code-block:: bash 37 | 38 | $ ./email_crawler.py --emails 39 | 40 | The emails will be saved in ./data/emails.csv 41 | 42 | 43 | Contents: 44 | 45 | .. toctree:: 46 | :maxdepth: 2 47 | 48 | 49 | 50 | Indices and tables 51 | ================== 52 | 53 | * :ref:`genindex` 54 | * :ref:`modindex` 55 | * :ref:`search` 56 | 57 | -------------------------------------------------------------------------------- /ColorStreamHandler.py: -------------------------------------------------------------------------------- 1 | import logging 2 | import curses 3 | 4 | class ColorStreamHandler(logging.Handler): 5 | 6 | def __init__(self, use_colors): 7 | logging.Handler.__init__(self) 8 | self.use_colors = use_colors 9 | 10 | # Initialize environment 11 | curses.setupterm() 12 | 13 | # Get the foreground color attribute for this environment 14 | self.fcap = curses.tigetstr('setaf') 15 | 16 | #Get the normal attribute 17 | self.COLOR_NORMAL = curses.tigetstr('sgr0') 18 | 19 | # Get + Save the color sequences 20 | self.COLOR_INFO = curses.tparm(self.fcap, curses.COLOR_GREEN) 21 | self.COLOR_ERROR = curses.tparm(self.fcap, curses.COLOR_RED) 22 | self.COLOR_WARNING = curses.tparm(self.fcap, curses.COLOR_YELLOW) 23 | self.COLOR_DEBUG = curses.tparm(self.fcap, curses.COLOR_BLUE) 24 | 25 | def color(self, msg, level): 26 | if level == "INFO": 27 | return "%s%s%s" % (self.COLOR_INFO, msg, self.COLOR_NORMAL) 28 | elif level == "WARNING": 29 | return "%s%s%s" % (self.COLOR_WARNING, msg, self.COLOR_NORMAL) 30 | elif level == "ERROR": 31 | return "%s%s%s" % (self.COLOR_ERROR, msg, self.COLOR_NORMAL) 32 | elif level == "DEBUG": 33 | return "%s%s%s" % (self.COLOR_DEBUG, msg, self.COLOR_NORMAL) 34 | else: 35 | return msg 36 | 37 | def emit(self, record): 38 | record.msg = record.msg.encode('utf-8', 'ignore') 39 | msg = self.format(record) 40 | 41 | # This just removes the date and milliseconds from asctime 42 | temp = msg.split(']') 43 | msg = '[' + temp[0].split(' ')[1].split(',')[0] + ']' + temp[1] 44 | 45 | if self.use_colors: 46 | msg = self.color(msg, record.levelname) 47 | print msg 48 | 49 | # 'record' has the following attributes: 50 | # threadName 51 | # name 52 | # thread 53 | # created 54 | # process 55 | # processName 56 | # args 57 | # module 58 | # filename 59 | # levelno 60 | # exc_text 61 | # pathname 62 | # lineno 63 | # msg 64 | # exc_info 65 | # funcName 66 | # relativeCreated 67 | # levelname 68 | # msecs -------------------------------------------------------------------------------- /docs/make.bat: -------------------------------------------------------------------------------- 1 | @ECHO OFF 2 | 3 | REM Command file for Sphinx documentation 4 | 5 | if "%SPHINXBUILD%" == "" ( 6 | set SPHINXBUILD=sphinx-build 7 | ) 8 | set BUILDDIR=_build 9 | set ALLSPHINXOPTS=-d %BUILDDIR%/doctrees %SPHINXOPTS% . 10 | set I18NSPHINXOPTS=%SPHINXOPTS% . 11 | if NOT "%PAPER%" == "" ( 12 | set ALLSPHINXOPTS=-D latex_paper_size=%PAPER% %ALLSPHINXOPTS% 13 | set I18NSPHINXOPTS=-D latex_paper_size=%PAPER% %I18NSPHINXOPTS% 14 | ) 15 | 16 | if "%1" == "" goto help 17 | 18 | if "%1" == "help" ( 19 | :help 20 | echo.Please use `make ^` where ^ is one of 21 | echo. html to make standalone HTML files 22 | echo. dirhtml to make HTML files named index.html in directories 23 | echo. singlehtml to make a single large HTML file 24 | echo. pickle to make pickle files 25 | echo. json to make JSON files 26 | echo. htmlhelp to make HTML files and a HTML help project 27 | echo. qthelp to make HTML files and a qthelp project 28 | echo. devhelp to make HTML files and a Devhelp project 29 | echo. epub to make an epub 30 | echo. latex to make LaTeX files, you can set PAPER=a4 or PAPER=letter 31 | echo. text to make text files 32 | echo. man to make manual pages 33 | echo. texinfo to make Texinfo files 34 | echo. gettext to make PO message catalogs 35 | echo. changes to make an overview over all changed/added/deprecated items 36 | echo. linkcheck to check all external links for integrity 37 | echo. doctest to run all doctests embedded in the documentation if enabled 38 | goto end 39 | ) 40 | 41 | if "%1" == "clean" ( 42 | for /d %%i in (%BUILDDIR%\*) do rmdir /q /s %%i 43 | del /q /s %BUILDDIR%\* 44 | goto end 45 | ) 46 | 47 | if "%1" == "html" ( 48 | %SPHINXBUILD% -b html %ALLSPHINXOPTS% %BUILDDIR%/html 49 | if errorlevel 1 exit /b 1 50 | echo. 51 | echo.Build finished. The HTML pages are in %BUILDDIR%/html. 52 | goto end 53 | ) 54 | 55 | if "%1" == "dirhtml" ( 56 | %SPHINXBUILD% -b dirhtml %ALLSPHINXOPTS% %BUILDDIR%/dirhtml 57 | if errorlevel 1 exit /b 1 58 | echo. 59 | echo.Build finished. The HTML pages are in %BUILDDIR%/dirhtml. 60 | goto end 61 | ) 62 | 63 | if "%1" == "singlehtml" ( 64 | %SPHINXBUILD% -b singlehtml %ALLSPHINXOPTS% %BUILDDIR%/singlehtml 65 | if errorlevel 1 exit /b 1 66 | echo. 67 | echo.Build finished. The HTML pages are in %BUILDDIR%/singlehtml. 68 | goto end 69 | ) 70 | 71 | if "%1" == "pickle" ( 72 | %SPHINXBUILD% -b pickle %ALLSPHINXOPTS% %BUILDDIR%/pickle 73 | if errorlevel 1 exit /b 1 74 | echo. 75 | echo.Build finished; now you can process the pickle files. 76 | goto end 77 | ) 78 | 79 | if "%1" == "json" ( 80 | %SPHINXBUILD% -b json %ALLSPHINXOPTS% %BUILDDIR%/json 81 | if errorlevel 1 exit /b 1 82 | echo. 83 | echo.Build finished; now you can process the JSON files. 84 | goto end 85 | ) 86 | 87 | if "%1" == "htmlhelp" ( 88 | %SPHINXBUILD% -b htmlhelp %ALLSPHINXOPTS% %BUILDDIR%/htmlhelp 89 | if errorlevel 1 exit /b 1 90 | echo. 91 | echo.Build finished; now you can run HTML Help Workshop with the ^ 92 | .hhp project file in %BUILDDIR%/htmlhelp. 93 | goto end 94 | ) 95 | 96 | if "%1" == "qthelp" ( 97 | %SPHINXBUILD% -b qthelp %ALLSPHINXOPTS% %BUILDDIR%/qthelp 98 | if errorlevel 1 exit /b 1 99 | echo. 100 | echo.Build finished; now you can run "qcollectiongenerator" with the ^ 101 | .qhcp project file in %BUILDDIR%/qthelp, like this: 102 | echo.^> qcollectiongenerator %BUILDDIR%\qthelp\PythonEmailCrawler.qhcp 103 | echo.To view the help file: 104 | echo.^> assistant -collectionFile %BUILDDIR%\qthelp\PythonEmailCrawler.ghc 105 | goto end 106 | ) 107 | 108 | if "%1" == "devhelp" ( 109 | %SPHINXBUILD% -b devhelp %ALLSPHINXOPTS% %BUILDDIR%/devhelp 110 | if errorlevel 1 exit /b 1 111 | echo. 112 | echo.Build finished. 113 | goto end 114 | ) 115 | 116 | if "%1" == "epub" ( 117 | %SPHINXBUILD% -b epub %ALLSPHINXOPTS% %BUILDDIR%/epub 118 | if errorlevel 1 exit /b 1 119 | echo. 120 | echo.Build finished. The epub file is in %BUILDDIR%/epub. 121 | goto end 122 | ) 123 | 124 | if "%1" == "latex" ( 125 | %SPHINXBUILD% -b latex %ALLSPHINXOPTS% %BUILDDIR%/latex 126 | if errorlevel 1 exit /b 1 127 | echo. 128 | echo.Build finished; the LaTeX files are in %BUILDDIR%/latex. 129 | goto end 130 | ) 131 | 132 | if "%1" == "text" ( 133 | %SPHINXBUILD% -b text %ALLSPHINXOPTS% %BUILDDIR%/text 134 | if errorlevel 1 exit /b 1 135 | echo. 136 | echo.Build finished. The text files are in %BUILDDIR%/text. 137 | goto end 138 | ) 139 | 140 | if "%1" == "man" ( 141 | %SPHINXBUILD% -b man %ALLSPHINXOPTS% %BUILDDIR%/man 142 | if errorlevel 1 exit /b 1 143 | echo. 144 | echo.Build finished. The manual pages are in %BUILDDIR%/man. 145 | goto end 146 | ) 147 | 148 | if "%1" == "texinfo" ( 149 | %SPHINXBUILD% -b texinfo %ALLSPHINXOPTS% %BUILDDIR%/texinfo 150 | if errorlevel 1 exit /b 1 151 | echo. 152 | echo.Build finished. The Texinfo files are in %BUILDDIR%/texinfo. 153 | goto end 154 | ) 155 | 156 | if "%1" == "gettext" ( 157 | %SPHINXBUILD% -b gettext %I18NSPHINXOPTS% %BUILDDIR%/locale 158 | if errorlevel 1 exit /b 1 159 | echo. 160 | echo.Build finished. The message catalogs are in %BUILDDIR%/locale. 161 | goto end 162 | ) 163 | 164 | if "%1" == "changes" ( 165 | %SPHINXBUILD% -b changes %ALLSPHINXOPTS% %BUILDDIR%/changes 166 | if errorlevel 1 exit /b 1 167 | echo. 168 | echo.The overview file is in %BUILDDIR%/changes. 169 | goto end 170 | ) 171 | 172 | if "%1" == "linkcheck" ( 173 | %SPHINXBUILD% -b linkcheck %ALLSPHINXOPTS% %BUILDDIR%/linkcheck 174 | if errorlevel 1 exit /b 1 175 | echo. 176 | echo.Link check complete; look for any errors in the above output ^ 177 | or in %BUILDDIR%/linkcheck/output.txt. 178 | goto end 179 | ) 180 | 181 | if "%1" == "doctest" ( 182 | %SPHINXBUILD% -b doctest %ALLSPHINXOPTS% %BUILDDIR%/doctest 183 | if errorlevel 1 exit /b 1 184 | echo. 185 | echo.Testing of doctests in the sources finished, look at the ^ 186 | results in %BUILDDIR%/doctest/output.txt. 187 | goto end 188 | ) 189 | 190 | :end 191 | -------------------------------------------------------------------------------- /docs/Makefile: -------------------------------------------------------------------------------- 1 | # Makefile for Sphinx documentation 2 | # 3 | 4 | # You can set these variables from the command line. 5 | SPHINXOPTS = 6 | SPHINXBUILD = sphinx-build 7 | PAPER = 8 | BUILDDIR = _build 9 | 10 | # Internal variables. 11 | PAPEROPT_a4 = -D latex_paper_size=a4 12 | PAPEROPT_letter = -D latex_paper_size=letter 13 | ALLSPHINXOPTS = -d $(BUILDDIR)/doctrees $(PAPEROPT_$(PAPER)) $(SPHINXOPTS) . 14 | # the i18n builder cannot share the environment and doctrees with the others 15 | I18NSPHINXOPTS = $(PAPEROPT_$(PAPER)) $(SPHINXOPTS) . 16 | 17 | .PHONY: help clean html dirhtml singlehtml pickle json htmlhelp qthelp devhelp epub latex latexpdf text man changes linkcheck doctest gettext 18 | 19 | help: 20 | @echo "Please use \`make ' where is one of" 21 | @echo " html to make standalone HTML files" 22 | @echo " dirhtml to make HTML files named index.html in directories" 23 | @echo " singlehtml to make a single large HTML file" 24 | @echo " pickle to make pickle files" 25 | @echo " json to make JSON files" 26 | @echo " htmlhelp to make HTML files and a HTML help project" 27 | @echo " qthelp to make HTML files and a qthelp project" 28 | @echo " devhelp to make HTML files and a Devhelp project" 29 | @echo " epub to make an epub" 30 | @echo " latex to make LaTeX files, you can set PAPER=a4 or PAPER=letter" 31 | @echo " latexpdf to make LaTeX files and run them through pdflatex" 32 | @echo " text to make text files" 33 | @echo " man to make manual pages" 34 | @echo " texinfo to make Texinfo files" 35 | @echo " info to make Texinfo files and run them through makeinfo" 36 | @echo " gettext to make PO message catalogs" 37 | @echo " changes to make an overview of all changed/added/deprecated items" 38 | @echo " linkcheck to check all external links for integrity" 39 | @echo " doctest to run all doctests embedded in the documentation (if enabled)" 40 | 41 | clean: 42 | -rm -rf $(BUILDDIR)/* 43 | 44 | html: 45 | $(SPHINXBUILD) -b html $(ALLSPHINXOPTS) $(BUILDDIR)/html 46 | @echo 47 | @echo "Build finished. The HTML pages are in $(BUILDDIR)/html." 48 | 49 | dirhtml: 50 | $(SPHINXBUILD) -b dirhtml $(ALLSPHINXOPTS) $(BUILDDIR)/dirhtml 51 | @echo 52 | @echo "Build finished. The HTML pages are in $(BUILDDIR)/dirhtml." 53 | 54 | singlehtml: 55 | $(SPHINXBUILD) -b singlehtml $(ALLSPHINXOPTS) $(BUILDDIR)/singlehtml 56 | @echo 57 | @echo "Build finished. The HTML page is in $(BUILDDIR)/singlehtml." 58 | 59 | pickle: 60 | $(SPHINXBUILD) -b pickle $(ALLSPHINXOPTS) $(BUILDDIR)/pickle 61 | @echo 62 | @echo "Build finished; now you can process the pickle files." 63 | 64 | json: 65 | $(SPHINXBUILD) -b json $(ALLSPHINXOPTS) $(BUILDDIR)/json 66 | @echo 67 | @echo "Build finished; now you can process the JSON files." 68 | 69 | htmlhelp: 70 | $(SPHINXBUILD) -b htmlhelp $(ALLSPHINXOPTS) $(BUILDDIR)/htmlhelp 71 | @echo 72 | @echo "Build finished; now you can run HTML Help Workshop with the" \ 73 | ".hhp project file in $(BUILDDIR)/htmlhelp." 74 | 75 | qthelp: 76 | $(SPHINXBUILD) -b qthelp $(ALLSPHINXOPTS) $(BUILDDIR)/qthelp 77 | @echo 78 | @echo "Build finished; now you can run "qcollectiongenerator" with the" \ 79 | ".qhcp project file in $(BUILDDIR)/qthelp, like this:" 80 | @echo "# qcollectiongenerator $(BUILDDIR)/qthelp/PythonEmailCrawler.qhcp" 81 | @echo "To view the help file:" 82 | @echo "# assistant -collectionFile $(BUILDDIR)/qthelp/PythonEmailCrawler.qhc" 83 | 84 | devhelp: 85 | $(SPHINXBUILD) -b devhelp $(ALLSPHINXOPTS) $(BUILDDIR)/devhelp 86 | @echo 87 | @echo "Build finished." 88 | @echo "To view the help file:" 89 | @echo "# mkdir -p $$HOME/.local/share/devhelp/PythonEmailCrawler" 90 | @echo "# ln -s $(BUILDDIR)/devhelp $$HOME/.local/share/devhelp/PythonEmailCrawler" 91 | @echo "# devhelp" 92 | 93 | epub: 94 | $(SPHINXBUILD) -b epub $(ALLSPHINXOPTS) $(BUILDDIR)/epub 95 | @echo 96 | @echo "Build finished. The epub file is in $(BUILDDIR)/epub." 97 | 98 | latex: 99 | $(SPHINXBUILD) -b latex $(ALLSPHINXOPTS) $(BUILDDIR)/latex 100 | @echo 101 | @echo "Build finished; the LaTeX files are in $(BUILDDIR)/latex." 102 | @echo "Run \`make' in that directory to run these through (pdf)latex" \ 103 | "(use \`make latexpdf' here to do that automatically)." 104 | 105 | latexpdf: 106 | $(SPHINXBUILD) -b latex $(ALLSPHINXOPTS) $(BUILDDIR)/latex 107 | @echo "Running LaTeX files through pdflatex..." 108 | $(MAKE) -C $(BUILDDIR)/latex all-pdf 109 | @echo "pdflatex finished; the PDF files are in $(BUILDDIR)/latex." 110 | 111 | text: 112 | $(SPHINXBUILD) -b text $(ALLSPHINXOPTS) $(BUILDDIR)/text 113 | @echo 114 | @echo "Build finished. The text files are in $(BUILDDIR)/text." 115 | 116 | man: 117 | $(SPHINXBUILD) -b man $(ALLSPHINXOPTS) $(BUILDDIR)/man 118 | @echo 119 | @echo "Build finished. The manual pages are in $(BUILDDIR)/man." 120 | 121 | texinfo: 122 | $(SPHINXBUILD) -b texinfo $(ALLSPHINXOPTS) $(BUILDDIR)/texinfo 123 | @echo 124 | @echo "Build finished. The Texinfo files are in $(BUILDDIR)/texinfo." 125 | @echo "Run \`make' in that directory to run these through makeinfo" \ 126 | "(use \`make info' here to do that automatically)." 127 | 128 | info: 129 | $(SPHINXBUILD) -b texinfo $(ALLSPHINXOPTS) $(BUILDDIR)/texinfo 130 | @echo "Running Texinfo files through makeinfo..." 131 | make -C $(BUILDDIR)/texinfo info 132 | @echo "makeinfo finished; the Info files are in $(BUILDDIR)/texinfo." 133 | 134 | gettext: 135 | $(SPHINXBUILD) -b gettext $(I18NSPHINXOPTS) $(BUILDDIR)/locale 136 | @echo 137 | @echo "Build finished. The message catalogs are in $(BUILDDIR)/locale." 138 | 139 | changes: 140 | $(SPHINXBUILD) -b changes $(ALLSPHINXOPTS) $(BUILDDIR)/changes 141 | @echo 142 | @echo "The overview file is in $(BUILDDIR)/changes." 143 | 144 | linkcheck: 145 | $(SPHINXBUILD) -b linkcheck $(ALLSPHINXOPTS) $(BUILDDIR)/linkcheck 146 | @echo 147 | @echo "Link check complete; look for any errors in the above output " \ 148 | "or in $(BUILDDIR)/linkcheck/output.txt." 149 | 150 | doctest: 151 | $(SPHINXBUILD) -b doctest $(ALLSPHINXOPTS) $(BUILDDIR)/doctest 152 | @echo "Testing of doctests in the sources finished, look at the " \ 153 | "results in $(BUILDDIR)/doctest/output.txt." 154 | -------------------------------------------------------------------------------- /docs/conf.py: -------------------------------------------------------------------------------- 1 | # -*- coding: utf-8 -*- 2 | # 3 | # Python Email Crawler documentation build configuration file, created by 4 | # sphinx-quickstart on Fri Aug 3 12:26:56 2012. 5 | # 6 | # This file is execfile()d with the current directory set to its containing dir. 7 | # 8 | # Note that not all possible configuration values are present in this 9 | # autogenerated file. 10 | # 11 | # All configuration values have a default; values that are commented out 12 | # serve to show the default. 13 | 14 | import sys, os 15 | 16 | # If extensions (or modules to document with autodoc) are in another directory, 17 | # add these directories to sys.path here. If the directory is relative to the 18 | # documentation root, use os.path.abspath to make it absolute, like shown here. 19 | #sys.path.insert(0, os.path.abspath('.')) 20 | 21 | # -- General configuration ----------------------------------------------------- 22 | 23 | # If your documentation needs a minimal Sphinx version, state it here. 24 | #needs_sphinx = '1.0' 25 | 26 | # Add any Sphinx extension module names here, as strings. They can be extensions 27 | # coming with Sphinx (named 'sphinx.ext.*') or your custom ones. 28 | extensions = ['sphinx.ext.autodoc'] 29 | 30 | # Add any paths that contain templates here, relative to this directory. 31 | templates_path = ['_templates'] 32 | 33 | # The suffix of source filenames. 34 | source_suffix = '.rst' 35 | 36 | # The encoding of source files. 37 | #source_encoding = 'utf-8-sig' 38 | 39 | # The master toctree document. 40 | master_doc = 'index' 41 | 42 | # General information about the project. 43 | project = u'Python Email Crawler' 44 | copyright = u'2012, Junda Ong' 45 | 46 | # The version info for the project you're documenting, acts as replacement for 47 | # |version| and |release|, also used in various other places throughout the 48 | # built documents. 49 | # 50 | # The short X.Y version. 51 | version = '1.0' 52 | # The full version, including alpha/beta/rc tags. 53 | release = '1.0' 54 | 55 | # The language for content autogenerated by Sphinx. Refer to documentation 56 | # for a list of supported languages. 57 | #language = None 58 | 59 | # There are two options for replacing |today|: either, you set today to some 60 | # non-false value, then it is used: 61 | #today = '' 62 | # Else, today_fmt is used as the format for a strftime call. 63 | #today_fmt = '%B %d, %Y' 64 | 65 | # List of patterns, relative to source directory, that match files and 66 | # directories to ignore when looking for source files. 67 | exclude_patterns = ['_build'] 68 | 69 | # The reST default role (used for this markup: `text`) to use for all documents. 70 | #default_role = None 71 | 72 | # If true, '()' will be appended to :func: etc. cross-reference text. 73 | #add_function_parentheses = True 74 | 75 | # If true, the current module name will be prepended to all description 76 | # unit titles (such as .. function::). 77 | #add_module_names = True 78 | 79 | # If true, sectionauthor and moduleauthor directives will be shown in the 80 | # output. They are ignored by default. 81 | #show_authors = False 82 | 83 | # The name of the Pygments (syntax highlighting) style to use. 84 | pygments_style = 'sphinx' 85 | 86 | # A list of ignored prefixes for module index sorting. 87 | #modindex_common_prefix = [] 88 | 89 | 90 | # -- Options for HTML output --------------------------------------------------- 91 | 92 | # The theme to use for HTML and HTML Help pages. See the documentation for 93 | # a list of builtin themes. 94 | html_theme = 'default' 95 | 96 | # Theme options are theme-specific and customize the look and feel of a theme 97 | # further. For a list of options available for each theme, see the 98 | # documentation. 99 | #html_theme_options = {} 100 | 101 | # Add any paths that contain custom themes here, relative to this directory. 102 | #html_theme_path = [] 103 | 104 | # The name for this set of Sphinx documents. If None, it defaults to 105 | # " v documentation". 106 | #html_title = None 107 | 108 | # A shorter title for the navigation bar. Default is the same as html_title. 109 | #html_short_title = None 110 | 111 | # The name of an image file (relative to this directory) to place at the top 112 | # of the sidebar. 113 | #html_logo = None 114 | 115 | # The name of an image file (within the static path) to use as favicon of the 116 | # docs. This file should be a Windows icon file (.ico) being 16x16 or 32x32 117 | # pixels large. 118 | #html_favicon = None 119 | 120 | # Add any paths that contain custom static files (such as style sheets) here, 121 | # relative to this directory. They are copied after the builtin static files, 122 | # so a file named "default.css" will overwrite the builtin "default.css". 123 | html_static_path = ['_static'] 124 | 125 | # If not '', a 'Last updated on:' timestamp is inserted at every page bottom, 126 | # using the given strftime format. 127 | #html_last_updated_fmt = '%b %d, %Y' 128 | 129 | # If true, SmartyPants will be used to convert quotes and dashes to 130 | # typographically correct entities. 131 | #html_use_smartypants = True 132 | 133 | # Custom sidebar templates, maps document names to template names. 134 | #html_sidebars = {} 135 | 136 | # Additional templates that should be rendered to pages, maps page names to 137 | # template names. 138 | #html_additional_pages = {} 139 | 140 | # If false, no module index is generated. 141 | #html_domain_indices = True 142 | 143 | # If false, no index is generated. 144 | #html_use_index = True 145 | 146 | # If true, the index is split into individual pages for each letter. 147 | #html_split_index = False 148 | 149 | # If true, links to the reST sources are added to the pages. 150 | #html_show_sourcelink = True 151 | 152 | # If true, "Created using Sphinx" is shown in the HTML footer. Default is True. 153 | #html_show_sphinx = True 154 | 155 | # If true, "(C) Copyright ..." is shown in the HTML footer. Default is True. 156 | #html_show_copyright = True 157 | 158 | # If true, an OpenSearch description file will be output, and all pages will 159 | # contain a tag referring to it. The value of this option must be the 160 | # base URL from which the finished HTML is served. 161 | #html_use_opensearch = '' 162 | 163 | # This is the file name suffix for HTML files (e.g. ".xhtml"). 164 | #html_file_suffix = None 165 | 166 | # Output file base name for HTML help builder. 167 | htmlhelp_basename = 'PythonEmailCrawlerdoc' 168 | 169 | 170 | # -- Options for LaTeX output -------------------------------------------------- 171 | 172 | latex_elements = { 173 | # The paper size ('letterpaper' or 'a4paper'). 174 | #'papersize': 'letterpaper', 175 | 176 | # The font size ('10pt', '11pt' or '12pt'). 177 | #'pointsize': '10pt', 178 | 179 | # Additional stuff for the LaTeX preamble. 180 | #'preamble': '', 181 | } 182 | 183 | # Grouping the document tree into LaTeX files. List of tuples 184 | # (source start file, target name, title, author, documentclass [howto/manual]). 185 | latex_documents = [ 186 | ('index', 'PythonEmailCrawler.tex', u'Python Email Crawler Documentation', 187 | u'Junda Ong', 'manual'), 188 | ] 189 | 190 | # The name of an image file (relative to this directory) to place at the top of 191 | # the title page. 192 | #latex_logo = None 193 | 194 | # For "manual" documents, if this is true, then toplevel headings are parts, 195 | # not chapters. 196 | #latex_use_parts = False 197 | 198 | # If true, show page references after internal links. 199 | #latex_show_pagerefs = False 200 | 201 | # If true, show URL addresses after external links. 202 | #latex_show_urls = False 203 | 204 | # Documents to append as an appendix to all manuals. 205 | #latex_appendices = [] 206 | 207 | # If false, no module index is generated. 208 | #latex_domain_indices = True 209 | 210 | 211 | # -- Options for manual page output -------------------------------------------- 212 | 213 | # One entry per manual page. List of tuples 214 | # (source start file, name, description, authors, manual section). 215 | man_pages = [ 216 | ('index', 'pythonemailcrawler', u'Python Email Crawler Documentation', 217 | [u'Junda Ong'], 1) 218 | ] 219 | 220 | # If true, show URL addresses after external links. 221 | #man_show_urls = False 222 | 223 | 224 | # -- Options for Texinfo output ------------------------------------------------ 225 | 226 | # Grouping the document tree into Texinfo files. List of tuples 227 | # (source start file, target name, title, author, 228 | # dir menu entry, description, category) 229 | texinfo_documents = [ 230 | ('index', 'PythonEmailCrawler', u'Python Email Crawler Documentation', 231 | u'Junda Ong', 'PythonEmailCrawler', 'One line description of project.', 232 | 'Miscellaneous'), 233 | ] 234 | 235 | # Documents to append as an appendix to all manuals. 236 | #texinfo_appendices = [] 237 | 238 | # If false, no module index is generated. 239 | #texinfo_domain_indices = True 240 | 241 | # How to display URL addresses: 'footnote', 'no', or 'inline'. 242 | #texinfo_show_urls = 'footnote' 243 | -------------------------------------------------------------------------------- /logs/pycrawler.log: -------------------------------------------------------------------------------- 1 | [2020-05-12 00:31:08,449] INFO::(P:29691 T:140651032557376)::email_crawler - ---------------------------------------- 2 | [2020-05-12 00:31:08,449] INFO::(P:29691 T:140651032557376)::email_crawler - Keywords to Google for: iphone developers 3 | [2020-05-12 00:31:08,449] INFO::(P:29691 T:140651032557376)::email_crawler - ---------------------------------------- 4 | [2020-05-12 00:31:08,450] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=0 5 | [2020-05-12 00:31:09,280] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=10 6 | [2020-05-12 00:31:09,800] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=20 7 | [2020-05-12 00:31:10,323] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=30 8 | [2020-05-12 00:31:11,113] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=40 9 | [2020-05-12 00:31:11,689] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=50 10 | [2020-05-12 00:31:12,240] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=60 11 | [2020-05-12 00:31:12,805] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=70 12 | [2020-05-12 00:31:13,382] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=80 13 | [2020-05-12 00:31:13,993] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=90 14 | [2020-05-12 00:31:14,662] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=100 15 | [2020-05-12 00:31:15,332] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=110 16 | [2020-05-12 00:31:16,230] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=120 17 | [2020-05-12 00:31:16,893] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=130 18 | [2020-05-12 00:31:17,557] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.google.com/search?q=iphone+developers&start=140 19 | [2020-05-12 00:31:18,253] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://inifixme.com/tag/logo-apple/ 20 | [2020-05-12 00:31:22,366] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://id.wikipedia.org/wiki/Apple_Inc. 21 | [2020-05-12 00:31:22,931] INFO::(P:29691 T:140651032557376)::email_crawler - No email at level 1.. proceeding to crawl level 2 22 | [2020-05-12 00:31:22,946] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Perusahaan_teknologi&action=edit&redlink=1 23 | [2020-05-12 00:31:23,492] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Perusahaan_teknologi&action=edit&redlink=1 24 | HTTP Error 404: Not Found 25 | [2020-05-12 00:31:23,493] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/IWork 26 | [2020-05-12 00:31:24,772] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kendara_otonom&action=edit&redlink=1 27 | [2020-05-12 00:31:25,401] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Kendara_otonom&action=edit&redlink=1 28 | HTTP Error 404: Not Found 29 | [2020-05-12 00:31:25,402] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Sydney_Morning_Herald 30 | [2020-05-12 00:31:26,380] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Nike,_Inc. 31 | [2020-05-12 00:31:26,588] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Templat:Perusahaan_semikonduktor_besar 32 | [2020-05-12 00:31:27,560] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Apple_Worldwide_Developers_Conference 33 | [2020-05-12 00:31:29,180] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Razer_Inc. 34 | [2020-05-12 00:31:30,225] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Northern_Trust&action=edit&redlink=1 35 | [2020-05-12 00:31:31,028] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Northern_Trust&action=edit&redlink=1 36 | HTTP Error 404: Not Found 37 | [2020-05-12 00:31:31,028] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.apple.com/ipad/ 38 | [2020-05-12 00:31:31,226] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://news.bbc.co.uk/2/hi/technology/3797261.stm 39 | [2020-05-12 00:31:32,468] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Michael_S._Malone&action=edit&redlink=1 40 | [2020-05-12 00:31:33,098] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Michael_S._Malone&action=edit&redlink=1 41 | HTTP Error 404: Not Found 42 | [2020-05-12 00:31:33,099] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=PowerMac_G4&action=edit&redlink=1 43 | [2020-05-12 00:31:33,930] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=PowerMac_G4&action=edit&redlink=1 44 | HTTP Error 404: Not Found 45 | [2020-05-12 00:31:33,930] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://sv.wikipedia.org/wiki/Apple_Inc. 46 | [2020-05-12 00:31:34,210] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Artikel_Wikipedia_dengan_penanda_VIAF 47 | [2020-05-12 00:31:35,615] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://web.archive.org/web/20080616163344/http://www.wired.com/gadgets/mac/commentary/cultofmac/2006/06/71138 48 | [2020-05-12 00:31:49,089] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Empat_pita&action=edit&redlink=1 49 | [2020-05-12 00:31:49,687] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Empat_pita&action=edit&redlink=1 50 | HTTP Error 404: Not Found 51 | [2020-05-12 00:31:49,687] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/IBook 52 | [2020-05-12 00:31:50,513] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Mike_Markkula 53 | [2020-05-12 00:31:51,976] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Dolar_Amerika_Serikat 54 | [2020-05-12 00:31:52,246] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Berkas:City_of_Madrid_(18045362985).jpg 55 | [2020-05-12 00:31:53,351] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Mozilla_Corporation 56 | [2020-05-12 00:31:53,577] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Ronald_Wayne 57 | [2020-05-12 00:31:54,431] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Templat:Perusahaan_perangkat_lunak_besar 58 | [2020-05-12 00:31:54,987] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Proyek_mobil_listrik_Apple&action=edit&redlink=1 59 | [2020-05-12 00:31:55,774] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Proyek_mobil_listrik_Apple&action=edit&redlink=1 60 | HTTP Error 404: Not Found 61 | [2020-05-12 00:31:55,775] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Xilinx 62 | [2020-05-12 00:31:55,993] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:CS1_sumber_berbahasa_Inggris_(en) 63 | [2020-05-12 00:31:57,387] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://en.wikipedia.org/wiki/Apple_Inc. 64 | [2020-05-12 00:31:58,194] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://ca.wikipedia.org/wiki/Apple_Inc 65 | [2020-05-12 00:31:59,624] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Arthur_D._Levinson&action=edit&redlink=1 66 | [2020-05-12 00:32:00,192] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Arthur_D._Levinson&action=edit&redlink=1 67 | HTTP Error 404: Not Found 68 | [2020-05-12 00:32:00,192] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=BLU_Products&action=edit&redlink=1 69 | [2020-05-12 00:32:01,064] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=BLU_Products&action=edit&redlink=1 70 | HTTP Error 404: Not Found 71 | [2020-05-12 00:32:01,065] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.macrumors.com/2016/09/20/macos-server-updated-for-sierra/ 72 | [2020-05-12 00:32:02,052] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Artikel_Wikipedia_dengan_penanda_BNF 73 | [2020-05-12 00:32:02,605] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Istimewa:Pranala_balik/Apple_Inc. 74 | [2020-05-12 00:32:03,649] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Badai_Harvey 75 | [2020-05-12 00:32:04,503] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://finance.yahoo.com/q?s=AAPL 76 | [2020-05-12 00:32:05,662] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Anak_perusahaan 77 | [2020-05-12 00:32:05,868] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Sir_Isaac_Newton 78 | [2020-05-12 00:32:07,106] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Goldman_Sachs 79 | [2020-05-12 00:32:07,309] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Barcelona 80 | [2020-05-12 00:32:07,608] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://venturebeat.com/2018/02/01/idc-smartphone-shipments-down-6-3-in-q4-2017-apple-overtakes-samsung-for-top-spot/ 81 | [2020-05-12 00:32:09,713] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Silicon_Image&action=edit&redlink=1 82 | [2020-05-12 00:32:10,316] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Silicon_Image&action=edit&redlink=1 83 | HTTP Error 404: Not Found 84 | [2020-05-12 00:32:10,316] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://si.wikipedia.org/wiki/%E0%B6%87%E0%B6%B4%E0%B6%BD%E0%B7%8A_%E0%B6%89%E0%B6%B1%E0%B7%8A%E0%B6%9A%E0%B7%9D%E0%B6%B4%E0%B6%BB%E0%B7%9A%E0%B7%82%E0%B6%B1%E0%B7%8A 85 | [2020-05-12 00:32:11,547] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://da.wikipedia.org/wiki/Apple_Inc. 86 | [2020-05-12 00:32:12,196] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/CBS 87 | [2020-05-12 00:32:12,477] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=International_Securities_Identification_Number&action=edit&redlink=1 88 | [2020-05-12 00:32:13,384] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=International_Securities_Identification_Number&action=edit&redlink=1 89 | HTTP Error 404: Not Found 90 | [2020-05-12 00:32:13,385] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/ILife 91 | [2020-05-12 00:32:14,440] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.mobilecrunch.com/2010/02/20/over-5000-apps-stricken-from-the-apple-app-store-new-rules-in-place/ 92 | [2020-05-12 00:32:17,913] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://bgr.com/2017/05/22/iphone-vs-android-switchers-ad-campaign/ 93 | [2020-05-12 00:32:19,526] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://foundation.wikimedia.org/wiki/Privacy_policy 94 | [2020-05-12 00:32:19,749] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Wikipedia:Pancapilar 95 | [2020-05-12 00:32:19,984] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/United_Continental_Holdings 96 | [2020-05-12 00:32:20,175] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.bloomberg.com/news/articles/2016-09-16/the-apple-store-line-is-dying-as-iphone-fans-order-more-online 97 | [2020-05-12 00:32:23,079] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://gadgets.ndtv.com/mobiles/news/apple-ceo-tim-cook-i-love-india-but-247307 98 | [2020-05-12 00:32:23,548] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.theverge.com/2017/11/29/16715246/apple-releases-high-sierra-root-security-patch 99 | [2020-05-12 00:32:25,773] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Pembicaraan_Templat:Perusahaan_Dow_Jones_Industrial_Average&action=edit&redlink=1 100 | [2020-05-12 00:32:26,583] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Pembicaraan_Templat:Perusahaan_Dow_Jones_Industrial_Average&action=edit&redlink=1 101 | HTTP Error 404: Not Found 102 | [2020-05-12 00:32:26,583] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Reputation.com&action=edit&redlink=1 103 | [2020-05-12 00:32:27,441] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Reputation.com&action=edit&redlink=1 104 | HTTP Error 404: Not Found 105 | [2020-05-12 00:32:27,441] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Tecno_Mobile&action=edit&redlink=1 106 | [2020-05-12 00:32:27,991] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Tecno_Mobile&action=edit&redlink=1 107 | HTTP Error 404: Not Found 108 | [2020-05-12 00:32:27,992] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.statista.com/topics/870/iphone/ 109 | [2020-05-12 00:32:28,486] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Idexx_Laboratories&action=edit&redlink=1 110 | [2020-05-12 00:32:29,313] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Idexx_Laboratories&action=edit&redlink=1 111 | HTTP Error 404: Not Found 112 | [2020-05-12 00:32:29,313] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Pembicaraan_Templat:Industri_elektronik_di_Amerika_Serikat&action=edit&redlink=1 113 | [2020-05-12 00:32:29,911] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Pembicaraan_Templat:Industri_elektronik_di_Amerika_Serikat&action=edit&redlink=1 114 | HTTP Error 404: Not Found 115 | [2020-05-12 00:32:29,911] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&veaction=edit 116 | [2020-05-12 00:32:31,934] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://musicbrainz.org/artist/9b502a85-104b-4489-beff-ecedca81741c 117 | [2020-05-12 00:32:34,968] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kategori:All_pages_needing_factual_verification&action=edit&redlink=1 118 | [2020-05-12 00:32:35,666] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.theverge.com/2017/9/12/16288806/apple-iphone-x-price-release-date-features-announced 119 | [2020-05-12 00:32:36,795] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Wina 120 | [2020-05-12 00:32:37,505] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.irishtimes.com/business/economy/apple-s-irish-company-structure-key-to-eu-tax-finding-1.2775684 121 | [2020-05-12 00:32:38,715] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Singgahan_(komputasi)&action=edit&redlink=1 122 | [2020-05-12 00:32:39,267] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Singgahan_(komputasi)&action=edit&redlink=1 123 | HTTP Error 404: Not Found 124 | [2020-05-12 00:32:39,267] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=MacBook_(Retina)&action=edit&redlink=1 125 | [2020-05-12 00:32:40,033] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=MacBook_(Retina)&action=edit&redlink=1 126 | HTTP Error 404: Not Found 127 | [2020-05-12 00:32:40,033] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/720p 128 | [2020-05-12 00:32:40,219] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=IPad_Air_2&action=edit&redlink=1 129 | [2020-05-12 00:32:41,059] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=IPad_Air_2&action=edit&redlink=1 130 | HTTP Error 404: Not Found 131 | [2020-05-12 00:32:41,059] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Templat:Seleb_Apple 132 | [2020-05-12 00:32:42,065] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://fo.wikipedia.org/wiki/Apple 133 | [2020-05-12 00:32:43,090] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.climatecounts.org/scorecard_score.php?co=7 134 | [2020-05-12 00:32:46,551] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Google_Pixel&action=edit&redlink=1 135 | [2020-05-12 00:32:47,156] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Google_Pixel&action=edit&redlink=1 136 | HTTP Error 404: Not Found 137 | [2020-05-12 00:32:47,156] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://9to5mac.com/2017/02/17/ipad-pro-pc-ads/ 138 | [2020-05-12 00:32:48,153] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=NBCUniversal_News_Group&action=edit&redlink=1 139 | [2020-05-12 00:32:48,731] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=NBCUniversal_News_Group&action=edit&redlink=1 140 | HTTP Error 404: Not Found 141 | [2020-05-12 00:32:48,732] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Ron_Johnson_(pebisnis)&action=edit&redlink=1 142 | [2020-05-12 00:32:49,371] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Ron_Johnson_(pebisnis)&action=edit&redlink=1 143 | HTTP Error 404: Not Found 144 | [2020-05-12 00:32:49,372] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kategori:Penawaran_umum_perdana_tahun_1980-an&action=edit&redlink=1 145 | [2020-05-12 00:32:50,034] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Steven_Levy&action=edit&redlink=1 146 | [2020-05-12 00:32:50,694] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Steven_Levy&action=edit&redlink=1 147 | HTTP Error 404: Not Found 148 | [2020-05-12 00:32:50,694] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Wiko&action=edit&redlink=1 149 | [2020-05-12 00:32:51,327] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Wiko&action=edit&redlink=1 150 | HTTP Error 404: Not Found 151 | [2020-05-12 00:32:51,327] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.macrumors.com/2018/02/01/apple-now-has-1-3-billion-active-devices-worldwide/ 152 | [2020-05-12 00:32:52,251] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=The_Register&action=edit&redlink=1 153 | [2020-05-12 00:32:52,893] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=The_Register&action=edit&redlink=1 154 | HTTP Error 404: Not Found 155 | [2020-05-12 00:32:52,894] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Artikel_Wikipedia_dengan_penanda_SUDOC 156 | [2020-05-12 00:32:54,242] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/IPhone_6S 157 | [2020-05-12 00:32:54,498] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Tingkat_pajak&action=edit&redlink=1 158 | [2020-05-12 00:32:55,027] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Tingkat_pajak&action=edit&redlink=1 159 | HTTP Error 404: Not Found 160 | [2020-05-12 00:32:55,027] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Actuate_Corporation&action=edit&redlink=1 161 | [2020-05-12 00:32:55,838] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Actuate_Corporation&action=edit&redlink=1 162 | HTTP Error 404: Not Found 163 | [2020-05-12 00:32:55,839] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Herzliya 164 | [2020-05-12 00:32:56,702] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kategori:Perusahaan_teknologi_yang_berpusat_di_Wilayah_Teluk_San_Francisco&action=edit&redlink=1 165 | [2020-05-12 00:32:57,277] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Berkas:Apple_store_fifth_avenue.jpg 166 | [2020-05-12 00:32:58,181] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.youtube.com/watch?v=CW0DUg63lqU&hd=1 167 | [2020-05-12 00:32:58,409] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kategori:Artikel_yang_mengandung_pernyataan_berpotensi_usang_sejak_Juli_2018&action=edit&redlink=1 168 | [2020-05-12 00:32:59,349] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.cultofmac.com/194455/apple-posts-steve-jobs-tribute-his-spirit-will-forever-be-the-foundation-of-apple/ 169 | [2020-05-12 00:33:02,439] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/HomePod 170 | [2020-05-12 00:33:02,920] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/PCI_Express 171 | [2020-05-12 00:33:03,109] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://web.archive.org/web/20101104233204/http://www.wired.com/gadgetlab/2010/11/foxconn-photo-gallery/?pid=731&viewall=true 172 | [2020-05-12 00:33:10,424] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Hak_pekerja 173 | [2020-05-12 00:33:11,058] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/IPad_2 174 | [2020-05-12 00:33:11,277] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Halaman_dengan_kesalahan_referensi 175 | [2020-05-12 00:33:13,355] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Produksi_nirpabrik&action=edit&redlink=1 176 | [2020-05-12 00:33:13,985] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Produksi_nirpabrik&action=edit&redlink=1 177 | HTTP Error 404: Not Found 178 | [2020-05-12 00:33:13,986] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://finance.yahoo.com/quote/AAPL/holders?p=AAPL 179 | [2020-05-12 00:33:15,238] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://ga.wikipedia.org/wiki/Apple_Inc. 180 | [2020-05-12 00:33:15,999] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://web.archive.org/web/20100130142222/http://www.macrumors.com/2010/01/27/apple-tablet-media-event-today-come-see-our-latest-creation/ 181 | [2020-05-12 00:33:23,429] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&veaction=edit&section=25 182 | [2020-05-12 00:33:24,794] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Mac_OS_X 183 | [2020-05-12 00:33:25,082] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Biblioth%C3%A8que_nationale_de_France 184 | [2020-05-12 00:33:26,339] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Liberty_Global&action=edit&redlink=1 185 | [2020-05-12 00:33:26,950] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Liberty_Global&action=edit&redlink=1 186 | HTTP Error 404: Not Found 187 | [2020-05-12 00:33:26,950] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://web.archive.org/web/20080725082132/http://www.apple.com/hotnews/agreenerapple/ 188 | [2020-05-12 00:33:30,211] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Layar_LCD_bercahaya_latar_LED&action=edit&redlink=1 189 | [2020-05-12 00:33:31,094] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Layar_LCD_bercahaya_latar_LED&action=edit&redlink=1 190 | HTTP Error 404: Not Found 191 | [2020-05-12 00:33:31,094] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://nla.gov.au/anbd.aut-an36551832 192 | [2020-05-12 00:33:33,425] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://kn.wikipedia.org/wiki/%E0%B2%86%E0%B2%AA%E0%B2%B2%E0%B3%8D 193 | [2020-05-12 00:33:34,539] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://vi.wikipedia.org/wiki/Apple_Inc. 194 | [2020-05-12 00:33:34,845] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Adobe_Systems 195 | [2020-05-12 00:33:35,176] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Palo_Alto_Networks&action=edit&redlink=1 196 | [2020-05-12 00:33:35,798] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Palo_Alto_Networks&action=edit&redlink=1 197 | HTTP Error 404: Not Found 198 | [2020-05-12 00:33:35,799] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Fortune_500 199 | [2020-05-12 00:33:36,000] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Telepon_genggam#Produsen 200 | [2020-05-12 00:33:36,247] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=American_Airlines_Group&action=edit&redlink=1 201 | [2020-05-12 00:33:37,020] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=American_Airlines_Group&action=edit&redlink=1 202 | HTTP Error 404: Not Found 203 | [2020-05-12 00:33:37,021] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Oath_Inc.&action=edit&redlink=1 204 | [2020-05-12 00:33:37,630] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Oath_Inc.&action=edit&redlink=1 205 | HTTP Error 404: Not Found 206 | [2020-05-12 00:33:37,631] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Media_pemutaran_mengalir&action=edit&redlink=1 207 | [2020-05-12 00:33:38,177] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Media_pemutaran_mengalir&action=edit&redlink=1 208 | HTTP Error 404: Not Found 209 | [2020-05-12 00:33:38,178] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=HGST&action=edit&redlink=1 210 | [2020-05-12 00:33:39,063] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=HGST&action=edit&redlink=1 211 | HTTP Error 404: Not Found 212 | [2020-05-12 00:33:39,063] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://9to5mac.com/2017/05/17/iphone-made-in-india-2/ 213 | [2020-05-12 00:33:40,029] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://tech.fortune.cnn.com/2012/10/29/inside-apples-major-shakeup/ 214 | [2020-05-12 00:33:40,240] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://tech.fortune.cnn.com/2012/10/29/inside-apples-major-shakeup/ 215 | HTTP Error 502: Bad Gateway 216 | [2020-05-12 00:33:40,240] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Signetics&action=edit&redlink=1 217 | [2020-05-12 00:33:40,845] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Signetics&action=edit&redlink=1 218 | HTTP Error 404: Not Found 219 | [2020-05-12 00:33:40,846] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/USA_Today 220 | [2020-05-12 00:33:41,417] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.apple.com/newsroom/2001/05/15Apple-to-Open-25-Retail-Stores-in-2001/ 221 | [2020-05-12 00:33:42,057] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://id.wikivoyage.org/wiki/Special:Search/Cupertino 222 | [2020-05-12 00:33:44,319] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.apple.com/pr/library/2009/04/24appstore.html 223 | [2020-05-12 00:33:45,578] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Robin_Williams 224 | [2020-05-12 00:33:45,809] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://appleinsider.com/articles/15/12/14/apple-buys-former-maxim-chip-fab-in-north-san-jose-neighboring-samsung-semiconductor- 225 | [2020-05-12 00:33:47,986] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Greenpeace 226 | [2020-05-12 00:33:48,217] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.macrumors.com/2015/09/24/iphone-6s-apple-store-lineups/ 227 | [2020-05-12 00:33:49,120] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Nelson_Mandela 228 | [2020-05-12 00:33:49,569] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/PDF 229 | [2020-05-12 00:33:49,789] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/PDB 230 | [2020-05-12 00:33:50,030] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/United_Technologies 231 | [2020-05-12 00:33:51,475] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Moskwa 232 | [2020-05-12 00:33:51,977] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Maps_(aplikasi)&action=edit&redlink=1 233 | [2020-05-12 00:33:52,521] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Maps_(aplikasi)&action=edit&redlink=1 234 | HTTP Error 404: Not Found 235 | [2020-05-12 00:33:52,521] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Dan_Riccio&action=edit&redlink=1 236 | [2020-05-12 00:33:53,059] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Dan_Riccio&action=edit&redlink=1 237 | HTTP Error 404: Not Found 238 | [2020-05-12 00:33:53,060] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/TechCrunch 239 | [2020-05-12 00:33:53,740] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.docketalarm.com/cases/AllNaturesOfSuit/Apple%2C%20Inc./ 240 | [2020-05-12 00:33:55,412] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://web.archive.org/web/20140304092046/http://www.wired.com/science/discoveries/news/2002/01/49652 241 | [2020-05-12 00:34:00,331] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://hr.wikipedia.org/wiki/Apple_Inc. 242 | [2020-05-12 00:34:01,591] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://pa.wikipedia.org/wiki/%E0%A8%90%E0%A8%AA%E0%A8%B2 243 | [2020-05-12 00:34:02,620] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/AppleTalk 244 | [2020-05-12 00:34:03,160] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Apple_Airport 245 | [2020-05-12 00:34:04,022] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://zh-min-nan.wikipedia.org/wiki/Apple_Inc. 246 | [2020-05-12 00:34:04,779] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/CNET 247 | [2020-05-12 00:34:05,610] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/MediaTek 248 | [2020-05-12 00:34:05,838] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Fujitsu 249 | [2020-05-12 00:34:06,090] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://id.wikinews.org/wiki/Special:Search/Apple_Inc. 250 | [2020-05-12 00:34:08,788] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/The_Washington_Post 251 | [2020-05-12 00:34:09,260] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Pengelompokan_OpenCorporates 252 | [2020-05-12 00:34:09,891] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Yahoo! 253 | [2020-05-12 00:34:10,142] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/San_Jos%C3%A9,_California 254 | [2020-05-12 00:34:10,456] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=New_Town_Plaza&action=edit&redlink=1 255 | [2020-05-12 00:34:11,070] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=New_Town_Plaza&action=edit&redlink=1 256 | HTTP Error 404: Not Found 257 | [2020-05-12 00:34:11,070] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=IPad_(generasi_ke-4)&action=edit&redlink=1 258 | [2020-05-12 00:34:11,671] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=IPad_(generasi_ke-4)&action=edit&redlink=1 259 | HTTP Error 404: Not Found 260 | [2020-05-12 00:34:11,672] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Wikipedia:Warung_Kopi 261 | [2020-05-12 00:34:11,934] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Artikel_Wikipedia_dengan_penanda_MusicBrainz 262 | [2020-05-12 00:34:13,025] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kimball_International&action=edit&redlink=1 263 | [2020-05-12 00:34:13,607] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Kimball_International&action=edit&redlink=1 264 | HTTP Error 404: Not Found 265 | [2020-05-12 00:34:13,607] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/NEC 266 | [2020-05-12 00:34:15,234] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=LSI_Corporation&action=edit&redlink=1 267 | [2020-05-12 00:34:15,813] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=LSI_Corporation&action=edit&redlink=1 268 | HTTP Error 404: Not Found 269 | [2020-05-12 00:34:15,814] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/STMicroelectronics 270 | [2020-05-12 00:34:16,066] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Commodore_International 271 | [2020-05-12 00:34:16,952] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://arstechnica.com/apple/2015/09/apples-new-ipad-2-pro-is-an-expansive-12-9-inches/ 272 | [2020-05-12 00:34:19,860] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.greenpeace.org/international/en/campaigns/toxics/electronics/Guide-to-Greener-Electronics/which-companies-really-sell-gr/ 273 | [2020-05-12 00:34:22,690] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://www.greenpeace.org/international/en/campaigns/toxics/electronics/Guide-to-Greener-Electronics/which-companies-really-sell-gr/ 274 | HTTP Error 404: Not Found 275 | [2020-05-12 00:34:22,690] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Mark_Papermaster&action=edit&redlink=1 276 | [2020-05-12 00:34:23,288] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Mark_Papermaster&action=edit&redlink=1 277 | HTTP Error 404: Not Found 278 | [2020-05-12 00:34:23,289] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://so.wikipedia.org/wiki/Apple_Inc 279 | [2020-05-12 00:34:24,357] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Alto 280 | [2020-05-12 00:34:24,537] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://google.brand.edgar-online.com/?sym=AAPL 281 | [2020-05-12 00:34:27,235] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Magnavox 282 | [2020-05-12 00:34:28,442] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=KEMET_Corporation&action=edit&redlink=1 283 | [2020-05-12 00:34:29,102] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=KEMET_Corporation&action=edit&redlink=1 284 | HTTP Error 404: Not Found 285 | [2020-05-12 00:34:29,102] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Cirque_Corporation&action=edit&redlink=1 286 | [2020-05-12 00:34:29,968] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Cirque_Corporation&action=edit&redlink=1 287 | HTTP Error 404: Not Found 288 | [2020-05-12 00:34:29,969] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/JD.com 289 | [2020-05-12 00:34:30,204] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.zdnet.com/blog/semantic-web/siri-acquired-by-apple-iphone-becomes-the-virtual-personal-assistant/371 290 | [2020-05-12 00:34:33,213] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: https://www.zdnet.com/blog/semantic-web/siri-acquired-by-apple-iphone-becomes-the-virtual-personal-assistant/371 291 | HTTP Error 404: Not Found 292 | [2020-05-12 00:34:33,213] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Pemegang_saham 293 | [2020-05-12 00:34:33,397] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://web.archive.org/web/20100201014134/https://www.pcworld.com/article/188149/atandt_beefing_up_network_for_ipad_and_iphone.html 294 | [2020-05-12 00:34:38,918] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Jony_Ive&action=edit&redlink=1 295 | [2020-05-12 00:34:39,759] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Jony_Ive&action=edit&redlink=1 296 | HTTP Error 404: Not Found 297 | [2020-05-12 00:34:39,759] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://www.abc.net.au/news/stories/2010/10/26/3048024.htm 298 | [2020-05-12 00:34:41,341] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/ASML_Holding 299 | [2020-05-12 00:34:41,603] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=BlackBerry_Mobile&action=edit&redlink=1 300 | [2020-05-12 00:34:42,508] ERROR::(P:29691 T:140651032557376)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=BlackBerry_Mobile&action=edit&redlink=1 301 | HTTP Error 404: Not Found 302 | [2020-05-12 00:34:42,509] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/Procter_%26_Gamble 303 | [2020-05-12 00:34:42,801] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/wiki/PC_Magazine 304 | [2020-05-12 00:34:42,998] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://ja.wikipedia.org/wiki/%E3%82%A2%E3%83%83%E3%83%97%E3%83%AB_(%E4%BC%81%E6%A5%AD) 305 | [2020-05-12 00:34:43,644] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling https://www.apple.com/iwork/ 306 | [2020-05-12 00:34:44,428] INFO::(P:29691 T:140651032557376)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Layanan_penyimpanan_berkas&action=edit&redlink=1 307 | [2020-05-12 00:35:08,189] INFO::(P:29774 T:140343819659072)::email_crawler - ======================================== 308 | [2020-05-12 00:35:08,189] INFO::(P:29774 T:140343819659072)::email_crawler - Processing... 309 | [2020-05-12 00:35:08,191] INFO::(P:29774 T:140343819659072)::email_crawler - There are 40 emails 310 | [2020-05-12 00:35:08,192] INFO::(P:29774 T:140343819659072)::email_crawler - All emails saved to ./data/emails.csv 311 | [2020-05-12 00:35:08,192] INFO::(P:29774 T:140343819659072)::email_crawler - ======================================== 312 | [2020-05-12 00:35:16,441] INFO::(P:29776 T:140164560942912)::email_crawler - ---------------------------------------- 313 | [2020-05-12 00:35:16,441] INFO::(P:29776 T:140164560942912)::email_crawler - Keywords to Google for: --email 314 | [2020-05-12 00:35:16,441] INFO::(P:29776 T:140164560942912)::email_crawler - ---------------------------------------- 315 | [2020-05-12 00:35:16,441] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=0 316 | [2020-05-12 00:35:17,081] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=10 317 | [2020-05-12 00:35:17,569] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=20 318 | [2020-05-12 00:35:18,045] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=30 319 | [2020-05-12 00:35:18,531] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=40 320 | [2020-05-12 00:35:19,222] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=50 321 | [2020-05-12 00:35:19,911] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=60 322 | [2020-05-12 00:35:20,473] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=70 323 | [2020-05-12 00:35:21,018] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=80 324 | [2020-05-12 00:35:21,749] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=90 325 | [2020-05-12 00:35:22,285] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=100 326 | [2020-05-12 00:35:22,769] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=110 327 | [2020-05-12 00:35:23,331] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=120 328 | [2020-05-12 00:35:23,874] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=130 329 | [2020-05-12 00:35:24,363] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.google.com/search?q=--email&start=140 330 | [2020-05-12 00:35:25,057] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling https://id.wikipedia.org/wiki/Apple_Inc. 331 | [2020-05-12 00:35:25,609] INFO::(P:29776 T:140164560942912)::email_crawler - No email at level 1.. proceeding to crawl level 2 332 | [2020-05-12 00:35:25,624] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Perusahaan_teknologi&action=edit&redlink=1 333 | [2020-05-12 00:35:26,234] ERROR::(P:29776 T:140164560942912)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Perusahaan_teknologi&action=edit&redlink=1 334 | HTTP Error 404: Not Found 335 | [2020-05-12 00:35:26,235] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/IWork 336 | [2020-05-12 00:35:26,425] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kendara_otonom&action=edit&redlink=1 337 | [2020-05-12 00:35:26,972] ERROR::(P:29776 T:140164560942912)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Kendara_otonom&action=edit&redlink=1 338 | HTTP Error 404: Not Found 339 | [2020-05-12 00:35:26,973] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Sydney_Morning_Herald 340 | [2020-05-12 00:35:27,162] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Nike,_Inc. 341 | [2020-05-12 00:35:27,373] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Templat:Perusahaan_semikonduktor_besar 342 | [2020-05-12 00:35:27,554] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Apple_Worldwide_Developers_Conference 343 | [2020-05-12 00:35:27,747] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Razer_Inc. 344 | [2020-05-12 00:35:27,953] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Northern_Trust&action=edit&redlink=1 345 | [2020-05-12 00:35:28,574] ERROR::(P:29776 T:140164560942912)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Northern_Trust&action=edit&redlink=1 346 | HTTP Error 404: Not Found 347 | [2020-05-12 00:35:28,575] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling https://www.apple.com/ipad/ 348 | [2020-05-12 00:35:28,788] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://news.bbc.co.uk/2/hi/technology/3797261.stm 349 | [2020-05-12 00:35:29,770] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Michael_S._Malone&action=edit&redlink=1 350 | [2020-05-12 00:35:30,377] ERROR::(P:29776 T:140164560942912)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Michael_S._Malone&action=edit&redlink=1 351 | HTTP Error 404: Not Found 352 | [2020-05-12 00:35:30,377] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=PowerMac_G4&action=edit&redlink=1 353 | [2020-05-12 00:35:31,155] ERROR::(P:29776 T:140164560942912)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=PowerMac_G4&action=edit&redlink=1 354 | HTTP Error 404: Not Found 355 | [2020-05-12 00:35:31,155] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling https://sv.wikipedia.org/wiki/Apple_Inc. 356 | [2020-05-12 00:35:31,396] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Artikel_Wikipedia_dengan_penanda_VIAF 357 | [2020-05-12 00:35:31,684] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling https://web.archive.org/web/20080616163344/http://www.wired.com/gadgets/mac/commentary/cultofmac/2006/06/71138 358 | [2020-05-12 00:35:33,907] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Empat_pita&action=edit&redlink=1 359 | [2020-05-12 00:35:34,807] ERROR::(P:29776 T:140164560942912)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Empat_pita&action=edit&redlink=1 360 | HTTP Error 404: Not Found 361 | [2020-05-12 00:35:34,808] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/IBook 362 | [2020-05-12 00:35:34,985] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Mike_Markkula 363 | [2020-05-12 00:35:35,173] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Dolar_Amerika_Serikat 364 | [2020-05-12 00:35:35,442] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Berkas:City_of_Madrid_(18045362985).jpg 365 | [2020-05-12 00:35:35,631] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Mozilla_Corporation 366 | [2020-05-12 00:35:35,863] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Ronald_Wayne 367 | [2020-05-12 00:35:36,061] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Templat:Perusahaan_perangkat_lunak_besar 368 | [2020-05-12 00:35:36,243] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Proyek_mobil_listrik_Apple&action=edit&redlink=1 369 | [2020-05-12 00:35:36,773] ERROR::(P:29776 T:140164560942912)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Proyek_mobil_listrik_Apple&action=edit&redlink=1 370 | HTTP Error 404: Not Found 371 | [2020-05-12 00:35:36,774] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Xilinx 372 | [2020-05-12 00:35:36,985] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:CS1_sumber_berbahasa_Inggris_(en) 373 | [2020-05-12 00:35:37,187] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling https://en.wikipedia.org/wiki/Apple_Inc. 374 | [2020-05-12 00:35:38,057] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling https://ca.wikipedia.org/wiki/Apple_Inc 375 | [2020-05-12 00:35:38,284] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Arthur_D._Levinson&action=edit&redlink=1 376 | [2020-05-12 00:35:38,888] ERROR::(P:29776 T:140164560942912)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Arthur_D._Levinson&action=edit&redlink=1 377 | HTTP Error 404: Not Found 378 | [2020-05-12 00:35:38,889] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=BLU_Products&action=edit&redlink=1 379 | [2020-05-12 00:35:39,465] ERROR::(P:29776 T:140164560942912)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=BLU_Products&action=edit&redlink=1 380 | HTTP Error 404: Not Found 381 | [2020-05-12 00:35:39,466] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling https://www.macrumors.com/2016/09/20/macos-server-updated-for-sierra/ 382 | [2020-05-12 00:35:39,646] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Artikel_Wikipedia_dengan_penanda_BNF 383 | [2020-05-12 00:35:39,835] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Istimewa:Pranala_balik/Apple_Inc. 384 | [2020-05-12 00:35:40,796] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Badai_Harvey 385 | [2020-05-12 00:35:40,996] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://finance.yahoo.com/q?s=AAPL 386 | [2020-05-12 00:35:42,121] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Anak_perusahaan 387 | [2020-05-12 00:35:42,310] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Sir_Isaac_Newton 388 | [2020-05-12 00:35:42,626] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Goldman_Sachs 389 | [2020-05-12 00:35:42,821] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/Barcelona 390 | [2020-05-12 00:35:43,116] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling https://venturebeat.com/2018/02/01/idc-smartphone-shipments-down-6-3-in-q4-2017-apple-overtakes-samsung-for-top-spot/ 391 | [2020-05-12 00:35:43,340] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Silicon_Image&action=edit&redlink=1 392 | [2020-05-12 00:35:43,951] ERROR::(P:29776 T:140164560942912)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Silicon_Image&action=edit&redlink=1 393 | HTTP Error 404: Not Found 394 | [2020-05-12 00:35:43,951] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling https://si.wikipedia.org/wiki/%E0%B6%87%E0%B6%B4%E0%B6%BD%E0%B7%8A_%E0%B6%89%E0%B6%B1%E0%B7%8A%E0%B6%9A%E0%B7%9D%E0%B6%B4%E0%B6%BB%E0%B7%9A%E0%B7%82%E0%B6%B1%E0%B7%8A 395 | [2020-05-12 00:35:44,126] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling https://da.wikipedia.org/wiki/Apple_Inc. 396 | [2020-05-12 00:35:44,334] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/CBS 397 | [2020-05-12 00:35:44,612] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=International_Securities_Identification_Number&action=edit&redlink=1 398 | [2020-05-12 00:35:45,174] ERROR::(P:29776 T:140164560942912)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=International_Securities_Identification_Number&action=edit&redlink=1 399 | HTTP Error 404: Not Found 400 | [2020-05-12 00:35:45,174] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://id.wikipedia.org/wiki/ILife 401 | [2020-05-12 00:35:45,378] INFO::(P:29776 T:140164560942912)::email_crawler - Crawling http://www.mobilecrunch.com/2010/02/20/over-5000-apps-stricken-from-the-apple-app-store-new-rules-in-place/ 402 | [2020-05-12 00:36:15,044] INFO::(P:29811 T:139773745317696)::email_crawler - ---------------------------------------- 403 | [2020-05-12 00:36:15,044] INFO::(P:29811 T:139773745317696)::email_crawler - Keywords to Google for: @gmail.com 404 | [2020-05-12 00:36:15,044] INFO::(P:29811 T:139773745317696)::email_crawler - ---------------------------------------- 405 | [2020-05-12 00:36:15,045] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=0 406 | [2020-05-12 00:36:15,793] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=10 407 | [2020-05-12 00:36:16,302] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=20 408 | [2020-05-12 00:36:16,718] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=30 409 | [2020-05-12 00:36:17,265] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=40 410 | [2020-05-12 00:36:17,720] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=50 411 | [2020-05-12 00:36:18,165] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=60 412 | [2020-05-12 00:36:18,819] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=70 413 | [2020-05-12 00:36:19,289] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=80 414 | [2020-05-12 00:36:19,701] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=90 415 | [2020-05-12 00:36:20,191] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=100 416 | [2020-05-12 00:36:20,631] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=110 417 | [2020-05-12 00:36:21,157] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=120 418 | [2020-05-12 00:36:21,515] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=130 419 | [2020-05-12 00:36:21,928] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.google.com/search?q=%40gmail.com&start=140 420 | [2020-05-12 00:36:22,335] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://id.wikipedia.org/wiki/Apple_Inc. 421 | [2020-05-12 00:36:23,057] INFO::(P:29811 T:139773745317696)::email_crawler - No email at level 1.. proceeding to crawl level 2 422 | [2020-05-12 00:36:23,080] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Perusahaan_teknologi&action=edit&redlink=1 423 | [2020-05-12 00:36:23,645] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Perusahaan_teknologi&action=edit&redlink=1 424 | HTTP Error 404: Not Found 425 | [2020-05-12 00:36:23,645] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/IWork 426 | [2020-05-12 00:36:23,840] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kendara_otonom&action=edit&redlink=1 427 | [2020-05-12 00:36:24,371] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Kendara_otonom&action=edit&redlink=1 428 | HTTP Error 404: Not Found 429 | [2020-05-12 00:36:24,372] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Sydney_Morning_Herald 430 | [2020-05-12 00:36:24,564] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Nike,_Inc. 431 | [2020-05-12 00:36:24,778] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Templat:Perusahaan_semikonduktor_besar 432 | [2020-05-12 00:36:24,953] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Apple_Worldwide_Developers_Conference 433 | [2020-05-12 00:36:25,148] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Razer_Inc. 434 | [2020-05-12 00:36:25,345] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Northern_Trust&action=edit&redlink=1 435 | [2020-05-12 00:36:26,145] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Northern_Trust&action=edit&redlink=1 436 | HTTP Error 404: Not Found 437 | [2020-05-12 00:36:26,146] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.apple.com/ipad/ 438 | [2020-05-12 00:36:26,353] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://news.bbc.co.uk/2/hi/technology/3797261.stm 439 | [2020-05-12 00:36:29,484] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Michael_S._Malone&action=edit&redlink=1 440 | [2020-05-12 00:36:30,314] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Michael_S._Malone&action=edit&redlink=1 441 | HTTP Error 404: Not Found 442 | [2020-05-12 00:36:30,314] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=PowerMac_G4&action=edit&redlink=1 443 | [2020-05-12 00:36:31,082] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=PowerMac_G4&action=edit&redlink=1 444 | HTTP Error 404: Not Found 445 | [2020-05-12 00:36:31,082] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://sv.wikipedia.org/wiki/Apple_Inc. 446 | [2020-05-12 00:36:31,325] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Artikel_Wikipedia_dengan_penanda_VIAF 447 | [2020-05-12 00:36:31,611] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20080616163344/http://www.wired.com/gadgets/mac/commentary/cultofmac/2006/06/71138 448 | [2020-05-12 00:36:33,301] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Empat_pita&action=edit&redlink=1 449 | [2020-05-12 00:36:34,104] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Empat_pita&action=edit&redlink=1 450 | HTTP Error 404: Not Found 451 | [2020-05-12 00:36:34,104] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/IBook 452 | [2020-05-12 00:36:34,281] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Mike_Markkula 453 | [2020-05-12 00:36:34,470] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Dolar_Amerika_Serikat 454 | [2020-05-12 00:36:34,779] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Berkas:City_of_Madrid_(18045362985).jpg 455 | [2020-05-12 00:36:34,968] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Mozilla_Corporation 456 | [2020-05-12 00:36:35,187] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Ronald_Wayne 457 | [2020-05-12 00:36:35,375] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Templat:Perusahaan_perangkat_lunak_besar 458 | [2020-05-12 00:36:35,553] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Proyek_mobil_listrik_Apple&action=edit&redlink=1 459 | [2020-05-12 00:36:36,481] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Proyek_mobil_listrik_Apple&action=edit&redlink=1 460 | HTTP Error 404: Not Found 461 | [2020-05-12 00:36:36,482] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Xilinx 462 | [2020-05-12 00:36:36,692] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:CS1_sumber_berbahasa_Inggris_(en) 463 | [2020-05-12 00:36:36,894] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://en.wikipedia.org/wiki/Apple_Inc. 464 | [2020-05-12 00:36:37,676] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://ca.wikipedia.org/wiki/Apple_Inc 465 | [2020-05-12 00:36:37,910] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Arthur_D._Levinson&action=edit&redlink=1 466 | [2020-05-12 00:36:38,895] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Arthur_D._Levinson&action=edit&redlink=1 467 | HTTP Error 404: Not Found 468 | [2020-05-12 00:36:38,895] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=BLU_Products&action=edit&redlink=1 469 | [2020-05-12 00:36:40,519] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=BLU_Products&action=edit&redlink=1 470 | HTTP Error 404: Not Found 471 | [2020-05-12 00:36:40,519] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.macrumors.com/2016/09/20/macos-server-updated-for-sierra/ 472 | [2020-05-12 00:36:40,716] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Artikel_Wikipedia_dengan_penanda_BNF 473 | [2020-05-12 00:36:40,919] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Istimewa:Pranala_balik/Apple_Inc. 474 | [2020-05-12 00:36:41,626] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Badai_Harvey 475 | [2020-05-12 00:36:41,826] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://finance.yahoo.com/q?s=AAPL 476 | [2020-05-12 00:36:43,138] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Anak_perusahaan 477 | [2020-05-12 00:36:43,325] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Sir_Isaac_Newton 478 | [2020-05-12 00:36:43,644] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Goldman_Sachs 479 | [2020-05-12 00:36:43,854] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Barcelona 480 | [2020-05-12 00:36:44,145] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://venturebeat.com/2018/02/01/idc-smartphone-shipments-down-6-3-in-q4-2017-apple-overtakes-samsung-for-top-spot/ 481 | [2020-05-12 00:36:45,669] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Silicon_Image&action=edit&redlink=1 482 | [2020-05-12 00:36:46,321] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Silicon_Image&action=edit&redlink=1 483 | HTTP Error 404: Not Found 484 | [2020-05-12 00:36:46,321] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://si.wikipedia.org/wiki/%E0%B6%87%E0%B6%B4%E0%B6%BD%E0%B7%8A_%E0%B6%89%E0%B6%B1%E0%B7%8A%E0%B6%9A%E0%B7%9D%E0%B6%B4%E0%B6%BB%E0%B7%9A%E0%B7%82%E0%B6%B1%E0%B7%8A 485 | [2020-05-12 00:36:46,494] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://da.wikipedia.org/wiki/Apple_Inc. 486 | [2020-05-12 00:36:46,698] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/CBS 487 | [2020-05-12 00:36:47,048] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=International_Securities_Identification_Number&action=edit&redlink=1 488 | [2020-05-12 00:36:48,039] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=International_Securities_Identification_Number&action=edit&redlink=1 489 | HTTP Error 404: Not Found 490 | [2020-05-12 00:36:48,039] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/ILife 491 | [2020-05-12 00:36:48,242] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.mobilecrunch.com/2010/02/20/over-5000-apps-stricken-from-the-apple-app-store-new-rules-in-place/ 492 | [2020-05-12 00:36:51,262] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://bgr.com/2017/05/22/iphone-vs-android-switchers-ad-campaign/ 493 | [2020-05-12 00:36:52,361] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://foundation.wikimedia.org/wiki/Privacy_policy 494 | [2020-05-12 00:36:52,557] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Wikipedia:Pancapilar 495 | [2020-05-12 00:36:52,864] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/United_Continental_Holdings 496 | [2020-05-12 00:36:53,048] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.bloomberg.com/news/articles/2016-09-16/the-apple-store-line-is-dying-as-iphone-fans-order-more-online 497 | [2020-05-12 00:36:54,300] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://gadgets.ndtv.com/mobiles/news/apple-ceo-tim-cook-i-love-india-but-247307 498 | [2020-05-12 00:36:54,582] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.theverge.com/2017/11/29/16715246/apple-releases-high-sierra-root-security-patch 499 | [2020-05-12 00:36:54,884] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Pembicaraan_Templat:Perusahaan_Dow_Jones_Industrial_Average&action=edit&redlink=1 500 | [2020-05-12 00:36:55,832] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Pembicaraan_Templat:Perusahaan_Dow_Jones_Industrial_Average&action=edit&redlink=1 501 | HTTP Error 404: Not Found 502 | [2020-05-12 00:36:55,833] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Reputation.com&action=edit&redlink=1 503 | [2020-05-12 00:36:56,473] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Reputation.com&action=edit&redlink=1 504 | HTTP Error 404: Not Found 505 | [2020-05-12 00:36:56,473] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Tecno_Mobile&action=edit&redlink=1 506 | [2020-05-12 00:36:57,115] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Tecno_Mobile&action=edit&redlink=1 507 | HTTP Error 404: Not Found 508 | [2020-05-12 00:36:57,116] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.statista.com/topics/870/iphone/ 509 | [2020-05-12 00:36:57,523] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Idexx_Laboratories&action=edit&redlink=1 510 | [2020-05-12 00:36:58,108] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Idexx_Laboratories&action=edit&redlink=1 511 | HTTP Error 404: Not Found 512 | [2020-05-12 00:36:58,109] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Pembicaraan_Templat:Industri_elektronik_di_Amerika_Serikat&action=edit&redlink=1 513 | [2020-05-12 00:36:58,634] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Pembicaraan_Templat:Industri_elektronik_di_Amerika_Serikat&action=edit&redlink=1 514 | HTTP Error 404: Not Found 515 | [2020-05-12 00:36:58,634] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&veaction=edit 516 | [2020-05-12 00:37:01,300] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://musicbrainz.org/artist/9b502a85-104b-4489-beff-ecedca81741c 517 | [2020-05-12 00:37:03,899] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kategori:All_pages_needing_factual_verification&action=edit&redlink=1 518 | [2020-05-12 00:37:04,587] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.theverge.com/2017/9/12/16288806/apple-iphone-x-price-release-date-features-announced 519 | [2020-05-12 00:37:04,913] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Wina 520 | [2020-05-12 00:37:05,201] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.irishtimes.com/business/economy/apple-s-irish-company-structure-key-to-eu-tax-finding-1.2775684 521 | [2020-05-12 00:37:06,497] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Singgahan_(komputasi)&action=edit&redlink=1 522 | [2020-05-12 00:37:07,395] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Singgahan_(komputasi)&action=edit&redlink=1 523 | HTTP Error 404: Not Found 524 | [2020-05-12 00:37:07,395] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=MacBook_(Retina)&action=edit&redlink=1 525 | [2020-05-12 00:37:07,955] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=MacBook_(Retina)&action=edit&redlink=1 526 | HTTP Error 404: Not Found 527 | [2020-05-12 00:37:07,956] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/720p 528 | [2020-05-12 00:37:08,141] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=IPad_Air_2&action=edit&redlink=1 529 | [2020-05-12 00:37:08,708] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=IPad_Air_2&action=edit&redlink=1 530 | HTTP Error 404: Not Found 531 | [2020-05-12 00:37:08,708] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Templat:Seleb_Apple 532 | [2020-05-12 00:37:08,893] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://fo.wikipedia.org/wiki/Apple 533 | [2020-05-12 00:37:09,061] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.climatecounts.org/scorecard_score.php?co=7 534 | [2020-05-12 00:37:14,099] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Google_Pixel&action=edit&redlink=1 535 | [2020-05-12 00:37:14,678] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Google_Pixel&action=edit&redlink=1 536 | HTTP Error 404: Not Found 537 | [2020-05-12 00:37:14,679] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://9to5mac.com/2017/02/17/ipad-pro-pc-ads/ 538 | [2020-05-12 00:37:14,918] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=NBCUniversal_News_Group&action=edit&redlink=1 539 | [2020-05-12 00:37:15,539] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=NBCUniversal_News_Group&action=edit&redlink=1 540 | HTTP Error 404: Not Found 541 | [2020-05-12 00:37:15,539] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Ron_Johnson_(pebisnis)&action=edit&redlink=1 542 | [2020-05-12 00:37:16,078] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Ron_Johnson_(pebisnis)&action=edit&redlink=1 543 | HTTP Error 404: Not Found 544 | [2020-05-12 00:37:16,078] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kategori:Penawaran_umum_perdana_tahun_1980-an&action=edit&redlink=1 545 | [2020-05-12 00:37:17,041] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Steven_Levy&action=edit&redlink=1 546 | [2020-05-12 00:37:17,825] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Steven_Levy&action=edit&redlink=1 547 | HTTP Error 404: Not Found 548 | [2020-05-12 00:37:17,826] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Wiko&action=edit&redlink=1 549 | [2020-05-12 00:37:19,637] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Wiko&action=edit&redlink=1 550 | HTTP Error 404: Not Found 551 | [2020-05-12 00:37:19,637] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.macrumors.com/2018/02/01/apple-now-has-1-3-billion-active-devices-worldwide/ 552 | [2020-05-12 00:37:19,875] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=The_Register&action=edit&redlink=1 553 | [2020-05-12 00:37:20,427] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=The_Register&action=edit&redlink=1 554 | HTTP Error 404: Not Found 555 | [2020-05-12 00:37:20,427] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Artikel_Wikipedia_dengan_penanda_SUDOC 556 | [2020-05-12 00:37:20,621] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/IPhone_6S 557 | [2020-05-12 00:37:20,858] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Tingkat_pajak&action=edit&redlink=1 558 | [2020-05-12 00:37:21,647] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Tingkat_pajak&action=edit&redlink=1 559 | HTTP Error 404: Not Found 560 | [2020-05-12 00:37:21,647] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Actuate_Corporation&action=edit&redlink=1 561 | [2020-05-12 00:37:23,136] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Actuate_Corporation&action=edit&redlink=1 562 | HTTP Error 404: Not Found 563 | [2020-05-12 00:37:23,136] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Herzliya 564 | [2020-05-12 00:37:23,347] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kategori:Perusahaan_teknologi_yang_berpusat_di_Wilayah_Teluk_San_Francisco&action=edit&redlink=1 565 | [2020-05-12 00:37:24,058] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Berkas:Apple_store_fifth_avenue.jpg 566 | [2020-05-12 00:37:24,269] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.youtube.com/watch?v=CW0DUg63lqU&hd=1 567 | [2020-05-12 00:37:24,509] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kategori:Artikel_yang_mengandung_pernyataan_berpotensi_usang_sejak_Juli_2018&action=edit&redlink=1 568 | [2020-05-12 00:37:25,129] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.cultofmac.com/194455/apple-posts-steve-jobs-tribute-his-spirit-will-forever-be-the-foundation-of-apple/ 569 | [2020-05-12 00:37:26,424] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/HomePod 570 | [2020-05-12 00:37:26,611] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/PCI_Express 571 | [2020-05-12 00:37:26,798] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20101104233204/http://www.wired.com/gadgetlab/2010/11/foxconn-photo-gallery/?pid=731&viewall=true 572 | [2020-05-12 00:37:32,993] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Hak_pekerja 573 | [2020-05-12 00:37:33,187] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/IPad_2 574 | [2020-05-12 00:37:33,396] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Halaman_dengan_kesalahan_referensi 575 | [2020-05-12 00:37:33,647] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Produksi_nirpabrik&action=edit&redlink=1 576 | [2020-05-12 00:37:34,193] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Produksi_nirpabrik&action=edit&redlink=1 577 | HTTP Error 404: Not Found 578 | [2020-05-12 00:37:34,194] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://finance.yahoo.com/quote/AAPL/holders?p=AAPL 579 | [2020-05-12 00:37:35,356] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://ga.wikipedia.org/wiki/Apple_Inc. 580 | [2020-05-12 00:37:35,516] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20100130142222/http://www.macrumors.com/2010/01/27/apple-tablet-media-event-today-come-see-our-latest-creation/ 581 | [2020-05-12 00:37:37,366] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&veaction=edit&section=25 582 | [2020-05-12 00:37:39,364] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Mac_OS_X 583 | [2020-05-12 00:37:39,653] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Biblioth%C3%A8que_nationale_de_France 584 | [2020-05-12 00:37:39,875] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Liberty_Global&action=edit&redlink=1 585 | [2020-05-12 00:37:40,522] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Liberty_Global&action=edit&redlink=1 586 | HTTP Error 404: Not Found 587 | [2020-05-12 00:37:40,522] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20080725082132/http://www.apple.com/hotnews/agreenerapple/ 588 | [2020-05-12 00:37:41,807] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Layar_LCD_bercahaya_latar_LED&action=edit&redlink=1 589 | [2020-05-12 00:37:42,377] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Layar_LCD_bercahaya_latar_LED&action=edit&redlink=1 590 | HTTP Error 404: Not Found 591 | [2020-05-12 00:37:42,378] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://nla.gov.au/anbd.aut-an36551832 592 | [2020-05-12 00:37:44,111] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://kn.wikipedia.org/wiki/%E0%B2%86%E0%B2%AA%E0%B2%B2%E0%B3%8D 593 | [2020-05-12 00:37:44,305] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://vi.wikipedia.org/wiki/Apple_Inc. 594 | [2020-05-12 00:37:44,729] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Adobe_Systems 595 | [2020-05-12 00:37:44,967] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Palo_Alto_Networks&action=edit&redlink=1 596 | [2020-05-12 00:37:45,511] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Palo_Alto_Networks&action=edit&redlink=1 597 | HTTP Error 404: Not Found 598 | [2020-05-12 00:37:45,511] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Fortune_500 599 | [2020-05-12 00:37:45,696] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Telepon_genggam#Produsen 600 | [2020-05-12 00:37:45,942] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=American_Airlines_Group&action=edit&redlink=1 601 | [2020-05-12 00:37:46,480] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=American_Airlines_Group&action=edit&redlink=1 602 | HTTP Error 404: Not Found 603 | [2020-05-12 00:37:46,481] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Oath_Inc.&action=edit&redlink=1 604 | [2020-05-12 00:37:47,018] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Oath_Inc.&action=edit&redlink=1 605 | HTTP Error 404: Not Found 606 | [2020-05-12 00:37:47,018] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Media_pemutaran_mengalir&action=edit&redlink=1 607 | [2020-05-12 00:37:47,624] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Media_pemutaran_mengalir&action=edit&redlink=1 608 | HTTP Error 404: Not Found 609 | [2020-05-12 00:37:47,625] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=HGST&action=edit&redlink=1 610 | [2020-05-12 00:37:48,438] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=HGST&action=edit&redlink=1 611 | HTTP Error 404: Not Found 612 | [2020-05-12 00:37:48,438] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://9to5mac.com/2017/05/17/iphone-made-in-india-2/ 613 | [2020-05-12 00:37:49,411] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://tech.fortune.cnn.com/2012/10/29/inside-apples-major-shakeup/ 614 | [2020-05-12 00:37:49,589] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://tech.fortune.cnn.com/2012/10/29/inside-apples-major-shakeup/ 615 | HTTP Error 502: Bad Gateway 616 | [2020-05-12 00:37:49,590] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Signetics&action=edit&redlink=1 617 | [2020-05-12 00:37:50,254] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Signetics&action=edit&redlink=1 618 | HTTP Error 404: Not Found 619 | [2020-05-12 00:37:50,255] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/USA_Today 620 | [2020-05-12 00:37:50,440] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.apple.com/newsroom/2001/05/15Apple-to-Open-25-Retail-Stores-in-2001/ 621 | [2020-05-12 00:37:50,616] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://id.wikivoyage.org/wiki/Special:Search/Cupertino 622 | [2020-05-12 00:37:52,311] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.apple.com/pr/library/2009/04/24appstore.html 623 | [2020-05-12 00:37:52,782] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Robin_Williams 624 | [2020-05-12 00:37:53,009] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://appleinsider.com/articles/15/12/14/apple-buys-former-maxim-chip-fab-in-north-san-jose-neighboring-samsung-semiconductor- 625 | [2020-05-12 00:37:55,131] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Greenpeace 626 | [2020-05-12 00:37:55,362] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.macrumors.com/2015/09/24/iphone-6s-apple-store-lineups/ 627 | [2020-05-12 00:37:55,612] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Nelson_Mandela 628 | [2020-05-12 00:37:56,050] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/PDF 629 | [2020-05-12 00:37:56,282] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/PDB 630 | [2020-05-12 00:37:56,524] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/United_Technologies 631 | [2020-05-12 00:37:56,750] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Moskwa 632 | [2020-05-12 00:37:57,158] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Maps_(aplikasi)&action=edit&redlink=1 633 | [2020-05-12 00:37:57,908] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Maps_(aplikasi)&action=edit&redlink=1 634 | HTTP Error 404: Not Found 635 | [2020-05-12 00:37:57,908] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Dan_Riccio&action=edit&redlink=1 636 | [2020-05-12 00:37:58,746] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Dan_Riccio&action=edit&redlink=1 637 | HTTP Error 404: Not Found 638 | [2020-05-12 00:37:58,747] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/TechCrunch 639 | [2020-05-12 00:37:58,961] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.docketalarm.com/cases/AllNaturesOfSuit/Apple%2C%20Inc./ 640 | [2020-05-12 00:38:00,481] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20140304092046/http://www.wired.com/science/discoveries/news/2002/01/49652 641 | [2020-05-12 00:38:02,994] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://hr.wikipedia.org/wiki/Apple_Inc. 642 | [2020-05-12 00:38:03,181] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://pa.wikipedia.org/wiki/%E0%A8%90%E0%A8%AA%E0%A8%B2 643 | [2020-05-12 00:38:03,388] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/AppleTalk 644 | [2020-05-12 00:38:03,569] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Apple_Airport 645 | [2020-05-12 00:38:03,748] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://zh-min-nan.wikipedia.org/wiki/Apple_Inc. 646 | [2020-05-12 00:38:03,905] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/CNET 647 | [2020-05-12 00:38:04,093] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/MediaTek 648 | [2020-05-12 00:38:04,326] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Fujitsu 649 | [2020-05-12 00:38:04,575] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://id.wikinews.org/wiki/Special:Search/Apple_Inc. 650 | [2020-05-12 00:38:06,145] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/The_Washington_Post 651 | [2020-05-12 00:38:06,355] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Pengelompokan_OpenCorporates 652 | [2020-05-12 00:38:06,533] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Yahoo! 653 | [2020-05-12 00:38:06,794] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/San_Jos%C3%A9,_California 654 | [2020-05-12 00:38:07,061] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=New_Town_Plaza&action=edit&redlink=1 655 | [2020-05-12 00:38:07,671] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=New_Town_Plaza&action=edit&redlink=1 656 | HTTP Error 404: Not Found 657 | [2020-05-12 00:38:07,671] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=IPad_(generasi_ke-4)&action=edit&redlink=1 658 | [2020-05-12 00:38:08,265] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=IPad_(generasi_ke-4)&action=edit&redlink=1 659 | HTTP Error 404: Not Found 660 | [2020-05-12 00:38:08,266] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Wikipedia:Warung_Kopi 661 | [2020-05-12 00:38:08,510] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Artikel_Wikipedia_dengan_penanda_MusicBrainz 662 | [2020-05-12 00:38:08,740] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kimball_International&action=edit&redlink=1 663 | [2020-05-12 00:38:09,274] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Kimball_International&action=edit&redlink=1 664 | HTTP Error 404: Not Found 665 | [2020-05-12 00:38:09,274] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/NEC 666 | [2020-05-12 00:38:09,514] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=LSI_Corporation&action=edit&redlink=1 667 | [2020-05-12 00:38:10,092] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=LSI_Corporation&action=edit&redlink=1 668 | HTTP Error 404: Not Found 669 | [2020-05-12 00:38:10,093] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/STMicroelectronics 670 | [2020-05-12 00:38:10,312] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Commodore_International 671 | [2020-05-12 00:38:10,501] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://arstechnica.com/apple/2015/09/apples-new-ipad-2-pro-is-an-expansive-12-9-inches/ 672 | [2020-05-12 00:38:13,114] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.greenpeace.org/international/en/campaigns/toxics/electronics/Guide-to-Greener-Electronics/which-companies-really-sell-gr/ 673 | [2020-05-12 00:38:15,634] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://www.greenpeace.org/international/en/campaigns/toxics/electronics/Guide-to-Greener-Electronics/which-companies-really-sell-gr/ 674 | HTTP Error 404: Not Found 675 | [2020-05-12 00:38:15,634] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Mark_Papermaster&action=edit&redlink=1 676 | [2020-05-12 00:38:16,179] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Mark_Papermaster&action=edit&redlink=1 677 | HTTP Error 404: Not Found 678 | [2020-05-12 00:38:16,180] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://so.wikipedia.org/wiki/Apple_Inc 679 | [2020-05-12 00:38:16,337] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Alto 680 | [2020-05-12 00:38:16,516] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://google.brand.edgar-online.com/?sym=AAPL 681 | [2020-05-12 00:38:18,472] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Magnavox 682 | [2020-05-12 00:38:18,652] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=KEMET_Corporation&action=edit&redlink=1 683 | [2020-05-12 00:38:19,212] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=KEMET_Corporation&action=edit&redlink=1 684 | HTTP Error 404: Not Found 685 | [2020-05-12 00:38:19,213] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Cirque_Corporation&action=edit&redlink=1 686 | [2020-05-12 00:38:19,794] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Cirque_Corporation&action=edit&redlink=1 687 | HTTP Error 404: Not Found 688 | [2020-05-12 00:38:19,795] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/JD.com 689 | [2020-05-12 00:38:20,026] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.zdnet.com/blog/semantic-web/siri-acquired-by-apple-iphone-becomes-the-virtual-personal-assistant/371 690 | [2020-05-12 00:38:23,052] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: https://www.zdnet.com/blog/semantic-web/siri-acquired-by-apple-iphone-becomes-the-virtual-personal-assistant/371 691 | HTTP Error 404: Not Found 692 | [2020-05-12 00:38:23,052] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Pemegang_saham 693 | [2020-05-12 00:38:23,231] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20100201014134/https://www.pcworld.com/article/188149/atandt_beefing_up_network_for_ipad_and_iphone.html 694 | [2020-05-12 00:38:25,503] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Jony_Ive&action=edit&redlink=1 695 | [2020-05-12 00:38:26,476] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Jony_Ive&action=edit&redlink=1 696 | HTTP Error 404: Not Found 697 | [2020-05-12 00:38:26,476] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.abc.net.au/news/stories/2010/10/26/3048024.htm 698 | [2020-05-12 00:38:27,170] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/ASML_Holding 699 | [2020-05-12 00:38:27,450] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=BlackBerry_Mobile&action=edit&redlink=1 700 | [2020-05-12 00:38:28,060] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=BlackBerry_Mobile&action=edit&redlink=1 701 | HTTP Error 404: Not Found 702 | [2020-05-12 00:38:28,061] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Procter_%26_Gamble 703 | [2020-05-12 00:38:28,305] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/PC_Magazine 704 | [2020-05-12 00:38:28,485] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://ja.wikipedia.org/wiki/%E3%82%A2%E3%83%83%E3%83%97%E3%83%AB_(%E4%BC%81%E6%A5%AD) 705 | [2020-05-12 00:38:29,004] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.apple.com/iwork/ 706 | [2020-05-12 00:38:29,199] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Layanan_penyimpanan_berkas&action=edit&redlink=1 707 | [2020-05-12 00:38:29,767] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Layanan_penyimpanan_berkas&action=edit&redlink=1 708 | HTTP Error 404: Not Found 709 | [2020-05-12 00:38:29,767] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/LCD 710 | [2020-05-12 00:38:30,489] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.apple.com/newsroom/2016/08/apple-announces-environmental-progress-in-china/ 711 | [2020-05-12 00:38:33,182] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Dana_cadangan&action=edit&redlink=1 712 | [2020-05-12 00:38:33,756] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Dana_cadangan&action=edit&redlink=1 713 | HTTP Error 404: Not Found 714 | [2020-05-12 00:38:33,757] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Apple_Inc.#1989_hingga_1991:_Periode_Emas 715 | [2020-05-12 00:38:34,364] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.apple.com/environment/ 716 | [2020-05-12 00:38:34,606] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://is.wikipedia.org/wiki/Apple 717 | [2020-05-12 00:38:35,806] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://d-nb.info/gnd/1095305-X 718 | [2020-05-12 00:38:42,084] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=NeXTstep&action=edit&redlink=1 719 | [2020-05-12 00:38:42,636] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=NeXTstep&action=edit&redlink=1 720 | HTTP Error 404: Not Found 721 | [2020-05-12 00:38:42,637] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Halaman_depan&action=edit&redlink=1 722 | [2020-05-12 00:38:43,449] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Halaman_depan&action=edit&redlink=1 723 | HTTP Error 404: Not Found 724 | [2020-05-12 00:38:43,450] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=IPhone_6_Plus&action=edit&redlink=1 725 | [2020-05-12 00:38:43,991] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=IPhone_6_Plus&action=edit&redlink=1 726 | HTTP Error 404: Not Found 727 | [2020-05-12 00:38:43,992] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://an.wikipedia.org/wiki/Apple 728 | [2020-05-12 00:38:45,652] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Magellan_Navigation&action=edit&redlink=1 729 | [2020-05-12 00:38:46,552] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Magellan_Navigation&action=edit&redlink=1 730 | HTTP Error 404: Not Found 731 | [2020-05-12 00:38:46,553] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Silicon_Valley 732 | [2020-05-12 00:38:46,787] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Samsung 733 | [2020-05-12 00:38:47,068] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Nothing_Real&action=edit&redlink=1 734 | [2020-05-12 00:38:47,677] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Nothing_Real&action=edit&redlink=1 735 | HTTP Error 404: Not Found 736 | [2020-05-12 00:38:47,678] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://techcrunch.com/2017/09/12/the-new-apple-watch-series-3-has-cellular-built-in/ 737 | [2020-05-12 00:38:49,577] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20080907003704/http://billday.com/2007/06/29/say-hello-to-iphone/ 738 | [2020-05-12 00:38:58,398] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Aqua_(antarmuka_pengguna)&action=edit&redlink=1 739 | [2020-05-12 00:38:59,294] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Aqua_(antarmuka_pengguna)&action=edit&redlink=1 740 | HTTP Error 404: Not Found 741 | [2020-05-12 00:38:59,295] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.reuters.com/article/us-apple-india/apple-india-wrangle-over-import-tax-on-mobile-parts-sources-idUSKBN1E50WK 742 | [2020-05-12 00:39:00,219] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Campbell,_California&action=edit&redlink=1 743 | [2020-05-12 00:39:00,749] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Campbell,_California&action=edit&redlink=1 744 | HTTP Error 404: Not Found 745 | [2020-05-12 00:39:00,749] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://eu.wikipedia.org/wiki/Apple_Inc. 746 | [2020-05-12 00:39:01,863] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Berkas:IPhone_montage.png 747 | [2020-05-12 00:39:03,445] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Climate_Counts&action=edit&redlink=1 748 | [2020-05-12 00:39:03,992] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Climate_Counts&action=edit&redlink=1 749 | HTTP Error 404: Not Found 750 | [2020-05-12 00:39:03,993] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Jam_pintar&action=edit&redlink=1 751 | [2020-05-12 00:39:04,748] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Jam_pintar&action=edit&redlink=1 752 | HTTP Error 404: Not Found 753 | [2020-05-12 00:39:04,748] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Take-Two_Interactive&action=edit&redlink=1 754 | [2020-05-12 00:39:05,321] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Take-Two_Interactive&action=edit&redlink=1 755 | HTTP Error 404: Not Found 756 | [2020-05-12 00:39:05,321] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Gempa_bumi_Haiti_2010 757 | [2020-05-12 00:39:05,572] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Eric_Schmidt 758 | [2020-05-12 00:39:05,820] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Advanced_Micro_Devices 759 | [2020-05-12 00:39:06,048] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Apple_Inc.#1981_hingga_1989:_Lisa_dan_Macintosh 760 | [2020-05-12 00:39:06,657] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://arstechnica.com/gadgets/2014/09/apple-expands-data-encryption-under-ios-8-making-handover-to-cops-moot/ 761 | [2020-05-12 00:39:08,371] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://support.apple.com/en-us/HT202186 762 | [2020-05-12 00:39:08,689] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://mn.wikipedia.org/wiki/Apple_%D0%BA%D0%BE%D0%BC%D0%BF%D0%B0%D0%BD%D0%B8 763 | [2020-05-12 00:39:10,696] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Steve_Wozniak 764 | [2020-05-12 00:39:11,780] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Templat:Perusahaan_perangkat_bergerak_besar 765 | [2020-05-12 00:39:12,410] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kepulauan_Virgin_Britania 766 | [2020-05-12 00:39:13,727] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Muhammad_Ali 767 | [2020-05-12 00:39:14,047] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Republik_Irlandia 768 | [2020-05-12 00:39:14,358] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Final_Cut_Pro_X&action=edit&redlink=1 769 | [2020-05-12 00:39:14,887] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Final_Cut_Pro_X&action=edit&redlink=1 770 | HTTP Error 404: Not Found 771 | [2020-05-12 00:39:14,887] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.wap.org/tours/macworldny/ithink.html 772 | [2020-05-12 00:39:18,211] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Tony_Fadell 773 | [2020-05-12 00:39:18,409] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://creatingcustomerevangelists.com/resources/evangelists/guy_kawasaki.asp 774 | [2020-05-12 00:39:19,617] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://creatingcustomerevangelists.com/resources/evangelists/guy_kawasaki.asp 775 | 776 | [2020-05-12 00:39:19,617] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Santa_Clara,_California 777 | [2020-05-12 00:39:19,890] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Integrated_Authority_File 778 | [2020-05-12 00:39:20,721] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Badai_Sandy 779 | [2020-05-12 00:39:21,303] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://sl.wikipedia.org/wiki/Apple_Inc. 780 | [2020-05-12 00:39:22,768] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Apple_Inc.#Privasi 781 | [2020-05-12 00:39:23,380] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Magic_Mouse&action=edit&redlink=1 782 | [2020-05-12 00:39:23,994] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Magic_Mouse&action=edit&redlink=1 783 | HTTP Error 404: Not Found 784 | [2020-05-12 00:39:23,994] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Motif_desain_Apple_Inc.&action=edit&redlink=1 785 | [2020-05-12 00:39:24,547] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Motif_desain_Apple_Inc.&action=edit&redlink=1 786 | HTTP Error 404: Not Found 787 | [2020-05-12 00:39:24,547] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Achronix&action=edit&redlink=1 788 | [2020-05-12 00:39:25,167] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Achronix&action=edit&redlink=1 789 | HTTP Error 404: Not Found 790 | [2020-05-12 00:39:25,167] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Refurbishment_(elektronik)&action=edit&redlink=1 791 | [2020-05-12 00:39:25,703] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Refurbishment_(elektronik)&action=edit&redlink=1 792 | HTTP Error 404: Not Found 793 | [2020-05-12 00:39:25,703] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Comcast 794 | [2020-05-12 00:39:25,893] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/NDTV 795 | [2020-05-12 00:39:26,907] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Electronic_Arts 796 | [2020-05-12 00:39:27,152] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.abc.net.au/foreign/content/2010/s3044840.htm 797 | [2020-05-12 00:39:27,584] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://www.abc.net.au/foreign/content/2010/s3044840.htm 798 | HTTP Error 404: Not Found 799 | [2020-05-12 00:39:27,585] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://sr.wikipedia.org/wiki/Apple_Inc. 800 | [2020-05-12 00:39:29,034] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kategori:Perusahaan_komputer_yang_didirikan_tahun_1976&action=edit&redlink=1 801 | [2020-05-12 00:39:29,685] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Visa_Inc. 802 | [2020-05-12 00:39:31,961] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/SAP_SE 803 | [2020-05-12 00:39:32,199] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.apple.com/hotnews/agreenerapple/ 804 | [2020-05-12 00:39:32,926] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Watch_Series_2&action=edit&redlink=1 805 | [2020-05-12 00:39:33,502] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Apple_Watch_Series_2&action=edit&redlink=1 806 | HTTP Error 404: Not Found 807 | [2020-05-12 00:39:33,502] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Berkas:Steve_Jobs_Theater_-_Auditorium.jpg 808 | [2020-05-12 00:39:34,174] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Mylan&action=edit&redlink=1 809 | [2020-05-12 00:39:35,111] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Mylan&action=edit&redlink=1 810 | HTTP Error 404: Not Found 811 | [2020-05-12 00:39:35,112] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kategori:Perusahaan_perangkat_keras_komputer&action=edit&redlink=1 812 | [2020-05-12 00:39:36,037] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.theverge.com/2015/9/9/9287861/apple-ipad-mini-4-specs-price-release-date-announced 813 | [2020-05-12 00:39:37,364] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_A6X&action=edit&redlink=1 814 | [2020-05-12 00:39:37,978] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Apple_A6X&action=edit&redlink=1 815 | HTTP Error 404: Not Found 816 | [2020-05-12 00:39:37,979] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Fruitarian 817 | [2020-05-12 00:39:38,607] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Menlo_Park,_California 818 | [2020-05-12 00:39:38,854] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Final_Cut_Pro&action=edit&redlink=1 819 | [2020-05-12 00:39:40,366] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Final_Cut_Pro&action=edit&redlink=1 820 | HTTP Error 404: Not Found 821 | [2020-05-12 00:39:40,366] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Micromax_Informatics 822 | [2020-05-12 00:39:41,574] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.nytimes.com/1985/03/19/science/personal-computers-apple-might-learn-a-thing-or-two-from-ibm.html 823 | [2020-05-12 00:39:43,504] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Omnicom_Group&action=edit&redlink=1 824 | [2020-05-12 00:39:44,467] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Omnicom_Group&action=edit&redlink=1 825 | HTTP Error 404: Not Found 826 | [2020-05-12 00:39:44,468] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://mashable.com/2017/06/14/apple-store-wedding-pics/ 827 | [2020-05-12 00:39:45,383] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Boy_Genius_Report&action=edit&redlink=1 828 | [2020-05-12 00:39:45,923] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Boy_Genius_Report&action=edit&redlink=1 829 | HTTP Error 404: Not Found 830 | [2020-05-12 00:39:45,924] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Budaya_perusahaan&action=edit&redlink=1 831 | [2020-05-12 00:39:46,462] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Budaya_perusahaan&action=edit&redlink=1 832 | HTTP Error 404: Not Found 833 | [2020-05-12 00:39:46,462] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Britania_Raya 834 | [2020-05-12 00:39:47,109] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Tweeter&action=edit&redlink=1 835 | [2020-05-12 00:39:47,696] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Tweeter&action=edit&redlink=1 836 | HTTP Error 404: Not Found 837 | [2020-05-12 00:39:47,696] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Badai_Irma 838 | [2020-05-12 00:39:48,362] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/LinkedIn 839 | [2020-05-12 00:39:48,586] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=FileMaker&action=edit&redlink=1 840 | [2020-05-12 00:39:49,224] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=FileMaker&action=edit&redlink=1 841 | HTTP Error 404: Not Found 842 | [2020-05-12 00:39:49,225] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/VGA 843 | [2020-05-12 00:39:49,455] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/W._W._Norton_%26_Company 844 | [2020-05-12 00:39:50,440] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Ronald_Sugar 845 | [2020-05-12 00:39:51,515] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.bizjournals.com/sanjose/news/2015/12/14/exclusive-apple-buys-former-chip-fab-in-north-san.html 846 | [2020-05-12 00:39:52,181] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Video_komponen&action=edit&redlink=1 847 | [2020-05-12 00:39:52,776] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Video_komponen&action=edit&redlink=1 848 | HTTP Error 404: Not Found 849 | [2020-05-12 00:39:52,777] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://no.wikipedia.org/wiki/Apple 850 | [2020-05-12 00:39:53,261] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/World_Wide_Fund_for_Nature 851 | [2020-05-12 00:39:54,189] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Keyboard&action=edit&redlink=1 852 | [2020-05-12 00:39:54,758] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Apple_Keyboard&action=edit&redlink=1 853 | HTTP Error 404: Not Found 854 | [2020-05-12 00:39:54,759] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Fred_D._Anderson&action=edit&redlink=1 855 | [2020-05-12 00:39:55,555] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Fred_D._Anderson&action=edit&redlink=1 856 | HTTP Error 404: Not Found 857 | [2020-05-12 00:39:55,556] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.wsj.com/articles/SB118677584137994489 858 | [2020-05-12 00:39:58,236] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Berkas:Regent_Street_Apple_Store,_London_12297897574_o.jpg 859 | [2020-05-12 00:39:59,085] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kasus_bantuan_pemerintah_ilegal_UE_melawan_Apple_di_Irlandia&action=edit&redlink=1 860 | [2020-05-12 00:40:00,008] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Kasus_bantuan_pemerintah_ilegal_UE_melawan_Apple_di_Irlandia&action=edit&redlink=1 861 | HTTP Error 404: Not Found 862 | [2020-05-12 00:40:00,009] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Ketua 863 | [2020-05-12 00:40:00,212] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Redmi 864 | [2020-05-12 00:40:00,976] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://et.wikipedia.org/wiki/Apple_Inc. 865 | [2020-05-12 00:40:02,589] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.apple.com/newsroom/2007/01/09Apple-Reinvents-the-Phone-with-iPhone/ 866 | [2020-05-12 00:40:03,210] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Xerox_PARC 867 | [2020-05-12 00:40:04,400] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Glassdoor&action=edit&redlink=1 868 | [2020-05-12 00:40:05,004] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Glassdoor&action=edit&redlink=1 869 | HTTP Error 404: Not Found 870 | [2020-05-12 00:40:05,004] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Chief_executive_officer 871 | [2020-05-12 00:40:05,232] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://ia.wikipedia.org/wiki/Apple_Computer 872 | [2020-05-12 00:40:06,144] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Verizon_Communications 873 | [2020-05-12 00:40:06,411] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=IPad_(generasi_ke-1)&action=edit&redlink=1 874 | [2020-05-12 00:40:07,023] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=IPad_(generasi_ke-1)&action=edit&redlink=1 875 | HTTP Error 404: Not Found 876 | [2020-05-12 00:40:07,024] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=IPad_(4th_generation)&action=edit&redlink=1 877 | [2020-05-12 00:40:07,963] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=IPad_(4th_generation)&action=edit&redlink=1 878 | HTTP Error 404: Not Found 879 | [2020-05-12 00:40:07,963] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Bertrand_Serlet&action=edit&redlink=1 880 | [2020-05-12 00:40:08,528] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Bertrand_Serlet&action=edit&redlink=1 881 | HTTP Error 404: Not Found 882 | [2020-05-12 00:40:08,529] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.macrumors.com/2016/02/08/apple-retail-locations-in-india/ 883 | [2020-05-12 00:40:09,448] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=BYTE&action=edit&redlink=1 884 | [2020-05-12 00:40:10,372] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=BYTE&action=edit&redlink=1 885 | HTTP Error 404: Not Found 886 | [2020-05-12 00:40:10,372] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://or.wikipedia.org/wiki/%E0%AC%86%E0%AC%AA%E0%AC%B2_%E0%AC%95%E0%AC%AE%E0%AD%8D%E0%AC%AA%E0%AC%BE%E0%AC%A8%E0%AD%80 887 | [2020-05-12 00:40:11,261] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.nydailynews.com/entertainment/music/2008/03/11/2008-03-11_apple_ad_creates_recognition_for_yael_na.html 888 | [2020-05-12 00:40:12,450] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://www.nydailynews.com/entertainment/music/2008/03/11/2008-03-11_apple_ad_creates_recognition_for_yael_na.html 889 | HTTP Error 404: Not Found 890 | [2020-05-12 00:40:12,450] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Card&action=edit&redlink=1 891 | [2020-05-12 00:40:13,995] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Apple_Card&action=edit&redlink=1 892 | HTTP Error 404: Not Found 893 | [2020-05-12 00:40:13,995] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Wayback_Machine 894 | [2020-05-12 00:40:14,194] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://9to5mac.com/2010/01/14/apple-sets-up-haiti-donation-page-in-itunes/ 895 | [2020-05-12 00:40:15,103] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://finance.google.com/finance?cid=AAPL 896 | [2020-05-12 00:40:16,730] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Terminal_komputer&action=edit&redlink=1 897 | [2020-05-12 00:40:17,366] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Terminal_komputer&action=edit&redlink=1 898 | HTTP Error 404: Not Found 899 | [2020-05-12 00:40:17,366] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=39 900 | [2020-05-12 00:40:19,513] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=38 901 | [2020-05-12 00:40:21,323] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20080618233109/http://traveldk.com/san-francisco/bay-area/member/apple-inc 902 | [2020-05-12 00:40:24,841] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=33 903 | [2020-05-12 00:40:27,094] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=32 904 | [2020-05-12 00:40:29,148] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=31 905 | [2020-05-12 00:40:30,988] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=30 906 | [2020-05-12 00:40:33,374] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=37 907 | [2020-05-12 00:40:36,071] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=36 908 | [2020-05-12 00:40:38,081] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=35 909 | [2020-05-12 00:40:40,338] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Patent&action=edit&redlink=1 910 | [2020-05-12 00:40:40,913] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Patent&action=edit&redlink=1 911 | HTTP Error 404: Not Found 912 | [2020-05-12 00:40:40,913] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Lenovo 913 | [2020-05-12 00:40:41,133] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.cnet.com/news/apple-by-the-numbers-84m-ipads-400m-ios-devices-350m-ipods-sold/ 914 | [2020-05-12 00:40:44,839] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.macrumors.com/2014/12/17/apple-product-red-20-million/ 915 | [2020-05-12 00:40:45,723] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Semikonduktor 916 | [2020-05-12 00:40:45,957] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.oazoo.com 917 | [2020-05-12 00:40:46,903] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Fidelity_Investments&action=edit&redlink=1 918 | [2020-05-12 00:40:47,568] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Fidelity_Investments&action=edit&redlink=1 919 | HTTP Error 404: Not Found 920 | [2020-05-12 00:40:47,568] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=BMC_Software&action=edit&redlink=1 921 | [2020-05-12 00:40:48,091] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=BMC_Software&action=edit&redlink=1 922 | HTTP Error 404: Not Found 923 | [2020-05-12 00:40:48,091] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit 924 | [2020-05-12 00:40:50,240] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.mediawiki.org/wiki/Special:MyLanguage/How_to_contribute 925 | [2020-05-12 00:40:51,412] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://sa.wikipedia.org/wiki/%E0%A4%8F%E0%A4%AA%E0%A5%8D%E0%A4%AA%E0%A4%B2%E0%A5%8D 926 | [2020-05-12 00:40:53,016] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Honeywell 927 | [2020-05-12 00:40:55,574] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Fortune_00&action=edit&redlink=1 928 | [2020-05-12 00:40:56,174] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Fortune_00&action=edit&redlink=1 929 | HTTP Error 404: Not Found 930 | [2020-05-12 00:40:56,175] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=No_Starch_Press&action=edit&redlink=1 931 | [2020-05-12 00:40:56,816] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=No_Starch_Press&action=edit&redlink=1 932 | HTTP Error 404: Not Found 933 | [2020-05-12 00:40:56,816] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Pendapatan_nasional_bruto_disesuaikan&action=edit&redlink=1 934 | [2020-05-12 00:40:57,411] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Pendapatan_nasional_bruto_disesuaikan&action=edit&redlink=1 935 | HTTP Error 404: Not Found 936 | [2020-05-12 00:40:57,412] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Capcom 937 | [2020-05-12 00:40:57,646] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org//id.wikinews.org/wiki/Special:Search/Apple_Inc. 938 | [2020-05-12 00:40:58,090] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org//id.wikinews.org/wiki/Special:Search/Apple_Inc. 939 | HTTP Error 404: Not Found 940 | [2020-05-12 00:40:58,091] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.theguardian.com/technology/2015/apr/08/confessions-of-an-apple-fanboy-im-going-to-miss-the-queues 941 | [2020-05-12 00:41:04,703] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Mondelez_International 942 | [2020-05-12 00:41:04,968] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=FingerWorks&action=edit&redlink=1 943 | [2020-05-12 00:41:05,530] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=FingerWorks&action=edit&redlink=1 944 | HTTP Error 404: Not Found 945 | [2020-05-12 00:41:05,530] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Istimewa:Daftar_kategori 946 | [2020-05-12 00:41:06,384] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kategori:Perusahaan_multinasional_yang_berpusat_di_Amerika_Serikat&action=edit&redlink=1 947 | [2020-05-12 00:41:07,023] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Virtual_International_Authority_File 948 | [2020-05-12 00:41:07,248] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.usatoday.com/story/tech/2016/12/21/nokia-sues-apple-patent-infringement/95709378/ 949 | [2020-05-12 00:41:07,839] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/2002 950 | [2020-05-12 00:41:08,067] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Pencil&action=edit&redlink=1 951 | [2020-05-12 00:41:08,822] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Apple_Pencil&action=edit&redlink=1 952 | HTTP Error 404: Not Found 953 | [2020-05-12 00:41:08,823] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Rambus&action=edit&redlink=1 954 | [2020-05-12 00:41:09,436] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Rambus&action=edit&redlink=1 955 | HTTP Error 404: Not Found 956 | [2020-05-12 00:41:09,437] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Berkas:An_Apple_HomePod_speaker_.png 957 | [2020-05-12 00:41:10,423] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.apple.com/pr/library/2011/10/17iPhone-4S-First-Weekend-Sales-Top-Four-Million.html 958 | [2020-05-12 00:41:11,149] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Universitas_California,_Santa_Cruz 959 | [2020-05-12 00:41:11,383] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=IPod_Classic&action=edit&redlink=1 960 | [2020-05-12 00:41:11,954] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=IPod_Classic&action=edit&redlink=1 961 | HTTP Error 404: Not Found 962 | [2020-05-12 00:41:11,954] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://tt.wikipedia.org/wiki/Apple_Inc. 963 | [2020-05-12 00:41:13,035] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Campus&action=edit&redlink=1 964 | [2020-05-12 00:41:14,503] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Apple_Campus&action=edit&redlink=1 965 | HTTP Error 404: Not Found 966 | [2020-05-12 00:41:14,503] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Game_Center&action=edit&redlink=1 967 | [2020-05-12 00:41:15,984] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Game_Center&action=edit&redlink=1 968 | HTTP Error 404: Not Found 969 | [2020-05-12 00:41:15,985] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Lingkungan_kerja_tidak_sehat&action=edit&redlink=1 970 | [2020-05-12 00:41:16,530] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Lingkungan_kerja_tidak_sehat&action=edit&redlink=1 971 | HTTP Error 404: Not Found 972 | [2020-05-12 00:41:16,530] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://dty.wikipedia.org/wiki/%E0%A4%8F%E0%A4%AA%E0%A5%8D%E0%A4%AA%E0%A4%B2_%E0%A4%95%E0%A4%AE%E0%A5%8D%E0%A4%AA%E0%A4%A8%E0%A5%80 973 | [2020-05-12 00:41:18,343] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Artikel_Wikipedia_dengan_penanda_SNAC-ID 974 | [2020-05-12 00:41:19,989] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Tenaga_surya 975 | [2020-05-12 00:41:21,258] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Statista&action=edit&redlink=1 976 | [2020-05-12 00:41:21,849] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Statista&action=edit&redlink=1 977 | HTTP Error 404: Not Found 978 | [2020-05-12 00:41:21,849] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Elektronik_konsumen 979 | [2020-05-12 00:41:22,051] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Paul_Krugman 980 | [2020-05-12 00:41:23,713] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Store_(daring)&action=edit&redlink=1 981 | [2020-05-12 00:41:24,322] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Apple_Store_(daring)&action=edit&redlink=1 982 | HTTP Error 404: Not Found 983 | [2020-05-12 00:41:24,323] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.nytimes.com/2009/12/06/technology/06apps.html?pagewanted=all 984 | [2020-05-12 00:41:27,397] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/3M 985 | [2020-05-12 00:41:27,593] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20111008110802/http://media.theage.com.au/news/world-news/tearful-memories-for-apple-cofounder-2675550.html 986 | [2020-05-12 00:41:32,423] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Berkas:Flag_of_the_United_States.svg 987 | [2020-05-12 00:41:32,678] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.irishexaminer.com/ireland/kfgbsnsnmhoj/rss2/ 988 | [2020-05-12 00:41:35,221] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/3G 989 | [2020-05-12 00:41:35,433] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Angela_Ahrendts&action=edit&redlink=1 990 | [2020-05-12 00:41:36,043] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Angela_Ahrendts&action=edit&redlink=1 991 | HTTP Error 404: Not Found 992 | [2020-05-12 00:41:36,043] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Berkas:IPod_line_as_of_2014.png 993 | [2020-05-12 00:41:37,100] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Sichuan 994 | [2020-05-12 00:41:37,342] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.pcworld.com/article/188149/atandt_beefing_up_network_for_ipad_and_iphone.html 995 | [2020-05-12 00:41:42,515] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kesetiaan_merek 996 | [2020-05-12 00:41:43,370] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=NVM_Express&action=edit&redlink=1 997 | [2020-05-12 00:41:44,019] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=NVM_Express&action=edit&redlink=1 998 | HTTP Error 404: Not Found 999 | [2020-05-12 00:41:44,019] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.theverge.com/2017/9/30/16390098/apple-iphone-8-plus-ad-portrait-lighting 1000 | [2020-05-12 00:41:45,280] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.apple.com/newsroom/2008/02/05Apple-Adds-New-iPhone-iPod-touch-Models/ 1001 | [2020-05-12 00:41:45,927] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Alcatel_Mobile&action=edit&redlink=1 1002 | [2020-05-12 00:41:46,752] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Alcatel_Mobile&action=edit&redlink=1 1003 | HTTP Error 404: Not Found 1004 | [2020-05-12 00:41:46,753] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Verisk_Analytics&action=edit&redlink=1 1005 | [2020-05-12 00:41:47,295] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Verisk_Analytics&action=edit&redlink=1 1006 | HTTP Error 404: Not Found 1007 | [2020-05-12 00:41:47,295] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Ohlone_College&action=edit&redlink=1 1008 | [2020-05-12 00:41:48,882] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Ohlone_College&action=edit&redlink=1 1009 | HTTP Error 404: Not Found 1010 | [2020-05-12 00:41:48,882] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Portal:Apple_Inc.&action=edit&redlink=1 1011 | [2020-05-12 00:41:50,741] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Portal:Apple_Inc.&action=edit&redlink=1 1012 | HTTP Error 404: Not Found 1013 | [2020-05-12 00:41:50,741] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://arstechnica.com/apple/2015/09/apple-announces-the-iphone-6s-and-6s-plus/ 1014 | [2020-05-12 00:41:53,728] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Distribusi_digital 1015 | [2020-05-12 00:41:54,305] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.marketwatch.com/investing/stock/aapl/financials 1016 | [2020-05-12 00:41:56,726] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Accelerated_Graphics_Port 1017 | [2020-05-12 00:41:57,305] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/NPR 1018 | [2020-05-12 00:41:58,661] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Google 1019 | [2020-05-12 00:41:58,972] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Irlandia_ganda_dengan_roti_lapis_Belanda&action=edit&redlink=1 1020 | [2020-05-12 00:41:59,530] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Irlandia_ganda_dengan_roti_lapis_Belanda&action=edit&redlink=1 1021 | HTTP Error 404: Not Found 1022 | [2020-05-12 00:41:59,531] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/ZTE 1023 | [2020-05-12 00:41:59,754] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.forbes.com/sites/susanadams/2013/09/27/is-apple-the-worlds-most-innovative-company-still/ 1024 | [2020-05-12 00:42:01,530] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20150818101000/http://money.cnn.com/2015/07/22/investing/apple-stock-cash-earnings/ 1025 | [2020-05-12 00:42:05,992] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Activision_Blizzard&action=edit&redlink=1 1026 | [2020-05-12 00:42:06,598] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Activision_Blizzard&action=edit&redlink=1 1027 | HTTP Error 404: Not Found 1028 | [2020-05-12 00:42:06,599] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/SanDisk 1029 | [2020-05-12 00:42:06,826] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Forbes 1030 | [2020-05-12 00:42:07,048] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.apple.com/ie/contact/ 1031 | [2020-05-12 00:42:07,909] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.apple.com/newsroom/2008/06/09Apple-Introduces-the-New-iPhone-3G/ 1032 | [2020-05-12 00:42:08,495] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Templat:Industri_elektronik_di_Amerika_Serikat 1033 | [2020-05-12 00:42:09,280] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://thebrainfever.com/apple/the-lost-apple-logos-you-ve-never-seen 1034 | [2020-05-12 00:42:10,529] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=PRISM_(program_pengintaian)&action=edit&redlink=1 1035 | [2020-05-12 00:42:11,126] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=PRISM_(program_pengintaian)&action=edit&redlink=1 1036 | HTTP Error 404: Not Found 1037 | [2020-05-12 00:42:11,126] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.theverge.com/2016/9/7/12828846/apple-s-greatest-product-is-its-ecosystem 1038 | [2020-05-12 00:42:16,518] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kategori:Produsen_telepon_genggam&action=edit&redlink=1 1039 | [2020-05-12 00:42:17,438] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Woofer&action=edit&redlink=1 1040 | [2020-05-12 00:42:18,019] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Woofer&action=edit&redlink=1 1041 | HTTP Error 404: Not Found 1042 | [2020-05-12 00:42:18,020] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Jabil&action=edit&redlink=1 1043 | [2020-05-12 00:42:18,656] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Jabil&action=edit&redlink=1 1044 | HTTP Error 404: Not Found 1045 | [2020-05-12 00:42:18,656] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Monster_Beverage&action=edit&redlink=1 1046 | [2020-05-12 00:42:19,263] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Monster_Beverage&action=edit&redlink=1 1047 | HTTP Error 404: Not Found 1048 | [2020-05-12 00:42:19,264] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://biz.yahoo.com/ic/AAPL.html 1049 | [2020-05-12 00:42:20,799] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Beats_Electronics&action=edit&redlink=1 1050 | [2020-05-12 00:42:21,379] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Beats_Electronics&action=edit&redlink=1 1051 | HTTP Error 404: Not Found 1052 | [2020-05-12 00:42:21,380] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/19990827174523/http://www.apple.com/ 1053 | [2020-05-12 00:42:31,153] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/1984_(novel) 1054 | [2020-05-12 00:42:32,026] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/ZDNet 1055 | [2020-05-12 00:42:32,261] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Qualcomm 1056 | [2020-05-12 00:42:32,497] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Lululemon_Athletica&action=edit&redlink=1 1057 | [2020-05-12 00:42:34,114] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Lululemon_Athletica&action=edit&redlink=1 1058 | HTTP Error 404: Not Found 1059 | [2020-05-12 00:42:34,114] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=MercadoLibre&action=edit&redlink=1 1060 | [2020-05-12 00:42:34,750] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=MercadoLibre&action=edit&redlink=1 1061 | HTTP Error 404: Not Found 1062 | [2020-05-12 00:42:34,750] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Microsoft 1063 | [2020-05-12 00:42:35,191] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Time_Capsule&action=edit&redlink=1 1064 | [2020-05-12 00:42:35,804] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Apple_Time_Capsule&action=edit&redlink=1 1065 | HTTP Error 404: Not Found 1066 | [2020-05-12 00:42:35,804] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/North_Carolina 1067 | [2020-05-12 00:42:36,561] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.wired.com/wiredenterprise/2012/05/apple_coal/ 1068 | [2020-05-12 00:42:49,498] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://hyw.wikipedia.org/wiki/%D4%B7%D6%83%D5%A8%D5%AC 1069 | [2020-05-12 00:42:52,001] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://pfl.wikipedia.org/wiki/Apple_Inc. 1070 | [2020-05-12 00:42:53,020] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/American_Express 1071 | [2020-05-12 00:42:53,254] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://9to5mac.com/2018/01/15/martin-luther-king-jr-day/ 1072 | [2020-05-12 00:42:54,201] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://techcrunch.com/2016/06/13/apple-overhauls-watchos-with-new-ui-and-faster-app-launching/ 1073 | [2020-05-12 00:42:56,289] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Wireless_LAN 1074 | [2020-05-12 00:42:56,507] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Jangkauan_dinamis_tinggi&action=edit&redlink=1 1075 | [2020-05-12 00:42:57,414] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Jangkauan_dinamis_tinggi&action=edit&redlink=1 1076 | HTTP Error 404: Not Found 1077 | [2020-05-12 00:42:57,414] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=28 1078 | [2020-05-12 00:42:59,201] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=29 1079 | [2020-05-12 00:43:01,497] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=20 1080 | [2020-05-12 00:43:04,678] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=21 1081 | [2020-05-12 00:43:07,449] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=22 1082 | [2020-05-12 00:43:11,136] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=23 1083 | [2020-05-12 00:43:14,781] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=24 1084 | [2020-05-12 00:43:17,118] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=25 1085 | [2020-05-12 00:43:19,412] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=26 1086 | [2020-05-12 00:43:21,204] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Inc.&action=edit&section=27 1087 | [2020-05-12 00:43:23,613] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Semiconductor_Manufacturing_International_Corporation&action=edit&redlink=1 1088 | [2020-05-12 00:43:24,225] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Semiconductor_Manufacturing_International_Corporation&action=edit&redlink=1 1089 | HTTP Error 404: Not Found 1090 | [2020-05-12 00:43:24,226] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.macrumors.com/roundup/apple-music/ 1091 | [2020-05-12 00:43:25,139] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20080921095608/http://www.operating-system.org/betriebssystem/_english/fa-apple.htm 1092 | [2020-05-12 00:43:28,533] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Henry_Schein&action=edit&redlink=1 1093 | [2020-05-12 00:43:29,078] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Henry_Schein&action=edit&redlink=1 1094 | HTTP Error 404: Not Found 1095 | [2020-05-12 00:43:29,078] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://uk.reuters.com/article/2012/04/20/us-apple-ireland-idUKBRE83J0PI20120420 1096 | [2020-05-12 00:43:29,537] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://uk.reuters.com/article/2012/04/20/us-apple-ireland-idUKBRE83J0PI20120420 1097 | HTTP Error 404: Not Found 1098 | [2020-05-12 00:43:29,537] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20070602180903/http://www.apple.com/macbookpro/graphics.html 1099 | [2020-05-12 00:43:32,404] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://el.wikipedia.org/wiki/Apple 1100 | [2020-05-12 00:43:35,427] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Menlo_College&action=edit&redlink=1 1101 | [2020-05-12 00:43:36,002] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Menlo_College&action=edit&redlink=1 1102 | HTTP Error 404: Not Found 1103 | [2020-05-12 00:43:36,002] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=David_Nagel&action=edit&redlink=1 1104 | [2020-05-12 00:43:37,775] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=David_Nagel&action=edit&redlink=1 1105 | HTTP Error 404: Not Found 1106 | [2020-05-12 00:43:37,775] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.folklore.org/StoryView.py?project=Macintosh&story=Credit_Where_Due.txt 1107 | [2020-05-12 00:43:39,493] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Garis_waktu_produk_Apple_Inc.&action=edit&redlink=1 1108 | [2020-05-12 00:43:40,107] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Garis_waktu_produk_Apple_Inc.&action=edit&redlink=1 1109 | HTTP Error 404: Not Found 1110 | [2020-05-12 00:43:40,107] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/App_Store_(iOS) 1111 | [2020-05-12 00:43:40,357] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Bangalore 1112 | [2020-05-12 00:43:41,265] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://9to5mac.com/2017/06/20/apple-product-secrecy-leaks-leaked-meeting/ 1113 | [2020-05-12 00:43:42,280] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Kategori:Artikel_yang_mengandung_pernyataan_berpotensi_usang_sejak_September_2012&action=edit&redlink=1 1114 | [2020-05-12 00:43:43,227] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.microprocessor.sscc.ru/comphist/ 1115 | [2020-05-12 00:43:45,117] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://www.microprocessor.sscc.ru/comphist/ 1116 | HTTP Error 404: Not Found 1117 | [2020-05-12 00:43:45,118] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Pembelajaran_mesin 1118 | [2020-05-12 00:43:46,222] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Peter_Oppenheimer&action=edit&redlink=1 1119 | [2020-05-12 00:43:46,819] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Peter_Oppenheimer&action=edit&redlink=1 1120 | HTTP Error 404: Not Found 1121 | [2020-05-12 00:43:46,819] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://9to5mac.com/2015/01/19/apple-commemorates-martin-luther-king-on-its-homepage-encouraging-employees-to-volunteer-through-gift-matching/ 1122 | [2020-05-12 00:43:47,838] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.engadget.com/2010/09/29/apple-tv-teardown-reveals-8gb-flash-storage-256mb-ram/ 1123 | [2020-05-12 00:43:49,302] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Daftar_perusahaan_perangkat_lunak_terbesar&action=edit&redlink=1 1124 | [2020-05-12 00:43:50,066] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Daftar_perusahaan_perangkat_lunak_terbesar&action=edit&redlink=1 1125 | HTTP Error 404: Not Found 1126 | [2020-05-12 00:43:50,067] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Newton_(platform)&action=edit&redlink=1 1127 | [2020-05-12 00:43:51,628] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Newton_(platform)&action=edit&redlink=1 1128 | HTTP Error 404: Not Found 1129 | [2020-05-12 00:43:51,628] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Alih_daya 1130 | [2020-05-12 00:43:51,830] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Daftar_produsen_perangkat_keras_komputer&action=edit&redlink=1 1131 | [2020-05-12 00:43:52,384] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Daftar_produsen_perangkat_keras_komputer&action=edit&redlink=1 1132 | HTTP Error 404: Not Found 1133 | [2020-05-12 00:43:52,384] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://inhabitat.com/apples-new-headquarters-will-be-designed-by-norman-foster/ 1134 | [2020-05-12 00:43:53,922] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Pembicaraan_Templat:Silicon_Valley&action=edit&redlink=1 1135 | [2020-05-12 00:43:55,379] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Pembicaraan_Templat:Silicon_Valley&action=edit&redlink=1 1136 | HTTP Error 404: Not Found 1137 | [2020-05-12 00:43:55,380] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20080602225734/http://www.microprocessor.sscc.ru/comphist/ 1138 | [2020-05-12 00:43:58,803] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://ku.wikipedia.org/wiki/Apple,_Inc. 1139 | [2020-05-12 00:43:59,621] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Walgreens_Boots_Alliance&action=edit&redlink=1 1140 | [2020-05-12 00:44:00,210] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Walgreens_Boots_Alliance&action=edit&redlink=1 1141 | HTTP Error 404: Not Found 1142 | [2020-05-12 00:44:00,211] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Pembicaraan:Apple_Inc. 1143 | [2020-05-12 00:44:00,851] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Amadeus_IT_Group&action=edit&redlink=1 1144 | [2020-05-12 00:44:01,864] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Amadeus_IT_Group&action=edit&redlink=1 1145 | HTTP Error 404: Not Found 1146 | [2020-05-12 00:44:01,865] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://arstechnica.com/tech-policy/2016/01/virnetx-kicks-off-final-massive-patent-trolling-attempt-vs-apple/ 1147 | [2020-05-12 00:44:03,304] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Sanmina_Corporation&action=edit&redlink=1 1148 | [2020-05-12 00:44:03,900] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Sanmina_Corporation&action=edit&redlink=1 1149 | HTTP Error 404: Not Found 1150 | [2020-05-12 00:44:03,900] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Telepon_pintar 1151 | [2020-05-12 00:44:05,371] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Cisco_Webex 1152 | [2020-05-12 00:44:05,604] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=FileMaker_Inc.&action=edit&redlink=1 1153 | [2020-05-12 00:44:06,380] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=FileMaker_Inc.&action=edit&redlink=1 1154 | HTTP Error 404: Not Found 1155 | [2020-05-12 00:44:06,380] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/IWoz 1156 | [2020-05-12 00:44:06,977] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Martin_Luther_King_Jr. 1157 | [2020-05-12 00:44:07,286] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://9to5mac.com/2016/06/10/apple-energy-landfill-gas-electricity/ 1158 | [2020-05-12 00:44:08,211] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/ExxonMobil 1159 | [2020-05-12 00:44:08,428] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Logo&action=edit&redlink=1 1160 | [2020-05-12 00:44:09,399] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Apple_Logo&action=edit&redlink=1 1161 | HTTP Error 404: Not Found 1162 | [2020-05-12 00:44:09,399] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_v._Samsung&action=edit&redlink=1 1163 | [2020-05-12 00:44:09,987] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Apple_v._Samsung&action=edit&redlink=1 1164 | HTTP Error 404: Not Found 1165 | [2020-05-12 00:44:09,988] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Kategori:Perusahaan_teknologi_Amerika_Serikat 1166 | [2020-05-12 00:44:10,533] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/ITunes 1167 | [2020-05-12 00:44:10,759] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Apple_Developer&action=edit&redlink=1 1168 | [2020-05-12 00:44:11,308] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Apple_Developer&action=edit&redlink=1 1169 | HTTP Error 404: Not Found 1170 | [2020-05-12 00:44:11,309] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Automatic_Data_Processing&action=edit&redlink=1 1171 | [2020-05-12 00:44:11,851] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Automatic_Data_Processing&action=edit&redlink=1 1172 | HTTP Error 404: Not Found 1173 | [2020-05-12 00:44:11,852] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Fastenal 1174 | [2020-05-12 00:44:12,130] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=TiVo_Corporation&action=edit&redlink=1 1175 | [2020-05-12 00:44:12,693] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=TiVo_Corporation&action=edit&redlink=1 1176 | HTTP Error 404: Not Found 1177 | [2020-05-12 00:44:12,694] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://aaplinvestors.net/stats/acquisitions/ 1178 | [2020-05-12 00:44:14,032] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Apple_Inc. 1179 | [2020-05-12 00:44:14,633] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Acer_Inc. 1180 | [2020-05-12 00:44:14,855] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/IPhone_3GS 1181 | [2020-05-12 00:44:16,023] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Siri 1182 | [2020-05-12 00:44:16,221] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Penske_Media_Corporation&action=edit&redlink=1 1183 | [2020-05-12 00:44:16,761] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Penske_Media_Corporation&action=edit&redlink=1 1184 | HTTP Error 404: Not Found 1185 | [2020-05-12 00:44:16,762] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Tempat_pembuangan_akhir 1186 | [2020-05-12 00:44:16,962] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/YouTube 1187 | [2020-05-12 00:44:17,343] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://id.wikipedia.org/w/index.php?title=Templat:Seleb_Apple&action=edit 1188 | [2020-05-12 00:44:18,277] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.businessinsider.com/qa-with-an-apple-store-worker-2016-5 1189 | [2020-05-12 00:44:19,612] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Millard_Drexler&action=edit&redlink=1 1190 | [2020-05-12 00:44:20,176] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Millard_Drexler&action=edit&redlink=1 1191 | HTTP Error 404: Not Found 1192 | [2020-05-12 00:44:20,176] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/National_Library_of_Australia 1193 | [2020-05-12 00:44:21,522] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://www.truth-out.org/news/item/32208-why-is-apple-lying-about-powering-its-data-centers-with-renewable-energy 1194 | [2020-05-12 00:44:23,606] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://web.archive.org/web/20141224223906/http://www.globallabourrights.org/reports/exhaustion-has-no-limit-at-apple-supplier-in-china 1195 | [2020-05-12 00:44:28,787] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/w/index.php?title=Agilent_Technologies&action=edit&redlink=1 1196 | [2020-05-12 00:44:29,399] ERROR::(P:29811 T:139773745317696)::email_crawler - Exception at url: http://id.wikipedia.org/w/index.php?title=Agilent_Technologies&action=edit&redlink=1 1197 | HTTP Error 404: Not Found 1198 | [2020-05-12 00:44:29,400] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling http://id.wikipedia.org/wiki/Avaya 1199 | [2020-05-12 00:44:29,638] INFO::(P:29811 T:139773745317696)::email_crawler - Crawling https://www.wsj.com/articles/apple-assembles-first-iphones-in-india-1495016276 1200 | --------------------------------------------------------------------------------