├── .gitignore ├── data └── py_instaloader.rda ├── tests └── testthat.R ├── .Rbuildignore ├── NAMESPACE ├── cran-comments.md ├── R ├── utils-pipe.R ├── login.R ├── get_posts.R └── utils.R ├── man ├── pipe.Rd ├── from_unix.Rd ├── init_instaloadeR.Rd ├── install_instaloadeR.Rd ├── insta_login.Rd └── insta_posts.Rd ├── instaloadeR.Rproj ├── .github ├── ISSUE_TEMPLATE.md ├── CODE_OF_CONDUCT.md ├── SUPPORT.md └── CONTRIBUTING.md ├── DESCRIPTION ├── README.md ├── README.Rmd ├── script.py ├── script2.py └── LICENSE.md /.gitignore: -------------------------------------------------------------------------------- 1 | .Rproj.user 2 | .Rhistory 3 | .RData 4 | -------------------------------------------------------------------------------- /data/py_instaloader.rda: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/favstats/instaloadeR/HEAD/data/py_instaloader.rda -------------------------------------------------------------------------------- /tests/testthat.R: -------------------------------------------------------------------------------- 1 | library(testthat) 2 | library(instaloadeR) 3 | 4 | test_check("instaloadeR") 5 | -------------------------------------------------------------------------------- /.Rbuildignore: -------------------------------------------------------------------------------- 1 | ^instaloadeR\.Rproj$ 2 | ^\.Rproj\.user$ 3 | ^LICENSE\.md$ 4 | ^README\.Rmd$ 5 | ^cran-comments\.md$ 6 | ^\.github$ 7 | -------------------------------------------------------------------------------- /NAMESPACE: -------------------------------------------------------------------------------- 1 | # Generated by roxygen2: do not edit by hand 2 | 3 | export("%>%") 4 | export(from_unix) 5 | export(init_instaloadeR) 6 | export(insta_login) 7 | export(insta_posts) 8 | export(install_instaloadeR) 9 | importFrom(magrittr,"%>%") 10 | -------------------------------------------------------------------------------- /cran-comments.md: -------------------------------------------------------------------------------- 1 | ## Test environments 2 | * local OS X install, R 3.6.3 3 | * ubuntu 14.04 (on travis-ci), R 3.6.3 4 | * win-builder (devel and release) 5 | 6 | ## R CMD check results 7 | 8 | 0 errors | 0 warnings | 1 note 9 | 10 | * This is a new release. 11 | -------------------------------------------------------------------------------- /R/utils-pipe.R: -------------------------------------------------------------------------------- 1 | #' Pipe operator 2 | #' 3 | #' See \code{magrittr::\link[magrittr:pipe]{\%>\%}} for details. 4 | #' 5 | #' @name %>% 6 | #' @rdname pipe 7 | #' @keywords internal 8 | #' @export 9 | #' @importFrom magrittr %>% 10 | #' @usage lhs \%>\% rhs 11 | NULL 12 | -------------------------------------------------------------------------------- /man/pipe.Rd: -------------------------------------------------------------------------------- 1 | % Generated by roxygen2: do not edit by hand 2 | % Please edit documentation in R/utils-pipe.R 3 | \name{\%>\%} 4 | \alias{\%>\%} 5 | \title{Pipe operator} 6 | \usage{ 7 | lhs \%>\% rhs 8 | } 9 | \description{ 10 | See \code{magrittr::\link[magrittr:pipe]{\%>\%}} for details. 11 | } 12 | \keyword{internal} 13 | -------------------------------------------------------------------------------- /man/from_unix.Rd: -------------------------------------------------------------------------------- 1 | % Generated by roxygen2: do not edit by hand 2 | % Please edit documentation in R/utils.R 3 | \name{from_unix} 4 | \alias{from_unix} 5 | \title{from_unix} 6 | \usage{ 7 | from_unix(x) 8 | } 9 | \arguments{ 10 | \item{x}{UNIX timestamp to be converted to datetime} 11 | } 12 | \description{ 13 | Converts UNIX timestamp to datetime format 14 | } 15 | -------------------------------------------------------------------------------- /man/init_instaloadeR.Rd: -------------------------------------------------------------------------------- 1 | % Generated by roxygen2: do not edit by hand 2 | % Please edit documentation in R/utils.R 3 | \name{init_instaloadeR} 4 | \alias{init_instaloadeR} 5 | \title{init_instaloadeR} 6 | \usage{ 7 | init_instaloadeR() 8 | } 9 | \description{ 10 | Initializes the instaloader python functions. Only works when \code{install_instaloadeR()} has ben executed. 11 | } 12 | -------------------------------------------------------------------------------- /man/install_instaloadeR.Rd: -------------------------------------------------------------------------------- 1 | % Generated by roxygen2: do not edit by hand 2 | % Please edit documentation in R/utils.R 3 | \name{install_instaloadeR} 4 | \alias{install_instaloadeR} 5 | \title{install_instaloadeR} 6 | \usage{ 7 | install_instaloadeR(envname = NULL) 8 | } 9 | \arguments{ 10 | \item{envname}{specify Python environment name for module installation} 11 | } 12 | \description{ 13 | Installs the instaloader Python module 14 | } 15 | -------------------------------------------------------------------------------- /instaloadeR.Rproj: -------------------------------------------------------------------------------- 1 | Version: 1.0 2 | 3 | RestoreWorkspace: No 4 | SaveWorkspace: No 5 | AlwaysSaveHistory: Default 6 | 7 | EnableCodeIndexing: Yes 8 | UseSpacesForTab: Yes 9 | NumSpacesForTab: 2 10 | Encoding: UTF-8 11 | 12 | RnwWeave: Sweave 13 | LaTeX: pdfLaTeX 14 | 15 | AutoAppendNewline: Yes 16 | StripTrailingWhitespace: Yes 17 | 18 | BuildType: Package 19 | PackageUseDevtools: Yes 20 | PackageInstallArgs: --no-multiarch --with-keep.source 21 | PackageRoxygenize: rd,collate,namespace 22 | -------------------------------------------------------------------------------- /.github/ISSUE_TEMPLATE.md: -------------------------------------------------------------------------------- 1 | Please briefly describe your problem and what output you expect. If you have a question, please don't use this form. Instead, ask on or . 2 | 3 | Please include a minimal reproducible example (AKA a reprex). If you've never heard of a [reprex](https://reprex.tidyverse.org/) before, start by reading . 4 | 5 | --- 6 | 7 | Brief description of the problem 8 | 9 | ```r 10 | # insert reprex here 11 | ``` 12 | -------------------------------------------------------------------------------- /man/insta_login.Rd: -------------------------------------------------------------------------------- 1 | % Generated by roxygen2: do not edit by hand 2 | % Please edit documentation in R/login.R 3 | \name{insta_login} 4 | \alias{insta_login} 5 | \title{insta_login} 6 | \usage{ 7 | insta_login(passwd = "", save = F, load = F) 8 | } 9 | \arguments{ 10 | \item{passwd}{Specify your Instagram password. If you leave empty a prompt will ask you for input.} 11 | 12 | \item{save}{save your session so you don't need to specify password again} 13 | 14 | \item{load}{load the instagram session by just setting INSTAGRAM_LOGIN environment variable. Will only work if you have saved your credentials before.} 15 | } 16 | \description{ 17 | insta_login 18 | } 19 | -------------------------------------------------------------------------------- /DESCRIPTION: -------------------------------------------------------------------------------- 1 | Package: instaloadeR 2 | Title: Wraps the instaloader python module 3 | Version: 0.0.0.9001 4 | Authors@R: 5 | person(given = "Fabio", 6 | family = "Votta", 7 | role = c("aut", "cre"), 8 | email = "fabio.votta@gmail.com.com", 9 | comment = c(ORCID = "YOUR-ORCID-ID")) 10 | Description: Wraps the instaloader python module. 11 | License: GPL-3 12 | Suggests: 13 | testthat 14 | Encoding: UTF-8 15 | LazyData: true 16 | Roxygen: list(markdown = TRUE) 17 | RoxygenNote: 7.1.0 18 | Imports: 19 | magrittr, 20 | dplyr, 21 | reticulate, 22 | readr, 23 | purrr, 24 | rstudioapi 25 | URL: https://github.com/favstats/instaloadeR 26 | BugReports: https://github.com/favstats/instaloadeR/issues 27 | -------------------------------------------------------------------------------- /R/login.R: -------------------------------------------------------------------------------- 1 | #' insta_login 2 | #' 3 | #' @param passwd Specify your Instagram password. If you leave empty a prompt will ask you for input. 4 | #' @param save save your session so you don't need to specify password again 5 | #' @param load load the instagram session by just setting INSTAGRAM_LOGIN environment variable. Will only work if you have saved your credentials before. 6 | #' @export 7 | insta_login <- function(passwd = "", save = F, load = F) { 8 | 9 | if(Sys.getenv("INSTAGRAM_LOGIN") != ""){ 10 | user <- Sys.getenv("INSTAGRAM_LOGIN") 11 | } else { 12 | stop("You need to set a INSTAGRAM_LOGIN environment variable.") 13 | } 14 | if (!load){ 15 | if(Sys.getenv("INSTAGRAM_PW") != ""){ 16 | passwd <- Sys.getenv("INSTAGRAM_PW") 17 | } else if (passwd == ""){ 18 | passwd <- rstudioapi::askForPassword("Please enter your Instagram PW") 19 | } 20 | } 21 | 22 | 23 | py$insta_login_py(user, passwd, save) 24 | } 25 | -------------------------------------------------------------------------------- /R/get_posts.R: -------------------------------------------------------------------------------- 1 | #' insta_posts 2 | #' 3 | #' @param query Specify hashtag or username 4 | #' @param scope takes two values, either \code{hashtag} or \code{username} 5 | #' @param max_posts what is the maximum amount of posts to scrape 6 | #' @param scrape_comments get all comments from the retrieved posts (defaults to \code{FALSE}`) 7 | #' @param save_path specify a path to stream the Instagram data to (defaults to \code{""} in which case no csv file is saved) 8 | #' @param since optional. get posts in specific timerange (latest date) 9 | #' @param until optional. get posts in specific timerange (earliest date) 10 | #' @export 11 | insta_posts <- function(query, scope, max_posts, scrape_comments, save_path = "", since = "", until = "") { 12 | 13 | py$insta_posts_py(query, scope, max_posts, scrape_comments, save_path, since, until) %>% 14 | purrr::flatten() %>% 15 | dplyr::bind_rows() %>% 16 | unique()# %>% 17 | # dplyr::mutate(timestamp = from_unix(timestamp)) 18 | } 19 | -------------------------------------------------------------------------------- /man/insta_posts.Rd: -------------------------------------------------------------------------------- 1 | % Generated by roxygen2: do not edit by hand 2 | % Please edit documentation in R/get_posts.R 3 | \name{insta_posts} 4 | \alias{insta_posts} 5 | \title{insta_posts} 6 | \usage{ 7 | insta_posts( 8 | query, 9 | scope, 10 | max_posts, 11 | scrape_comments, 12 | save_path = "", 13 | since = "", 14 | until = "" 15 | ) 16 | } 17 | \arguments{ 18 | \item{query}{Specify hashtag or username} 19 | 20 | \item{scope}{takes two values, either \code{hashtag} or \code{username}} 21 | 22 | \item{max_posts}{what is the maximum amount of posts to scrape} 23 | 24 | \item{scrape_comments}{get all comments from the retrieved posts (defaults to \code{FALSE}`)} 25 | 26 | \item{save_path}{specify a path to stream the Instagram data to (defaults to \code{""} in which case no csv file is saved)} 27 | 28 | \item{since}{optional. get posts in specific timerange (latest date)} 29 | 30 | \item{until}{optional. get posts in specific timerange (earliest date)} 31 | } 32 | \description{ 33 | insta_posts 34 | } 35 | -------------------------------------------------------------------------------- /.github/CODE_OF_CONDUCT.md: -------------------------------------------------------------------------------- 1 | # Contributor Code of Conduct 2 | 3 | As contributors and maintainers of this project, we pledge to respect all people who 4 | contribute through reporting issues, posting feature requests, updating documentation, 5 | submitting pull requests or patches, and other activities. 6 | 7 | We are committed to making participation in this project a harassment-free experience for 8 | everyone, regardless of level of experience, gender, gender identity and expression, 9 | sexual orientation, disability, personal appearance, body size, race, ethnicity, age, or religion. 10 | 11 | Examples of unacceptable behavior by participants include the use of sexual language or 12 | imagery, derogatory comments or personal attacks, trolling, public or private harassment, 13 | insults, or other unprofessional conduct. 14 | 15 | Project maintainers have the right and responsibility to remove, edit, or reject comments, 16 | commits, code, wiki edits, issues, and other contributions that are not aligned to this 17 | Code of Conduct. Project maintainers who do not follow the Code of Conduct may be removed 18 | from the project team. 19 | 20 | Instances of abusive, harassing, or otherwise unacceptable behavior may be reported by 21 | opening an issue or contacting one or more of the project maintainers. 22 | 23 | This Code of Conduct is adapted from the Contributor Covenant 24 | (https://www.contributor-covenant.org), version 1.0.0, available at 25 | https://contributor-covenant.org/version/1/0/0/. 26 | -------------------------------------------------------------------------------- /.github/SUPPORT.md: -------------------------------------------------------------------------------- 1 | # Getting help with instaloadeR 2 | 3 | Thanks for using instaloadeR. Before filing an issue, there are a few places 4 | to explore and pieces to put together to make the process as smooth as possible. 5 | 6 | Start by making a minimal **repr**oducible **ex**ample using the 7 | [reprex](https://reprex.tidyverse.org/) package. If you haven't heard of or used 8 | reprex before, you're in for a treat! Seriously, reprex will make all of your 9 | R-question-asking endeavors easier (which is a pretty insane ROI for the five to 10 | ten minutes it'll take you to learn what it's all about). For additional reprex 11 | pointers, check out the [Get help!](https://www.tidyverse.org/help/) section of 12 | the tidyverse site. 13 | 14 | Armed with your reprex, the next step is to figure out [where to ask](https://www.tidyverse.org/help/#where-to-ask). 15 | 16 | * If it's a question: start with [community.rstudio.com](https://community.rstudio.com/), 17 | and/or StackOverflow. There are more people there to answer questions. 18 | * If it's a bug: you're in the right place, file an issue. 19 | * If you're not sure: let the community help you figure it out! If your 20 | problem _is_ a bug or a feature request, you can easily return here and 21 | report it. 22 | 23 | Before opening a new issue, be sure to [search issues and pull requests](https://github.com/tidyverse/instaloadeR/issues) to make sure the 24 | bug hasn't been reported and/or already fixed in the development version. By 25 | default, the search will be pre-populated with `is:issue is:open`. You can 26 | [edit the qualifiers](https://help.github.com/articles/searching-issues-and-pull-requests/) 27 | (e.g. `is:pr`, `is:closed`) as needed. For example, you'd simply 28 | remove `is:open` to search _all_ issues in the repo, open or closed. 29 | 30 | 31 | If you _are_ in the right place, and need to file an issue, please review the 32 | ["File issues"](https://www.tidyverse.org/contribute/#issues) paragraph from 33 | the tidyverse contributing guidelines. 34 | 35 | Thanks for your help! 36 | -------------------------------------------------------------------------------- /.github/CONTRIBUTING.md: -------------------------------------------------------------------------------- 1 | # Contributing to instaloadeR 2 | 3 | This outlines how to propose a change to instaloadeR. For more detailed 4 | info about contributing to this, and other tidyverse packages, please see the 5 | [**development contributing guide**](https://rstd.io/tidy-contrib). 6 | 7 | ### Fixing typos 8 | 9 | Small typos or grammatical errors in documentation may be edited directly using 10 | the GitHub web interface, so long as the changes are made in the _source_ file. 11 | 12 | * YES: you edit a roxygen comment in a `.R` file below `R/`. 13 | * NO: you edit an `.Rd` file below `man/`. 14 | 15 | ### Prerequisites 16 | 17 | Before you make a substantial pull request, you should always file an issue and 18 | make sure someone from the team agrees that it’s a problem. If you’ve found a 19 | bug, create an associated issue and illustrate the bug with a minimal 20 | [reprex](https://www.tidyverse.org/help/#reprex). 21 | 22 | ### Pull request process 23 | 24 | * We recommend that you create a Git branch for each pull request (PR). 25 | * Look at the Travis and AppVeyor build status before and after making changes. 26 | The `README` should contain badges for any continuous integration services used 27 | by the package. 28 | * New code should follow the tidyverse [style guide](https://style.tidyverse.org). 29 | You can use the [styler](https://CRAN.R-project.org/package=styler) package to 30 | apply these styles, but please don't restyle code that has nothing to do with 31 | your PR. 32 | * We use [roxygen2](https://cran.r-project.org/package=roxygen2), with 33 | [Markdown syntax](https://cran.r-project.org/web/packages/roxygen2/vignettes/markdown.html), 34 | for documentation. 35 | * We use [testthat](https://cran.r-project.org/package=testthat). Contributions 36 | with test cases included are easier to accept. 37 | * For user-facing changes, add a bullet to the top of `NEWS.md` below the 38 | current development version header describing the changes made followed by your 39 | GitHub username, and links to relevant issue(s)/PR(s). 40 | 41 | ### Code of Conduct 42 | 43 | Please note that the instaloadeR project is released with a 44 | [Contributor Code of Conduct](CODE_OF_CONDUCT.md). By contributing to this 45 | project you agree to abide by its terms. 46 | 47 | ### See tidyverse [development contributing guide](https://rstd.io/tidy-contrib) 48 | for further details. 49 | -------------------------------------------------------------------------------- /R/utils.R: -------------------------------------------------------------------------------- 1 | 2 | #' init_instaloadeR 3 | #' 4 | #' @description Initializes the instaloader python functions. Only works when \code{install_instaloadeR()} has ben executed. 5 | #' @export 6 | init_instaloadeR <- function(){ 7 | 8 | reticulate::source_python("https://raw.githubusercontent.com/favstats/instaloadeR/master/script.py") 9 | 10 | message("instaloader initialized") 11 | } 12 | 13 | #' install_instaloadeR 14 | #' 15 | #' 16 | #' @description Installs the instaloader Python module 17 | #' @param envname specify Python environment name for module installation 18 | #' @export 19 | install_instaloadeR <- function(envname = NULL){ 20 | reticulate::py_install(c("instaloader"), pip = T, envname = envname) 21 | } 22 | 23 | #' from_unix 24 | #' 25 | #' @description Converts UNIX timestamp to datetime format 26 | #' @param x UNIX timestamp to be converted to datetime 27 | #' @export 28 | from_unix <- function(x) { 29 | as.POSIXct(as.numeric(x), origin = '1970-01-01', tz = 'UTC') 30 | } 31 | 32 | 33 | 34 | hashies <- c("2a", "2ndamendment", "acab", "amerikkka", "anarchy", "aoc", 35 | "barbz4bernie", "bernie_", "bernie2020", "berniesanders", 36 | "biden", "biden2020", "blackconservative", 37 | "blacklivesmatter", "blackrepublican", "blexit", 38 | "blm", "climatestrike", "communism", "communist", 39 | "conservativegirls", "conservativehypehouse", "conservativerepublican", 40 | "conservatives", "creepyjoebiden", 41 | "democraticsocialism", "democrats", "donaldjtrump", "donaldtrump", 42 | "donaldtrump2020", "dumptrump", "eattherich", "fakenews", 43 | "feelthebern", "feminism", "feminist", "georgefloyd", "guncontrol", 44 | "impeach", "impeachment", "impeachtrump", 45 | "joebiden", "justiceforgeorgefloyd", "kag2020", 46 | "kavanaugh", "keepamericagreat", "leftist", "leftists", "lgbtrights", 47 | "liberal", "liberalhypehouse", "liberalism", "liberals", "liberalsaredumb", 48 | "libertarian", "libsoc", "lockhimup", "maga", "maga2020", "magachallenge", 49 | "notmeus", "obama", "openyoureyes_", "political", "politicalhumor", 50 | "politicallyincorrect", "potus", "presidenttrump", "pro2a", "prochoice", 51 | "progressive", "prolife", "protest", 52 | "republican", "republicanhypehouse", 53 | "republicans", "slavery", "soc", "socialism", 54 | "socialist", "tedcruz", "thanksobama", "tpusa", 55 | "transrights", "triggeredliberal", "trump", "trump10to20", 56 | "trump2020", "trump2020kag", "trumppence", "trumpsupporters", 57 | "trumptards", "trumpteam", "trumptrain", "trumptrain2020", 58 | "trumpvoters", "twogenders", "usgovernment", "uspolitics", "vote", 59 | "votebiden", "voteblue", "voteblue2020", "votebluenomatterwho", 60 | "votetrump", "warren2020", "whiteprivilege", "woke", 61 | "wokeposting", "women4trump", "womenfortrump", "ww3", 62 | "yang", "yang2020", "yanggang") %>% unique %>% unlist %>% rev 63 | 64 | c("republicans", "slavery", "soc", "socialism", 65 | "socialist", "tedcruz", "thanksobama", "tpusa", 66 | "2a", "2ndamendment", "acab", "amerikkka", "anarchy", "aoc", 67 | "barbz4bernie", "bernie_", "bernie2020", "berniesanders", 68 | "biden", "biden2020", "blackconservative", 69 | "blacklivesmatter", "blackrepublican", "blexit", 70 | "blm", "climatestrike", "communism", "communist", 71 | "conservativegirls", "conservativehypehouse", "conservativerepublican", 72 | "conservatives", "creepyjoebiden") -> ww 73 | 74 | c( "democraticsocialism", "democrats", "donaldjtrump", "donaldtrump", 75 | "donaldtrump2020", "dumptrump", "eattherich", "fakenews", 76 | "feelthebern", "feminism", "feminist", "georgefloyd", "guncontrol", 77 | "impeach", "impeachment", "impeachtrump") -> rr 78 | 79 | c("joebiden", "justiceforgeorgefloyd", "kag2020", 80 | "kavanaugh", "keepamericagreat", "leftist", "leftists", "lgbtrights", 81 | "liberal", "liberalhypehouse", "liberalism", "liberals", "liberalsaredumb", 82 | "libertarian", "libsoc", "lockhimup", "maga", "maga2020", "magachallenge", 83 | "notmeus", "obama", "openyoureyes_", "political", "politicalhumor", 84 | "politicallyincorrect", "potus", "presidenttrump", "pro2a", "prochoice", 85 | "progressive", "prolife", "protest", 86 | "republican", "republicanhypehouse") -> tt 87 | 88 | c("transrights", "triggeredliberal", "trump", "trump10to20", 89 | "trump2020", "trump2020kag", "trumppence", "trumpsupporters", 90 | "trumptards", "trumpteam", "trumptrain", "trumptrain2020", 91 | "trumpvoters", "twogenders", "usgovernment", "uspolitics", "vote") -> oo 92 | 93 | 94 | hashies %>% 95 | purrr::discard(~magrittr::is_in(.x, ww)) %>% 96 | purrr::discard(~magrittr::is_in(.x, rr)) %>% 97 | purrr::discard(~magrittr::is_in(.x, tt)) %>% 98 | purrr::discard(~magrittr::is_in(.x, oo)) 99 | 100 | c("conservatives", 101 | "impeachment", 102 | "republican", 103 | "uspolitics", 104 | "voteblue") 105 | 106 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | 2 | 3 | 4 | # instaloadeR 5 | 6 | 7 | 8 | [![Lifecycle: 9 | experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://www.tidyverse.org/lifecycle/#experimental) 10 | [![CRAN 11 | status](https://www.r-pkg.org/badges/version/instaloadeR)](https://CRAN.R-project.org/package=instaloadeR) 12 | 13 | 14 | The goal of `instaloadeR` is to provide a scraper for the image-sharing 15 | social networking service 16 | [Instagram](http://https://www.instagram.com/). Mostly inspired by this 17 | Scraping Tool: 18 | [digitalmethodsinitiative/4cat](https://github.com/digitalmethodsinitiative/4cat). 19 | Wraps this Python module 20 | [instaloader](https://github.com/instaloader/instaloader). You will need 21 | Python 3.6 or higher to use `instaloader`. 22 | 23 | ## Installation 24 | 25 | You can install the development version from 26 | [GitHub](https://github.com/) with: 27 | 28 | ``` r 29 | # install.packages("devtools") 30 | devtools::install_github("favstats/instaloadeR") 31 | ``` 32 | 33 | Load library 34 | 35 | ``` r 36 | library(instaloadeR) 37 | ``` 38 | 39 | Make sure to use your preferred Python installation 40 | 41 | ``` r 42 | library(reticulate) 43 | 44 | use_python(py_config()$python) 45 | ``` 46 | 47 | Install necessary Python libraries 48 | 49 | ``` r 50 | install_instaloadeR() 51 | ``` 52 | 53 | ## Example 54 | 55 | This is a basic example which shows you how to solve a common problem: 56 | 57 | Initialize `instaloadeR` 58 | 59 | ``` r 60 | init_instaloadeR() 61 | ``` 62 | 63 | ### Get Posts with Hashtag 64 | 65 | Return a tibble with 10 Instagram posts that use `#coronavirus`. 66 | 67 | ``` r 68 | 69 | corona <- insta_posts(query = "coronavirus", 70 | scope = "hashtag", 71 | max_posts = 10, 72 | scrape_comments = F) 73 | 74 | 75 | corona 76 | ``` 77 | 78 | Also return comments and replies to comments for the 10 last 79 | `#coronavirus` posts. 80 | 81 | ``` r 82 | corona_comments <- insta_posts(query = "coronavirus", 83 | scope = "hashtag", 84 | max_posts = 10, 85 | scrape_comments = T) 86 | 87 | corona_comments 88 | ``` 89 | 90 | ### Get Posts from a Specific User 91 | 92 | Return a tibble with the 10 Instagram posts by `francediplo`. 93 | 94 | ``` r 95 | 96 | francediplo <- insta_posts(query = "francediplo", 97 | scope = "username", 98 | max_posts = 10, 99 | scrape_comments = F) 100 | 101 | 102 | francediplo 103 | ``` 104 | 105 | Retrieve comments as well: 106 | 107 | ``` r 108 | 109 | francediplo_comments <- insta_posts(query = "francediplo", 110 | scope = "username", 111 | max_posts = 10, 112 | scrape_comments = T) 113 | 114 | 115 | francediplo_comments 116 | ``` 117 | 118 | ## Save output to csv 119 | 120 | As function scrapes, the data is saved and continously appended to a 121 | `.csv` file (for when you have long scraping tasks). Just specify a path 122 | for the `save_path` argument. 123 | 124 | ``` r 125 | francediplo_comments <- insta_posts(query = "francediplo", 126 | scope = "username", 127 | max_posts = 10, 128 | scrape_comments = T, 129 | save_path = "francediplo.csv") 130 | 131 | 132 | readr::read_csv("francediplo.csv") 133 | ``` 134 | 135 | ## Get User or Hashtag posts in specific timerange 136 | 137 | Just specify both an argument for `since` (latest date) and `until` 138 | (earliest date) and get Instagram posts in specific timerange. 139 | 140 | ``` r 141 | 142 | covidgoodnews <- insta_posts(query = "covidgoodnews", 143 | scope = "username", 144 | max_posts = 10, 145 | scrape_comments = F, 146 | since = "2020-06-01", 147 | until = "2020-01-01") 148 | 149 | 150 | 151 | covidgoodnews 152 | ``` 153 | 154 | ## Login with your Instagram account 155 | 156 | Some functionalities (like scraping followers from an account) are only 157 | available if you login with your Instagram account. For this purpose, 158 | `instaloadeR` functions will read from your environment variables. 159 | 160 | In order to add the info to your environment file, you can use the 161 | function `edit_r_environ()` from the [`usethis` 162 | package](https://usethis.r-lib.org/). 163 | 164 | ``` r 165 | usethis::edit_r_environ() 166 | ``` 167 | 168 | This will open your .Renviron file in your text editor. Now, you can add 169 | the following line(s) to it: 170 | 171 | ``` r 172 | INSTAGRAM_LOGIN=YOUR_USERNAME 173 | INSTAGRAM_PW=YOUR_PW 174 | ``` 175 | 176 | Save the file and restart R for the changes to take effect. 177 | 178 | The password line is optional and can also be provided via a prompt 179 | which will appear if you run `insta_login` without specifying the 180 | `passwd` argument (if you are icky about typing out your password which 181 | you should always be). 182 | 183 | Now you can use `insta_login` in the following way: 184 | 185 | ``` r 186 | insta_login(save = T) 187 | ``` 188 | 189 | `save = T` will save your credentials, so next time you can just do 190 | `load = T` and you don’t need to specify your password again: 191 | 192 | ``` r 193 | insta_login(load = T) 194 | ``` 195 | 196 | ## Get followers of an account 197 | 198 | You can all followers of an Instagram account with `get_followers` but 199 | only if you are logged in as an Instagram user. The function 200 | `get_followers` retrieves a character vector with all usernames that 201 | follow the specified account. 202 | 203 | ``` r 204 | insta_login(load = T) 205 | 206 | 207 | get_followers("willsmith_fan_") 208 | ``` 209 | 210 | ## Get similar or suggested accounts 211 | 212 | ``` r 213 | insta_login(load = T) 214 | 215 | 216 | similar_accounts <- get_similar_accounts("willsmith_fan_") 217 | 218 | similar_accounts 219 | ``` 220 | 221 | ## TODO 222 | 223 | - Geotags 224 | - Download Images/videos 225 | - … 226 | -------------------------------------------------------------------------------- /README.Rmd: -------------------------------------------------------------------------------- 1 | --- 2 | output: github_document 3 | --- 4 | 5 | 6 | 7 | ```{r, include = FALSE} 8 | knitr::opts_chunk$set( 9 | collapse = TRUE, 10 | comment = "#>", 11 | fig.path = "man/figures/README-", 12 | out.width = "100%" 13 | ) 14 | ``` 15 | 16 | # instaloadeR 17 | 18 | 19 | [![Lifecycle: experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://www.tidyverse.org/lifecycle/#experimental) 20 | [![CRAN status](https://www.r-pkg.org/badges/version/instaloadeR)](https://CRAN.R-project.org/package=instaloadeR) 21 | 22 | 23 | The goal of `instaloadeR` is to provide a scraper for the image-sharing social networking service [Instagram](http://https://www.instagram.com/). Mostly inspired by this Scraping Tool: [digitalmethodsinitiative/4cat](https://github.com/digitalmethodsinitiative/4cat). Wraps this Python module [instaloader](https://github.com/instaloader/instaloader). You will need Python 3.6 or higher to use `instaloader`. 24 | 25 | ## Installation 26 | 27 | You can install the development version from [GitHub](https://github.com/) with: 28 | 29 | ```{r eval = F} 30 | # install.packages("devtools") 31 | devtools::install_github("favstats/instaloadeR") 32 | ``` 33 | 34 | Load library 35 | 36 | ```{r example} 37 | library(instaloadeR) 38 | 39 | ``` 40 | 41 | Make sure to use your preferred Python installation 42 | 43 | ```{r} 44 | library(reticulate) 45 | 46 | use_python(py_config()$python) 47 | 48 | ``` 49 | 50 | Install necessary Python libraries 51 | 52 | ```{r, eval = F} 53 | install_instaloadeR() 54 | ``` 55 | 56 | 57 | ## Example 58 | 59 | This is a basic example which shows you how to solve a common problem: 60 | 61 | 62 | 63 | Initialize `instaloadeR` 64 | 65 | ```{r, eval=F} 66 | init_instaloadeR() 67 | ``` 68 | 69 | ### Get Posts with Hashtag 70 | 71 | Return a tibble with 10 Instagram posts that use `#coronavirus`. 72 | 73 | 74 | ```{r, eval = F} 75 | 76 | corona <- insta_posts(query = "coronavirus", 77 | scope = "hashtag", 78 | max_posts = 10, 79 | scrape_comments = F) 80 | 81 | 82 | corona 83 | ``` 84 | 85 | Also return comments and replies to comments for the 10 last `#coronavirus` posts. 86 | 87 | ```{r, eval = F} 88 | corona_comments <- insta_posts(query = "coronavirus", 89 | scope = "hashtag", 90 | max_posts = 10, 91 | scrape_comments = T) 92 | 93 | corona_comments 94 | ``` 95 | 96 | 97 | ### Get Posts from a Specific User 98 | 99 | Return a tibble with the 10 Instagram posts by `francediplo`. 100 | 101 | 102 | ```{r, eval = F} 103 | 104 | francediplo <- insta_posts(query = "francediplo", 105 | scope = "username", 106 | max_posts = 10, 107 | scrape_comments = F) 108 | 109 | 110 | francediplo 111 | ``` 112 | 113 | Retrieve comments as well: 114 | 115 | ```{r, eval = F} 116 | 117 | francediplo_comments <- insta_posts(query = "francediplo", 118 | scope = "username", 119 | max_posts = 10, 120 | scrape_comments = T) 121 | 122 | 123 | francediplo_comments 124 | ``` 125 | 126 | 127 | ## Save output to csv 128 | 129 | As function scrapes, the data is saved and continously appended to a `.csv` file (for when you have long scraping tasks). Just specify a path for the `save_path` argument. 130 | 131 | ```{r, eval = F} 132 | francediplo_comments <- insta_posts(query = "francediplo", 133 | scope = "username", 134 | max_posts = 10, 135 | scrape_comments = T, 136 | save_path = "francediplo.csv") 137 | 138 | 139 | readr::read_csv("francediplo.csv") 140 | 141 | 142 | 143 | ``` 144 | 145 | 146 | ## Get User or Hashtag posts in specific timerange 147 | 148 | Just specify both an argument for `since` (latest date) and `until` (earliest date) and get Instagram posts in specific timerange. 149 | 150 | ```{r, eval = F} 151 | 152 | covidgoodnews <- insta_posts(query = "covidgoodnews", 153 | scope = "username", 154 | max_posts = 10, 155 | scrape_comments = F, 156 | since = "2020-06-01", 157 | until = "2020-01-01") 158 | 159 | 160 | 161 | covidgoodnews 162 | ``` 163 | 164 | 165 | ## Login with your Instagram account 166 | 167 | Some functionalities (like scraping followers from an account) are only available if you login with your Instagram account. For this purpose, `instaloadeR` functions will read from your environment variables. 168 | 169 | 170 | 171 | In order to add the info to your environment file, you can use the function `edit_r_environ()` from the [`usethis` package](https://usethis.r-lib.org/). 172 | 173 | ```{r, eval = F} 174 | usethis::edit_r_environ() 175 | ``` 176 | 177 | 178 | This will open your .Renviron file in your text editor. Now, you can add the following line(s) to it: 179 | 180 | ```{r, eval = F} 181 | INSTAGRAM_LOGIN=YOUR_USERNAME 182 | INSTAGRAM_PW=YOUR_PW 183 | ``` 184 | 185 | Save the file and restart R for the changes to take effect. 186 | 187 | The password line is optional and can also be provided via a prompt which will appear if you run `insta_login` without specifying the `passwd` argument (if you are icky about typing out your password which you should always be). 188 | 189 | Now you can use `insta_login` in the following way: 190 | 191 | ```{r, eval=F} 192 | insta_login(save = T) 193 | ``` 194 | 195 | `save = T` will save your credentials, so next time you can just do `load = T` and you don't need to specify your password again: 196 | 197 | ```{r, eval=F} 198 | insta_login(load = T) 199 | ``` 200 | 201 | ## Get followers of an account 202 | 203 | You can all followers of an Instagram account with `get_followers` but only if you are logged in as an Instagram user. The function `get_followers` retrieves a character vector with all usernames that follow the specified account. 204 | 205 | ```{r, eval=F} 206 | insta_login(load = T) 207 | 208 | 209 | get_followers("willsmith_fan_") 210 | ``` 211 | 212 | ## Get similar or suggested accounts 213 | 214 | 215 | ```{r, eval=F} 216 | insta_login(load = T) 217 | 218 | 219 | similar_accounts <- get_similar_accounts("willsmith_fan_") 220 | 221 | similar_accounts 222 | ``` 223 | 224 | 225 | ## TODO 226 | 227 | + Geotags 228 | + Download Images/videos 229 | + ... 230 | 231 | 232 | -------------------------------------------------------------------------------- /script.py: -------------------------------------------------------------------------------- 1 | import instaloader 2 | from datetime import datetime 3 | import os.path 4 | from os import path 5 | from itertools import dropwhile, takewhile 6 | import re 7 | import csv 8 | from random import randint 9 | from time import sleep 10 | 11 | 12 | 13 | instagram = instaloader.Instaloader( 14 | quiet=False, 15 | download_pictures=False, 16 | download_videos=False, 17 | download_comments=True, 18 | download_geotags=False, 19 | download_video_thumbnails=False, 20 | compress_json=False, 21 | save_metadata=True 22 | ) 23 | 24 | def insta_login_py(user, passwd = "", save = False): 25 | if passwd == "": 26 | instagram.load_session_from_file(user) 27 | if user != "" and passwd != "": 28 | instagram.login(user = user, passwd = passwd) 29 | if save: 30 | instagram.save_session_to_file() 31 | 32 | def save_csv(save_path, results_posts): 33 | if not path.exists(save_path): 34 | with open(save_path, 'w', newline='', encoding='utf-8') as csvfile: 35 | fieldnames = [*results_posts[0].keys()] 36 | writer = csv.DictWriter(csvfile, fieldnames=fieldnames) 37 | writer.writeheader() 38 | if path.exists(save_path): 39 | with open(save_path, 'a', newline='', encoding='utf-8') as csvfile: 40 | fieldnames = [*results_posts[0].keys()] 41 | writer = csv.DictWriter(csvfile, fieldnames=fieldnames) 42 | for dictrow in results_posts: 43 | writer.writerow(dictrow) 44 | 45 | def insta_posts_py(query, scope, max_posts, scrape_comments, save_path = "", since = "", until = ""): 46 | """ 47 | Run custom search 48 | 49 | Fetches data from Instagram via instaloader. 50 | """ 51 | # this is useful to include in the results because researchers are 52 | # always thirsty for them hashtags 53 | hashtag = re.compile(r"#([^\s,.+=-]+)") 54 | mention = re.compile(r"@([a-zA-Z0-9_]+)") 55 | 56 | queries = query.split(",") 57 | 58 | if since != "" and until != "": 59 | since = since.split("-") 60 | until = until.split("-") 61 | 62 | for item in range(len(since)): 63 | since[item] = int(since[item]) 64 | 65 | for item in range(len(until)): 66 | until[item] = int(until[item]) 67 | 68 | since = datetime(since[0], since[1], since[2]) 69 | until = datetime(until[0], until[1], until[2]) 70 | 71 | # return queries 72 | posts = [] 73 | 74 | # for each query, get items 75 | for query in queries: 76 | chunk_size = 0 77 | print("Retrieving posts ('%s')" % query) 78 | try: 79 | if scope == "hashtag": 80 | query = query.replace("#", "") 81 | hashtag_obj = instaloader.Hashtag.from_name(instagram.context, query) 82 | chunk = hashtag_obj.get_posts() 83 | elif scope == "username": 84 | query = query.replace("@", "") 85 | profile = instaloader.Profile.from_username(instagram.context, query) 86 | chunk = profile.get_posts() 87 | else: 88 | print("Invalid search scope for instagram scraper: %s" % repr(scope)) 89 | return [] 90 | 91 | # "chunk" is a generator so actually retrieve the posts next 92 | posts_processed = 0 93 | for post in chunk: 94 | 95 | chunk_size += 1 96 | print("Retrieving posts ('%s', %i posts)" % (query, chunk_size)) 97 | if posts_processed >= max_posts: 98 | break 99 | try: 100 | posts.append(chunk.__next__()) 101 | posts_processed += 1 102 | except StopIteration: 103 | break 104 | except instaloader.InstaloaderException as e: 105 | print("Error while retrieving posts for query '%s'" % query) 106 | 107 | # go through posts, and retrieve comments 108 | results = [] 109 | posts_processed = 0 110 | comments_bit = " and comments" if scrape_comments==True else "" 111 | 112 | if since != "" and until != "": 113 | posts = takewhile(lambda p: p.date > until, dropwhile(lambda p: p.date > since, posts)) 114 | 115 | 116 | 117 | for post in posts: 118 | 119 | results_posts = [] 120 | 121 | posts_processed += 1 122 | print("Retrieving metadata%s for post %i" % (comments_bit, posts_processed)) 123 | 124 | thread_id = post.shortcode 125 | 126 | try: 127 | results_posts.append({ 128 | "id": str(thread_id), 129 | "thread_id": str(thread_id), 130 | "parent_id": str(thread_id), 131 | "body": post.caption if post.caption is not None else "", 132 | "author": post.owner_username, 133 | "timestamp": post.date_utc.timestamp(), 134 | "type": "video" if post.is_video else "picture", 135 | "url": post.video_url if post.is_video else post.url, 136 | "thumbnail_url": post.url, 137 | "hashtags": ",".join(post.caption_hashtags), 138 | "usertags": ",".join(post.tagged_users), 139 | "mentioned": ",".join(mention.findall(post.caption) if post.caption else ""), 140 | "num_likes": post.likes, 141 | "num_comments": post.comments, 142 | "level": "post", 143 | "query": query 144 | }) 145 | except (instaloader.QueryReturnedNotFoundException, instaloader.ConnectionException): 146 | pass 147 | 148 | if not scrape_comments==True: 149 | if save_path != "": 150 | save_csv(save_path, results_posts) 151 | results.append(results_posts) 152 | continue 153 | 154 | if(posts_processed % 10 == 0): 155 | wait_time = randint(300,500) 156 | print("Wating for " + str(wait_time) + " seconds.") 157 | sleep(wait_time) 158 | else: 159 | wait_time = randint(20,30) 160 | print("Wating for " + str(wait_time) + " seconds.") 161 | sleep(wait_time) 162 | 163 | try: 164 | for comment in post.get_comments(): 165 | answers = [answer for answer in comment.answers] 166 | 167 | try: 168 | results_posts.append({ 169 | "id": str(comment.id), 170 | "thread_id": str(thread_id), 171 | "parent_id": str(thread_id), 172 | "body": comment.text, 173 | "author": comment.owner.username, 174 | "timestamp": comment.created_at_utc.timestamp(), 175 | "type": "comment", 176 | "url": "", 177 | "hashtags": ",".join(hashtag.findall(comment.text)), 178 | "usertags": "", 179 | "mentioned": ",".join(mention.findall(comment.text)), 180 | "num_likes": comment.likes_count if hasattr(comment, "likes_count") else 0, 181 | "num_comments": len(answers), 182 | "level": "comment", 183 | "query": query 184 | }) 185 | except instaloader.QueryReturnedNotFoundException: 186 | pass 187 | 188 | 189 | # instagram only has one reply depth level at the time of 190 | # writing, represented here 191 | for answer in answers: 192 | try: 193 | results_posts.append({ 194 | "id": str(answer.id), 195 | "thread_id": str(thread_id), 196 | "parent_id": str(comment.id), 197 | "body": answer.text, 198 | "author": answer.owner.username, 199 | "timestamp": answer.created_at_utc.timestamp(), 200 | "type": "comment", 201 | "url": "", 202 | "hashtags": ",".join(hashtag.findall(answer.text)), 203 | "usertags": "", 204 | "mentioned": ",".join(mention.findall(answer.text)), 205 | "num_likes": answer.likes_count if hasattr(answer, "likes_count") else 0, 206 | "num_comments": 0, 207 | "level": "answer", 208 | "query": query 209 | }) 210 | except instaloader.QueryReturnedNotFoundException: 211 | pass 212 | 213 | except (instaloader.QueryReturnedNotFoundException, instaloader.ConnectionException): 214 | # data not available...? this happens sometimes, not clear why 215 | pass 216 | 217 | if save_path != "": 218 | save_csv(save_path, results_posts) 219 | 220 | results.append(results_posts) 221 | 222 | return results 223 | 224 | 225 | def get_followers(username): 226 | 227 | profile = instaloader.Profile.from_username(instagram.context, username) 228 | 229 | follower_list = [] 230 | for follower in profile.get_followers(): 231 | # print(follower.username) 232 | follower_list.append(follower.username) 233 | 234 | return(follower_list) 235 | 236 | 237 | def get_similar_accounts(username): 238 | 239 | profile = instaloader.Profile.from_username(instagram.context, username) 240 | 241 | account_list = [] 242 | for account in profile.get_similar_accounts(): 243 | # print(account.username) 244 | account_list.append(account.username) 245 | 246 | return(account_list) 247 | -------------------------------------------------------------------------------- /script2.py: -------------------------------------------------------------------------------- 1 | import instaloader 2 | from datetime import datetime 3 | import os.path 4 | from os import path 5 | from itertools import dropwhile, takewhile 6 | import re 7 | import csv 8 | from random import randint 9 | from time import sleep 10 | 11 | 12 | 13 | instagram = instaloader.Instaloader( 14 | quiet=False, 15 | download_pictures=False, 16 | download_videos=False, 17 | download_comments=True, 18 | download_geotags=False, 19 | download_video_thumbnails=False, 20 | compress_json=False, 21 | save_metadata=True 22 | ) 23 | 24 | def insta_login_py(user, passwd = "", save = False): 25 | if passwd == "": 26 | instagram.load_session_from_file(user) 27 | if user != "" and passwd != "": 28 | instagram.login(user = user, passwd = passwd) 29 | if save: 30 | instagram.save_session_to_file() 31 | 32 | def save_csv(save_path, results_posts): 33 | if not path.exists(save_path): 34 | with open(save_path, 'w', newline='', encoding='utf-8') as csvfile: 35 | fieldnames = [*results_posts[0].keys()] 36 | writer = csv.DictWriter(csvfile, fieldnames=fieldnames) 37 | writer.writeheader() 38 | if path.exists(save_path): 39 | with open(save_path, 'a', newline='', encoding='utf-8') as csvfile: 40 | fieldnames = [*results_posts[0].keys()] 41 | writer = csv.DictWriter(csvfile, fieldnames=fieldnames) 42 | for dictrow in results_posts: 43 | writer.writerow(dictrow) 44 | 45 | def insta_posts_py(query, scope, max_posts, scrape_comments, save_path = "", since = "", until = ""): 46 | """ 47 | Run custom search 48 | 49 | Fetches data from Instagram via instaloader. 50 | """ 51 | # this is useful to include in the results because researchers are 52 | # always thirsty for them hashtags 53 | hashtag = re.compile(r"#([^\s,.+=-]+)") 54 | mention = re.compile(r"@([a-zA-Z0-9_]+)") 55 | 56 | queries = query.split(",") 57 | 58 | if since != "" and until != "": 59 | since = since.split("-") 60 | until = until.split("-") 61 | 62 | for item in range(len(since)): 63 | since[item] = int(since[item]) 64 | 65 | for item in range(len(until)): 66 | until[item] = int(until[item]) 67 | 68 | since = datetime(since[0], since[1], since[2]) 69 | until = datetime(until[0], until[1], until[2]) 70 | 71 | # return queries 72 | posts = [] 73 | 74 | # for each query, get items 75 | for query in queries: 76 | chunk_size = 0 77 | print("Retrieving posts ('%s')" % query) 78 | try: 79 | if scope == "hashtag": 80 | query = query.replace("#", "") 81 | hashtag_obj = instaloader.Hashtag.from_name(instagram.context, query) 82 | chunk = hashtag_obj.get_posts() 83 | elif scope == "username": 84 | query = query.replace("@", "") 85 | profile = instaloader.Profile.from_username(instagram.context, query) 86 | chunk = profile.get_posts() 87 | else: 88 | print("Invalid search scope for instagram scraper: %s" % repr(scope)) 89 | return [] 90 | 91 | # "chunk" is a generator so actually retrieve the posts next 92 | posts_processed = 0 93 | # go through posts, and retrieve comments 94 | results = [] 95 | results_posts = [] 96 | for post in chunk: 97 | 98 | chunk_size += 1 99 | print("Retrieving posts ('%s', %i posts)" % (query, chunk_size)) 100 | if posts_processed >= max_posts: 101 | break 102 | try: 103 | posts.append(chunk.__next__()) 104 | posts_processed += 1 105 | 106 | comments_bit = " and comments" if scrape_comments==True else "" 107 | 108 | if since != "" and until != "": 109 | posts = takewhile(lambda p: p.date > until, dropwhile(lambda p: p.date > since, posts)) 110 | 111 | for post in posts: 112 | 113 | print("Retrieving metadata%s for post %i" % (comments_bit, posts_processed)) 114 | 115 | thread_id = post.shortcode 116 | try: 117 | results_posts.append({ 118 | "id": str(thread_id), 119 | "thread_id": str(thread_id), 120 | "parent_id": str(thread_id), 121 | "body": post.caption if post.caption is not None else "", 122 | "author": post.owner_username, 123 | "timestamp": post.date_utc.timestamp(), 124 | "type": "video" if post.is_video else "picture", 125 | "url": post.video_url if post.is_video else post.url, 126 | "thumbnail_url": post.url, 127 | "hashtags": ",".join(post.caption_hashtags), 128 | "usertags": ",".join(post.tagged_users), 129 | "mentioned": ",".join(mention.findall(post.caption) if post.caption else ""), 130 | "num_likes": post.likes, 131 | "num_comments": post.comments, 132 | "level": "post", 133 | "query": query 134 | }) 135 | except (instaloader.QueryReturnedNotFoundException, instaloader.ConnectionException): 136 | pass 137 | 138 | if not scrape_comments==True: 139 | if save_path != "": 140 | save_csv(save_path, results_posts) 141 | results.append(results_posts) 142 | continue 143 | 144 | if(posts_processed % 10 == 0): 145 | wait_time = randint(300,500) 146 | print("Wating for " + str(wait_time) + " seconds.") 147 | sleep(wait_time) 148 | else: 149 | wait_time = randint(20,30) 150 | print("Wating for " + str(wait_time) + " seconds.") 151 | sleep(wait_time) 152 | 153 | try: 154 | for comment in post.get_comments(): 155 | answers = [answer for answer in comment.answers] 156 | 157 | try: 158 | results_posts.append({ 159 | "id": str(comment.id), 160 | "thread_id": str(thread_id), 161 | "parent_id": str(thread_id), 162 | "body": comment.text, 163 | "author": comment.owner.username, 164 | "timestamp": comment.created_at_utc.timestamp(), 165 | "type": "comment", 166 | "url": "", 167 | "hashtags": ",".join(hashtag.findall(comment.text)), 168 | "usertags": "", 169 | "mentioned": ",".join(mention.findall(comment.text)), 170 | "num_likes": comment.likes_count if hasattr(comment, "likes_count") else 0, 171 | "num_comments": len(answers), 172 | "level": "comment", 173 | "query": query 174 | }) 175 | except instaloader.QueryReturnedNotFoundException: 176 | pass 177 | 178 | 179 | # instagram only has one reply depth level at the time of 180 | # writing, represented here 181 | for answer in answers: 182 | try: 183 | results_posts.append({ 184 | "id": str(answer.id), 185 | "thread_id": str(thread_id), 186 | "parent_id": str(comment.id), 187 | "body": answer.text, 188 | "author": answer.owner.username, 189 | "timestamp": answer.created_at_utc.timestamp(), 190 | "type": "comment", 191 | "url": "", 192 | "hashtags": ",".join(hashtag.findall(answer.text)), 193 | "usertags": "", 194 | "mentioned": ",".join(mention.findall(answer.text)), 195 | "num_likes": answer.likes_count if hasattr(answer, "likes_count") else 0, 196 | "num_comments": 0, 197 | "level": "answer", 198 | "query": query 199 | }) 200 | except instaloader.QueryReturnedNotFoundException: 201 | pass 202 | except instaloader.QueryReturnedNotFoundException: 203 | pass 204 | if save_path != "": 205 | save_csv(save_path, results_posts) 206 | except instaloader.QueryReturnedNotFoundException: 207 | pass 208 | except (instaloader.QueryReturnedNotFoundException, instaloader.ConnectionException): 209 | # data not available...? this happens sometimes, not clear why 210 | pass 211 | results.append(results_posts) 212 | return results 213 | 214 | 215 | def get_followers(username): 216 | 217 | profile = instaloader.Profile.from_username(instagram.context, username) 218 | 219 | follower_list = [] 220 | for follower in profile.get_followers(): 221 | # print(follower.username) 222 | follower_list.append(follower.username) 223 | 224 | return(follower_list) 225 | 226 | 227 | def get_similar_accounts(username): 228 | 229 | profile = instaloader.Profile.from_username(instagram.context, username) 230 | 231 | account_list = [] 232 | for account in profile.get_similar_accounts(): 233 | # print(account.username) 234 | account_list.append(account.username) 235 | 236 | return(account_list) 237 | -------------------------------------------------------------------------------- /LICENSE.md: -------------------------------------------------------------------------------- 1 | GNU General Public License 2 | ========================== 3 | 4 | _Version 3, 29 June 2007_ 5 | _Copyright © 2007 Free Software Foundation, Inc. <>_ 6 | 7 | Everyone is permitted to copy and distribute verbatim copies of this license 8 | document, but changing it is not allowed. 9 | 10 | ## Preamble 11 | 12 | The GNU General Public License is a free, copyleft license for software and other 13 | kinds of works. 14 | 15 | The licenses for most software and other practical works are designed to take away 16 | your freedom to share and change the works. By contrast, the GNU General Public 17 | License is intended to guarantee your freedom to share and change all versions of a 18 | program--to make sure it remains free software for all its users. We, the Free 19 | Software Foundation, use the GNU General Public License for most of our software; it 20 | applies also to any other work released this way by its authors. You can apply it to 21 | your programs, too. 22 | 23 | When we speak of free software, we are referring to freedom, not price. Our General 24 | Public Licenses are designed to make sure that you have the freedom to distribute 25 | copies of free software (and charge for them if you wish), that you receive source 26 | code or can get it if you want it, that you can change the software or use pieces of 27 | it in new free programs, and that you know you can do these things. 28 | 29 | To protect your rights, we need to prevent others from denying you these rights or 30 | asking you to surrender the rights. Therefore, you have certain responsibilities if 31 | you distribute copies of the software, or if you modify it: responsibilities to 32 | respect the freedom of others. 33 | 34 | For example, if you distribute copies of such a program, whether gratis or for a fee, 35 | you must pass on to the recipients the same freedoms that you received. You must make 36 | sure that they, too, receive or can get the source code. And you must show them these 37 | terms so they know their rights. 38 | 39 | Developers that use the GNU GPL protect your rights with two steps: **(1)** assert 40 | copyright on the software, and **(2)** offer you this License giving you legal permission 41 | to copy, distribute and/or modify it. 42 | 43 | For the developers' and authors' protection, the GPL clearly explains that there is 44 | no warranty for this free software. For both users' and authors' sake, the GPL 45 | requires that modified versions be marked as changed, so that their problems will not 46 | be attributed erroneously to authors of previous versions. 47 | 48 | Some devices are designed to deny users access to install or run modified versions of 49 | the software inside them, although the manufacturer can do so. This is fundamentally 50 | incompatible with the aim of protecting users' freedom to change the software. The 51 | systematic pattern of such abuse occurs in the area of products for individuals to 52 | use, which is precisely where it is most unacceptable. Therefore, we have designed 53 | this version of the GPL to prohibit the practice for those products. If such problems 54 | arise substantially in other domains, we stand ready to extend this provision to 55 | those domains in future versions of the GPL, as needed to protect the freedom of 56 | users. 57 | 58 | Finally, every program is threatened constantly by software patents. States should 59 | not allow patents to restrict development and use of software on general-purpose 60 | computers, but in those that do, we wish to avoid the special danger that patents 61 | applied to a free program could make it effectively proprietary. To prevent this, the 62 | GPL assures that patents cannot be used to render the program non-free. 63 | 64 | The precise terms and conditions for copying, distribution and modification follow. 65 | 66 | ## TERMS AND CONDITIONS 67 | 68 | ### 0. Definitions 69 | 70 | “This License” refers to version 3 of the GNU General Public License. 71 | 72 | “Copyright” also means copyright-like laws that apply to other kinds of 73 | works, such as semiconductor masks. 74 | 75 | “The Program” refers to any copyrightable work licensed under this 76 | License. Each licensee is addressed as “you”. “Licensees” and 77 | “recipients” may be individuals or organizations. 78 | 79 | To “modify” a work means to copy from or adapt all or part of the work in 80 | a fashion requiring copyright permission, other than the making of an exact copy. The 81 | resulting work is called a “modified version” of the earlier work or a 82 | work “based on” the earlier work. 83 | 84 | A “covered work” means either the unmodified Program or a work based on 85 | the Program. 86 | 87 | To “propagate” a work means to do anything with it that, without 88 | permission, would make you directly or secondarily liable for infringement under 89 | applicable copyright law, except executing it on a computer or modifying a private 90 | copy. Propagation includes copying, distribution (with or without modification), 91 | making available to the public, and in some countries other activities as well. 92 | 93 | To “convey” a work means any kind of propagation that enables other 94 | parties to make or receive copies. Mere interaction with a user through a computer 95 | network, with no transfer of a copy, is not conveying. 96 | 97 | An interactive user interface displays “Appropriate Legal Notices” to the 98 | extent that it includes a convenient and prominently visible feature that **(1)** 99 | displays an appropriate copyright notice, and **(2)** tells the user that there is no 100 | warranty for the work (except to the extent that warranties are provided), that 101 | licensees may convey the work under this License, and how to view a copy of this 102 | License. If the interface presents a list of user commands or options, such as a 103 | menu, a prominent item in the list meets this criterion. 104 | 105 | ### 1. Source Code 106 | 107 | The “source code” for a work means the preferred form of the work for 108 | making modifications to it. “Object code” means any non-source form of a 109 | work. 110 | 111 | A “Standard Interface” means an interface that either is an official 112 | standard defined by a recognized standards body, or, in the case of interfaces 113 | specified for a particular programming language, one that is widely used among 114 | developers working in that language. 115 | 116 | The “System Libraries” of an executable work include anything, other than 117 | the work as a whole, that **(a)** is included in the normal form of packaging a Major 118 | Component, but which is not part of that Major Component, and **(b)** serves only to 119 | enable use of the work with that Major Component, or to implement a Standard 120 | Interface for which an implementation is available to the public in source code form. 121 | A “Major Component”, in this context, means a major essential component 122 | (kernel, window system, and so on) of the specific operating system (if any) on which 123 | the executable work runs, or a compiler used to produce the work, or an object code 124 | interpreter used to run it. 125 | 126 | The “Corresponding Source” for a work in object code form means all the 127 | source code needed to generate, install, and (for an executable work) run the object 128 | code and to modify the work, including scripts to control those activities. However, 129 | it does not include the work's System Libraries, or general-purpose tools or 130 | generally available free programs which are used unmodified in performing those 131 | activities but which are not part of the work. For example, Corresponding Source 132 | includes interface definition files associated with source files for the work, and 133 | the source code for shared libraries and dynamically linked subprograms that the work 134 | is specifically designed to require, such as by intimate data communication or 135 | control flow between those subprograms and other parts of the work. 136 | 137 | The Corresponding Source need not include anything that users can regenerate 138 | automatically from other parts of the Corresponding Source. 139 | 140 | The Corresponding Source for a work in source code form is that same work. 141 | 142 | ### 2. Basic Permissions 143 | 144 | All rights granted under this License are granted for the term of copyright on the 145 | Program, and are irrevocable provided the stated conditions are met. This License 146 | explicitly affirms your unlimited permission to run the unmodified Program. The 147 | output from running a covered work is covered by this License only if the output, 148 | given its content, constitutes a covered work. This License acknowledges your rights 149 | of fair use or other equivalent, as provided by copyright law. 150 | 151 | You may make, run and propagate covered works that you do not convey, without 152 | conditions so long as your license otherwise remains in force. You may convey covered 153 | works to others for the sole purpose of having them make modifications exclusively 154 | for you, or provide you with facilities for running those works, provided that you 155 | comply with the terms of this License in conveying all material for which you do not 156 | control copyright. Those thus making or running the covered works for you must do so 157 | exclusively on your behalf, under your direction and control, on terms that prohibit 158 | them from making any copies of your copyrighted material outside their relationship 159 | with you. 160 | 161 | Conveying under any other circumstances is permitted solely under the conditions 162 | stated below. Sublicensing is not allowed; section 10 makes it unnecessary. 163 | 164 | ### 3. Protecting Users' Legal Rights From Anti-Circumvention Law 165 | 166 | No covered work shall be deemed part of an effective technological measure under any 167 | applicable law fulfilling obligations under article 11 of the WIPO copyright treaty 168 | adopted on 20 December 1996, or similar laws prohibiting or restricting circumvention 169 | of such measures. 170 | 171 | When you convey a covered work, you waive any legal power to forbid circumvention of 172 | technological measures to the extent such circumvention is effected by exercising 173 | rights under this License with respect to the covered work, and you disclaim any 174 | intention to limit operation or modification of the work as a means of enforcing, 175 | against the work's users, your or third parties' legal rights to forbid circumvention 176 | of technological measures. 177 | 178 | ### 4. Conveying Verbatim Copies 179 | 180 | You may convey verbatim copies of the Program's source code as you receive it, in any 181 | medium, provided that you conspicuously and appropriately publish on each copy an 182 | appropriate copyright notice; keep intact all notices stating that this License and 183 | any non-permissive terms added in accord with section 7 apply to the code; keep 184 | intact all notices of the absence of any warranty; and give all recipients a copy of 185 | this License along with the Program. 186 | 187 | You may charge any price or no price for each copy that you convey, and you may offer 188 | support or warranty protection for a fee. 189 | 190 | ### 5. Conveying Modified Source Versions 191 | 192 | You may convey a work based on the Program, or the modifications to produce it from 193 | the Program, in the form of source code under the terms of section 4, provided that 194 | you also meet all of these conditions: 195 | 196 | * **a)** The work must carry prominent notices stating that you modified it, and giving a 197 | relevant date. 198 | * **b)** The work must carry prominent notices stating that it is released under this 199 | License and any conditions added under section 7. This requirement modifies the 200 | requirement in section 4 to “keep intact all notices”. 201 | * **c)** You must license the entire work, as a whole, under this License to anyone who 202 | comes into possession of a copy. This License will therefore apply, along with any 203 | applicable section 7 additional terms, to the whole of the work, and all its parts, 204 | regardless of how they are packaged. This License gives no permission to license the 205 | work in any other way, but it does not invalidate such permission if you have 206 | separately received it. 207 | * **d)** If the work has interactive user interfaces, each must display Appropriate Legal 208 | Notices; however, if the Program has interactive interfaces that do not display 209 | Appropriate Legal Notices, your work need not make them do so. 210 | 211 | A compilation of a covered work with other separate and independent works, which are 212 | not by their nature extensions of the covered work, and which are not combined with 213 | it such as to form a larger program, in or on a volume of a storage or distribution 214 | medium, is called an “aggregate” if the compilation and its resulting 215 | copyright are not used to limit the access or legal rights of the compilation's users 216 | beyond what the individual works permit. Inclusion of a covered work in an aggregate 217 | does not cause this License to apply to the other parts of the aggregate. 218 | 219 | ### 6. Conveying Non-Source Forms 220 | 221 | You may convey a covered work in object code form under the terms of sections 4 and 222 | 5, provided that you also convey the machine-readable Corresponding Source under the 223 | terms of this License, in one of these ways: 224 | 225 | * **a)** Convey the object code in, or embodied in, a physical product (including a 226 | physical distribution medium), accompanied by the Corresponding Source fixed on a 227 | durable physical medium customarily used for software interchange. 228 | * **b)** Convey the object code in, or embodied in, a physical product (including a 229 | physical distribution medium), accompanied by a written offer, valid for at least 230 | three years and valid for as long as you offer spare parts or customer support for 231 | that product model, to give anyone who possesses the object code either **(1)** a copy of 232 | the Corresponding Source for all the software in the product that is covered by this 233 | License, on a durable physical medium customarily used for software interchange, for 234 | a price no more than your reasonable cost of physically performing this conveying of 235 | source, or **(2)** access to copy the Corresponding Source from a network server at no 236 | charge. 237 | * **c)** Convey individual copies of the object code with a copy of the written offer to 238 | provide the Corresponding Source. This alternative is allowed only occasionally and 239 | noncommercially, and only if you received the object code with such an offer, in 240 | accord with subsection 6b. 241 | * **d)** Convey the object code by offering access from a designated place (gratis or for 242 | a charge), and offer equivalent access to the Corresponding Source in the same way 243 | through the same place at no further charge. You need not require recipients to copy 244 | the Corresponding Source along with the object code. If the place to copy the object 245 | code is a network server, the Corresponding Source may be on a different server 246 | (operated by you or a third party) that supports equivalent copying facilities, 247 | provided you maintain clear directions next to the object code saying where to find 248 | the Corresponding Source. Regardless of what server hosts the Corresponding Source, 249 | you remain obligated to ensure that it is available for as long as needed to satisfy 250 | these requirements. 251 | * **e)** Convey the object code using peer-to-peer transmission, provided you inform 252 | other peers where the object code and Corresponding Source of the work are being 253 | offered to the general public at no charge under subsection 6d. 254 | 255 | A separable portion of the object code, whose source code is excluded from the 256 | Corresponding Source as a System Library, need not be included in conveying the 257 | object code work. 258 | 259 | A “User Product” is either **(1)** a “consumer product”, which 260 | means any tangible personal property which is normally used for personal, family, or 261 | household purposes, or **(2)** anything designed or sold for incorporation into a 262 | dwelling. In determining whether a product is a consumer product, doubtful cases 263 | shall be resolved in favor of coverage. For a particular product received by a 264 | particular user, “normally used” refers to a typical or common use of 265 | that class of product, regardless of the status of the particular user or of the way 266 | in which the particular user actually uses, or expects or is expected to use, the 267 | product. A product is a consumer product regardless of whether the product has 268 | substantial commercial, industrial or non-consumer uses, unless such uses represent 269 | the only significant mode of use of the product. 270 | 271 | “Installation Information” for a User Product means any methods, 272 | procedures, authorization keys, or other information required to install and execute 273 | modified versions of a covered work in that User Product from a modified version of 274 | its Corresponding Source. The information must suffice to ensure that the continued 275 | functioning of the modified object code is in no case prevented or interfered with 276 | solely because modification has been made. 277 | 278 | If you convey an object code work under this section in, or with, or specifically for 279 | use in, a User Product, and the conveying occurs as part of a transaction in which 280 | the right of possession and use of the User Product is transferred to the recipient 281 | in perpetuity or for a fixed term (regardless of how the transaction is 282 | characterized), the Corresponding Source conveyed under this section must be 283 | accompanied by the Installation Information. But this requirement does not apply if 284 | neither you nor any third party retains the ability to install modified object code 285 | on the User Product (for example, the work has been installed in ROM). 286 | 287 | The requirement to provide Installation Information does not include a requirement to 288 | continue to provide support service, warranty, or updates for a work that has been 289 | modified or installed by the recipient, or for the User Product in which it has been 290 | modified or installed. Access to a network may be denied when the modification itself 291 | materially and adversely affects the operation of the network or violates the rules 292 | and protocols for communication across the network. 293 | 294 | Corresponding Source conveyed, and Installation Information provided, in accord with 295 | this section must be in a format that is publicly documented (and with an 296 | implementation available to the public in source code form), and must require no 297 | special password or key for unpacking, reading or copying. 298 | 299 | ### 7. Additional Terms 300 | 301 | “Additional permissions” are terms that supplement the terms of this 302 | License by making exceptions from one or more of its conditions. Additional 303 | permissions that are applicable to the entire Program shall be treated as though they 304 | were included in this License, to the extent that they are valid under applicable 305 | law. If additional permissions apply only to part of the Program, that part may be 306 | used separately under those permissions, but the entire Program remains governed by 307 | this License without regard to the additional permissions. 308 | 309 | When you convey a copy of a covered work, you may at your option remove any 310 | additional permissions from that copy, or from any part of it. (Additional 311 | permissions may be written to require their own removal in certain cases when you 312 | modify the work.) You may place additional permissions on material, added by you to a 313 | covered work, for which you have or can give appropriate copyright permission. 314 | 315 | Notwithstanding any other provision of this License, for material you add to a 316 | covered work, you may (if authorized by the copyright holders of that material) 317 | supplement the terms of this License with terms: 318 | 319 | * **a)** Disclaiming warranty or limiting liability differently from the terms of 320 | sections 15 and 16 of this License; or 321 | * **b)** Requiring preservation of specified reasonable legal notices or author 322 | attributions in that material or in the Appropriate Legal Notices displayed by works 323 | containing it; or 324 | * **c)** Prohibiting misrepresentation of the origin of that material, or requiring that 325 | modified versions of such material be marked in reasonable ways as different from the 326 | original version; or 327 | * **d)** Limiting the use for publicity purposes of names of licensors or authors of the 328 | material; or 329 | * **e)** Declining to grant rights under trademark law for use of some trade names, 330 | trademarks, or service marks; or 331 | * **f)** Requiring indemnification of licensors and authors of that material by anyone 332 | who conveys the material (or modified versions of it) with contractual assumptions of 333 | liability to the recipient, for any liability that these contractual assumptions 334 | directly impose on those licensors and authors. 335 | 336 | All other non-permissive additional terms are considered “further 337 | restrictions” within the meaning of section 10. If the Program as you received 338 | it, or any part of it, contains a notice stating that it is governed by this License 339 | along with a term that is a further restriction, you may remove that term. If a 340 | license document contains a further restriction but permits relicensing or conveying 341 | under this License, you may add to a covered work material governed by the terms of 342 | that license document, provided that the further restriction does not survive such 343 | relicensing or conveying. 344 | 345 | If you add terms to a covered work in accord with this section, you must place, in 346 | the relevant source files, a statement of the additional terms that apply to those 347 | files, or a notice indicating where to find the applicable terms. 348 | 349 | Additional terms, permissive or non-permissive, may be stated in the form of a 350 | separately written license, or stated as exceptions; the above requirements apply 351 | either way. 352 | 353 | ### 8. Termination 354 | 355 | You may not propagate or modify a covered work except as expressly provided under 356 | this License. Any attempt otherwise to propagate or modify it is void, and will 357 | automatically terminate your rights under this License (including any patent licenses 358 | granted under the third paragraph of section 11). 359 | 360 | However, if you cease all violation of this License, then your license from a 361 | particular copyright holder is reinstated **(a)** provisionally, unless and until the 362 | copyright holder explicitly and finally terminates your license, and **(b)** permanently, 363 | if the copyright holder fails to notify you of the violation by some reasonable means 364 | prior to 60 days after the cessation. 365 | 366 | Moreover, your license from a particular copyright holder is reinstated permanently 367 | if the copyright holder notifies you of the violation by some reasonable means, this 368 | is the first time you have received notice of violation of this License (for any 369 | work) from that copyright holder, and you cure the violation prior to 30 days after 370 | your receipt of the notice. 371 | 372 | Termination of your rights under this section does not terminate the licenses of 373 | parties who have received copies or rights from you under this License. If your 374 | rights have been terminated and not permanently reinstated, you do not qualify to 375 | receive new licenses for the same material under section 10. 376 | 377 | ### 9. Acceptance Not Required for Having Copies 378 | 379 | You are not required to accept this License in order to receive or run a copy of the 380 | Program. Ancillary propagation of a covered work occurring solely as a consequence of 381 | using peer-to-peer transmission to receive a copy likewise does not require 382 | acceptance. However, nothing other than this License grants you permission to 383 | propagate or modify any covered work. These actions infringe copyright if you do not 384 | accept this License. Therefore, by modifying or propagating a covered work, you 385 | indicate your acceptance of this License to do so. 386 | 387 | ### 10. Automatic Licensing of Downstream Recipients 388 | 389 | Each time you convey a covered work, the recipient automatically receives a license 390 | from the original licensors, to run, modify and propagate that work, subject to this 391 | License. You are not responsible for enforcing compliance by third parties with this 392 | License. 393 | 394 | An “entity transaction” is a transaction transferring control of an 395 | organization, or substantially all assets of one, or subdividing an organization, or 396 | merging organizations. If propagation of a covered work results from an entity 397 | transaction, each party to that transaction who receives a copy of the work also 398 | receives whatever licenses to the work the party's predecessor in interest had or 399 | could give under the previous paragraph, plus a right to possession of the 400 | Corresponding Source of the work from the predecessor in interest, if the predecessor 401 | has it or can get it with reasonable efforts. 402 | 403 | You may not impose any further restrictions on the exercise of the rights granted or 404 | affirmed under this License. For example, you may not impose a license fee, royalty, 405 | or other charge for exercise of rights granted under this License, and you may not 406 | initiate litigation (including a cross-claim or counterclaim in a lawsuit) alleging 407 | that any patent claim is infringed by making, using, selling, offering for sale, or 408 | importing the Program or any portion of it. 409 | 410 | ### 11. Patents 411 | 412 | A “contributor” is a copyright holder who authorizes use under this 413 | License of the Program or a work on which the Program is based. The work thus 414 | licensed is called the contributor's “contributor version”. 415 | 416 | A contributor's “essential patent claims” are all patent claims owned or 417 | controlled by the contributor, whether already acquired or hereafter acquired, that 418 | would be infringed by some manner, permitted by this License, of making, using, or 419 | selling its contributor version, but do not include claims that would be infringed 420 | only as a consequence of further modification of the contributor version. For 421 | purposes of this definition, “control” includes the right to grant patent 422 | sublicenses in a manner consistent with the requirements of this License. 423 | 424 | Each contributor grants you a non-exclusive, worldwide, royalty-free patent license 425 | under the contributor's essential patent claims, to make, use, sell, offer for sale, 426 | import and otherwise run, modify and propagate the contents of its contributor 427 | version. 428 | 429 | In the following three paragraphs, a “patent license” is any express 430 | agreement or commitment, however denominated, not to enforce a patent (such as an 431 | express permission to practice a patent or covenant not to sue for patent 432 | infringement). To “grant” such a patent license to a party means to make 433 | such an agreement or commitment not to enforce a patent against the party. 434 | 435 | If you convey a covered work, knowingly relying on a patent license, and the 436 | Corresponding Source of the work is not available for anyone to copy, free of charge 437 | and under the terms of this License, through a publicly available network server or 438 | other readily accessible means, then you must either **(1)** cause the Corresponding 439 | Source to be so available, or **(2)** arrange to deprive yourself of the benefit of the 440 | patent license for this particular work, or **(3)** arrange, in a manner consistent with 441 | the requirements of this License, to extend the patent license to downstream 442 | recipients. “Knowingly relying” means you have actual knowledge that, but 443 | for the patent license, your conveying the covered work in a country, or your 444 | recipient's use of the covered work in a country, would infringe one or more 445 | identifiable patents in that country that you have reason to believe are valid. 446 | 447 | If, pursuant to or in connection with a single transaction or arrangement, you 448 | convey, or propagate by procuring conveyance of, a covered work, and grant a patent 449 | license to some of the parties receiving the covered work authorizing them to use, 450 | propagate, modify or convey a specific copy of the covered work, then the patent 451 | license you grant is automatically extended to all recipients of the covered work and 452 | works based on it. 453 | 454 | A patent license is “discriminatory” if it does not include within the 455 | scope of its coverage, prohibits the exercise of, or is conditioned on the 456 | non-exercise of one or more of the rights that are specifically granted under this 457 | License. You may not convey a covered work if you are a party to an arrangement with 458 | a third party that is in the business of distributing software, under which you make 459 | payment to the third party based on the extent of your activity of conveying the 460 | work, and under which the third party grants, to any of the parties who would receive 461 | the covered work from you, a discriminatory patent license **(a)** in connection with 462 | copies of the covered work conveyed by you (or copies made from those copies), or **(b)** 463 | primarily for and in connection with specific products or compilations that contain 464 | the covered work, unless you entered into that arrangement, or that patent license 465 | was granted, prior to 28 March 2007. 466 | 467 | Nothing in this License shall be construed as excluding or limiting any implied 468 | license or other defenses to infringement that may otherwise be available to you 469 | under applicable patent law. 470 | 471 | ### 12. No Surrender of Others' Freedom 472 | 473 | If conditions are imposed on you (whether by court order, agreement or otherwise) 474 | that contradict the conditions of this License, they do not excuse you from the 475 | conditions of this License. If you cannot convey a covered work so as to satisfy 476 | simultaneously your obligations under this License and any other pertinent 477 | obligations, then as a consequence you may not convey it at all. For example, if you 478 | agree to terms that obligate you to collect a royalty for further conveying from 479 | those to whom you convey the Program, the only way you could satisfy both those terms 480 | and this License would be to refrain entirely from conveying the Program. 481 | 482 | ### 13. Use with the GNU Affero General Public License 483 | 484 | Notwithstanding any other provision of this License, you have permission to link or 485 | combine any covered work with a work licensed under version 3 of the GNU Affero 486 | General Public License into a single combined work, and to convey the resulting work. 487 | The terms of this License will continue to apply to the part which is the covered 488 | work, but the special requirements of the GNU Affero General Public License, section 489 | 13, concerning interaction through a network will apply to the combination as such. 490 | 491 | ### 14. Revised Versions of this License 492 | 493 | The Free Software Foundation may publish revised and/or new versions of the GNU 494 | General Public License from time to time. Such new versions will be similar in spirit 495 | to the present version, but may differ in detail to address new problems or concerns. 496 | 497 | Each version is given a distinguishing version number. If the Program specifies that 498 | a certain numbered version of the GNU General Public License “or any later 499 | version” applies to it, you have the option of following the terms and 500 | conditions either of that numbered version or of any later version published by the 501 | Free Software Foundation. If the Program does not specify a version number of the GNU 502 | General Public License, you may choose any version ever published by the Free 503 | Software Foundation. 504 | 505 | If the Program specifies that a proxy can decide which future versions of the GNU 506 | General Public License can be used, that proxy's public statement of acceptance of a 507 | version permanently authorizes you to choose that version for the Program. 508 | 509 | Later license versions may give you additional or different permissions. However, no 510 | additional obligations are imposed on any author or copyright holder as a result of 511 | your choosing to follow a later version. 512 | 513 | ### 15. Disclaimer of Warranty 514 | 515 | THERE IS NO WARRANTY FOR THE PROGRAM, TO THE EXTENT PERMITTED BY APPLICABLE LAW. 516 | EXCEPT WHEN OTHERWISE STATED IN WRITING THE COPYRIGHT HOLDERS AND/OR OTHER PARTIES 517 | PROVIDE THE PROGRAM “AS IS” WITHOUT WARRANTY OF ANY KIND, EITHER 518 | EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF 519 | MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE ENTIRE RISK AS TO THE 520 | QUALITY AND PERFORMANCE OF THE PROGRAM IS WITH YOU. SHOULD THE PROGRAM PROVE 521 | DEFECTIVE, YOU ASSUME THE COST OF ALL NECESSARY SERVICING, REPAIR OR CORRECTION. 522 | 523 | ### 16. Limitation of Liability 524 | 525 | IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING WILL ANY 526 | COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MODIFIES AND/OR CONVEYS THE PROGRAM AS 527 | PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES, INCLUDING ANY GENERAL, SPECIAL, 528 | INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING OUT OF THE USE OR INABILITY TO USE THE 529 | PROGRAM (INCLUDING BUT NOT LIMITED TO LOSS OF DATA OR DATA BEING RENDERED INACCURATE 530 | OR LOSSES SUSTAINED BY YOU OR THIRD PARTIES OR A FAILURE OF THE PROGRAM TO OPERATE 531 | WITH ANY OTHER PROGRAMS), EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE 532 | POSSIBILITY OF SUCH DAMAGES. 533 | 534 | ### 17. Interpretation of Sections 15 and 16 535 | 536 | If the disclaimer of warranty and limitation of liability provided above cannot be 537 | given local legal effect according to their terms, reviewing courts shall apply local 538 | law that most closely approximates an absolute waiver of all civil liability in 539 | connection with the Program, unless a warranty or assumption of liability accompanies 540 | a copy of the Program in return for a fee. 541 | 542 | _END OF TERMS AND CONDITIONS_ 543 | 544 | ## How to Apply These Terms to Your New Programs 545 | 546 | If you develop a new program, and you want it to be of the greatest possible use to 547 | the public, the best way to achieve this is to make it free software which everyone 548 | can redistribute and change under these terms. 549 | 550 | To do so, attach the following notices to the program. It is safest to attach them 551 | to the start of each source file to most effectively state the exclusion of warranty; 552 | and each file should have at least the “copyright” line and a pointer to 553 | where the full notice is found. 554 | 555 | 556 | Copyright (C) 2020 instaloadeR 557 | 558 | This program is free software: you can redistribute it and/or modify 559 | it under the terms of the GNU General Public License as published by 560 | the Free Software Foundation, either version 3 of the License, or 561 | (at your option) any later version. 562 | 563 | This program is distributed in the hope that it will be useful, 564 | but WITHOUT ANY WARRANTY; without even the implied warranty of 565 | MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the 566 | GNU General Public License for more details. 567 | 568 | You should have received a copy of the GNU General Public License 569 | along with this program. If not, see . 570 | 571 | Also add information on how to contact you by electronic and paper mail. 572 | 573 | If the program does terminal interaction, make it output a short notice like this 574 | when it starts in an interactive mode: 575 | 576 | instaloadeR Copyright (C) 2020 instaloadeR 577 | This program comes with ABSOLUTELY NO WARRANTY; for details type 'show w'. 578 | This is free software, and you are welcome to redistribute it 579 | under certain conditions; type 'show c' for details. 580 | 581 | The hypothetical commands `show w` and `show c` should show the appropriate parts of 582 | the General Public License. Of course, your program's commands might be different; 583 | for a GUI interface, you would use an “about box”. 584 | 585 | You should also get your employer (if you work as a programmer) or school, if any, to 586 | sign a “copyright disclaimer” for the program, if necessary. For more 587 | information on this, and how to apply and follow the GNU GPL, see 588 | <>. 589 | 590 | The GNU General Public License does not permit incorporating your program into 591 | proprietary programs. If your program is a subroutine library, you may consider it 592 | more useful to permit linking proprietary applications with the library. If this is 593 | what you want to do, use the GNU Lesser General Public License instead of this 594 | License. But first, please read 595 | <>. 596 | --------------------------------------------------------------------------------