├── AUTHORS.md
├── CONTRIBUTING.md
├── LICENSE
├── README.md
├── README
    └── inputs.conf.spec
├── appserver
    └── static
    │   └── screenshot.png
├── bin
    └── mail.py
├── default
    ├── app.conf
    ├── authorize.conf
    ├── inputs.conf
    ├── props.conf
    └── transforms.conf
├── lib
    ├── file_parser
    │   ├── __init__.py
    │   ├── docx.py
    │   ├── email_mime.py
    │   ├── utils.py
    │   └── zip.py
    ├── mail_constants.py
    ├── mail_exceptions.py
    ├── mail_utils.py
    ├── six.py
    └── splunklib
    │   ├── __init__.py
    │   ├── binding.py
    │   ├── client.py
    │   ├── data.py
    │   ├── modularinput
    │       ├── __init__.py
    │       ├── argument.py
    │       ├── event.py
    │       ├── event_writer.py
    │       ├── input_definition.py
    │       ├── scheme.py
    │       ├── script.py
    │       ├── utils.py
    │       └── validation_definition.py
    │   ├── ordereddict.py
    │   ├── results.py
    │   ├── searchcommands
    │       ├── __init__.py
    │       ├── decorators.py
    │       ├── environment.py
    │       ├── eventing_command.py
    │       ├── external_search_command.py
    │       ├── generating_command.py
    │       ├── internals.py
    │       ├── reporting_command.py
    │       ├── search_command.py
    │       ├── streaming_command.py
    │       └── validators.py
    │   └── six.py
├── metadata
    └── default.meta
└── static
    ├── appIcon.png
    └── appIcon_2x.png


/AUTHORS.md:
--------------------------------------------------------------------------------
 1 | =======
 2 | Credits
 3 | =======
 4 | 
 5 | Development Lead
 6 | ----------------
 7 | 
 8 | * [Oluwaseun Remi-Omosowon](mailto:seunomosowon@gmail.com)
 9 | 
10 | Contributors
11 | ------------
12 | 
13 | * [François Lacombe](mailto:flacombe@adista.fr)
14 | * [Nathan Worsham](mailto:nworsham@gmail.com)
15 | * [Lowell Alleman](mailto:lowell@kintyre.co)
16 | 


--------------------------------------------------------------------------------
/CONTRIBUTING.md:
--------------------------------------------------------------------------------
 1 | ---
 2 | 
 3 | Contributing
 4 | 
 5 | ---
 6 | 
 7 | Contributions are welcome, and they are greatly appreciated! Every little bit helps, and credit will always be given.
 8 | 
 9 | You can contribute in many ways:
10 | 
11 | Types of Contributions
12 | 
13 | 1. Report Bugs
14 | 
15 | Report bugs at [TA-mailclient repo via on Github](https://github.com/seunomosowon/TA-mailclient/issues).
16 | 
17 | If you are reporting a bug, please include:
18 | 
19 | * Your operating system name and version.
20 | * Any details about your local setup that might be helpful in troubleshooting.
21 | * Detailed steps to reproduce the bug.
22 | 
23 | 2. Fix Bugs
24 | 
25 | Look through the GitHub issues for bugs. Anything tagged with "bug" is open to whoever wants to implement it.
26 | 
27 | 3. Implement Features
28 | 
29 | Look through the GitHub issues for features. Anything tagged with "feature" is open to whoever wants to implement it.
30 | 
31 | 4. Write Documentation
32 | 
33 | TA-mailclient could always use more documentation. Feel free to add documentation for an undocumented feature.
34 | 
35 | 5. Submit Feedback
36 | 
37 | Please rate the app on [Splunkbase](https:://splunkbase.splunk.com/app/3200/)
38 | You can also send feedback or submit an issue on [Github](https://github.com/seunomosowon/TA-mailclient/issues).
39 | 
40 | Feature requests can also be submitted in the same way.
41 | Remember that this is a volunteer-driven project, and that contributions are welcome :)
42 | 
43 | This has been tested with Gmail, gmx.com, and a few other mail servers. You can also send a list of public mail servers that you use this without issues.
44 | 
45 | Feature requests are yet to be added to Github include the following:
46 | * Oath support for imap
47 | * Additional mailbox folder support for IMAP
48 | * Parameterization of mailbox limits for each run (currenlty set to 25)
49 | 
50 | I'm also working on integrating with Travis CI to allow automatic tests and continuous integration.
51 | 
52 | #Guidelines:
53 | 
54 | Please fork the repo on [Github](https://github.com/seunomosowon/TA-mailclient/) and create a branch for local changes. Create a pull request to the development branch.
55 | 
56 | Thanks again for volunteering :smiley:
57 | 
58 | Also remember to add your name to the list of contributors in AUTHORs.md
59 | 
60 | 


--------------------------------------------------------------------------------
/LICENSE:
--------------------------------------------------------------------------------
  1 | 
  2 |                                  Apache License
  3 |                            Version 2.0, January 2004
  4 |                         http://www.apache.org/licenses/
  5 | 
  6 |    TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
  7 | 
  8 |    1. Definitions.
  9 | 
 10 |       "License" shall mean the terms and conditions for use, reproduction,
 11 |       and distribution as defined by Sections 1 through 9 of this document.
 12 | 
 13 |       "Licensor" shall mean the copyright owner or entity authorized by
 14 |       the copyright owner that is granting the License.
 15 | 
 16 |       "Legal Entity" shall mean the union of the acting entity and all
 17 |       other entities that control, are controlled by, or are under common
 18 |       control with that entity. For the purposes of this definition,
 19 |       "control" means (i) the power, direct or indirect, to cause the
 20 |       direction or management of such entity, whether by contract or
 21 |       otherwise, or (ii) ownership of fifty percent (50%) or more of the
 22 |       outstanding shares, or (iii) beneficial ownership of such entity.
 23 | 
 24 |       "You" (or "Your") shall mean an individual or Legal Entity
 25 |       exercising permissions granted by this License.
 26 | 
 27 |       "Source" form shall mean the preferred form for making modifications,
 28 |       including but not limited to software source code, documentation
 29 |       source, and configuration files.
 30 | 
 31 |       "Object" form shall mean any form resulting from mechanical
 32 |       transformation or translation of a Source form, including but
 33 |       not limited to compiled object code, generated documentation,
 34 |       and conversions to other media types.
 35 | 
 36 |       "Work" shall mean the work of authorship, whether in Source or
 37 |       Object form, made available under the License, as indicated by a
 38 |       copyright notice that is included in or attached to the work
 39 |       (an example is provided in the Appendix below).
 40 | 
 41 |       "Derivative Works" shall mean any work, whether in Source or Object
 42 |       form, that is based on (or derived from) the Work and for which the
 43 |       editorial revisions, annotations, elaborations, or other modifications
 44 |       represent, as a whole, an original work of authorship. For the purposes
 45 |       of this License, Derivative Works shall not include works that remain
 46 |       separable from, or merely link (or bind by name) to the interfaces of,
 47 |       the Work and Derivative Works thereof.
 48 | 
 49 |       "Contribution" shall mean any work of authorship, including
 50 |       the original version of the Work and any modifications or additions
 51 |       to that Work or Derivative Works thereof, that is intentionally
 52 |       submitted to Licensor for inclusion in the Work by the copyright owner
 53 |       or by an individual or Legal Entity authorized to submit on behalf of
 54 |       the copyright owner. For the purposes of this definition, "submitted"
 55 |       means any form of electronic, verbal, or written communication sent
 56 |       to the Licensor or its representatives, including but not limited to
 57 |       communication on electronic mailing lists, source code control systems,
 58 |       and issue tracking systems that are managed by, or on behalf of, the
 59 |       Licensor for the purpose of discussing and improving the Work, but
 60 |       excluding communication that is conspicuously marked or otherwise
 61 |       designated in writing by the copyright owner as "Not a Contribution."
 62 | 
 63 |       "Contributor" shall mean Licensor and any individual or Legal Entity
 64 |       on behalf of whom a Contribution has been received by Licensor and
 65 |       subsequently incorporated within the Work.
 66 | 
 67 |    2. Grant of Copyright License. Subject to the terms and conditions of
 68 |       this License, each Contributor hereby grants to You a perpetual,
 69 |       worldwide, non-exclusive, no-charge, royalty-free, irrevocable
 70 |       copyright license to reproduce, prepare Derivative Works of,
 71 |       publicly display, publicly perform, sublicense, and distribute the
 72 |       Work and such Derivative Works in Source or Object form.
 73 | 
 74 |    3. Grant of Patent License. Subject to the terms and conditions of
 75 |       this License, each Contributor hereby grants to You a perpetual,
 76 |       worldwide, non-exclusive, no-charge, royalty-free, irrevocable
 77 |       (except as stated in this section) patent license to make, have made,
 78 |       use, offer to sell, sell, import, and otherwise transfer the Work,
 79 |       where such license applies only to those patent claims licensable
 80 |       by such Contributor that are necessarily infringed by their
 81 |       Contribution(s) alone or by combination of their Contribution(s)
 82 |       with the Work to which such Contribution(s) was submitted. If You
 83 |       institute patent litigation against any entity (including a
 84 |       cross-claim or counterclaim in a lawsuit) alleging that the Work
 85 |       or a Contribution incorporated within the Work constitutes direct
 86 |       or contributory patent infringement, then any patent licenses
 87 |       granted to You under this License for that Work shall terminate
 88 |       as of the date such litigation is filed.
 89 | 
 90 |    4. Redistribution. You may reproduce and distribute copies of the
 91 |       Work or Derivative Works thereof in any medium, with or without
 92 |       modifications, and in Source or Object form, provided that You
 93 |       meet the following conditions:
 94 | 
 95 |       (a) You must give any other recipients of the Work or
 96 |           Derivative Works a copy of this License; and
 97 | 
 98 |       (b) You must cause any modified files to carry prominent notices
 99 |           stating that You changed the files; and
100 | 
101 |       (c) You must retain, in the Source form of any Derivative Works
102 |           that You distribute, all copyright, patent, trademark, and
103 |           attribution notices from the Source form of the Work,
104 |           excluding those notices that do not pertain to any part of
105 |           the Derivative Works; and
106 | 
107 |       (d) If the Work includes a "NOTICE" text file as part of its
108 |           distribution, then any Derivative Works that You distribute must
109 |           include a readable copy of the attribution notices contained
110 |           within such NOTICE file, excluding those notices that do not
111 |           pertain to any part of the Derivative Works, in at least one
112 |           of the following places: within a NOTICE text file distributed
113 |           as part of the Derivative Works; within the Source form or
114 |           documentation, if provided along with the Derivative Works; or,
115 |           within a display generated by the Derivative Works, if and
116 |           wherever such third-party notices normally appear. The contents
117 |           of the NOTICE file are for informational purposes only and
118 |           do not modify the License. You may add Your own attribution
119 |           notices within Derivative Works that You distribute, alongside
120 |           or as an addendum to the NOTICE text from the Work, provided
121 |           that such additional attribution notices cannot be construed
122 |           as modifying the License.
123 | 
124 |       You may add Your own copyright statement to Your modifications and
125 |       may provide additional or different license terms and conditions
126 |       for use, reproduction, or distribution of Your modifications, or
127 |       for any such Derivative Works as a whole, provided Your use,
128 |       reproduction, and distribution of the Work otherwise complies with
129 |       the conditions stated in this License.
130 | 
131 |    5. Submission of Contributions. Unless You explicitly state otherwise,
132 |       any Contribution intentionally submitted for inclusion in the Work
133 |       by You to the Licensor shall be under the terms and conditions of
134 |       this License, without any additional terms or conditions.
135 |       Notwithstanding the above, nothing herein shall supersede or modify
136 |       the terms of any separate license agreement you may have executed
137 |       with Licensor regarding such Contributions.
138 | 
139 |    6. Trademarks. This License does not grant permission to use the trade
140 |       names, trademarks, service marks, or product names of the Licensor,
141 |       except as required for reasonable and customary use in describing the
142 |       origin of the Work and reproducing the content of the NOTICE file.
143 | 
144 |    7. Disclaimer of Warranty. Unless required by applicable law or
145 |       agreed to in writing, Licensor provides the Work (and each
146 |       Contributor provides its Contributions) on an "AS IS" BASIS,
147 |       WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
148 |       implied, including, without limitation, any warranties or conditions
149 |       of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
150 |       PARTICULAR PURPOSE. You are solely responsible for determining the
151 |       appropriateness of using or redistributing the Work and assume any
152 |       risks associated with Your exercise of permissions under this License.
153 | 
154 |    8. Limitation of Liability. In no event and under no legal theory,
155 |       whether in tort (including negligence), contract, or otherwise,
156 |       unless required by applicable law (such as deliberate and grossly
157 |       negligent acts) or agreed to in writing, shall any Contributor be
158 |       liable to You for damages, including any direct, indirect, special,
159 |       incidental, or consequential damages of any character arising as a
160 |       result of this License or out of the use or inability to use the
161 |       Work (including but not limited to damages for loss of goodwill,
162 |       work stoppage, computer failure or malfunction, or any and all
163 |       other commercial damages or losses), even if such Contributor
164 |       has been advised of the possibility of such damages.
165 | 
166 |    9. Accepting Warranty or Additional Liability. While redistributing
167 |       the Work or Derivative Works thereof, You may choose to offer,
168 |       and charge a fee for, acceptance of support, warranty, indemnity,
169 |       or other liability obligations and/or rights consistent with this
170 |       License. However, in accepting such obligations, You may act only
171 |       on Your own behalf and on Your sole responsibility, not on behalf
172 |       of any other Contributor, and only if You agree to indemnify,
173 |       defend, and hold each Contributor harmless for any liability
174 |       incurred by, or claims asserted against, such Contributor by reason
175 |       of your accepting any such warranty or additional liability.
176 | 
177 |    END OF TERMS AND CONDITIONS
178 | 
179 |    APPENDIX: How to apply the Apache License to your work.
180 | 
181 |       To apply the Apache License to your work, attach the following
182 |       boilerplate notice, with the fields enclosed by brackets "[]"
183 |       replaced with your own identifying information. (Don't include
184 |       the brackets!)  The text should be enclosed in the appropriate
185 |       comment syntax for the file format. We also recommend that a
186 |       file or class name and description of purpose be included on the
187 |       same "printed page" as the copyright notice for easier
188 |       identification within third-party archives.
189 | 
190 |    Copyright [yyyy] [name of copyright owner]
191 | 
192 |    Licensed under the Apache License, Version 2.0 (the "License");
193 |    you may not use this file except in compliance with the License.
194 |    You may obtain a copy of the License at
195 | 
196 |        http://www.apache.org/licenses/LICENSE-2.0
197 | 
198 |    Unless required by applicable law or agreed to in writing, software
199 |    distributed under the License is distributed on an "AS IS" BASIS,
200 |    WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
201 |    See the License for the specific language governing permissions and
202 |    limitations under the License.
203 | 


--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
  1 | 
  2 | ## Table of Contents
  3 | 
  4 | ### OVERVIEW
  5 | 
  6 | - About the TA-mailclient
  7 | - Release notes
  8 |     - About this release
  9 |     - New features
 10 |     - To Do
 11 |     - Known issues
 12 |     - Third-party software attributions
 13 |     - Older Releases
 14 | - Support and resources
 15 | 
 16 | ### INSTALLATION AND CONFIGURATION
 17 | 
 18 | - Hardware and software requirements
 19 | - Splunk Enterprise system requirements
 20 | - Download
 21 | - Installation steps
 22 |     - Deploy to single server instance
 23 |     - Deploy to distributed deployment
 24 |     - Deploy to Splunk Cloud
 25 |     - Configure TA-mailclient
 26 |         - Parameters
 27 | - Upgrade
 28 | - Copyright & License
 29 | 
 30 | ### USER GUIDE
 31 | 
 32 | - Data types
 33 | - Troubleshooting
 34 | - Diagnostic & Debug Logs
 35 | 
 36 | 
 37 | ---
 38 | ### OVERVIEW
 39 | 
 40 | #### About the TA-mailclient
 41 | 
 42 | | Author | Oluwaseun Remi-Omosowon |
 43 | | --- | --- |
 44 | | App Version | 1.6.0 |
 45 | | Vendor Products | <ul><li>poplib</li><li>imaplib</li><li>SDK for Python 1.6.14</li></ul> |
 46 | 
 47 | The TA-mailclient add-on fetches emails for Splunk to index from mailboxes
 48 | using either POP3 or IMAP, with or without SSL.
 49 | 
 50 | The modular input also stores takes the password from inputs.conf in plain text,
 51 | and replaces it with a place holder, while storing it encrypted within Splunk.
 52 | This is built using the Splunk SDK for Python, should work on any Splunk
 53 | installation with Python available including SHC.
 54 | Passwords should also get replicated between search heard peer members.
 55 | 
 56 | This only fetches emails from the 'inbox' folder when using POP3. Additional mailbox folders can be indexed when using IMAP.
 57 | 
 58 | Be sure to set the interval to run this as frequently as required.
 59 | 
 60 | It supports all 'text/\*' content types and several well known scripts (.bat, .js, .sh) detailed below:
 61 | 
 62 | ```
 63 | 'application/xml'
 64 | 'application/xhtml'
 65 | 'application/x-sh'
 66 | 'application/x-csh',
 67 | 'application/javascript'
 68 | 'application/bat'
 69 | 'application/x-bat'
 70 | 'application/x-msdos-program'
 71 | 'application/textedit'
 72 | ```
 73 | Images, videos and executables are not indexed.
 74 | 
 75 | ##### Scripts and binaries
 76 | 
 77 | Includes:
 78 | - Splunk SDK for Python (1.6.14)
 79 | - Six python 2/3 compatibility (1.15.0)
 80 | - mail_lib - supports the calculation of vincenty distances which is used by default
 81 |     - constants.py - A number of constants / defaults used throughout the mail_lib module.
 82 |     - mail_common.py - Shared functions used to parse emails and attachments
 83 |     - exceptions raised by functions used in the mail_lib module.
 84 | 
 85 | #### Release notes
 86 | 
 87 | ##### About this release
 88 | 
 89 | Version 1.6.0 of the TA-mailclient is compatible with:
 90 | 
 91 | | Splunk Enterprise versions | 8.x, 7.x |
 92 | | --- | --- |
 93 | | CIM | Not Applicable |
 94 | | Platforms | Platform independent |
 95 | | Lookup file changes | No lookups included in this app |
 96 | 
 97 | This version removes support for unencrypted connections to mailboxes to allow the app pass Splunk Certification. 
 98 | The _is_secure_ is no longer required and should be removed from the config.
 99 | 
100 | The administrator is responsible for setting the sourcetype to whatever is desired,
101 | as well as extracting CIM fields for the sourcetype.
102 | This app already includes several extractions for different parts of the message that can be reused.
103 | 
104 | This app will not work on a universal forwarder,
105 | as it requires Python which comes with an HF or a full Splunk install.
106 | 
107 | **Note:** Travis CI includes tests for both secure versions of POP3 / IMAP. 
108 | 
109 | ##### New features
110 | 
111 | TA-mailclient includes the following new features:
112 | 
113 | - Added support for Python 3
114 | - Added six 1.15.0
115 | - Upgraded Splunk SDK to 1.6.14
116 | - Fix CI/CD tests to work for POP3 on v7.3, fix testing
117 | - Added Fix for working with Zips and docx with python2/python3
118 | - Added support for indexing emails from additional folders when using IMAP
119 | 
120 | ##### To Do
121 | 
122 | - Add attachment file hash to Splunk
123 | - Add support for doc / ppt / pptx
124 | 
125 | ##### Known issues
126 | 
127 | This is currently tested against 7.3, 8.0 and the latest version of Splunk Enterprise (v8.1 as at the time of this writing).
128 | Issues can be reported and tracked on Github at this time.
129 | 
130 | 
131 | ##### Third-party software attributions
132 | 
133 | This uses the inbuilt poplib and imaplib that comes with Python by default.
134 | 
135 | Contributions on github are welcome and will be incorporated into the main release.
136 | Current contributors are listed in AUTHORS.md.
137 | 
138 | 
139 | ##### Older Releases
140 | * v1.6.0
141 |     * Includes support for dropping attachments
142 |     * Migrated CICD to CircleCI
143 |     * Added appinspect testing to CI/CD pipeline
144 | * v1.5.5
145 |     * Updated Improved support for Python3
146 |     * Improved coding style to match new Splunk standards
147 |     * Fixed bugs related to indexing zip and docx as a result of Python 2-3 compatibility
148 | * v1.4.0
149 |     * Included support for Splunk v8.0
150 | * v1.3.5
151 |     * Fixed bug introduced  in v1.3.0
152 | * v1.3.0
153 |     * Made it more modular to supporting more file types in zips and in emails
154 |     * Added support for zips and files within zips
155 |     * Fixed unicode conversion of emails following contributions from Francois Lacombe on GitHub
156 |         - Also added static mail preamble for line break. Event breaking configuration may not be
157 |           required since the modular input writes individual events separately, but it's always a good idea.
158 |     *  Additional logging from pop3 / imap 
159 |     *  Removed interval from inputs.conf.spec
160 |     *  Upgraded Splunk SDK to 1.6.2
161 |     *  Added additional test cases on Travis CI to test that functionality work
162 |     *  modularized storage/password functions to make them reusable and simpler
163 |     *  Also fixed exception handling when dealing with storage/password
164 |     *  Fixed type casting for boolean parameters (is\_secure, include\_headers) and port validation
165 |     *  Rewrote sections of mail\_common
166 |     *  Merged functions from poputils / imaputils into main code and added additional logs from connection
167 | 
168 | * v0.5.1
169 |     * encoding corrections
170 |     * deduplicate Date and MessageId from indexed headers
171 |     * correction of MessageID extraction
172 |     * changed the separator to a predefined one instead of Date and MessageID
173 |     * activated and changed label for unsupported attachment
174 | 
175 | * v0.5.0
176 |     * Fixed UTF-8 encoding of mails before indexing. (Supporting Gmail and others)
177 | 
178 | * v0.4.9
179 |     * Changed encoding to support reading gmail.
180 | 
181 | * v0.4.8
182 |     * removed error introduced in v0.4.7
183 | 
184 | * v0.4.7
185 |     * Removed password field validation to allow users have complex or easy passwords however long
186 |     * Handled all mail exceptions
187 | 
188 | * v0.4.6
189 |     * Fixed bug.
190 |     * Fixed header inclusion
191 | 
192 | * v0.4.5
193 |     * Fixed bug. Removed line which caused v0.4.4 to fail
194 |     * Fixed header inclusion
195 | 
196 | * v0.4.4
197 |     * Updated app to ignore case of file attachment extension
198 | 
199 | * v0.4.3
200 |     * Made extensions case insensitive
201 |     * Added support for indexing _.docx_ extensions
202 |     * Generalised ```Mail.save_password()``` to allow reuse of code when writing other modular inputs.
203 |     * Optimized python import statements
204 |     * Fixed deleting of mails in poplib which was broken in 0.4
205 | 
206 | * v0.4.2
207 |     * Added support for indexing mail headers
208 | 
209 | * v0.4.1
210 |     * Fixed bug with 0.4.0
211 |     * Made updates to fix unneeded else statement which introduced bug in 0.4.0.
212 | 
213 | * v0.4
214 |     * Added support for decoding unicode characters in other languages or and removing the unicode identifier in the header.
215 |     * Improved support for indexing some file types even if the content-type is not set correctly. (as with Microsoft sending some files as binaries instead of text)
216 |     * Added fundamental code to support indexing of attachment as a configurable option in future release by the user.
217 |     * Added multiple field extractions for the email header and file attachments.
218 |     * Introduced a bug which was corrected in 0.4.1 **Faulty version**
219 | 
220 | **Note:** _filename_ and _filecontent_ are multi-valve fields.
221 | 
222 | * v0.3
223 |     * Adds support for mailbox cleanup options
224 | 
225 | * v0.2
226 |     * Adds support for base64 encoded emails.
227 | 
228 | 
229 | #### Support and resources
230 | 
231 | **Questions and answers**
232 | 
233 | Access questions and answers specific to the TA-mailclient at (https://answers.splunk.com/).
234 | 
235 | **Support**
236 | 
237 | This Splunk support add-on is community / developer supported.
238 | 
239 | Questions asked on Splunk answers will be answered either by the community of users or by the developer when available.
240 | All support questions should include the version of Splunk and OS.
241 | 
242 | You can also contact the developer directly via [Splunkbase](https://splunkbase.splunk.com/app/3200/).
243 | Feedback and feature requests can also be sent via Splunkbase.
244 | 
245 | Issues can also be submitted at the [TA-mailclient repo via on Github](https://github.com/seunomosowon/TA-mailclient/issues)
246 | 
247 | Future release will support
248 | 1. Support for configuration of mail limits in inputs.conf
249 | 2. Recursive option to read all folders inside Inbox, and not just emails within inbox.
250 | 3. Support indexing mails from additional folders in a mailbox
251 | 
252 | **Note** : This has not been tested against an exhaustive list of mail servers, so I'll welcome the feedback.
253 | 
254 | Also, feel free to send me a list of well known servers that you 're using this with without problems.
255 | 
256 | Rate the add-on on [Splunkbase](https://splunkbase.splunk.com/app/3200/) if you use it and are happy with it, 
257 | and share your feedback. Thanks!
258 | 
259 | 
260 | ## INSTALLATION AND CONFIGURATION
261 | ### Hardware and software requirements
262 | 
263 | #### Hardware requirements
264 | 
265 | TA-mailclient supports the following server platforms in the versions supported by Splunk Enterprise:
266 | 
267 | - Linux
268 | - Windows
269 | 
270 | The app was developed to be platform agnostic, but tests are mostly run on Linix.
271 | 
272 | Please contact the developer with issues running this on Windows. See the Splunk documentation for hardware
273 | requirements for running a heavy forwarder.
274 | 
275 | #### Software requirements
276 | 
277 | To function properly, TA-mailclient has no external requirements but needs to be installed on a full Splunk
278 | install which provides python and the required libraries (poplib and imaplib).
279 | 
280 | #### Splunk Enterprise system requirements
281 | 
282 | Because this add-on runs on Splunk Enterprise, all of the [Splunk Enterprise system requirements](http://docs.splunk.com/Documentation/Splunk/latest/Installation/Systemrequirements) apply.
283 | 
284 | #### Download
285 | 
286 | Download the TA-mailclient at one of the following locaitons:
287 | - [Splunkbase](https://splunkbase.splunk.com/app/3200/#/details)
288 | - [Github](https://github.com/seunomosowon/TA-mailclient)
289 | 
290 | #### Installation steps
291 | 
292 | ##### Deploy to single server instance
293 | 
294 | To install and configure this app on your supported standalone platform, do one of the following:
295 | 
296 | - Install on a standalone Splunk Enterprise install via the GUI. [See Link](https://docs.splunk.com/Documentation/AddOns/released/Overview/Singleserverinstall)
297 | - Extract the technology add-on to ```$SPLUNK_HOME/etc/apps/``` and restart Splunk
298 | 
299 | ##### Deploy to distributed deployment
300 | 
301 | **Install to search head** - (Standalone or Search head cluster)
302 | 
303 | - Deploy the props.conf and transforms.conf from TA-mailclient to the search head. 
304 | If using search head cluster, deploy the props.conf and transforms.conf via a search head deployer.
305 | 
306 | 
307 | **Install to indexers**
308 | 
309 | - No App needs to be installed on indexers
310 | 
311 | **Install to forwarders**
312 | 
313 | - Follow the steps to install the TA-mailclient on a heavy forwarder.
314 | More instructions available at the following [URL](https://docs.splunk.com/Documentation/AddOns/released/Overview/Distributedinstall#Heavy_forwarders)
315 | 
316 | - Configure an email input by going to the setup page or configuring inputs.conf.
317 | 
318 | ##### Deploy to Splunk Cloud
319 | 
320 | For Splunk cloud installations, install TA-mailclient on a heavy forwarder that has been configured to forward
321 | events to your Splunk Cloud instance. 
322 | The sourcetype is set by the administrator of the heavy forwarder when configuring the inputs.
323 | 
324 | You can work with Splunk Support on installing the Support add-on on Splunk Cloud for parsing the mails collected.
325 | 
326 | 
327 | #### Configure TA-mailclient
328 | 
329 | This app adds a mail:// modular input and supports a variety of parameters in inputs.conf.
330 | 
331 | ```
332 | [mail://email_address@domain.com]
333 | interval = 600
334 | mailserver = imap.domain.com
335 | password = mypassword
336 | protocol = IMAP|POP3
337 | disabled = 0
338 | mailbox_cleanup = delete
339 | additional_folder = test,rfc,spam
340 | 
341 | ```
342 | 
343 | Once the input is read, the password gets replaced and shows as 'encrypted'.
344 | As such, the password for the mailbox must not be set to 'encrypted'.
345 | 
346 | The input can be edited if the password needs to be updated, and the password stored in a password
347 | storage endpoint would get updated automatically. Passwords are never stored in clear text.
348 | 
349 | A different sourcetype can be specified for each input, thus making it possible to have different sourcetypes
350 | for every mailbox. Mailbox cleanup is also managed automatically, and emails are deleted once it has been
351 | indexed.
352 | 
353 | ##### Parameters
354 | 
355 | **mailserver** - This is a mandatory field and should be the hostname or
356 | IP address for the mail server or client access server with support for retrieving emails via POP3 or IMAP
357 | 
358 | **protocol** - This must be set to either POP3 or IMAP
359 | 
360 | **password** - Passwords must be set for every account,
361 | or the input will get disabled.
362 | 
363 | **mailbox_cleanup** = This indicates if every email should be deleted as it is read,
364 |   or delayed until the next interval.
365 |   Setting this to ```readonly``` prevents mails from being deleted.
366 | 
367 |   The default is ```readonly```. Supported options are:
368 | ```delayed|delete|readonly```
369 | 
370 | **interval** - This should be configured to run as frequent as required
371 | to retreive emails. This modular input retrieves up to 20 emails at each run.
372 | A future release to this input might allow the limit to be configured as a parameter to the modular input.
373 | 
374 | This modular input supports multiple instances, and each input runs at separate intervals.
375 | 
376 | **include_headers** -  This determines if email headers should be included.
377 | 
378 | **additional_folders** - This is an optional parameter containing a comma-separated list of additional folders to be indexed if IMAP is configured for the mailbox.
379 | 
380 | **drop_attachment** -  This is an optional parameter to determine if email attachment should be discarded.
381 | 
382 | ### Copyright & License
383 | 
384 | A copy of the Creative Commons Legal code has been added to the add-on detailing its license.
385 | 
386 | 
387 | ## USER GUIDE
388 | 
389 | ### Data types
390 | 
391 | Data is indexed using a sourcetype specified by the administrator when configuring the inputs.
392 | If nothing is specified, events will get indexed with a sourcetype of `mail`. 
393 | 
394 | ### Troubleshooting
395 | 
396 | Once an email is indexed, it will not be re-indexed except the checkpoint directory is emptied.
397 | This can be achieved by running the following command:
398 | ```
399 | splunk clean inputdata mail
400 | ```
401 | 
402 | #### Diagnostic & Debug Logs
403 | 
404 | Logs can be found by searching Splunk internal logs
405 | 
406 | ```index=_internal sourcetype=splunkd (component=ModularInputs OR component=ExecProcessor) mail.py```
407 | 
408 | 
409 | Additional logging can be enabled by turning on debug logging for ExecProcessor and ModInputs.
410 | set the logging level of the ExecProcessor to Debug
411 | 
412 | /opt/splunk/bin/splunk set log-level ExecProcessor -level DEBUG
413 | /opt/splunk/bin/splunk set log-level ModInputs -level DEBUG
414 | 
415 | You can find additional ways to enable debug logging on
416 | [here](http://docs.splunk.com/Documentation/Splunk/latest/Troubleshooting/Enabledebuglogging).
417 | 


--------------------------------------------------------------------------------
/README/inputs.conf.spec:
--------------------------------------------------------------------------------
 1 | [mail://<name>]
 2 | * The name of the stanza should be an email address which would be used to connect to the server.
 3 | 
 4 | protocol = [POP3|IMAP]
 5 | * The protocol to be used to fetch emails from the server
 6 | 
 7 | mailserver = <value>
 8 | * This is the mailserver to fetch mails from
 9 | 
10 | password = <value>
11 | * The password for the account provided in the stanza name
12 | 
13 | mailbox_cleanup = [delete,delayed,readonly]
14 | * This determines if the mails should be one of the following:
15 |  * delete: deleted as they are indexed
16 |  * delayed: deleted on next connection to the mailbox after verifying that the mail was indexed
17 |  * readonly: mails will not be deleted. It will be read and left in the mailbox.
18 | * If this is not set, the default option used will be readonly
19 | 
20 | include_headers =  <bool>
21 | * This determines if email headers should be included.
22 | 
23 | maintain_rfc =  <bool>
24 | * This determines if email will still maintain RFC compatability for parsing tools
25 | 
26 | attach_message_primary =  <bool>
27 | * This determines if an attached message will instead be the indexed email (assuming the outer message was just the delivery mechanism)
28 | 
29 | additional_folders = <value>
30 | * This suggests additional folders to read messages via IMAP
31 | 
32 | drop_attachment = <bool>
33 | * This determines if an email attachment will be indexed


--------------------------------------------------------------------------------
/appserver/static/screenshot.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/seunomosowon/TA-mailclient/b4745263d53f03e06edf098a665a5597d40fe449/appserver/static/screenshot.png


--------------------------------------------------------------------------------
/default/app.conf:
--------------------------------------------------------------------------------
 1 | [install]
 2 | is_configured = 0
 3 | 
 4 | [ui]
 5 | is_visible = 0
 6 | label = Technology Add-on for Mail retrieval
 7 | 
 8 | [launcher]
 9 | author = seunomosowon
10 | description = Get mails from a mail server via POP3 or IMAP
11 | version = 1.6.0
12 | 
13 | [package]
14 | id = TA-mailclient
15 | check_for_updates = true
16 | 


--------------------------------------------------------------------------------
/default/authorize.conf:
--------------------------------------------------------------------------------
1 | [capability::edit_modinput_mail]
2 | # Capability required to add mail inputs and edit settings.
3 | 
4 | [role_admin]
5 | edit_modinput_mail = enabled
6 | 


--------------------------------------------------------------------------------
/default/inputs.conf:
--------------------------------------------------------------------------------
1 | [mail]
2 | python.version = python3
3 | 


--------------------------------------------------------------------------------
/default/props.conf:
--------------------------------------------------------------------------------
 1 | [source::mail:\/\/...]
 2 | KV_MODE = auto
 3 | SHOULD_LINEMERGE=false
 4 | MAX_EVENTS=5000
 5 | LINE_BREAKER=(VGhpcyBpcyBhIG1haWwgc2VwYXJhdG9yIGluIGJhc2U2NCBmb3Igb3VyIFNwbHVuayBpbmRleGluZwo=[\r\n]+)
 6 | TIME_PREFIX= \nDate: 
 7 | MAX_TIMESTAMP_LOOKAHEAD = 32
 8 | TIME_FORMAT= %a, %d %b %Y %H:%M:%S %z
 9 | TRUNCATE=200000
10 | REPORT-file_attachments = file_attachment
11 | REPORT-multi_part = multi_part
12 | REPORT-attachment_filename = attachment_filename:kvextraction
13 | REPORT-attachment_md5 = attachment_md5:kvextraction
14 | REPORT-attachment_sha256 = attachment_sha256:kvextraction
15 | EXTRACT-Message_ID = (?i)^Message-ID:\h+<?(?<message_id>[^\r\n>]+?)>?$
16 | EXTRACT-From = ^From:\h+(?<from>(?:"?(?<from_name>[^<\r\n]+)"?\h+)?<?(?<from_email>[^\r\n]+?)>?)$
17 | EXTRACT-Subject = ^Subject:\h+(?<subject>[^\r\n]+)$
18 | EXTRACT-TO = ^To:\h+(?<to>(?:"?(?<to_name>[^<\r\n]+)"?\h+)?<?(?<to_email>[^\r\n]+?)<?)$
19 | FIELDALIAS-dest = host AS dest
20 | FIELDALIAS-mid = MessageID AS message_id
21 | FIELDALIAS-src_user = from AS src_user
22 | FIELDALIAS-sender = from_email AS sender
23 | FIELDALIAS-recipient = to AS recipient
24 | FIELDALIAS-file_hash = sha256 AS file_hash
25 | ANNOTATE_PUNCT = false
26 | 
27 | 


--------------------------------------------------------------------------------
/default/transforms.conf:
--------------------------------------------------------------------------------
 1 | [file_attachment]
 2 | REGEX=(?ms)#BEGIN_ATTACHMENT:\s(?<file_name>[^\r\n]+)[\r\n]+(?<file_content>.*)#END_ATTACHMENT:\s*\g{file_name}
 3 | MV_ADD=true
 4 | 
 5 | [multi_part]
 6 | REGEX=(?ms)[\r\n]#START_OF_MULTIPART_(\d+)[\r\n](?<multipart>.*)[\r\n]#END_OF_MULTIPART_\1[\r\n]*
 7 | MV_ADD=true
 8 | 
 9 | [attachment_md5:kvextraction]
10 | FORMAT = md5::$1
11 | REGEX = md5\s=\s(\w+)
12 | MV_ADD = true
13 | 
14 | [attachment_sha256:kvextraction]
15 | FORMAT = sha256::$1
16 | REGEX = sha256\s=\s(\w+)
17 | MV_ADD = true
18 | 
19 | [attachment_filename:kvextraction]
20 | FORMAT = file_name::$1
21 | REGEX = file_name\s=\s((?!None\s)[^\.]+(?:\.\w+)?)\s
22 | MV_ADD = true
23 | 


--------------------------------------------------------------------------------
/lib/file_parser/__init__.py:
--------------------------------------------------------------------------------
1 | from .utils import *
2 | 
3 | __version_info__ = (1, 3, 0)
4 | __version__ = ".".join(map(str, __version_info__))
5 | __all__ = ['ZIP_EXTENSIONS', 'TEXT_FILE_EXTENSIONS', 'SUPPORTED_CONTENT_TYPES',
6 |            'email_mime', 'docx', 'zip']
7 | 


--------------------------------------------------------------------------------
/lib/file_parser/docx.py:
--------------------------------------------------------------------------------
 1 | """ Parse .docx files """
 2 | from __future__ import unicode_literals
 3 | 
 4 | from .utils import *
 5 | from xml.dom.minidom import parse as parsexml
 6 | from six import text_type, binary_type, BytesIO
 7 | from six import ensure_binary, ensure_str
 8 | import zipfile
 9 | 
10 | 
11 | def parse_docx(part, part_name):
12 |     """
13 |     This reads a docx file form a string and outputs just the text from the document
14 |     along with the document's internal structure
15 |     :param part: This is a MIME part from an email that contains a docx file
16 |     :type part: Union[email.message.Message, basestring]
17 |     :param part_name: This can be either a file name or string $EMAIL$
18 |     :type part_name basestring
19 |     :return: This returns the texts from the word document.
20 |     :rtype: list
21 |     """
22 |     if part_name == EMAIL_PART:
23 |         decoded_payload = part.get_payload(decode=True)
24 |         zip_name = part.get_filename() or ''
25 |     else:
26 |         decoded_payload = part
27 |         zip_name = part_name
28 |     fp = BytesIO(decoded_payload)
29 |     try:
30 |         zfp = zipfile.ZipFile(fp)
31 |     except zipfile.BadZipfile:
32 |         return ['#UNSUPPORTED_ATTACHMENT: %s' % zip_name]
33 |     return_doc = []
34 |     if zfp:
35 |         return_doc.append(parsexml(zfp.open('[Content_Types].xml', 'r')).documentElement.toprettyxml())
36 |         """
37 |         I can check for Macros here
38 |         if zfp.getinfo('word/vbaData.xml'):
39 |         openXML standard supports any name for xml file. Need to check all files.
40 |         Add the contents pages to the top of word file for visual inspection of macros
41 |         """
42 |         if zfp.getinfo('word/document.xml'):
43 |             doc_xml = parsexml(zfp.open('word/document.xml', 'r'))
44 |             return_doc.append(''.join([ensure_str(node.firstChild.nodeValue) for node in doc_xml.getElementsByTagName('w:t')]))
45 |         else:
46 |             return_doc.append('#UNSUPPORTED_DOCX_FILE: file_name = %s' % zip_name)
47 |     else:
48 |         return_doc.append('#INVALID_DOCX_FILE: file_name = %s' % zip_name)
49 |     return return_doc
50 | 
51 | 
52 | def parse_docx_from_mail(message):
53 |     """
54 | 
55 |     :param message: string representation of docx file
56 |     :type message: email.message.Message
57 |     :return:
58 |     """
59 |     parse_docx(message, EMAIL_PART)
60 | 
61 | 
62 | def parse_docx_from_string(docx_as_string, file_name):
63 |     """
64 | 
65 |     :param docx_as_string: string representation of docx file
66 |     :type docx_as_string: basestring
67 |     :param file_name: docx file name
68 |     :type file_name: basestring
69 |     :return:
70 |     """
71 |     parse_docx(docx_as_string, file_name)
72 | 


--------------------------------------------------------------------------------
/lib/file_parser/email_mime.py:
--------------------------------------------------------------------------------
  1 | """ Parse emails files """
  2 | from __future__ import unicode_literals
  3 | from six import text_type, binary_type
  4 | 
  5 | import email
  6 | import re
  7 | import os
  8 | from . import zip
  9 | import hashlib
 10 | import quopri
 11 | # noinspection PyUnresolvedReferences
 12 | from base64 import b64decode
 13 | try:
 14 |     from email.parser import Parser
 15 | except ImportError:
 16 |     # Python 2
 17 |     from email.Parser import Parser
 18 | 
 19 | from email.utils import mktime_tz, parsedate_tz
 20 | from .utils import *
 21 | 
 22 | 
 23 | def parse_email(email_as_string, include_headers, maintain_rfc, attach_message_primary):
 24 |     """
 25 |     This function parses an email and returns an array with different parts of the message.
 26 |     :param email_as_string: This represents the email in a bytearray to be processed
 27 |     :type email_as_string: basestring
 28 |     :param include_headers: This parameter specifies if all headers should be included.
 29 |     :type include_headers: bool
 30 |     :param maintain_rfc: This parameter specifies if RFC format for email stays intact
 31 |     :type maintain_rfc: bool
 32 |     :param attach_message_primary: This parameter specifies if first attached email should
 33 |       be used as the message for indexing instead of the carrier email
 34 |     :type attach_message_primary: bool
 35 |     :return: Returns a list with the [date, Message-id, mail_message]
 36 |       :rtype: list
 37 |     """
 38 |     message = email.message_from_string(email_as_string.strip()) or None
 39 |     if message is None:
 40 |         return [None, None, None]
 41 |     if attach_message_primary:
 42 |         message = change_primary_message(message)   
 43 |     if maintain_rfc:
 44 |         index_mail = maintain_rfc_parse(message)
 45 |     else:
 46 |         mailheaders = Parser().parsestr(message.as_string(), True)
 47 |         headers = ["%s: %s" % (k, getheader(v)) for k, v in mailheaders.items() if k in MAIN_HEADERS]
 48 |         if include_headers:
 49 |             other_headers = ["%s: %s" % (k, getheader(v)) for k, v in mailheaders.items() if k not in MAIN_HEADERS]
 50 |             headers.extend(other_headers)
 51 |         body = []
 52 |         if message.is_multipart():
 53 |             part_number = 1
 54 |             for part in message.walk():
 55 |                 content_type = part.get_content_type()
 56 |                 content_disposition = part.get('Content-Disposition')
 57 |                 if content_type in ['multipart/alternative', 'multipart/mixed']:
 58 |                     # The multipart/alternative part is usually empty.
 59 |                     body.append("Multipart envelope header: %s" % str(part.get_payload(decode=True)))
 60 |                     continue
 61 |                 body.append("#START_OF_MULTIPART_%d" % part_number)
 62 |                 extension = str(os.path.splitext(part.get_filename() or '')[1]).lower()
 63 |                 if extension in TEXT_FILE_EXTENSIONS or content_type in SUPPORTED_CONTENT_TYPES or \
 64 |                    part.get_content_maintype() == 'text' or extension in ZIP_EXTENSIONS:
 65 |                     if part.get_filename():
 66 |                         body.append("#BEGIN_ATTACHMENT: %s" % str(part.get_filename()))
 67 |                         if extension in ZIP_EXTENSIONS:
 68 |                             body.append("\n".join(zip.parse_zip(part, EMAIL_PART)))
 69 |                         else:
 70 |                             body.append(recode_mail(part))
 71 |                         body.append("#END_ATTACHMENT: %s" % str(part.get_filename()))
 72 |                     else:
 73 |                         body.append(recode_mail(part))
 74 |                 else:
 75 |                     body.append("#UNSUPPORTED_ATTACHMENT: file_name = %s - type = %s ; disposition=%s" % (
 76 |                         part.get_filename(), content_type, content_disposition))
 77 |                 body.append("#END_OF_MULTIPART_%d" % part_number)
 78 |                 part_number += 1
 79 |         else:
 80 |             body.append(recode_mail(message))
 81 |         """mail_for_index = [MESSAGE_PREAMBLE]"""
 82 |         mail_for_index = []
 83 |         mail_for_index.extend(headers + body)
 84 |         index_mail = '\n'.join(s.decode('utf-8', 'ignore') if isinstance(s,  binary_type) else s for s in mail_for_index)
 85 |     message_time = float(mktime_tz(parsedate_tz(message['Date'])))
 86 |     return [message_time, message['Message-ID'], index_mail]
 87 | 
 88 | def change_primary_message(message):
 89 |     """
 90 |     This function will look for an attached email and return it. This is inteded to use 
 91 |     the attached email as the email to be indexed instead of the carrier email.
 92 |     It checks if the message is already in message format or in a binary format and also
 93 |     only the first attached email will become the primary if there are more than one.
 94 |     :param message: This represents the email to be checked for attached email.
 95 |     :type message: email message object
 96 |     :return: Returns a email message object
 97 |       :rtype: email message object
 98 |     """
 99 |     for i in message.walk():
100 |         if i.get_content_maintype()=='message':
101 |             return i.get_payload()[0]
102 |         elif i.get_content_subtype()=='octet-stream' and i.get_filename().lower().endswith('.eml'):
103 |             if i['Content-Transfer-Encoding'].lower()=='base64': 
104 |                 return email.message_from_string(b64decode(i.get_payload()))
105 |             else:
106 |                 return email.message_from_string(i.get_payload())
107 | 
108 | def maintain_rfc_parse(message):
109 |     """
110 |     This function parses an email and returns an array with different parts of the message
111 |     but leaves the email still RFC compliant so that it works with Mail-Parser Plus app.
112 |     Attachment headers are left in tact.
113 |     :param message: This represents the email to be checked for attached email.
114 |     :type message: email message object
115 |     :return: Returns a email message formatted as a string
116 |       :rtype: str
117 |     """
118 |     if not message.is_multipart():
119 |         reformatted_message = quopri.decodestring(
120 |                                 message.as_string().encode('ascii', 'ignore')
121 |                             ).decode("utf-8", 'ignore')
122 |         return reformatted_message
123 |     boundary = message.get_boundary()
124 |     new_payload = '--' + boundary
125 |     for i in message.get_payload():
126 |         content_type = i.get_content_type()
127 |         extension = str(os.path.splitext(i.get_filename() or '')[1]).lower()
128 |         if extension in TEXT_FILE_EXTENSIONS or content_type in SUPPORTED_CONTENT_TYPES or \
129 |            i.get_content_maintype() == 'text':
130 |             text_content = i.as_string().encode('ascii', 'ignore')
131 |             text_content = quopri.decodestring(text_content).decode("utf-8", 'ignore')
132 |             new_payload += '\n' + text_content
133 |         else:
134 |             replace = re.sub(r'(?:\n\n)[\s\S]+',r'\n\n#UNSUPPORTED_ATTACHMENT:',i.as_string())
135 |             filename = i.get_filename()
136 |             charset = i.get_content_charset()
137 |             try:
138 |                 md5 = hashlib.md5(i.get_payload(None,True)).hexdigest()
139 |                 sha256 = hashlib.sha256(i.get_payload(None,True)).hexdigest()
140 |             except:
141 |                 md5 = ''
142 |                 sha256 = ''
143 |             replace_string = """
144 | file_name = %(filename)s
145 | type = %(content_type)s
146 | charset = %(charset)s
147 | md5 = %(md5)s
148 | sha256 = %(sha256)s
149 | """
150 |             metadata = replace_string % dict(
151 |                 content_type=content_type, 
152 |                 filename=filename, 
153 |                 charset=charset,
154 |                 md5=md5,
155 |                 sha256=sha256,
156 |             )
157 |             new_payload += '\n' \
158 |                 + replace \
159 |                 + metadata
160 |         new_payload += '\n--' + boundary
161 |     new_payload += '--'
162 |     message.set_payload(new_payload)
163 |     return message.as_string()
164 | 


--------------------------------------------------------------------------------
/lib/file_parser/utils.py:
--------------------------------------------------------------------------------
 1 | """
 2 | This includes common functions that are required when dealing with mails
 3 | """
 4 | from __future__ import unicode_literals
 5 | 
 6 | from email.header import decode_header
 7 | from six import text_type, binary_type
 8 | 
 9 | MAIN_HEADERS = ('Date', 'Message-Id', 'Message-ID', 'From', 'To', 'Subject')
10 | ZIP_EXTENSIONS = {'.zip', '.docx'}
11 | EMAIL_PART = '$EMAIL$'
12 | SUPPORTED_CONTENT_TYPES = {'application/xml', 'application/xhtml', 'application/x-sh', 'application/x-csh',
13 |                            'application/javascript', 'application/bat', 'application/x-bat',
14 |                            'application/x-msdos-program', 'application/textedit',
15 |                            'application/vnd.openxmlformats-officedocument.wordprocessingml.document'}
16 | TEXT_FILE_EXTENSIONS = {'.csv', '.txt', '.md', '.py', '.bat', '.sh', '.rb', '.js', '.asm', '.log'}
17 | """
18 | It already indexes all text/* including:
19 |     'text/plain', 'text/html', 'text/x-asm', 'text/x-c','text/x-python-script','text/x-python'
20 | No need to add this to the supported types list
21 | """
22 | 
23 | 
24 | def getheader(header_text, default="ascii"):
25 |     """ This decodes sections of the email header which could be represented in utf8 or other iso languages"""
26 |     headers = decode_header(header_text)
27 |     header_sections = [text if isinstance(text, text_type) else text_type(text, charset or default, "ignore") for text, charset in headers]
28 |     return "".join(header_sections)
29 | 
30 | 
31 | def recode_mail(part):
32 |     cset = part.get_content_charset()
33 |     if cset == "None":
34 |         cset = "ascii"
35 |     try:
36 |         if not part.get_payload(decode=True):
37 |             result = ""
38 |         else:
39 |             result = text_type(part.get_payload(decode=True), cset, "ignore").encode('utf8', 'xmlcharrefreplace').strip()
40 |     except TypeError:
41 |         result = part.get_payload(decode=True)
42 |         if isinstance(result, text_type):
43 |             result = result.encode('utf8', 'xmlcharrefreplace').strip()
44 |     return result
45 | 


--------------------------------------------------------------------------------
/lib/file_parser/zip.py:
--------------------------------------------------------------------------------
 1 | """Parse zip files"""
 2 | from __future__ import unicode_literals
 3 | from six import text_type, binary_type, BytesIO
 4 | from six import ensure_binary, ensure_str
 5 | from .utils import *
 6 | from . import docx
 7 | import os
 8 | import zipfile
 9 | 
10 | 
11 | def parse_zip(part, part_name):
12 |     """
13 |     This reads a docx file form a string and outputs just the text from the document
14 |     along with the document's internal structure
15 |     :param part: This is a MIME message part from an email that contains a docx file
16 |     :type part: Union[email.message.Message, basestring]
17 |     :param part_name: This can be either file or email
18 |     :type part_name basestring
19 |     :return: This returns the texts from the word document.
20 |     :rtype: list
21 |     """
22 |     if EMAIL_PART == part_name:
23 |         decoded_payload = part.get_payload(decode=True)
24 |         zip_name = part.get_filename() or ''
25 |     else:
26 |         decoded_payload = part
27 |         zip_name = part_name
28 |     fp = BytesIO(decoded_payload)
29 |     try:
30 |         zfp = zipfile.ZipFile(fp)
31 |     except zipfile.BadZipfile:
32 |         return ['#UNSUPPORTED_ATTACHMENT: %s' % zip_name]
33 |     extension = os.path.splitext(zip_name)[1].lower()
34 |     unzip_content = []
35 |     if zfp:
36 |         ziplist = ['#BEGIN_ZIP_FILELIST: %s' % zip_name]
37 |         ziplist.extend(zfp.namelist())
38 |         ziplist.append('#END_ZIP_FILELIST: %s' % zip_name)
39 |         unzip_content.append("\n".join(ziplist))
40 |         if '.docx' == extension:
41 |             unzip_content.extend(docx.parse_docx(part, part_name))
42 |         else:
43 |             for each_compressedfile in zfp.namelist():
44 |                 zipped_file = []
45 |                 if not each_compressedfile.endswith('/'):
46 |                     zipped_fextension = text_type(os.path.splitext(each_compressedfile)[1]).lower()
47 |                     zipped_file = ["#BEGIN_ATTACHMENT: %s/%s" % (zip_name, each_compressedfile)]
48 |                     if zipped_fextension in TEXT_FILE_EXTENSIONS:
49 |                         f = zfp.open(each_compressedfile)
50 |                         for line in f:
51 |                             zipped_file.append(ensure_str(line).rstrip('\n'))
52 |                     elif zipped_fextension in ZIP_EXTENSIONS:
53 |                         file_buff = zfp.open(each_compressedfile).read()
54 |                         zipped_file.extend(parse_zip(file_buff, each_compressedfile))
55 |                     else:
56 |                         zipped_file.append("#UNSUPPORTED_CONTENT: file_name = %s" % each_compressedfile)
57 |                     zipped_file.append("#END_ATTACHMENT: %s/%s" % (zip_name, each_compressedfile))
58 |                 unzip_content.append("\n".join(zipped_file))
59 |     return unzip_content
60 | 
61 | 
62 | def parse_zip_from_mail(message):
63 |     """
64 | 
65 |     :param message: string representation of docx file
66 |     :type message: email.message.Message
67 |     :return:
68 |     """
69 |     parse_zip(message, EMAIL_PART)
70 | 
71 | 
72 | def parse_zip_from_string(file_as_string, file_name):
73 |     """
74 | 
75 |         :param file_as_string: string representation of docx file
76 |         :type file_as_string: basestring
77 |         :param file_name: docx file name
78 |         :type file_name: basestring
79 |         :return:
80 |         """
81 |     parse_zip(file_as_string, file_name)
82 | 


--------------------------------------------------------------------------------
/lib/mail_constants.py:
--------------------------------------------------------------------------------
 1 | # DEFAULTS
 2 | from __future__ import unicode_literals
 3 | 
 4 | IMAP_READONLY_FLAG = True
 5 | INDEX_ATTACHMENT_DEFAULT = True
 6 | DEFAULT_INCLUDE_HEADERS = True
 7 | DEFAULT_INCLUDE_INBOX = True
 8 | DEFAULT_MAINTAIN_RFC = False
 9 | DEFAULT_ATTACH_MESSAGE_PRIMARY = False
10 | DEFAULT_MAILBOX_CLEANUP = 'readonly'
11 | DEFAULT_DROP_ATTACHMENT = False
12 | MAX_FETCH_COUNT = 25
13 | REALM = 'mail'
14 | PASSWORD_PLACEHOLDER = 'encrypted'
15 | REGEX_EMAIL = r'^[_a-z0-9-]+(\.[_a-z0-9-]+)*@[a-z0-9-]+(\.[a-z0-9-]+)*(\.[a-z]{2,4})$'
16 | REGEX_PASSWORD = r'^([\w!@#$%-]+)$'
17 | REGEX_HOSTNAME = r'^((25[0-5]|2[0-4][0-9]|[01]?[0-9][0-9]?)(\.(25[0-5]|2[0-4][0-9]|' \
18 |                  r'[01]?[0-9][0-9]?)){3})$|^((([a-zA-Z0-9]|[a-zA-Z0-9][a-zA-Z0-9\-]*[a-zA-Z0-9])' \
19 |                  r'\.)*([A-Za-z0-9]|[A-Za-z0-9][A-Za-z0-9\-]*[A-Za-z0-9]))$'
20 | MESSAGE_PREAMBLE = "VGhpcyBpcyBhIG1haWwgc2VwYXJhdG9yIGluIGJhc2U2NCBmb3Igb3VyIFNwbHVuayBpbmRleGluZwo=\n"
21 | 


--------------------------------------------------------------------------------
/lib/mail_exceptions.py:
--------------------------------------------------------------------------------
 1 | from __future__ import unicode_literals
 2 | 
 3 | """This contains exceptions defined for the Mail scheme"""
 4 | 
 5 | 
 6 | class MailException(Exception):
 7 |     """
 8 |     Exception raised for errors in the mail modular input.
 9 |     """
10 | 
11 | 
12 | class MailExceptionInvalidProtocol(MailException):
13 |     """
14 |     Raised if an invalid mail protocol is defined.
15 |     This requires POP3 or IMAP
16 |     """
17 | 
18 |     def __init__(self):
19 |         MailException.__init__(self, 'protocol must be set to either POP3 or IMAP')
20 | 
21 | 
22 | class MailExceptionStanzaNotEmail(MailException):
23 |     """
24 |     Raised if the stanza is not an email address
25 |     """
26 | 
27 |     def __init__(self, message):
28 |         self.input = message
29 |         MailException.__init__(self, 'Input stanza must be an email address. Error parsing %s' % message)
30 | 
31 | 
32 | class MailProtocolError(MailException):
33 |     """
34 |     Raised when a Poplib exception is thrown and caught
35 |     """
36 | 
37 |     def __init__(self, message):
38 |         self.message = message
39 |         MailException.__init__(self, 'Exception thrown by Poplib or Imaplib, %s' % message)
40 | 
41 | 
42 | class MailConnectionError(MailException):
43 |     """
44 |     Raised when there's a connection error
45 |     """
46 | 
47 |     def __init__(self, message):
48 |         self.message = message
49 |         MailException.__init__(self, 'Mail connection error: %s' % message)
50 | 
51 | 
52 | class MailLoginFailed(MailException):
53 |     """
54 |     Raised when there's a login failure
55 |     """
56 | 
57 |     def __init__(self, server, username):
58 |         self.user = username
59 |         MailException.__init__(self, 'Login failed on %s for username: %s' % (server, username))
60 | 
61 | 
62 | 


--------------------------------------------------------------------------------
/lib/mail_utils.py:
--------------------------------------------------------------------------------
  1 | from __future__ import unicode_literals
  2 | 
  3 | from six import text_type, binary_type
  4 | 
  5 | import hashlib
  6 | import os
  7 | import socket
  8 | import re
  9 | 
 10 | 
 11 | def mail_connectivity_test(server, protocol):
 12 |     """
 13 |     This validates connectivity to given hostname and port
 14 |     :param server: This is the remote hostname or IP to be used for the test.
 15 |     :type server: basestring
 16 |     :param protocol: The protocol to be used to fetch emails - IMAPS or POP3S
 17 |     :type protocol: basestring
 18 |     :return: Raises an exception back to the modinput validation if connectivity test fails
 19 |     """
 20 |     try:
 21 |         captive_dns_addr = socket.gethostbyname(server)
 22 |         s = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
 23 |         s.settimeout(1)
 24 |         s.connect((captive_dns_addr, get_mail_port(protocol=protocol)))
 25 |         s.close()
 26 |     except socket.error as e:
 27 |         raise socket.error("Socket error : %s" % e)
 28 | 
 29 | 
 30 | def save_checkpoint(checkpoint_dir, msg):
 31 |     """
 32 |     This creates a checkpoint file in the checkpoint directory for the message.
 33 |     :param checkpoint_dir: This contains the path where checkpoint files will be saved
 34 |     :type checkpoint_dir: basestring
 35 |     :param msg: Contains a message that needs to indexed and
 36 |      :type msg: basestring
 37 |     """
 38 |     filename = os.path.join(checkpoint_dir, hashlib.sha256(msg.encode("utf8", "backslashreplace")).hexdigest())
 39 |     f = open(filename, 'w')
 40 |     f.close()
 41 | 
 42 | 
 43 | def locate_checkpoint(checkpoint_dir, msg):
 44 |     """
 45 |     This checks if a message has already been indexed by using a digest of the first 300 characters,
 46 |     which includes a date, message id, source and destination email addresses.
 47 |     :param checkpoint_dir: This contains the path where checkpoint files will be saved
 48 |     :type checkpoint_dir: basestring
 49 |     :param msg: Contains a message that needs to indexed and
 50 |      :type msg: basestring
 51 |     :return: Returns true if the message has been indexed previously, and false if not.
 52 |      :rtype: bool
 53 |     """
 54 |     filename = os.path.join(checkpoint_dir, hashlib.sha256(msg.encode("utf8", "backslashreplace")).hexdigest())
 55 |     try:
 56 |         open(filename, 'r').close()
 57 |     except (OSError, IOError):
 58 |         return False
 59 |     return True
 60 | 
 61 | 
 62 | def bool_variable(x):
 63 |     """
 64 | 
 65 |     :param x: variable to be converted to boolean. This defaults to true if unsupported values are passed to this
 66 |     :return:
 67 |     """
 68 |     if x == "enabled":
 69 |         x = True
 70 |     elif x == "disabled":
 71 |         x = False
 72 |     elif x == "True":
 73 |         x = True
 74 |     elif x == "False":
 75 |         x = False
 76 |     elif x == "1" or x == "0":
 77 |         x = bool(int(x))
 78 |     else:
 79 |         x = True
 80 |     return x
 81 | 
 82 | 
 83 | def get_mail_port(protocol):
 84 |     """
 85 |     This returns the server port to use for POP retrieval of mails
 86 |     :param protocol: The protocol to be used to fetch emails - IMAP or POP3
 87 |     :type protocol: basestring
 88 |     :return: Returns the correct port for either POP3 or POP3 over SSL
 89 |     :rtype: int
 90 |     """
 91 |     if protocol == 'POP3':
 92 |         port = 995
 93 |     elif 'IMAP' == protocol:
 94 |         port = 993
 95 |     else:
 96 |         raise Exception("Invalid options passed to get_mail_port")
 97 |     return port
 98 | 
 99 | def drop_attachment_from_event(message):
100 |     """
101 |     This prevent the attachment content to be ingested in clear text by
102 |     dropping its content. If attachment is unsupported, nothing done.
103 |     :param message: Email message to be ingested as event in Splunk
104 |     :type message: basestring
105 |     :return: Return the email message with no attachment content
106 |     :rtype: basestring
107 |     """
108 |     pattern = r'^#BEGIN_ATTACHMENT:\s(.*)#END_ATTACHMENT:\s'
109 |     return re.sub(pattern, "", message, flags=re.DOTALL|re.MULTILINE)


--------------------------------------------------------------------------------
/lib/splunklib/__init__.py:
--------------------------------------------------------------------------------
 1 | # Copyright 2011-2015 Splunk, Inc.
 2 | #
 3 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
 4 | # not use this file except in compliance with the License. You may obtain
 5 | # a copy of the License at
 6 | #
 7 | #     http://www.apache.org/licenses/LICENSE-2.0
 8 | #
 9 | # Unless required by applicable law or agreed to in writing, software
10 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
11 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
12 | # License for the specific language governing permissions and limitations
13 | # under the License.
14 | 
15 | """Python library for Splunk."""
16 | 
17 | from __future__ import absolute_import
18 | from splunklib.six.moves import map
19 | __version_info__ = (1, 6, 14)
20 | __version__ = ".".join(map(str, __version_info__))
21 | 


--------------------------------------------------------------------------------
/lib/splunklib/data.py:
--------------------------------------------------------------------------------
  1 | # Copyright 2011-2015 Splunk, Inc.
  2 | #
  3 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  4 | # not use this file except in compliance with the License. You may obtain
  5 | # a copy of the License at
  6 | #
  7 | #     http://www.apache.org/licenses/LICENSE-2.0
  8 | #
  9 | # Unless required by applicable law or agreed to in writing, software
 10 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 11 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 12 | # License for the specific language governing permissions and limitations
 13 | # under the License.
 14 | 
 15 | """The **splunklib.data** module reads the responses from splunkd in Atom Feed 
 16 | format, which is the format used by most of the REST API.
 17 | """
 18 | 
 19 | from __future__ import absolute_import
 20 | import sys
 21 | from xml.etree.ElementTree import XML
 22 | from splunklib import six
 23 | 
 24 | __all__ = ["load"]
 25 | 
 26 | # LNAME refers to element names without namespaces; XNAME is the same
 27 | # name, but with an XML namespace.
 28 | LNAME_DICT = "dict"
 29 | LNAME_ITEM = "item"
 30 | LNAME_KEY = "key"
 31 | LNAME_LIST = "list"
 32 | 
 33 | XNAMEF_REST = "{http://dev.splunk.com/ns/rest}%s"
 34 | XNAME_DICT = XNAMEF_REST % LNAME_DICT
 35 | XNAME_ITEM = XNAMEF_REST % LNAME_ITEM
 36 | XNAME_KEY = XNAMEF_REST % LNAME_KEY
 37 | XNAME_LIST = XNAMEF_REST % LNAME_LIST
 38 | 
 39 | # Some responses don't use namespaces (eg: search/parse) so we look for
 40 | # both the extended and local versions of the following names.
 41 | 
 42 | def isdict(name):
 43 |     return name == XNAME_DICT or name == LNAME_DICT
 44 | 
 45 | def isitem(name):
 46 |     return name == XNAME_ITEM or name == LNAME_ITEM
 47 | 
 48 | def iskey(name):
 49 |     return name == XNAME_KEY or name == LNAME_KEY
 50 | 
 51 | def islist(name):
 52 |     return name == XNAME_LIST or name == LNAME_LIST
 53 | 
 54 | def hasattrs(element):
 55 |     return len(element.attrib) > 0
 56 | 
 57 | def localname(xname):
 58 |     rcurly = xname.find('}')
 59 |     return xname if rcurly == -1 else xname[rcurly+1:]
 60 | 
 61 | def load(text, match=None):
 62 |     """This function reads a string that contains the XML of an Atom Feed, then 
 63 |     returns the 
 64 |     data in a native Python structure (a ``dict`` or ``list``). If you also 
 65 |     provide a tag name or path to match, only the matching sub-elements are 
 66 |     loaded.
 67 | 
 68 |     :param text: The XML text to load.
 69 |     :type text: ``string``
 70 |     :param match: A tag name or path to match (optional).
 71 |     :type match: ``string``
 72 |     """
 73 |     if text is None: return None
 74 |     text = text.strip()
 75 |     if len(text) == 0: return None
 76 |     nametable = {
 77 |         'namespaces': [],
 78 |         'names': {}
 79 |     }
 80 | 
 81 |     # Convert to unicode encoding in only python 2 for xml parser
 82 |     if(sys.version_info < (3, 0, 0) and isinstance(text, unicode)):
 83 |         text = text.encode('utf-8')
 84 | 
 85 |     root = XML(text)
 86 |     items = [root] if match is None else root.findall(match)
 87 |     count = len(items)
 88 |     if count == 0: 
 89 |         return None
 90 |     elif count == 1: 
 91 |         return load_root(items[0], nametable)
 92 |     else:
 93 |         return [load_root(item, nametable) for item in items]
 94 | 
 95 | # Load the attributes of the given element.
 96 | def load_attrs(element):
 97 |     if not hasattrs(element): return None
 98 |     attrs = record()
 99 |     for key, value in six.iteritems(element.attrib): 
100 |         attrs[key] = value
101 |     return attrs
102 | 
103 | # Parse a <dict> element and return a Python dict
104 | def load_dict(element, nametable = None):
105 |     value = record()
106 |     children = list(element)
107 |     for child in children:
108 |         assert iskey(child.tag)
109 |         name = child.attrib["name"]
110 |         value[name] = load_value(child, nametable)
111 |     return value
112 | 
113 | # Loads the given elements attrs & value into single merged dict.
114 | def load_elem(element, nametable=None):
115 |     name = localname(element.tag)
116 |     attrs = load_attrs(element)
117 |     value = load_value(element, nametable)
118 |     if attrs is None: return name, value
119 |     if value is None: return name, attrs
120 |     # If value is simple, merge into attrs dict using special key
121 |     if isinstance(value, six.string_types):
122 |         attrs["$text"] = value
123 |         return name, attrs
124 |     # Both attrs & value are complex, so merge the two dicts, resolving collisions.
125 |     collision_keys = []
126 |     for key, val in six.iteritems(attrs):
127 |         if key in value and key in collision_keys:
128 |             value[key].append(val)
129 |         elif key in value and key not in collision_keys:
130 |             value[key] = [value[key], val]
131 |             collision_keys.append(key)
132 |         else:
133 |             value[key] = val
134 |     return name, value
135 | 
136 | # Parse a <list> element and return a Python list
137 | def load_list(element, nametable=None):
138 |     assert islist(element.tag)
139 |     value = []
140 |     children = list(element)
141 |     for child in children:
142 |         assert isitem(child.tag)
143 |         value.append(load_value(child, nametable))
144 |     return value
145 | 
146 | # Load the given root element.
147 | def load_root(element, nametable=None):
148 |     tag = element.tag
149 |     if isdict(tag): return load_dict(element, nametable)
150 |     if islist(tag): return load_list(element, nametable)
151 |     k, v = load_elem(element, nametable)
152 |     return Record.fromkv(k, v)
153 | 
154 | # Load the children of the given element.
155 | def load_value(element, nametable=None):
156 |     children = list(element)
157 |     count = len(children)
158 | 
159 |     # No children, assume a simple text value
160 |     if count == 0:
161 |         text = element.text
162 |         if text is None: 
163 |             return None
164 |         text = text.strip()
165 |         if len(text) == 0: 
166 |             return None
167 |         return text
168 | 
169 |     # Look for the special case of a single well-known structure
170 |     if count == 1:
171 |         child = children[0]
172 |         tag = child.tag
173 |         if isdict(tag): return load_dict(child, nametable)
174 |         if islist(tag): return load_list(child, nametable)
175 | 
176 |     value = record()
177 |     for child in children:
178 |         name, item = load_elem(child, nametable)
179 |         # If we have seen this name before, promote the value to a list
180 |         if name in value:
181 |             current = value[name]
182 |             if not isinstance(current, list): 
183 |                 value[name] = [current]
184 |             value[name].append(item)
185 |         else:
186 |             value[name] = item
187 | 
188 |     return value
189 | 
190 | # A generic utility that enables "dot" access to dicts
191 | class Record(dict):
192 |     """This generic utility class enables dot access to members of a Python 
193 |     dictionary.
194 | 
195 |     Any key that is also a valid Python identifier can be retrieved as a field. 
196 |     So, for an instance of ``Record`` called ``r``, ``r.key`` is equivalent to 
197 |     ``r['key']``. A key such as ``invalid-key`` or ``invalid.key`` cannot be 
198 |     retrieved as a field, because ``-`` and ``.`` are not allowed in 
199 |     identifiers.
200 | 
201 |     Keys of the form ``a.b.c`` are very natural to write in Python as fields. If 
202 |     a group of keys shares a prefix ending in ``.``, you can retrieve keys as a 
203 |     nested dictionary by calling only the prefix. For example, if ``r`` contains
204 |     keys ``'foo'``, ``'bar.baz'``, and ``'bar.qux'``, ``r.bar`` returns a record
205 |     with the keys ``baz`` and ``qux``. If a key contains multiple ``.``, each 
206 |     one is placed into a nested dictionary, so you can write ``r.bar.qux`` or 
207 |     ``r['bar.qux']`` interchangeably.
208 |     """
209 |     sep = '.'
210 | 
211 |     def __call__(self, *args):
212 |         if len(args) == 0: return self
213 |         return Record((key, self[key]) for key in args)
214 | 
215 |     def __getattr__(self, name):
216 |         try:
217 |             return self[name]
218 |         except KeyError: 
219 |             raise AttributeError(name)
220 | 
221 |     def __delattr__(self, name):
222 |         del self[name]
223 | 
224 |     def __setattr__(self, name, value):
225 |         self[name] = value
226 | 
227 |     @staticmethod
228 |     def fromkv(k, v):
229 |         result = record()
230 |         result[k] = v
231 |         return result
232 | 
233 |     def __getitem__(self, key):
234 |         if key in self:
235 |             return dict.__getitem__(self, key)
236 |         key += self.sep
237 |         result = record()
238 |         for k,v in six.iteritems(self):
239 |             if not k.startswith(key):
240 |                 continue
241 |             suffix = k[len(key):]
242 |             if '.' in suffix:
243 |                 ks = suffix.split(self.sep)
244 |                 z = result
245 |                 for x in ks[:-1]:
246 |                     if x not in z:
247 |                         z[x] = record()
248 |                     z = z[x]
249 |                 z[ks[-1]] = v
250 |             else:
251 |                 result[suffix] = v
252 |         if len(result) == 0:
253 |             raise KeyError("No key or prefix: %s" % key)
254 |         return result
255 |     
256 | 
257 | def record(value=None): 
258 |     """This function returns a :class:`Record` instance constructed with an 
259 |     initial value that you provide.
260 |     
261 |     :param `value`: An initial record value.
262 |     :type `value`: ``dict``
263 |     """
264 |     if value is None: value = {}
265 |     return Record(value)
266 | 
267 | 


--------------------------------------------------------------------------------
/lib/splunklib/modularinput/__init__.py:
--------------------------------------------------------------------------------
 1 | """The following imports allow these classes to be imported via
 2 | the splunklib.modularinput package like so:
 3 | 
 4 | from splunklib.modularinput import *
 5 | """
 6 | from .argument import Argument
 7 | from .event import Event
 8 | from .event_writer import EventWriter
 9 | from .input_definition import InputDefinition
10 | from .scheme import Scheme
11 | from .script import Script
12 | from .validation_definition import ValidationDefinition
13 | 


--------------------------------------------------------------------------------
/lib/splunklib/modularinput/argument.py:
--------------------------------------------------------------------------------
  1 | # Copyright 2011-2015 Splunk, Inc.
  2 | #
  3 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  4 | # not use this file except in compliance with the License. You may obtain
  5 | # a copy of the License at
  6 | #
  7 | #     http://www.apache.org/licenses/LICENSE-2.0
  8 | #
  9 | # Unless required by applicable law or agreed to in writing, software
 10 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 11 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 12 | # License for the specific language governing permissions and limitations
 13 | # under the License.
 14 | 
 15 | from __future__ import absolute_import
 16 | try:
 17 |     import xml.etree.ElementTree as ET
 18 | except ImportError:
 19 |     import xml.etree.cElementTree as ET
 20 | 
 21 | class Argument(object):
 22 |     """Class representing an argument to a modular input kind.
 23 | 
 24 |     ``Argument`` is meant to be used with ``Scheme`` to generate an XML 
 25 |     definition of the modular input kind that Splunk understands.
 26 | 
 27 |     ``name`` is the only required parameter for the constructor.
 28 | 
 29 |         **Example with least parameters**::
 30 | 
 31 |             arg1 = Argument(name="arg1")
 32 | 
 33 |         **Example with all parameters**::
 34 | 
 35 |             arg2 = Argument(
 36 |                 name="arg2",
 37 |                 description="This is an argument with lots of parameters",
 38 |                 validation="is_pos_int('some_name')",
 39 |                 data_type=Argument.data_type_number,
 40 |                 required_on_edit=True,
 41 |                 required_on_create=True
 42 |             )
 43 |     """
 44 | 
 45 |     # Constant values, do not change.
 46 |     # These should be used for setting the value of an Argument object's data_type field.
 47 |     data_type_boolean = "BOOLEAN"
 48 |     data_type_number = "NUMBER"
 49 |     data_type_string = "STRING"
 50 | 
 51 |     def __init__(self, name, description=None, validation=None,
 52 |                  data_type=data_type_string, required_on_edit=False, required_on_create=False, title=None):
 53 |         """
 54 |         :param name: ``string``, identifier for this argument in Splunk.
 55 |         :param description: ``string``, human-readable description of the argument.
 56 |         :param validation: ``string`` specifying how the argument should be validated, if using internal validation.
 57 |                If using external validation, this will be ignored.
 58 |         :param data_type: ``string``, data type of this field; use the class constants.
 59 |                "data_type_boolean", "data_type_number", or "data_type_string".
 60 |         :param required_on_edit: ``Boolean``, whether this arg is required when editing an existing modular input of this kind.
 61 |         :param required_on_create: ``Boolean``, whether this arg is required when creating a modular input of this kind.
 62 |         :param title: ``String``, a human-readable title for the argument.
 63 |         """
 64 |         self.name = name
 65 |         self.description = description
 66 |         self.validation = validation
 67 |         self.data_type = data_type
 68 |         self.required_on_edit = required_on_edit
 69 |         self.required_on_create = required_on_create
 70 |         self.title = title
 71 | 
 72 |     def add_to_document(self, parent):
 73 |         """Adds an ``Argument`` object to this ElementTree document.
 74 | 
 75 |         Adds an <arg> subelement to the parent element, typically <args>
 76 |         and sets up its subelements with their respective text.
 77 | 
 78 |         :param parent: An ``ET.Element`` to be the parent of a new <arg> subelement
 79 |         :returns: An ``ET.Element`` object representing this argument.
 80 |         """
 81 |         arg = ET.SubElement(parent, "arg")
 82 |         arg.set("name", self.name)
 83 | 
 84 |         if self.title is not None:
 85 |             ET.SubElement(arg, "title").text = self.title
 86 | 
 87 |         if self.description is not None:
 88 |             ET.SubElement(arg, "description").text = self.description
 89 | 
 90 |         if self.validation is not None:
 91 |             ET.SubElement(arg, "validation").text = self.validation
 92 | 
 93 |         # add all other subelements to this Argument, represented by (tag, text)
 94 |         subelements = [
 95 |             ("data_type", self.data_type),
 96 |             ("required_on_edit", self.required_on_edit),
 97 |             ("required_on_create", self.required_on_create)
 98 |         ]
 99 | 
100 |         for name, value in subelements:
101 |             ET.SubElement(arg, name).text = str(value).lower()
102 | 
103 |         return arg


--------------------------------------------------------------------------------
/lib/splunklib/modularinput/event.py:
--------------------------------------------------------------------------------
  1 | # Copyright 2011-2015 Splunk, Inc.
  2 | #
  3 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  4 | # not use this file except in compliance with the License. You may obtain
  5 | # a copy of the License at
  6 | #
  7 | #     http://www.apache.org/licenses/LICENSE-2.0
  8 | #
  9 | # Unless required by applicable law or agreed to in writing, software
 10 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 11 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 12 | # License for the specific language governing permissions and limitations
 13 | # under the License.
 14 | 
 15 | from __future__ import absolute_import
 16 | from io import TextIOBase
 17 | from splunklib.six import ensure_text
 18 | 
 19 | try:
 20 |     import xml.etree.cElementTree as ET
 21 | except ImportError as ie:
 22 |     import xml.etree.ElementTree as ET
 23 | 
 24 | class Event(object):
 25 |     """Represents an event or fragment of an event to be written by this modular input to Splunk.
 26 | 
 27 |     To write an input to a stream, call the ``write_to`` function, passing in a stream.
 28 |     """
 29 |     def __init__(self, data=None, stanza=None, time=None, host=None, index=None, source=None,
 30 |                  sourcetype=None, done=True, unbroken=True):
 31 |         """There are no required parameters for constructing an Event
 32 | 
 33 |         **Example with minimal configuration**::
 34 | 
 35 |             my_event = Event(
 36 |                 data="This is a test of my new event.",
 37 |                 stanza="myStanzaName",
 38 |                 time="%.3f" % 1372187084.000
 39 |             )
 40 | 
 41 |         **Example with full configuration**::
 42 | 
 43 |             excellent_event = Event(
 44 |                 data="This is a test of my excellent event.",
 45 |                 stanza="excellenceOnly",
 46 |                 time="%.3f" % 1372274622.493,
 47 |                 host="localhost",
 48 |                 index="main",
 49 |                 source="Splunk",
 50 |                 sourcetype="misc",
 51 |                 done=True,
 52 |                 unbroken=True
 53 |             )
 54 | 
 55 |         :param data: ``string``, the event's text.
 56 |         :param stanza: ``string``, name of the input this event should be sent to.
 57 |         :param time: ``float``, time in seconds, including up to 3 decimal places to represent milliseconds.
 58 |         :param host: ``string``, the event's host, ex: localhost.
 59 |         :param index: ``string``, the index this event is specified to write to, or None if default index.
 60 |         :param source: ``string``, the source of this event, or None to have Splunk guess.
 61 |         :param sourcetype: ``string``, source type currently set on this event, or None to have Splunk guess.
 62 |         :param done: ``boolean``, is this a complete ``Event``? False if an ``Event`` fragment.
 63 |         :param unbroken: ``boolean``, Is this event completely encapsulated in this ``Event`` object?
 64 |         """
 65 |         self.data = data
 66 |         self.done = done
 67 |         self.host = host
 68 |         self.index = index
 69 |         self.source = source
 70 |         self.sourceType = sourcetype
 71 |         self.stanza = stanza
 72 |         self.time = time
 73 |         self.unbroken = unbroken
 74 | 
 75 |     def write_to(self, stream):
 76 |         """Write an XML representation of self, an ``Event`` object, to the given stream.
 77 | 
 78 |         The ``Event`` object will only be written if its data field is defined,
 79 |         otherwise a ``ValueError`` is raised.
 80 | 
 81 |         :param stream: stream to write XML to.
 82 |         """
 83 |         if self.data is None:
 84 |             raise ValueError("Events must have at least the data field set to be written to XML.")
 85 | 
 86 |         event = ET.Element("event")
 87 |         if self.stanza is not None:
 88 |             event.set("stanza", self.stanza)
 89 |         event.set("unbroken", str(int(self.unbroken)))
 90 | 
 91 |         # if a time isn't set, let Splunk guess by not creating a <time> element
 92 |         if self.time is not None:
 93 |             ET.SubElement(event, "time").text = str(self.time)
 94 | 
 95 |         # add all other subelements to this Event, represented by (tag, text)
 96 |         subelements = [
 97 |             ("source", self.source),
 98 |             ("sourcetype", self.sourceType),
 99 |             ("index", self.index),
100 |             ("host", self.host),
101 |             ("data", self.data)
102 |         ]
103 |         for node, value in subelements:
104 |             if value is not None:
105 |                 ET.SubElement(event, node).text = value
106 | 
107 |         if self.done:
108 |             ET.SubElement(event, "done")
109 | 
110 |         if isinstance(stream, TextIOBase):
111 |             stream.write(ensure_text(ET.tostring(event)))
112 |         else:
113 |             stream.write(ET.tostring(event))
114 |         stream.flush()


--------------------------------------------------------------------------------
/lib/splunklib/modularinput/event_writer.py:
--------------------------------------------------------------------------------
 1 | # Copyright 2011-2015 Splunk, Inc.
 2 | #
 3 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
 4 | # not use this file except in compliance with the License. You may obtain
 5 | # a copy of the License at
 6 | #
 7 | #     http://www.apache.org/licenses/LICENSE-2.0
 8 | #
 9 | # Unless required by applicable law or agreed to in writing, software
10 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
11 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
12 | # License for the specific language governing permissions and limitations
13 | # under the License.
14 | 
15 | from __future__ import absolute_import
16 | import sys
17 | 
18 | from io import TextIOWrapper, TextIOBase
19 | from splunklib.six import ensure_str
20 | from .event import ET
21 | 
22 | try:
23 |     from splunklib.six.moves import cStringIO as StringIO
24 | except ImportError:
25 |     from splunklib.six import StringIO
26 | 
27 | class EventWriter(object):
28 |     """``EventWriter`` writes events and error messages to Splunk from a modular input.
29 |     Its two important methods are ``writeEvent``, which takes an ``Event`` object,
30 |     and ``log``, which takes a severity and an error message.
31 |     """
32 | 
33 |     # Severities that Splunk understands for log messages from modular inputs.
34 |     # Do not change these
35 |     DEBUG = "DEBUG"
36 |     INFO = "INFO"
37 |     WARN = "WARN"
38 |     ERROR = "ERROR"
39 |     FATAL = "FATAL"
40 | 
41 |     def __init__(self, output = sys.stdout, error = sys.stderr):
42 |         """
43 |         :param output: Where to write the output; defaults to sys.stdout.
44 |         :param error: Where to write any errors; defaults to sys.stderr.
45 |         """
46 |         self._out = output
47 |         self._err = error
48 | 
49 |         # has the opening <stream> tag been written yet?
50 |         self.header_written = False
51 | 
52 |     def write_event(self, event):
53 |         """Writes an ``Event`` object to Splunk.
54 | 
55 |         :param event: An ``Event`` object.
56 |         """
57 | 
58 |         if not self.header_written:
59 |             self._out.write("<stream>")
60 |             self.header_written = True
61 | 
62 |         event.write_to(self._out)
63 | 
64 |     def log(self, severity, message):
65 |         """Logs messages about the state of this modular input to Splunk.
66 |         These messages will show up in Splunk's internal logs.
67 | 
68 |         :param severity: ``string``, severity of message, see severities defined as class constants.
69 |         :param message: ``string``, message to log.
70 |         """
71 | 
72 |         self._err.write("%s %s\n" % (severity, message))
73 |         self._err.flush()
74 | 
75 |     def write_xml_document(self, document):
76 |         """Writes a string representation of an
77 |         ``ElementTree`` object to the output stream.
78 | 
79 |         :param document: An ``ElementTree`` object.
80 |         """
81 |         self._out.write(ensure_str(ET.tostring(document)))
82 |         self._out.flush()
83 | 
84 |     def close(self):
85 |         """Write the closing </stream> tag to make this XML well formed."""
86 |         self._out.write("</stream>")
87 |         self._out.flush()
88 | 


--------------------------------------------------------------------------------
/lib/splunklib/modularinput/input_definition.py:
--------------------------------------------------------------------------------
 1 | # Copyright 2011-2015 Splunk, Inc.
 2 | #
 3 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
 4 | # not use this file except in compliance with the License. You may obtain
 5 | # a copy of the License at
 6 | #
 7 | #     http://www.apache.org/licenses/LICENSE-2.0
 8 | #
 9 | # Unless required by applicable law or agreed to in writing, software
10 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
11 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
12 | # License for the specific language governing permissions and limitations
13 | # under the License.
14 | 
15 | from __future__ import absolute_import
16 | try:
17 |     import xml.etree.cElementTree as ET
18 | except ImportError as ie:
19 |     import xml.etree.ElementTree as ET
20 | 
21 | from .utils import parse_xml_data
22 | 
23 | class InputDefinition:
24 |     """``InputDefinition`` encodes the XML defining inputs that Splunk passes to
25 |     a modular input script.
26 | 
27 |      **Example**::
28 | 
29 |         i = InputDefinition()
30 | 
31 |     """
32 |     def __init__ (self):
33 |         self.metadata = {}
34 |         self.inputs = {}
35 | 
36 |     def __eq__(self, other):
37 |         if not isinstance(other, InputDefinition):
38 |             return False
39 |         return self.metadata == other.metadata and self.inputs == other.inputs
40 | 
41 |     @staticmethod
42 |     def parse(stream):
43 |         """Parse a stream containing XML into an ``InputDefinition``.
44 | 
45 |         :param stream: stream containing XML to parse.
46 |         :return: definition: an ``InputDefinition`` object.
47 |         """
48 |         definition = InputDefinition()
49 | 
50 |         # parse XML from the stream, then get the root node
51 |         root = ET.parse(stream).getroot()
52 | 
53 |         for node in root:
54 |             if node.tag == "configuration":
55 |                 # get config for each stanza
56 |                 definition.inputs = parse_xml_data(node, "stanza")
57 |             else:
58 |                 definition.metadata[node.tag] = node.text
59 | 
60 |         return definition


--------------------------------------------------------------------------------
/lib/splunklib/modularinput/scheme.py:
--------------------------------------------------------------------------------
 1 | # Copyright 2011-2015 Splunk, Inc.
 2 | #
 3 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
 4 | # not use this file except in compliance with the License. You may obtain
 5 | # a copy of the License at
 6 | #
 7 | #     http://www.apache.org/licenses/LICENSE-2.0
 8 | #
 9 | # Unless required by applicable law or agreed to in writing, software
10 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
11 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
12 | # License for the specific language governing permissions and limitations
13 | # under the License.
14 | 
15 | from __future__ import absolute_import
16 | try:
17 |     import xml.etree.cElementTree as ET
18 | except ImportError:
19 |     import xml.etree.ElementTree as ET
20 | 
21 | class Scheme(object):
22 |     """Class representing the metadata for a modular input kind.
23 | 
24 |     A ``Scheme`` specifies a title, description, several options of how Splunk should run modular inputs of this
25 |     kind, and a set of arguments which define a particular modular input's properties.
26 | 
27 |     The primary use of ``Scheme`` is to abstract away the construction of XML to feed to Splunk.
28 |     """
29 | 
30 |     # Constant values, do not change
31 |     # These should be used for setting the value of a Scheme object's streaming_mode field.
32 |     streaming_mode_simple = "SIMPLE"
33 |     streaming_mode_xml = "XML"
34 | 
35 |     def __init__(self, title):
36 |         """
37 |         :param title: ``string`` identifier for this Scheme in Splunk.
38 |         """
39 |         self.title = title
40 |         self.description = None
41 |         self.use_external_validation = True
42 |         self.use_single_instance = False
43 |         self.streaming_mode = Scheme.streaming_mode_xml
44 | 
45 |         # list of Argument objects, each to be represented by an <arg> tag
46 |         self.arguments = []
47 | 
48 |     def add_argument(self, arg):
49 |         """Add the provided argument, ``arg``, to the ``self.arguments`` list.
50 | 
51 |         :param arg: An ``Argument`` object to add to ``self.arguments``.
52 |         """
53 |         self.arguments.append(arg)
54 | 
55 |     def to_xml(self):
56 |         """Creates an ``ET.Element`` representing self, then returns it.
57 | 
58 |         :returns: an ``ET.Element`` representing this scheme.
59 |         """
60 |         root = ET.Element("scheme")
61 | 
62 |         ET.SubElement(root, "title").text = self.title
63 | 
64 |         # add a description subelement if it's defined
65 |         if self.description is not None:
66 |             ET.SubElement(root, "description").text = self.description
67 | 
68 |         # add all other subelements to this Scheme, represented by (tag, text)
69 |         subelements = [
70 |             ("use_external_validation", self.use_external_validation),
71 |             ("use_single_instance", self.use_single_instance),
72 |             ("streaming_mode", self.streaming_mode)
73 |         ]
74 |         for name, value in subelements:
75 |             ET.SubElement(root, name).text = str(value).lower()
76 | 
77 |         endpoint = ET.SubElement(root, "endpoint")
78 | 
79 |         args = ET.SubElement(endpoint, "args")
80 | 
81 |         # add arguments as subelements to the <args> element
82 |         for arg in self.arguments:
83 |             arg.add_to_document(args)
84 | 
85 |         return root


--------------------------------------------------------------------------------
/lib/splunklib/modularinput/script.py:
--------------------------------------------------------------------------------
  1 | # Copyright 2011-2015 Splunk, Inc.
  2 | #
  3 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  4 | # not use this file except in compliance with the License. You may obtain
  5 | # a copy of the License at
  6 | #
  7 | #     http://www.apache.org/licenses/LICENSE-2.0
  8 | #
  9 | # Unless required by applicable law or agreed to in writing, software
 10 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 11 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 12 | # License for the specific language governing permissions and limitations
 13 | # under the License.
 14 | 
 15 | from __future__ import absolute_import
 16 | from abc import ABCMeta, abstractmethod
 17 | from splunklib.six.moves.urllib.parse import urlsplit
 18 | import sys
 19 | 
 20 | from ..client import Service
 21 | from .event_writer import EventWriter
 22 | from .input_definition import InputDefinition
 23 | from .validation_definition import ValidationDefinition
 24 | from splunklib import six
 25 | 
 26 | try:
 27 |     import xml.etree.cElementTree as ET
 28 | except ImportError:
 29 |     import xml.etree.ElementTree as ET
 30 | 
 31 | 
 32 | class Script(six.with_metaclass(ABCMeta, object)):
 33 |     """An abstract base class for implementing modular inputs.
 34 | 
 35 |     Subclasses should override ``get_scheme``, ``stream_events``,
 36 |     and optionally ``validate_input`` if the modular input uses
 37 |     external validation.
 38 | 
 39 |     The ``run`` function is used to run modular inputs; it typically should
 40 |     not be overridden.
 41 |     """
 42 | 
 43 |     def __init__(self):
 44 |         self._input_definition = None
 45 |         self._service = None
 46 | 
 47 |     def run(self, args):
 48 |         """Runs this modular input
 49 | 
 50 |         :param args: List of command line arguments passed to this script.
 51 |         :returns: An integer to be used as the exit value of this program.
 52 |         """
 53 | 
 54 |         # call the run_script function, which handles the specifics of running
 55 |         # a modular input
 56 |         return self.run_script(args, EventWriter(), sys.stdin)
 57 | 
 58 |     def run_script(self, args, event_writer, input_stream):
 59 |         """Handles all the specifics of running a modular input
 60 | 
 61 |         :param args: List of command line arguments passed to this script.
 62 |         :param event_writer: An ``EventWriter`` object for writing events.
 63 |         :param input_stream: An input stream for reading inputs.
 64 |         :returns: An integer to be used as the exit value of this program.
 65 |         """
 66 | 
 67 |         try:
 68 |             if len(args) == 1:
 69 |                 # This script is running as an input. Input definitions will be
 70 |                 # passed on stdin as XML, and the script will write events on
 71 |                 # stdout and log entries on stderr.
 72 |                 self._input_definition = InputDefinition.parse(input_stream)
 73 |                 self.stream_events(self._input_definition, event_writer)
 74 |                 event_writer.close()
 75 |                 return 0
 76 | 
 77 |             elif str(args[1]).lower() == "--scheme":
 78 |                 # Splunk has requested XML specifying the scheme for this
 79 |                 # modular input Return it and exit.
 80 |                 scheme = self.get_scheme()
 81 |                 if scheme is None:
 82 |                     event_writer.log(
 83 |                         EventWriter.FATAL,
 84 |                         "Modular input script returned a null scheme.")
 85 |                     return 1
 86 |                 else:
 87 |                     event_writer.write_xml_document(scheme.to_xml())
 88 |                     return 0
 89 | 
 90 |             elif args[1].lower() == "--validate-arguments":
 91 |                 validation_definition = ValidationDefinition.parse(input_stream)
 92 |                 try:
 93 |                     self.validate_input(validation_definition)
 94 |                     return 0
 95 |                 except Exception as e:
 96 |                     root = ET.Element("error")
 97 |                     ET.SubElement(root, "message").text = str(e)
 98 |                     event_writer.write_xml_document(root)
 99 | 
100 |                     return 1
101 |             else:
102 |                 err_string = "ERROR Invalid arguments to modular input script:" + ' '.join(
103 |                     args)
104 |                 event_writer._err.write(err_string)
105 |                 return 1
106 | 
107 |         except Exception as e:
108 |             event_writer.log(EventWriter.ERROR, str(e))
109 |             return 1
110 | 
111 |     @property
112 |     def service(self):
113 |         """ Returns a Splunk service object for this script invocation.
114 | 
115 |         The service object is created from the Splunkd URI and session key
116 |         passed to the command invocation on the modular input stream. It is
117 |         available as soon as the :code:`Script.stream_events` method is
118 |         called.
119 | 
120 |         :return: :class:`splunklib.client.Service`. A value of None is returned,
121 |             if you call this method before the :code:`Script.stream_events` method
122 |             is called.
123 | 
124 |         """
125 |         if self._service is not None:
126 |             return self._service
127 | 
128 |         if self._input_definition is None:
129 |             return None
130 | 
131 |         splunkd_uri = self._input_definition.metadata["server_uri"]
132 |         session_key = self._input_definition.metadata["session_key"]
133 | 
134 |         splunkd = urlsplit(splunkd_uri, allow_fragments=False)
135 | 
136 |         self._service = Service(
137 |             scheme=splunkd.scheme,
138 |             host=splunkd.hostname,
139 |             port=splunkd.port,
140 |             token=session_key,
141 |         )
142 | 
143 |         return self._service
144 | 
145 |     @abstractmethod
146 |     def get_scheme(self):
147 |         """The scheme defines the parameters understood by this modular input.
148 | 
149 |         :return: a ``Scheme`` object representing the parameters for this modular input.
150 |         """
151 | 
152 |     def validate_input(self, definition):
153 |         """Handles external validation for modular input kinds.
154 | 
155 |         When Splunk calls a modular input script in validation mode, it will
156 |         pass in an XML document giving information about the Splunk instance (so
157 |         you can call back into it if needed) and the name and parameters of the
158 |         proposed input.
159 | 
160 |         If this function does not throw an exception, the validation is assumed
161 |         to succeed. Otherwise any errors thrown will be turned into a string and
162 |         logged back to Splunk.
163 | 
164 |         The default implementation always passes.
165 | 
166 |         :param definition: The parameters for the proposed input passed by splunkd.
167 |         """
168 |         pass
169 | 
170 |     @abstractmethod
171 |     def stream_events(self, inputs, ew):
172 |         """The method called to stream events into Splunk. It should do all of its output via
173 |         EventWriter rather than assuming that there is a console attached.
174 | 
175 |         :param inputs: An ``InputDefinition`` object.
176 |         :param ew: An object with methods to write events and log messages to Splunk.
177 |         """
178 | 


--------------------------------------------------------------------------------
/lib/splunklib/modularinput/utils.py:
--------------------------------------------------------------------------------
 1 | # Copyright 2011-2015 Splunk, Inc.
 2 | #
 3 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
 4 | # not use this file except in compliance with the License. You may obtain
 5 | # a copy of the License at
 6 | #
 7 | #     http://www.apache.org/licenses/LICENSE-2.0
 8 | #
 9 | # Unless required by applicable law or agreed to in writing, software
10 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
11 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
12 | # License for the specific language governing permissions and limitations
13 | # under the License.
14 | 
15 | # File for utility functions
16 | 
17 | from __future__ import absolute_import
18 | from splunklib.six.moves import zip
19 | def xml_compare(expected, found):
20 |     """Checks equality of two ``ElementTree`` objects.
21 | 
22 |     :param expected: An ``ElementTree`` object.
23 |     :param found: An ``ElementTree`` object.
24 |     :return: ``Boolean``, whether the two objects are equal.
25 |     """
26 | 
27 |     # if comparing the same ET object
28 |     if expected == found:
29 |         return True
30 | 
31 |     # compare element attributes, ignoring order
32 |     if set(expected.items()) != set(found.items()):
33 |         return False
34 | 
35 |     # check for equal number of children
36 |     expected_children = list(expected)
37 |     found_children = list(found)
38 |     if len(expected_children) != len(found_children):
39 |         return False
40 | 
41 |     # compare children
42 |     if not all([xml_compare(a, b) for a, b in zip(expected_children, found_children)]):
43 |         return False
44 | 
45 |     # compare elements, if there is no text node, return True
46 |     if (expected.text is None or expected.text.strip() == "") \
47 |         and (found.text is None or found.text.strip() == ""):
48 |         return True
49 |     else:
50 |         return expected.tag == found.tag and expected.text == found.text \
51 |             and expected.attrib == found.attrib
52 | 
53 | def parse_parameters(param_node):
54 |     if param_node.tag == "param":
55 |         return param_node.text
56 |     elif param_node.tag == "param_list":
57 |         parameters = []
58 |         for mvp in param_node:
59 |             parameters.append(mvp.text)
60 |         return parameters
61 |     else:
62 |         raise ValueError("Invalid configuration scheme, %s tag unexpected." % param_node.tag)
63 | 
64 | def parse_xml_data(parent_node, child_node_tag):
65 |     data = {}
66 |     for child in parent_node:
67 |         if child.tag == child_node_tag:
68 |             if child_node_tag == "stanza":
69 |                 data[child.get("name")] = {}
70 |                 for param in child:
71 |                     data[child.get("name")][param.get("name")] = parse_parameters(param)
72 |         elif "item" == parent_node.tag:
73 |             data[child.get("name")] = parse_parameters(child)
74 |     return data
75 | 


--------------------------------------------------------------------------------
/lib/splunklib/modularinput/validation_definition.py:
--------------------------------------------------------------------------------
 1 | # Copyright 2011-2015 Splunk, Inc.
 2 | #
 3 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
 4 | # not use this file except in compliance with the License. You may obtain
 5 | # a copy of the License at
 6 | #
 7 | #     http://www.apache.org/licenses/LICENSE-2.0
 8 | #
 9 | # Unless required by applicable law or agreed to in writing, software
10 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
11 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
12 | # License for the specific language governing permissions and limitations
13 | # under the License.
14 | 
15 | 
16 | from __future__ import absolute_import
17 | try:
18 |     import xml.etree.cElementTree as ET
19 | except ImportError as ie:
20 |     import xml.etree.ElementTree as ET
21 | 
22 | from .utils import parse_xml_data
23 | 
24 | 
25 | class ValidationDefinition(object):
26 |     """This class represents the XML sent by Splunk for external validation of a
27 |     new modular input.
28 | 
29 |     **Example**::
30 | 
31 |         v = ValidationDefinition()
32 | 
33 |     """
34 |     def __init__(self):
35 |         self.metadata = {}
36 |         self.parameters = {}
37 | 
38 |     def __eq__(self, other):
39 |         if not isinstance(other, ValidationDefinition):
40 |             return False
41 |         return self.metadata == other.metadata and self.parameters == other.parameters
42 | 
43 |     @staticmethod
44 |     def parse(stream):
45 |         """Creates a ``ValidationDefinition`` from a provided stream containing XML.
46 | 
47 |         The XML typically will look like this:
48 | 
49 |         ..  code-block:: xml
50 | 
51 |             <items>
52 |                <server_host>myHost</server_host>
53 |                  <server_uri>https://127.0.0.1:8089</server_uri>
54 |                  <session_key>123102983109283019283</session_key>
55 |                  <checkpoint_dir>/opt/splunk/var/lib/splunk/modinputs</checkpoint_dir>
56 |                  <item name="myScheme">
57 |                    <param name="param1">value1</param>
58 |                    <param_list name="param2">
59 |                      <value>value2</value>
60 |                      <value>value3</value>
61 |                      <value>value4</value>
62 |                    </param_list>
63 |                  </item>
64 |             </items>
65 | 
66 |         :param stream: ``Stream`` containing XML to parse.
67 |         :return: A ``ValidationDefinition`` object.
68 | 
69 |         """
70 | 
71 |         definition = ValidationDefinition()
72 | 
73 |         # parse XML from the stream, then get the root node
74 |         root = ET.parse(stream).getroot()
75 | 
76 |         for node in root:
77 |             # lone item node
78 |             if node.tag == "item":
79 |                 # name from item node
80 |                 definition.metadata["name"] = node.get("name")
81 |                 definition.parameters = parse_xml_data(node, "")
82 |             else:
83 |                 # Store anything else in metadata
84 |                 definition.metadata[node.tag] = node.text
85 | 
86 |         return definition


--------------------------------------------------------------------------------
/lib/splunklib/ordereddict.py:
--------------------------------------------------------------------------------
  1 | # Copyright (c) 2009 Raymond Hettinger
  2 | #
  3 | # Permission is hereby granted, free of charge, to any person
  4 | # obtaining a copy of this software and associated documentation files
  5 | # (the "Software"), to deal in the Software without restriction,
  6 | # including without limitation the rights to use, copy, modify, merge,
  7 | # publish, distribute, sublicense, and/or sell copies of the Software,
  8 | # and to permit persons to whom the Software is furnished to do so,
  9 | # subject to the following conditions:
 10 | #
 11 | #     The above copyright notice and this permission notice shall be
 12 | #     included in all copies or substantial portions of the Software.
 13 | #
 14 | #     THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
 15 | #     EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES
 16 | #     OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND
 17 | #     NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT
 18 | #     HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY,
 19 | #     WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING
 20 | #     FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR
 21 | #     OTHER DEALINGS IN THE SOFTWARE.
 22 | 
 23 | from UserDict import DictMixin
 24 | 
 25 | 
 26 | class OrderedDict(dict, DictMixin):
 27 | 
 28 |     def __init__(self, *args, **kwds):
 29 |         if len(args) > 1:
 30 |             raise TypeError('expected at most 1 arguments, got %d' % len(args))
 31 |         try:
 32 |             self.__end
 33 |         except AttributeError:
 34 |             self.clear()
 35 |         self.update(*args, **kwds)
 36 | 
 37 |     def clear(self):
 38 |         self.__end = end = []
 39 |         end += [None, end, end]         # sentinel node for doubly linked list
 40 |         self.__map = {}                 # key --> [key, prev, next]
 41 |         dict.clear(self)
 42 | 
 43 |     def __setitem__(self, key, value):
 44 |         if key not in self:
 45 |             end = self.__end
 46 |             curr = end[1]
 47 |             curr[2] = end[1] = self.__map[key] = [key, curr, end]
 48 |         dict.__setitem__(self, key, value)
 49 | 
 50 |     def __delitem__(self, key):
 51 |         dict.__delitem__(self, key)
 52 |         key, prev, next = self.__map.pop(key)
 53 |         prev[2] = next
 54 |         next[1] = prev
 55 | 
 56 |     def __iter__(self):
 57 |         end = self.__end
 58 |         curr = end[2]
 59 |         while curr is not end:
 60 |             yield curr[0]
 61 |             curr = curr[2]
 62 | 
 63 |     def __reversed__(self):
 64 |         end = self.__end
 65 |         curr = end[1]
 66 |         while curr is not end:
 67 |             yield curr[0]
 68 |             curr = curr[1]
 69 | 
 70 |     def popitem(self, last=True):
 71 |         if not self:
 72 |             raise KeyError('dictionary is empty')
 73 |         if last:
 74 |             key = reversed(self).next()
 75 |         else:
 76 |             key = iter(self).next()
 77 |         value = self.pop(key)
 78 |         return key, value
 79 | 
 80 |     def __reduce__(self):
 81 |         items = [[k, self[k]] for k in self]
 82 |         tmp = self.__map, self.__end
 83 |         del self.__map, self.__end
 84 |         inst_dict = vars(self).copy()
 85 |         self.__map, self.__end = tmp
 86 |         if inst_dict:
 87 |             return (self.__class__, (items,), inst_dict)
 88 |         return self.__class__, (items,)
 89 | 
 90 |     def keys(self):
 91 |         return list(self)
 92 | 
 93 |     setdefault = DictMixin.setdefault
 94 |     update = DictMixin.update
 95 |     pop = DictMixin.pop
 96 |     values = DictMixin.values
 97 |     items = DictMixin.items
 98 |     iterkeys = DictMixin.iterkeys
 99 |     itervalues = DictMixin.itervalues
100 |     iteritems = DictMixin.iteritems
101 | 
102 |     def __repr__(self):
103 |         if not self:
104 |             return '%s()' % (self.__class__.__name__,)
105 |         return '%s(%r)' % (self.__class__.__name__, self.items())
106 | 
107 |     def copy(self):
108 |         return self.__class__(self)
109 | 
110 |     @classmethod
111 |     def fromkeys(cls, iterable, value=None):
112 |         d = cls()
113 |         for key in iterable:
114 |             d[key] = value
115 |         return d
116 | 
117 |     def __eq__(self, other):
118 |         if isinstance(other, OrderedDict):
119 |             if len(self) != len(other):
120 |                 return False
121 |             for p, q in  zip(self.items(), other.items()):
122 |                 if p != q:
123 |                     return False
124 |             return True
125 |         return dict.__eq__(self, other)
126 | 
127 |     def __ne__(self, other):
128 |         return not self == other
129 | 


--------------------------------------------------------------------------------
/lib/splunklib/results.py:
--------------------------------------------------------------------------------
  1 | # Copyright 2011-2015 Splunk, Inc.
  2 | #
  3 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  4 | # not use this file except in compliance with the License. You may obtain
  5 | # a copy of the License at
  6 | #
  7 | #     http://www.apache.org/licenses/LICENSE-2.0
  8 | #
  9 | # Unless required by applicable law or agreed to in writing, software
 10 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 11 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 12 | # License for the specific language governing permissions and limitations
 13 | # under the License.
 14 | 
 15 | """The **splunklib.results** module provides a streaming XML reader for Splunk
 16 | search results.
 17 | 
 18 | Splunk search results can be returned in a variety of formats including XML,
 19 | JSON, and CSV. To make it easier to stream search results in XML format, they
 20 | are returned as a stream of XML *fragments*, not as a single XML document. This
 21 | module supports incrementally reading one result record at a time from such a
 22 | result stream. This module also provides a friendly iterator-based interface for
 23 | accessing search results while avoiding buffering the result set, which can be
 24 | very large.
 25 | 
 26 | To use the reader, instantiate :class:`ResultsReader` on a search result stream
 27 | as follows:::
 28 | 
 29 |     reader = ResultsReader(result_stream)
 30 |     for item in reader:
 31 |         print(item)
 32 |     print "Results are a preview: %s" % reader.is_preview
 33 | """
 34 | 
 35 | from __future__ import absolute_import
 36 | 
 37 | from io import BytesIO
 38 | 
 39 | from splunklib import six
 40 | try:
 41 |     import xml.etree.cElementTree as et
 42 | except:
 43 |     import xml.etree.ElementTree as et
 44 | 
 45 | try:
 46 |     from collections import OrderedDict  # must be python 2.7
 47 | except ImportError:
 48 |     from .ordereddict import OrderedDict
 49 | 
 50 | try:
 51 |     from splunklib.six.moves import cStringIO as StringIO
 52 | except:
 53 |     from splunklib.six import StringIO
 54 | 
 55 | __all__ = [
 56 |     "ResultsReader",
 57 |     "Message"
 58 | ]
 59 | 
 60 | class Message(object):
 61 |     """This class represents informational messages that Splunk interleaves in the results stream.
 62 | 
 63 |     ``Message`` takes two arguments: a string giving the message type (e.g., "DEBUG"), and
 64 |     a string giving the message itself.
 65 | 
 66 |     **Example**::
 67 | 
 68 |         m = Message("DEBUG", "There's something in that variable...")
 69 |     """
 70 |     def __init__(self, type_, message):
 71 |         self.type = type_
 72 |         self.message = message
 73 | 
 74 |     def __repr__(self):
 75 |         return "%s: %s" % (self.type, self.message)
 76 | 
 77 |     def __eq__(self, other):
 78 |         return (self.type, self.message) == (other.type, other.message)
 79 | 
 80 |     def __hash__(self):
 81 |         return hash((self.type, self.message))
 82 | 
 83 | class _ConcatenatedStream(object):
 84 |     """Lazily concatenate zero or more streams into a stream.
 85 | 
 86 |     As you read from the concatenated stream, you get characters from
 87 |     each stream passed to ``_ConcatenatedStream``, in order.
 88 | 
 89 |     **Example**::
 90 | 
 91 |         from StringIO import StringIO
 92 |         s = _ConcatenatedStream(StringIO("abc"), StringIO("def"))
 93 |         assert s.read() == "abcdef"
 94 |     """
 95 |     def __init__(self, *streams):
 96 |         self.streams = list(streams)
 97 | 
 98 |     def read(self, n=None):
 99 |         """Read at most *n* characters from this stream.
100 | 
101 |         If *n* is ``None``, return all available characters.
102 |         """
103 |         response = b""
104 |         while len(self.streams) > 0 and (n is None or n > 0):
105 |             txt = self.streams[0].read(n)
106 |             response += txt
107 |             if n is not None:
108 |                 n -= len(txt)
109 |             if n is None or n > 0:
110 |                 del self.streams[0]
111 |         return response
112 | 
113 | class _XMLDTDFilter(object):
114 |     """Lazily remove all XML DTDs from a stream.
115 | 
116 |     All substrings matching the regular expression <?[^>]*> are
117 |     removed in their entirety from the stream. No regular expressions
118 |     are used, however, so everything still streams properly.
119 | 
120 |     **Example**::
121 | 
122 |         from StringIO import StringIO
123 |         s = _XMLDTDFilter("<?xml abcd><element><?xml ...></element>")
124 |         assert s.read() == "<element></element>"
125 |     """
126 |     def __init__(self, stream):
127 |         self.stream = stream
128 | 
129 |     def read(self, n=None):
130 |         """Read at most *n* characters from this stream.
131 | 
132 |         If *n* is ``None``, return all available characters.
133 |         """
134 |         response = b""
135 |         while n is None or n > 0:
136 |             c = self.stream.read(1)
137 |             if c == b"":
138 |                 break
139 |             elif c == b"<":
140 |                 c += self.stream.read(1)
141 |                 if c == b"<?":
142 |                     while True:
143 |                         q = self.stream.read(1)
144 |                         if q == b">":
145 |                             break
146 |                 else:
147 |                     response += c
148 |                     if n is not None:
149 |                         n -= len(c)
150 |             else:
151 |                 response += c
152 |                 if n is not None:
153 |                     n -= 1
154 |         return response
155 | 
156 | class ResultsReader(object):
157 |     """This class returns dictionaries and Splunk messages from an XML results
158 |     stream.
159 | 
160 |     ``ResultsReader`` is iterable, and returns a ``dict`` for results, or a
161 |     :class:`Message` object for Splunk messages. This class has one field,
162 |     ``is_preview``, which is ``True`` when the results are a preview from a
163 |     running search, or ``False`` when the results are from a completed search.
164 | 
165 |     This function has no network activity other than what is implicit in the
166 |     stream it operates on.
167 | 
168 |     :param `stream`: The stream to read from (any object that supports
169 |         ``.read()``).
170 | 
171 |     **Example**::
172 | 
173 |         import results
174 |         response = ... # the body of an HTTP response
175 |         reader = results.ResultsReader(response)
176 |         for result in reader:
177 |             if isinstance(result, dict):
178 |                 print "Result: %s" % result
179 |             elif isinstance(result, results.Message):
180 |                 print "Message: %s" % result
181 |         print "is_preview = %s " % reader.is_preview
182 |     """
183 |     # Be sure to update the docstrings of client.Jobs.oneshot,
184 |     # client.Job.results_preview and client.Job.results to match any
185 |     # changes made to ResultsReader.
186 |     #
187 |     # This wouldn't be a class, just the _parse_results function below,
188 |     # except that you cannot get the current generator inside the
189 |     # function creating that generator. Thus it's all wrapped up for
190 |     # the sake of one field.
191 |     def __init__(self, stream):
192 |         # The search/jobs/exports endpoint, when run with
193 |         # earliest_time=rt and latest_time=rt streams a sequence of
194 |         # XML documents, each containing a result, as opposed to one
195 |         # results element containing lots of results. Python's XML
196 |         # parsers are broken, and instead of reading one full document
197 |         # and returning the stream that follows untouched, they
198 |         # destroy the stream and throw an error. To get around this,
199 |         # we remove all the DTD definitions inline, then wrap the
200 |         # fragments in a fiction <doc> element to make the parser happy.
201 |         stream = _XMLDTDFilter(stream)
202 |         stream = _ConcatenatedStream(BytesIO(b"<doc>"), stream, BytesIO(b"</doc>"))
203 |         self.is_preview = None
204 |         self._gen = self._parse_results(stream)
205 | 
206 |     def __iter__(self):
207 |         return self
208 | 
209 |     def next(self):
210 |         return next(self._gen)
211 | 
212 |     __next__ = next
213 | 
214 |     def _parse_results(self, stream):
215 |         """Parse results and messages out of *stream*."""
216 |         result = None
217 |         values = None
218 |         try:
219 |             for event, elem in et.iterparse(stream, events=('start', 'end')):
220 |                 if elem.tag == 'results' and event == 'start':
221 |                     # The wrapper element is a <results preview="0|1">. We
222 |                     # don't care about it except to tell is whether these
223 |                     # are preview results, or the final results from the
224 |                     # search.
225 |                     is_preview = elem.attrib['preview'] == '1'
226 |                     self.is_preview = is_preview
227 |                 if elem.tag == 'result':
228 |                     if event == 'start':
229 |                         result = OrderedDict()
230 |                     elif event == 'end':
231 |                         yield result
232 |                         result = None
233 |                         elem.clear()
234 | 
235 |                 elif elem.tag == 'field' and result is not None:
236 |                     # We need the 'result is not None' check because
237 |                     # 'field' is also the element name in the <meta>
238 |                     # header that gives field order, which is not what we
239 |                     # want at all.
240 |                     if event == 'start':
241 |                         values = []
242 |                     elif event == 'end':
243 |                         field_name = elem.attrib['k']
244 |                         if len(values) == 1:
245 |                             result[field_name] = values[0]
246 |                         else:
247 |                             result[field_name] = values
248 |                         # Calling .clear() is necessary to let the
249 |                         # element be garbage collected. Otherwise
250 |                         # arbitrarily large results sets will use
251 |                         # arbitrarily large memory intead of
252 |                         # streaming.
253 |                         elem.clear()
254 | 
255 |                 elif elem.tag in ('text', 'v') and event == 'end':
256 |                     try:
257 |                         text = "".join(elem.itertext())
258 |                     except AttributeError:
259 |                         # Assume we're running in Python < 2.7, before itertext() was added
260 |                         # So we'll define it here
261 | 
262 |                         def __itertext(self):
263 |                           tag = self.tag
264 |                           if not isinstance(tag, six.string_types) and tag is not None:
265 |                               return
266 |                           if self.text:
267 |                               yield self.text
268 |                           for e in self:
269 |                               for s in __itertext(e):
270 |                                   yield s
271 |                               if e.tail:
272 |                                   yield e.tail
273 | 
274 |                         text = "".join(__itertext(elem))
275 |                     values.append(text)
276 |                     elem.clear()
277 | 
278 |                 elif elem.tag == 'msg':
279 |                     if event == 'start':
280 |                         msg_type = elem.attrib['type']
281 |                     elif event == 'end':
282 |                         text = elem.text if elem.text is not None else ""
283 |                         yield Message(msg_type, text)
284 |                         elem.clear()
285 |         except SyntaxError as pe:
286 |             # This is here to handle the same incorrect return from
287 |             # splunk that is described in __init__.
288 |             if 'no element found' in pe.msg:
289 |                 return
290 |             else:
291 |                 raise
292 | 
293 | 
294 | 
295 | 
296 | 


--------------------------------------------------------------------------------
/lib/splunklib/searchcommands/__init__.py:
--------------------------------------------------------------------------------
  1 | # coding=utf-8
  2 | #
  3 | # Copyright © 2011-2015 Splunk, Inc.
  4 | #
  5 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  6 | # not use this file except in compliance with the License. You may obtain
  7 | # a copy of the License at
  8 | #
  9 | #     http://www.apache.org/licenses/LICENSE-2.0
 10 | #
 11 | # Unless required by applicable law or agreed to in writing, software
 12 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 13 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 14 | # License for the specific language governing permissions and limitations
 15 | # under the License.
 16 | 
 17 | """
 18 | 
 19 | .. topic:: Design Notes
 20 | 
 21 |   1. Commands are constrained to this ABNF grammar::
 22 | 
 23 |         command       = command-name *[wsp option] *[wsp [dquote] field-name [dquote]]
 24 |         command-name  = alpha *( alpha / digit )
 25 |         option        = option-name [wsp] "=" [wsp] option-value
 26 |         option-name   = alpha *( alpha / digit / "_" )
 27 |         option-value  = word / quoted-string
 28 |         word          = 1*( %01-%08 / %0B / %0C / %0E-1F / %21 / %23-%FF ) ; Any character but DQUOTE and WSP
 29 |         quoted-string = dquote *( word / wsp / "\" dquote / dquote dquote ) dquote
 30 |         field-name    = ( "_" / alpha ) *( alpha / digit / "_" / "." / "-" )
 31 | 
 32 |      It does not show that :code:`field-name` values may be comma-separated. This is because Splunk strips commas from
 33 |      the command line. A search command will never see them.
 34 | 
 35 |   2. Search commands targeting versions of Splunk prior to 6.3 must be statically configured as follows:
 36 | 
 37 |      .. code-block:: text
 38 |         :linenos:
 39 | 
 40 |         [command_name]
 41 |         filename = command_name.py
 42 |         supports_getinfo = true
 43 |         supports_rawargs = true
 44 | 
 45 |      No other static configuration is required or expected and may interfere with command execution.
 46 | 
 47 |   3. Commands support dynamic probing for settings.
 48 | 
 49 |      Splunk probes for settings dynamically when :code:`supports_getinfo=true`.
 50 |      You must add this line to the commands.conf stanza for each of your search
 51 |      commands.
 52 | 
 53 |   4. Commands do not support parsed arguments on the command line.
 54 | 
 55 |      Splunk parses arguments when :code:`supports_rawargs=false`. The
 56 |      :code:`SearchCommand` class sets this value unconditionally. You cannot
 57 |      override it.
 58 | 
 59 |      **Rationale**
 60 | 
 61 |      Splunk parses arguments by stripping quotes, nothing more. This may be useful
 62 |      in some cases, but doesn't work well with our chosen grammar.
 63 | 
 64 |   5. Commands consume input headers.
 65 | 
 66 |      An input header is provided by Splunk when :code:`enableheader=true`. The
 67 |      :class:`SearchCommand` class sets this value unconditionally. You cannot
 68 |      override it.
 69 | 
 70 |   6. Commands produce an output messages header.
 71 | 
 72 |      Splunk expects a command to produce an output messages header when
 73 |      :code:`outputheader=true`. The :class:`SearchCommand` class sets this value
 74 |      unconditionally. You cannot override it.
 75 | 
 76 |   7. Commands support multi-value fields.
 77 | 
 78 |      Multi-value fields are provided and consumed by Splunk when
 79 |      :code:`supports_multivalue=true`. This value is fixed. You cannot override
 80 |      it.
 81 | 
 82 |   8. This module represents all fields on the output stream in multi-value
 83 |      format.
 84 | 
 85 |      Splunk recognizes two kinds of data: :code:`value` and :code:`list(value)`.
 86 |      The multi-value format represents these data in field pairs. Given field
 87 |      :code:`name` the multi-value format calls for the creation of this pair of
 88 |      fields.
 89 | 
 90 |      ================= =========================================================
 91 |      Field name         Field data
 92 |      ================= =========================================================
 93 |      :code:`name`      Value or text from which a list of values was derived.
 94 | 
 95 |      :code:`__mv_name` Empty, if :code:`field` represents a :code:`value`;
 96 |                        otherwise, an encoded :code:`list(value)`. Values in the
 97 |                        list are wrapped in dollar signs ($) and separated by
 98 |                        semi-colons (;). Dollar signs ($) within a value are
 99 |                        represented by a pair of dollar signs ($$).
100 |      ================= =========================================================
101 | 
102 |      Serializing data in this format enables streaming and reduces a command's
103 |      memory footprint at the cost of one extra byte of data per field per record
104 |      and a small amount of extra processing time by the next command in the
105 |      pipeline.
106 | 
107 |   9. A :class:`ReportingCommand` must override :meth:`~ReportingCommand.reduce`
108 |      and may override :meth:`~ReportingCommand.map`. Map/reduce commands on the
109 |      Splunk processing pipeline are distinguished as this example illustrates.
110 | 
111 |      **Splunk command**
112 | 
113 |      .. code-block:: text
114 | 
115 |          sum total=total_date_hour date_hour
116 | 
117 |      **Map command line**
118 | 
119 |      .. code-block:: text
120 | 
121 |         sum __GETINFO__ __map__ total=total_date_hour date_hour
122 |         sum __EXECUTE__ __map__ total=total_date_hour date_hour
123 | 
124 |      **Reduce command line**
125 | 
126 |      .. code-block:: text
127 | 
128 |         sum __GETINFO__ total=total_date_hour date_hour
129 |         sum __EXECUTE__ total=total_date_hour date_hour
130 | 
131 |      The :code:`__map__` argument is introduced by
132 |      :meth:`ReportingCommand._execute`. Search command authors cannot influence
133 |      the contents of the command line in this release.
134 | 
135 | .. topic:: References
136 | 
137 |   1. `Search command style guide <http://docs.splunk.com/Documentation/Splunk/6.0/Search/Searchcommandstyleguide>`__
138 | 
139 |   2. `Commands.conf.spec <http://docs.splunk.com/Documentation/Splunk/5.0.5/Admin/Commandsconf>`_
140 | 
141 | """
142 | 
143 | from __future__ import absolute_import, division, print_function, unicode_literals
144 | 
145 | from .environment import *
146 | from .decorators import *
147 | from .validators import *
148 | 
149 | from .generating_command import GeneratingCommand
150 | from .streaming_command import StreamingCommand
151 | from .eventing_command import EventingCommand
152 | from .reporting_command import ReportingCommand
153 | 
154 | from .external_search_command import execute, ExternalSearchCommand
155 | from .search_command import dispatch, SearchMetric
156 | 


--------------------------------------------------------------------------------
/lib/splunklib/searchcommands/decorators.py:
--------------------------------------------------------------------------------
  1 | # coding=utf-8
  2 | #
  3 | # Copyright © 2011-2015 Splunk, Inc.
  4 | #
  5 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  6 | # not use this file except in compliance with the License. You may obtain
  7 | # a copy of the License at
  8 | #
  9 | #     http://www.apache.org/licenses/LICENSE-2.0
 10 | #
 11 | # Unless required by applicable law or agreed to in writing, software
 12 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 13 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 14 | # License for the specific language governing permissions and limitations
 15 | # under the License.
 16 | 
 17 | from __future__ import absolute_import, division, print_function, unicode_literals
 18 | from splunklib import six
 19 | 
 20 | try:
 21 |     from collections import OrderedDict  # must be python 2.7
 22 | except ImportError:
 23 |     from ..ordereddict import OrderedDict
 24 | 
 25 | from inspect import getmembers, isclass, isfunction
 26 | from splunklib.six.moves import map as imap
 27 | 
 28 | from .internals import ConfigurationSettingsType, json_encode_string
 29 | from .validators import OptionName
 30 | 
 31 | 
 32 | class Configuration(object):
 33 |     """ Defines the configuration settings for a search command.
 34 | 
 35 |     Documents, validates, and ensures that only relevant configuration settings are applied. Adds a :code:`name` class
 36 |     variable to search command classes that don't have one. The :code:`name` is derived from the name of the class.
 37 |     By convention command class names end with the word "Command". To derive :code:`name` the word "Command" is removed
 38 |     from the end of the class name and then converted to lower case for conformance with the `Search command style guide
 39 |     <http://docs.splunk.com/Documentation/Splunk/latest/Search/Searchcommandstyleguide>`__
 40 | 
 41 |     """
 42 |     def __init__(self, o=None, **kwargs):
 43 |         #
 44 |         # The o argument enables the configuration decorator to be used with or without parentheses. For example, it
 45 |         # enables you to write code that looks like this:
 46 |         #
 47 |         #   @Configuration
 48 |         #   class Foo(SearchCommand):
 49 |         #       ...
 50 |         #
 51 |         #   @Configuration()
 52 |         #   class Bar(SearchCommand):
 53 |         #       ...
 54 |         #
 55 |         # Without the o argument, the Python compiler will complain about the first form. With the o argument, both
 56 |         # forms work. The first form provides a value for o: Foo. The second form does does not provide a value for o.
 57 |         # The class or method decorated is not passed to the constructor. A value of None is passed instead.
 58 |         #
 59 |         self.settings = kwargs
 60 | 
 61 |     def __call__(self, o):
 62 | 
 63 |         if isfunction(o):
 64 |             # We must wait to finalize configuration as the class containing this function is under construction
 65 |             # at the time this call to decorate a member function. This will be handled in the call to
 66 |             # o.ConfigurationSettings.fix_up(o) in the elif clause of this code block.
 67 |             o._settings = self.settings
 68 |         elif isclass(o):
 69 | 
 70 |             # Set command name
 71 | 
 72 |             name = o.__name__
 73 |             if name.endswith('Command'):
 74 |                 name = name[:-len('Command')]
 75 |             o.name = six.text_type(name.lower())
 76 | 
 77 |             # Construct ConfigurationSettings instance for the command class
 78 | 
 79 |             o.ConfigurationSettings = ConfigurationSettingsType(
 80 |                 module=o.__module__ + '.' + o.__name__,
 81 |                 name='ConfigurationSettings',
 82 |                 bases=(o.ConfigurationSettings,))
 83 | 
 84 |             ConfigurationSetting.fix_up(o.ConfigurationSettings, self.settings)
 85 |             o.ConfigurationSettings.fix_up(o)
 86 |             Option.fix_up(o)
 87 |         else:
 88 |             raise TypeError('Incorrect usage: Configuration decorator applied to {0}'.format(type(o), o.__name__))
 89 | 
 90 |         return o
 91 | 
 92 | 
 93 | class ConfigurationSetting(property):
 94 |     """ Generates a :class:`property` representing the named configuration setting
 95 | 
 96 |     This is a convenience function designed to reduce the amount of boiler-plate code you must write; most notably for
 97 |     property setters.
 98 | 
 99 |     :param name: Configuration setting name.
100 |     :type name: str or unicode
101 | 
102 |     :param doc: A documentation string.
103 |     :type doc: bytes, unicode or NoneType
104 | 
105 |     :param readonly: If true, specifies that the configuration setting is fixed.
106 |     :type name: bool or NoneType
107 | 
108 |     :param value: Configuration setting value.
109 | 
110 |     :return: A :class:`property` instance representing the configuration setting.
111 |     :rtype: property
112 | 
113 |     """
114 |     def __init__(self, fget=None, fset=None, fdel=None, doc=None, name=None, readonly=None, value=None):
115 |         property.__init__(self, fget=fget, fset=fset, fdel=fdel, doc=doc)
116 |         self._readonly = readonly
117 |         self._value = value
118 |         self._name = name
119 | 
120 |     def __call__(self, function):
121 |         return self.getter(function)
122 | 
123 |     def deleter(self, function):
124 |         return self._copy_extra_attributes(property.deleter(self, function))
125 | 
126 |     def getter(self, function):
127 |         return self._copy_extra_attributes(property.getter(self, function))
128 | 
129 |     def setter(self, function):
130 |         return self._copy_extra_attributes(property.setter(self, function))
131 | 
132 |     @staticmethod
133 |     def fix_up(cls, values):
134 | 
135 |         is_configuration_setting = lambda attribute: isinstance(attribute, ConfigurationSetting)
136 |         definitions = getmembers(cls, is_configuration_setting)
137 |         i = 0
138 | 
139 |         for name, setting in definitions:
140 | 
141 |             if setting._name is None:
142 |                 setting._name = name = six.text_type(name)
143 |             else:
144 |                 name = setting._name
145 | 
146 |             validate, specification = setting._get_specification()
147 |             backing_field_name = '_' + name
148 | 
149 |             if setting.fget is None and setting.fset is None and setting.fdel is None:
150 | 
151 |                 value = setting._value
152 | 
153 |                 if setting._readonly or value is not None:
154 |                     validate(specification, name, value)
155 | 
156 |                 def fget(bfn, value):
157 |                     return lambda this: getattr(this, bfn, value)
158 | 
159 |                 setting = setting.getter(fget(backing_field_name, value))
160 | 
161 |                 if not setting._readonly:
162 | 
163 |                     def fset(bfn, validate, specification, name):
164 |                         return lambda this, value: setattr(this, bfn, validate(specification, name, value))
165 | 
166 |                     setting = setting.setter(fset(backing_field_name, validate, specification, name))
167 | 
168 |                 setattr(cls, name, setting)
169 | 
170 |             def is_supported_by_protocol(supporting_protocols):
171 | 
172 |                 def is_supported_by_protocol(version):
173 |                     return version in supporting_protocols
174 | 
175 |                 return is_supported_by_protocol
176 | 
177 |             del setting._name, setting._value, setting._readonly
178 | 
179 |             setting.is_supported_by_protocol = is_supported_by_protocol(specification.supporting_protocols)
180 |             setting.supporting_protocols = specification.supporting_protocols
181 |             setting.backing_field_name = backing_field_name
182 |             definitions[i] = setting
183 |             setting.name = name
184 | 
185 |             i += 1
186 | 
187 |             try:
188 |                 value = values[name]
189 |             except KeyError:
190 |                 continue
191 | 
192 |             if setting.fset is None:
193 |                 raise ValueError('The value of configuration setting {} is fixed'.format(name))
194 | 
195 |             setattr(cls, backing_field_name, validate(specification, name, value))
196 |             del values[name]
197 | 
198 |         if len(values) > 0:
199 |             settings = sorted(list(six.iteritems(values)))
200 |             settings = imap(lambda n_v: '{}={}'.format(n_v[0], repr(n_v[1])), settings)
201 |             raise AttributeError('Inapplicable configuration settings: ' + ', '.join(settings))
202 | 
203 |         cls.configuration_setting_definitions = definitions
204 | 
205 |     def _copy_extra_attributes(self, other):
206 |         other._readonly = self._readonly
207 |         other._value = self._value
208 |         other._name = self._name
209 |         return other
210 | 
211 |     def _get_specification(self):
212 | 
213 |         name = self._name
214 | 
215 |         try:
216 |             specification = ConfigurationSettingsType.specification_matrix[name]
217 |         except KeyError:
218 |             raise AttributeError('Unknown configuration setting: {}={}'.format(name, repr(self._value)))
219 | 
220 |         return ConfigurationSettingsType.validate_configuration_setting, specification
221 | 
222 | 
223 | class Option(property):
224 |     """ Represents a search command option.
225 | 
226 |     Required options must be specified on the search command line.
227 | 
228 |     **Example:**
229 | 
230 |     Short form (recommended). When you are satisfied with built-in or custom validation behaviors.
231 | 
232 |     ..  code-block:: python
233 |         :linenos:
234 | 
235 |         from splunklib.searchcommands.decorators import Option
236 |         from splunklib.searchcommands.validators import Fieldname
237 | 
238 |         total = Option(
239 |             doc=''' **Syntax:** **total=***<fieldname>*
240 |             **Description:** Name of the field that will hold the computed
241 |             sum''',
242 |             require=True, validate=Fieldname())
243 | 
244 |     **Example:**
245 | 
246 |     Long form. Useful when you wish to manage the option value and its deleter/getter/setter side-effects yourself. You
247 |     must provide a getter and a setter. If your :code:`Option` requires `destruction <https://docs.python.org/2/reference/datamodel.html#object.__del__>`_ you must
248 |     also provide a deleter. You must be prepared to accept a value of :const:`None` which indicates that your
249 |     :code:`Option` is unset.
250 | 
251 |     ..  code-block:: python
252 |         :linenos:
253 | 
254 |         from splunklib.searchcommands import Option
255 | 
256 |         @Option()
257 |         def logging_configuration(self):
258 |             \""" **Syntax:** logging_configuration=<path>
259 |             **Description:** Loads an alternative logging configuration file for a command invocation. The logging
260 |             configuration file must be in Python ConfigParser-format. The *<path>* name and all path names specified in
261 |             configuration are relative to the app root directory.
262 | 
263 |             \"""
264 |             return self._logging_configuration
265 | 
266 |         @logging_configuration.setter
267 |         def logging_configuration(self, value):
268 |             if value is not None
269 |                 logging.configure(value)
270 |                 self._logging_configuration = value
271 | 
272 |         def __init__(self)
273 |             self._logging_configuration = None
274 | 
275 |     """
276 |     def __init__(self, fget=None, fset=None, fdel=None, doc=None, name=None, default=None, require=None, validate=None):
277 |         property.__init__(self, fget, fset, fdel, doc)
278 |         self.name = name
279 |         self.default = default
280 |         self.validate = validate
281 |         self.require = bool(require)
282 | 
283 |     def __call__(self, function):
284 |         return self.getter(function)
285 | 
286 |     # region Methods
287 | 
288 |     def deleter(self, function):
289 |         return self._copy_extra_attributes(property.deleter(self, function))
290 | 
291 |     def getter(self, function):
292 |         return self._copy_extra_attributes(property.getter(self, function))
293 | 
294 |     def setter(self, function):
295 |         return self._copy_extra_attributes(property.setter(self, function))
296 | 
297 |     @classmethod
298 |     def fix_up(cls, command_class):
299 | 
300 |         is_option = lambda attribute: isinstance(attribute, Option)
301 |         definitions = getmembers(command_class, is_option)
302 |         validate_option_name = OptionName()
303 |         i = 0
304 | 
305 |         for name, option in definitions:
306 | 
307 |             if option.name is None:
308 |                 option.name = name  # no validation required
309 |             else:
310 |                 validate_option_name(option.name)
311 | 
312 |             if option.fget is None and option.fset is None and option.fdel is None:
313 |                 backing_field_name = '_' + name
314 | 
315 |                 def fget(bfn):
316 |                     return lambda this: getattr(this, bfn, None)
317 | 
318 |                 option = option.getter(fget(backing_field_name))
319 | 
320 |                 def fset(bfn, validate):
321 |                     if validate is None:
322 |                         return lambda this, value: setattr(this, bfn, value)
323 |                     return lambda this, value: setattr(this, bfn, validate(value))
324 | 
325 |                 option = option.setter(fset(backing_field_name, option.validate))
326 |                 setattr(command_class, name, option)
327 | 
328 |             elif option.validate is not None:
329 | 
330 |                 def fset(function, validate):
331 |                     return lambda this, value: function(this, validate(value))
332 | 
333 |                 option = option.setter(fset(option.fset, option.validate))
334 |                 setattr(command_class, name, option)
335 | 
336 |             definitions[i] = name, option
337 |             i += 1
338 | 
339 |         command_class.option_definitions = definitions
340 | 
341 |     def _copy_extra_attributes(self, other):
342 |         other.name = self.name
343 |         other.default = self.default
344 |         other.require = self.require
345 |         other.validate = self.validate
346 |         return other
347 | 
348 |     # endregion
349 | 
350 |     # region Types
351 | 
352 |     class Item(object):
353 |         """ Presents an instance/class view over a search command `Option`.
354 | 
355 |         This class is used by SearchCommand.process to parse and report on option values.
356 | 
357 |         """
358 |         def __init__(self, command, option):
359 |             self._command = command
360 |             self._option = option
361 |             self._is_set = False
362 |             validator = self.validator
363 |             self._format = six.text_type if validator is None else validator.format
364 | 
365 |         def __repr__(self):
366 |             return '(' + repr(self.name) + ', ' + repr(self._format(self.value)) + ')'
367 | 
368 |         def __str__(self):
369 |             value = self.value
370 |             value = 'None' if value is None else json_encode_string(self._format(value))
371 |             return self.name + '=' + value
372 | 
373 |         # region Properties
374 | 
375 |         @property
376 |         def is_required(self):
377 |             return bool(self._option.require)
378 | 
379 |         @property
380 |         def is_set(self):
381 |             """ Indicates whether an option value was provided as argument.
382 | 
383 |             """
384 |             return self._is_set
385 | 
386 |         @property
387 |         def name(self):
388 |             return self._option.name
389 | 
390 |         @property
391 |         def validator(self):
392 |             return self._option.validate
393 | 
394 |         @property
395 |         def value(self):
396 |             return self._option.__get__(self._command)
397 | 
398 |         @value.setter
399 |         def value(self, value):
400 |             self._option.__set__(self._command, value)
401 |             self._is_set = True
402 | 
403 |         # endregion
404 | 
405 |         # region Methods
406 | 
407 |         def reset(self):
408 |             self._option.__set__(self._command, self._option.default)
409 |             self._is_set = False
410 | 
411 |         pass
412 |         # endregion
413 | 
414 |     class View(OrderedDict):
415 |         """ Presents an ordered dictionary view of the set of :class:`Option` arguments to a search command.
416 | 
417 |         This class is used by SearchCommand.process to parse and report on option values.
418 | 
419 |         """
420 |         def __init__(self, command):
421 |             definitions = type(command).option_definitions
422 |             item_class = Option.Item
423 |             OrderedDict.__init__(self, ((option.name, item_class(command, option)) for (name, option) in definitions))
424 | 
425 |         def __repr__(self):
426 |             text = 'Option.View([' + ','.join(imap(lambda item: repr(item), six.itervalues(self))) + '])'
427 |             return text
428 | 
429 |         def __str__(self):
430 |             text = ' '.join([str(item) for item in six.itervalues(self) if item.is_set])
431 |             return text
432 | 
433 |         # region Methods
434 | 
435 |         def get_missing(self):
436 |             missing = [item.name for item in six.itervalues(self) if item.is_required and not item.is_set]
437 |             return missing if len(missing) > 0 else None
438 | 
439 |         def reset(self):
440 |             for value in six.itervalues(self):
441 |                 value.reset()
442 | 
443 |         pass
444 |         # endregion
445 | 
446 |     pass
447 |     # endregion
448 | 
449 | 
450 | __all__ = ['Configuration', 'Option']
451 | 


--------------------------------------------------------------------------------
/lib/splunklib/searchcommands/environment.py:
--------------------------------------------------------------------------------
  1 | # coding=utf-8
  2 | #
  3 | # Copyright © 2011-2015 Splunk, Inc.
  4 | #
  5 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  6 | # not use this file except in compliance with the License. You may obtain
  7 | # a copy of the License at
  8 | #
  9 | #     http://www.apache.org/licenses/LICENSE-2.0
 10 | #
 11 | # Unless required by applicable law or agreed to in writing, software
 12 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 13 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 14 | # License for the specific language governing permissions and limitations
 15 | # under the License.
 16 | 
 17 | from __future__ import absolute_import, division, print_function, unicode_literals
 18 | 
 19 | from logging import getLogger, root, StreamHandler
 20 | from logging.config import fileConfig
 21 | from os import chdir, environ, path
 22 | from splunklib.six.moves import getcwd
 23 | 
 24 | import sys
 25 | 
 26 | 
 27 | def configure_logging(logger_name, filename=None):
 28 |     """ Configure logging and return the named logger and the location of the logging configuration file loaded.
 29 | 
 30 |     This function expects a Splunk app directory structure::
 31 | 
 32 |         <app-root>
 33 |             bin
 34 |                 ...
 35 |             default
 36 |                 ...
 37 |             local
 38 |                 ...
 39 | 
 40 |     This function looks for a logging configuration file at each of these locations, loading the first, if any,
 41 |     logging configuration file that it finds::
 42 | 
 43 |         local/{name}.logging.conf
 44 |         default/{name}.logging.conf
 45 |         local/logging.conf
 46 |         default/logging.conf
 47 | 
 48 |     The current working directory is set to *<app-root>* before the logging configuration file is loaded. Hence, paths
 49 |     in the logging configuration file are relative to *<app-root>*. The current directory is reset before return.
 50 | 
 51 |     You may short circuit the search for a logging configuration file by providing an alternative file location in
 52 |     `path`. Logging configuration files must be in `ConfigParser format`_.
 53 | 
 54 |     #Arguments:
 55 | 
 56 |     :param logger_name: Logger name
 57 |     :type logger_name: bytes, unicode
 58 | 
 59 |     :param filename: Location of an alternative logging configuration file or `None`.
 60 |     :type filename: bytes, unicode or NoneType
 61 | 
 62 |     :returns: The named logger and the location of the logging configuration file loaded.
 63 |     :rtype: tuple
 64 | 
 65 |     .. _ConfigParser format: https://docs.python.org/2/library/logging.config.html#configuration-file-format
 66 | 
 67 |     """
 68 |     if filename is None:
 69 |         if logger_name is None:
 70 |             probing_paths = [path.join('local', 'logging.conf'), path.join('default', 'logging.conf')]
 71 |         else:
 72 |             probing_paths = [
 73 |                 path.join('local', logger_name + '.logging.conf'),
 74 |                 path.join('default', logger_name + '.logging.conf'),
 75 |                 path.join('local', 'logging.conf'),
 76 |                 path.join('default', 'logging.conf')]
 77 |         for relative_path in probing_paths:
 78 |             configuration_file = path.join(app_root, relative_path)
 79 |             if path.exists(configuration_file):
 80 |                 filename = configuration_file
 81 |                 break
 82 |     elif not path.isabs(filename):
 83 |         found = False
 84 |         for conf in 'local', 'default':
 85 |             configuration_file = path.join(app_root, conf, filename)
 86 |             if path.exists(configuration_file):
 87 |                 filename = configuration_file
 88 |                 found = True
 89 |                 break
 90 |         if not found:
 91 |             raise ValueError('Logging configuration file "{}" not found in local or default directory'.format(filename))
 92 |     elif not path.exists(filename):
 93 |         raise ValueError('Logging configuration file "{}" not found'.format(filename))
 94 | 
 95 |     if filename is not None:
 96 |         global _current_logging_configuration_file
 97 |         filename = path.realpath(filename)
 98 | 
 99 |         if filename != _current_logging_configuration_file:
100 |             working_directory = getcwd()
101 |             chdir(app_root)
102 |             try:
103 |                 fileConfig(filename, {'SPLUNK_HOME': splunk_home})
104 |             finally:
105 |                 chdir(working_directory)
106 |             _current_logging_configuration_file = filename
107 | 
108 |     if len(root.handlers) == 0:
109 |         root.addHandler(StreamHandler())
110 | 
111 |     return None if logger_name is None else getLogger(logger_name), filename
112 | 
113 | 
114 | _current_logging_configuration_file = None
115 | 
116 | splunk_home = path.abspath(path.join(getcwd(), environ.get('SPLUNK_HOME', '')))
117 | app_file = getattr(sys.modules['__main__'], '__file__', sys.executable)
118 | app_root = path.dirname(path.abspath(path.dirname(app_file)))
119 | 
120 | splunklib_logger, logging_configuration = configure_logging('splunklib')
121 | 
122 | 
123 | __all__ = ['app_file', 'app_root', 'logging_configuration', 'splunk_home', 'splunklib_logger']
124 | 


--------------------------------------------------------------------------------
/lib/splunklib/searchcommands/eventing_command.py:
--------------------------------------------------------------------------------
  1 | # coding=utf-8
  2 | #
  3 | # Copyright 2011-2015 Splunk, Inc.
  4 | #
  5 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  6 | # not use this file except in compliance with the License. You may obtain
  7 | # a copy of the License at
  8 | #
  9 | #     http://www.apache.org/licenses/LICENSE-2.0
 10 | #
 11 | # Unless required by applicable law or agreed to in writing, software
 12 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 13 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 14 | # License for the specific language governing permissions and limitations
 15 | # under the License.
 16 | 
 17 | from __future__ import absolute_import, division, print_function, unicode_literals
 18 | 
 19 | from splunklib import six
 20 | from splunklib.six.moves import map as imap
 21 | 
 22 | from .decorators import ConfigurationSetting
 23 | from .search_command import SearchCommand
 24 | 
 25 | 
 26 | class EventingCommand(SearchCommand):
 27 |     """ Applies a transformation to search results as they travel through the events pipeline.
 28 | 
 29 |     Eventing commands typically filter, group, order, and/or or augment event records. Examples of eventing commands
 30 |     from Splunk's built-in command set include sort_, dedup_, and cluster_. Each execution of an eventing command
 31 |     should produce a set of event records that is independently usable by downstream processors.
 32 | 
 33 |     .. _sort: http://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Sort
 34 |     .. _dedup: http://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Dedup
 35 |     .. _cluster: http://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Cluster
 36 | 
 37 |     EventingCommand configuration
 38 |     ==============================
 39 | 
 40 |     You can configure your command for operation under Search Command Protocol (SCP) version 1 or 2. SCP 2 requires
 41 |     Splunk 6.3 or later.
 42 | 
 43 |     """
 44 |     # region Methods
 45 | 
 46 |     def transform(self, records):
 47 |         """ Generator function that processes and yields event records to the Splunk events pipeline.
 48 | 
 49 |         You must override this method.
 50 | 
 51 |         """
 52 |         raise NotImplementedError('EventingCommand.transform(self, records)')
 53 | 
 54 |     def _execute(self, ifile, process):
 55 |         SearchCommand._execute(self, ifile, self.transform)
 56 | 
 57 |     # endregion
 58 | 
 59 |     class ConfigurationSettings(SearchCommand.ConfigurationSettings):
 60 |         """ Represents the configuration settings that apply to a :class:`EventingCommand`.
 61 | 
 62 |         """
 63 |         # region SCP v1/v2 properties
 64 | 
 65 |         required_fields = ConfigurationSetting(doc='''
 66 |             List of required fields for this search which back-propagates to the generating search.
 67 | 
 68 |             Setting this value enables selected fields mode under SCP 2. Under SCP 1 you must also specify
 69 |             :code:`clear_required_fields=True` to enable selected fields mode. To explicitly select all fields,
 70 |             specify a value of :const:`['*']`. No error is generated if a specified field is missing.
 71 | 
 72 |             Default: :const:`None`, which implicitly selects all fields.
 73 | 
 74 |             ''')
 75 | 
 76 |         # endregion
 77 | 
 78 |         # region SCP v1 properties
 79 | 
 80 |         clear_required_fields = ConfigurationSetting(doc='''
 81 |             :const:`True`, if required_fields represent the *only* fields required.
 82 | 
 83 |             If :const:`False`, required_fields are additive to any fields that may be required by subsequent commands.
 84 |             In most cases, :const:`False` is appropriate for eventing commands.
 85 | 
 86 |             Default: :const:`False`
 87 | 
 88 |             ''')
 89 | 
 90 |         retainsevents = ConfigurationSetting(readonly=True, value=True, doc='''
 91 |             :const:`True`, if the command retains events the way the sort/dedup/cluster commands do.
 92 | 
 93 |             If :const:`False`, the command transforms events the way the stats command does.
 94 | 
 95 |             Fixed: :const:`True`
 96 | 
 97 |             ''')
 98 | 
 99 |         # endregion
100 | 
101 |         # region SCP v2 properties
102 | 
103 |         maxinputs = ConfigurationSetting(doc='''
104 |             Specifies the maximum number of events that can be passed to the command for each invocation.
105 | 
106 |             This limit cannot exceed the value of `maxresultrows` as defined in limits.conf_. Under SCP 1 you must
107 |             specify this value in commands.conf_.
108 | 
109 |             Default: The value of `maxresultrows`.
110 | 
111 |             Supported by: SCP 2
112 | 
113 |             .. _limits.conf: http://docs.splunk.com/Documentation/Splunk/latest/admin/Limitsconf
114 | 
115 |             ''')
116 | 
117 |         type = ConfigurationSetting(readonly=True, value='events', doc='''
118 |             Command type
119 | 
120 |             Fixed: :const:`'events'`.
121 | 
122 |             Supported by: SCP 2
123 | 
124 |             ''')
125 | 
126 |         # endregion
127 | 
128 |         # region Methods
129 | 
130 |         @classmethod
131 |         def fix_up(cls, command):
132 |             """ Verifies :code:`command` class structure.
133 | 
134 |             """
135 |             if command.transform == EventingCommand.transform:
136 |                 raise AttributeError('No EventingCommand.transform override')
137 |             SearchCommand.ConfigurationSettings.fix_up(command)
138 | 
139 |         # TODO: Stop looking like a dictionary because we don't obey the semantics
140 |         # N.B.: Does not use Python 2 dict copy semantics
141 |         def iteritems(self):
142 |             iteritems = SearchCommand.ConfigurationSettings.iteritems(self)
143 |             return imap(lambda name_value: (name_value[0], 'events' if name_value[0] == 'type' else name_value[1]), iteritems)
144 | 
145 |         # N.B.: Does not use Python 3 dict view semantics
146 |         if not six.PY2:
147 |             items = iteritems
148 | 
149 |         # endregion
150 | 


--------------------------------------------------------------------------------
/lib/splunklib/searchcommands/external_search_command.py:
--------------------------------------------------------------------------------
  1 | # coding=utf-8
  2 | #
  3 | # Copyright 2011-2015 Splunk, Inc.
  4 | #
  5 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  6 | # not use this file except in compliance with the License. You may obtain
  7 | # a copy of the License at
  8 | #
  9 | #     http://www.apache.org/licenses/LICENSE-2.0
 10 | #
 11 | # Unless required by applicable law or agreed to in writing, software
 12 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 13 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 14 | # License for the specific language governing permissions and limitations
 15 | # under the License.
 16 | 
 17 | from __future__ import absolute_import, division, print_function, unicode_literals
 18 | 
 19 | from logging import getLogger
 20 | import os
 21 | import sys
 22 | import traceback
 23 | from splunklib import six
 24 | 
 25 | if sys.platform == 'win32':
 26 |     from signal import signal, CTRL_BREAK_EVENT, SIGBREAK, SIGINT, SIGTERM
 27 |     from subprocess import Popen
 28 |     import atexit
 29 | 
 30 | from . import splunklib_logger as logger
 31 | 
 32 | # P1 [ ] TODO: Add ExternalSearchCommand class documentation
 33 | 
 34 | 
 35 | class ExternalSearchCommand(object):
 36 |     """
 37 |     """
 38 |     def __init__(self, path, argv=None, environ=None):
 39 | 
 40 |         if not isinstance(path, (bytes, six.text_type)):
 41 |             raise ValueError('Expected a string value for path, not {}'.format(repr(path)))
 42 | 
 43 |         self._logger = getLogger(self.__class__.__name__)
 44 |         self._path = six.text_type(path)
 45 |         self._argv = None
 46 |         self._environ = None
 47 | 
 48 |         self.argv = argv
 49 |         self.environ = environ
 50 | 
 51 |     # region Properties
 52 | 
 53 |     @property
 54 |     def argv(self):
 55 |         return getattr(self, '_argv')
 56 | 
 57 |     @argv.setter
 58 |     def argv(self, value):
 59 |         if not (value is None or isinstance(value, (list, tuple))):
 60 |             raise ValueError('Expected a list, tuple or value of None for argv, not {}'.format(repr(value)))
 61 |         self._argv = value
 62 | 
 63 |     @property
 64 |     def environ(self):
 65 |         return getattr(self, '_environ')
 66 | 
 67 |     @environ.setter
 68 |     def environ(self, value):
 69 |         if not (value is None or isinstance(value, dict)):
 70 |             raise ValueError('Expected a dictionary value for environ, not {}'.format(repr(value)))
 71 |         self._environ = value
 72 | 
 73 |     @property
 74 |     def logger(self):
 75 |         return self._logger
 76 | 
 77 |     @property
 78 |     def path(self):
 79 |         return self._path
 80 | 
 81 |     # endregion
 82 | 
 83 |     # region Methods
 84 | 
 85 |     def execute(self):
 86 |         # noinspection PyBroadException
 87 |         try:
 88 |             if self._argv is None:
 89 |                 self._argv = os.path.splitext(os.path.basename(self._path))[0]
 90 |             self._execute(self._path, self._argv, self._environ)
 91 |         except:
 92 |             error_type, error, tb = sys.exc_info()
 93 |             message = 'Command execution failed: ' + six.text_type(error)
 94 |             self._logger.error(message + '\nTraceback:\n' + ''.join(traceback.format_tb(tb)))
 95 |             sys.exit(1)
 96 | 
 97 |     if sys.platform == 'win32':
 98 | 
 99 |         @staticmethod
100 |         def _execute(path, argv=None, environ=None):
101 |             """ Executes an external search command.
102 | 
103 |             :param path: Path to the external search command.
104 |             :type path: unicode
105 | 
106 |             :param argv: Argument list.
107 |             :type argv: list or tuple
108 |                 The arguments to the child process should start with the name of the command being run, but this is not
109 |                 enforced. A value of :const:`None` specifies that the base name of path name :param:`path` should be used.
110 | 
111 |             :param environ: A mapping which is used to define the environment variables for the new process.
112 |             :type environ: dict or None.
113 |                 This mapping is used instead of the current process’s environment. A value of :const:`None` specifies that
114 |                 the :data:`os.environ` mapping should be used.
115 | 
116 |             :return: None
117 | 
118 |             """
119 |             search_path = os.getenv('PATH') if environ is None else environ.get('PATH')
120 |             found = ExternalSearchCommand._search_path(path, search_path)
121 | 
122 |             if found is None:
123 |                 raise ValueError('Cannot find command on path: {}'.format(path))
124 | 
125 |             path = found
126 |             logger.debug('starting command="%s", arguments=%s', path, argv)
127 | 
128 |             def terminate(signal_number, frame):
129 |                 sys.exit('External search command is terminating on receipt of signal={}.'.format(signal_number))
130 | 
131 |             def terminate_child():
132 |                 if p.pid is not None and p.returncode is None:
133 |                     logger.debug('terminating command="%s", arguments=%d, pid=%d', path, argv, p.pid)
134 |                     os.kill(p.pid, CTRL_BREAK_EVENT)
135 | 
136 |             p = Popen(argv, executable=path, env=environ, stdin=sys.stdin, stdout=sys.stdout, stderr=sys.stderr)
137 |             atexit.register(terminate_child)
138 |             signal(SIGBREAK, terminate)
139 |             signal(SIGINT, terminate)
140 |             signal(SIGTERM, terminate)
141 | 
142 |             logger.debug('started command="%s", arguments=%s, pid=%d', path, argv, p.pid)
143 |             p.wait()
144 | 
145 |             logger.debug('finished command="%s", arguments=%s, pid=%d, returncode=%d', path, argv, p.pid, p.returncode)
146 | 
147 |             if p.returncode != 0:
148 |                 sys.exit(p.returncode)
149 | 
150 |         @staticmethod
151 |         def _search_path(executable, paths):
152 |             """ Locates an executable program file.
153 | 
154 |             :param executable: The name of the executable program to locate.
155 |             :type executable: unicode
156 | 
157 |             :param paths: A list of one or more directory paths where executable programs are located.
158 |             :type paths: unicode
159 | 
160 |             :return:
161 |             :rtype: Path to the executable program located or :const:`None`.
162 | 
163 |             """
164 |             directory, filename = os.path.split(executable)
165 |             extension = os.path.splitext(filename)[1].upper()
166 |             executable_extensions = ExternalSearchCommand._executable_extensions
167 | 
168 |             if directory:
169 |                 if len(extension) and extension in executable_extensions:
170 |                     return None
171 |                 for extension in executable_extensions:
172 |                     path = executable + extension
173 |                     if os.path.isfile(path):
174 |                         return path
175 |                 return None
176 | 
177 |             if not paths:
178 |                 return None
179 | 
180 |             directories = [directory for directory in paths.split(';') if len(directory)]
181 | 
182 |             if len(directories) == 0:
183 |                 return None
184 | 
185 |             if len(extension) and extension in executable_extensions:
186 |                 for directory in directories:
187 |                     path = os.path.join(directory, executable)
188 |                     if os.path.isfile(path):
189 |                         return path
190 |                 return None
191 | 
192 |             for directory in directories:
193 |                 path_without_extension = os.path.join(directory, executable)
194 |                 for extension in executable_extensions:
195 |                     path = path_without_extension + extension
196 |                     if os.path.isfile(path):
197 |                         return path
198 | 
199 |             return None
200 | 
201 |         _executable_extensions = ('.COM', '.EXE')
202 |     else:
203 |         @staticmethod
204 |         def _execute(path, argv, environ):
205 |             if environ is None:
206 |                 os.execvp(path, argv)
207 |             else:
208 |                 os.execvpe(path, argv, environ)
209 |             return
210 | 
211 |     # endregion
212 | 
213 | 
214 | def execute(path, argv=None, environ=None, command_class=ExternalSearchCommand):
215 |     """
216 |     :param path:
217 |     :type path: basestring
218 |     :param argv:
219 |     :type: argv: list, tuple, or None
220 |     :param environ:
221 |     :type environ: dict
222 |     :param command_class: External search command class to instantiate and execute.
223 |     :type command_class: type
224 |     :return:
225 |     :rtype: None
226 |     """
227 |     assert issubclass(command_class, ExternalSearchCommand)
228 |     command_class(path, argv, environ).execute()
229 | 


--------------------------------------------------------------------------------
/lib/splunklib/searchcommands/generating_command.py:
--------------------------------------------------------------------------------
  1 | # coding=utf-8
  2 | #
  3 | # Copyright © 2011-2015 Splunk, Inc.
  4 | #
  5 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  6 | # not use this file except in compliance with the License. You may obtain
  7 | # a copy of the License at
  8 | #
  9 | #     http://www.apache.org/licenses/LICENSE-2.0
 10 | #
 11 | # Unless required by applicable law or agreed to in writing, software
 12 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 13 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 14 | # License for the specific language governing permissions and limitations
 15 | # under the License.
 16 | 
 17 | from __future__ import absolute_import, division, print_function, unicode_literals
 18 | 
 19 | from .decorators import ConfigurationSetting
 20 | from .search_command import SearchCommand
 21 | 
 22 | from splunklib import six
 23 | from splunklib.six.moves import map as imap, filter as ifilter
 24 | 
 25 | # P1 [O] TODO: Discuss generates_timeorder in the class-level documentation for GeneratingCommand
 26 | 
 27 | 
 28 | class GeneratingCommand(SearchCommand):
 29 |     """ Generates events based on command arguments.
 30 | 
 31 |     Generating commands receive no input and must be the first command on a pipeline. There are three pipelines:
 32 |     streams, events, and reports. The streams pipeline generates or processes time-ordered event records on an
 33 |     indexer or search head.
 34 | 
 35 |     Streaming commands filter, modify, or augment event records and can be applied to subsets of index data in a
 36 |     parallel manner. An example of a streaming command from Splunk's built-in command set is rex_ which extracts and
 37 |     adds fields to event records at search time. Records that pass through the streams pipeline move on to the events
 38 |     pipeline.
 39 | 
 40 |     The events pipeline generates or processes records on a search head. Eventing commands typically filter, group,
 41 |     order, or augment event records. Examples of eventing commands from Splunk's built-in command set include sort_,
 42 |     dedup_, and cluster_. Each execution of an eventing command should produce a set of event records that is
 43 |     independently usable by downstream processors. Records that pass through the events pipeline move on to the reports
 44 |     pipeline.
 45 | 
 46 |     The reports pipeline also runs on a search head, but yields data structures for presentation, not event records.
 47 |     Examples of streaming from Splunk's built-in command set include chart_, stats_, and contingency_.
 48 | 
 49 |     GeneratingCommand configuration
 50 |     ===============================
 51 | 
 52 |     Configure your generating command based on the pipeline that it targets. How you configure your command depends on
 53 |     the Search Command Protocol (SCP) version.
 54 | 
 55 |     +----------+-------------------------------------+--------------------------------------------+
 56 |     | Pipeline | SCP 1                               | SCP 2                                      |
 57 |     +==========+=====================================+============================================+
 58 |     | streams  | streaming=True[,local=[True|False]] | type='streaming'[,distributed=[true|false] |
 59 |     +----------+-------------------------------------+--------------------------------------------+
 60 |     | events   | retainsevents=True, streaming=False | type='events'                              |
 61 |     +----------+-------------------------------------+--------------------------------------------+
 62 |     | reports  | streaming=False                     | type='reporting'                           |
 63 |     +----------+-------------------------------------+--------------------------------------------+
 64 | 
 65 |     Only streaming commands may be distributed to indexers. By default generating commands are configured to run
 66 |     locally in the streams pipeline and will run under either SCP 1 or SCP 2.
 67 | 
 68 |     .. code-block:: python
 69 | 
 70 |         @Configuration()
 71 |         class StreamingGeneratingCommand(GeneratingCommand)
 72 |             ...
 73 | 
 74 |     How you configure your command to run on a different pipeline or in a distributed fashion depends on what SCP
 75 |     protocol versions you wish to support. You must be sure to configure your command consistently for each protocol,
 76 |     if you wish to support both protocol versions correctly.
 77 | 
 78 |     .. _chart: http://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Chart
 79 |     .. _cluster: http://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Cluster
 80 |     .. _contingency: http://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Contingency
 81 |     .. _dedup: http://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Dedup
 82 |     .. _rex: http://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Rex
 83 |     .. _sort: http://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Sort
 84 |     .. _stats: http://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Stats
 85 | 
 86 |     Distributed Generating command
 87 |     ==============================
 88 | 
 89 |     Commands configured like this will run as the first command on search heads and/or indexers on the streams pipeline.
 90 | 
 91 |     +----------+---------------------------------------------------+---------------------------------------------------+
 92 |     | Pipeline | SCP 1                                             | SCP 2                                             |
 93 |     +==========+===================================================+===================================================+
 94 |     | streams  | 1. Add this line to your command's stanza in      | 1. Add this configuration setting to your code:   |
 95 |     |          |                                                   |                                                   |
 96 |     |          |    default/commands.conf::                        |    ..  code-block:: python                        |
 97 |     |          |                                                   |                                                   |
 98 |     |          |        local = false                              |        @Configuration(distributed=True)           |
 99 |     |          |                                                   |        class SomeCommand(GeneratingCommand)       |
100 |     |          |                                                   |            ...                                    |
101 |     |          | 2. Restart splunk                                 |                                                   |
102 |     |          |                                                   | 2. You are good to go; no need to restart Splunk  |
103 |     +----------+---------------------------------------------------+---------------------------------------------------+
104 | 
105 |     Eventing Generating command
106 |     ===========================
107 | 
108 |     Generating commands configured like this will run as the first command on a search head on the events pipeline.
109 | 
110 |     +----------+---------------------------------------------------+---------------------------------------------------+
111 |     | Pipeline | SCP 1                                             | SCP 2                                             |
112 |     +==========+===================================================+===================================================+
113 |     | events   | You have a choice. Add these configuration        | Add this configuration setting to your command    |
114 |     |          | settings to your command class:                   | setting to your command class:                    |
115 |     |          |                                                   |                                                   |
116 |     |          | .. code-block:: python                            | .. code-block:: python                            |
117 |     |          |                                                   |                                                   |
118 |     |          |     @Configuration(                               |     @Configuration(type='events')                 |
119 |     |          |         retainsevents=True, streaming=False)      |     class SomeCommand(GeneratingCommand)          |
120 |     |          |     class SomeCommand(GeneratingCommand)          |         ...                                       |
121 |     |          |         ...                                       |                                                   |
122 |     |          |                                                   |                                                   |
123 |     |          | Or add these lines to default/commands.conf:      |                                                   |
124 |     |          |                                                   |                                                   |
125 |     |          | ..  code-block:: text                             |                                                   |
126 |     |          |                                                   |                                                   |
127 |     |          |     retainsevents = true                          |                                                   |
128 |     |          |     streaming = false                             |                                                   |
129 |     +----------+---------------------------------------------------+---------------------------------------------------+
130 | 
131 |     Configure your command class like this, if you wish to support both protocols:
132 | 
133 |     ..  code-block:: python
134 | 
135 |         @Configuration(type='events', retainsevents=True, streaming=False)
136 |         class SomeCommand(GeneratingCommand)
137 |             ...
138 | 
139 |     You might also consider adding these lines to commands.conf instead of adding them to your command class:
140 | 
141 |     ..  code-block:: python
142 | 
143 |         retainsevents = false
144 |         streaming = false
145 | 
146 |     Reporting Generating command
147 |     ============================
148 | 
149 |     Commands configured like this will run as the first command on a search head on the reports pipeline.
150 | 
151 |     +----------+---------------------------------------------------+---------------------------------------------------+
152 |     | Pipeline | SCP 1                                             | SCP 2                                             |
153 |     +==========+===================================================+===================================================+
154 |     | events   | You have a choice. Add these configuration        | Add this configuration setting to your command    |
155 |     |          | settings to your command class:                   | setting to your command class:                    |
156 |     |          |                                                   |                                                   |
157 |     |          | .. code-block:: python                            | .. code-block:: python                            |
158 |     |          |                                                   |                                                   |
159 |     |          |     @Configuration(retainsevents=False)           |     @Configuration(type='reporting')              |
160 |     |          |     class SomeCommand(GeneratingCommand)          |     class SomeCommand(GeneratingCommand)          |
161 |     |          |         ...                                       |         ...                                       |
162 |     |          |                                                   |                                                   |
163 |     |          | Or add this lines to default/commands.conf:       |                                                   |
164 |     |          |                                                   |                                                   |
165 |     |          | .. code-block:: text                              |                                                   |
166 |     |          |                                                   |                                                   |
167 |     |          |     retainsevents = false                         |                                                   |
168 |     |          |     streaming = false                             |                                                   |
169 |     +----------+---------------------------------------------------+---------------------------------------------------+
170 | 
171 |     Configure your command class like this, if you wish to support both protocols:
172 | 
173 |     ..  code-block:: python
174 | 
175 |         @Configuration(type='reporting', streaming=False)
176 |         class SomeCommand(GeneratingCommand)
177 |             ...
178 | 
179 |     You might also consider adding these lines to commands.conf instead of adding them to your command class:
180 | 
181 |     ..  code-block:: text
182 | 
183 |         retainsevents = false
184 |         streaming = false
185 | 
186 |     """
187 |     # region Methods
188 | 
189 |     def generate(self):
190 |         """ A generator that yields records to the Splunk processing pipeline
191 | 
192 |         You must override this method.
193 | 
194 |         """
195 |         raise NotImplementedError('GeneratingCommand.generate(self)')
196 | 
197 |     def _execute(self, ifile, process):
198 |         """ Execution loop
199 | 
200 |         :param ifile: Input file object. Unused.
201 |         :type ifile: file
202 | 
203 |         :return: `None`.
204 | 
205 |         """
206 |         if self._protocol_version == 2:
207 |             result = self._read_chunk(self._as_binary_stream(ifile))
208 | 
209 |             if not result:
210 |                 return
211 | 
212 |             metadata, body = result
213 |             action = getattr(metadata, 'action', None)
214 | 
215 |             if action != 'execute':
216 |                 raise RuntimeError('Expected execute action, not {}'.format(action))
217 | 
218 |         self._record_writer.write_records(self.generate())
219 |         self.finish()
220 | 
221 |     # endregion
222 | 
223 |     # region Types
224 | 
225 |     class ConfigurationSettings(SearchCommand.ConfigurationSettings):
226 |         """ Represents the configuration settings for a :code:`GeneratingCommand` class.
227 | 
228 |         """
229 |         # region SCP v1/v2 Properties
230 | 
231 |         generating = ConfigurationSetting(readonly=True, value=True, doc='''
232 |             Tells Splunk that this command generates events, but does not process inputs.
233 | 
234 |             Generating commands must appear at the front of the search pipeline identified by :meth:`type`.
235 | 
236 |             Fixed: :const:`True`
237 | 
238 |             Supported by: SCP 1, SCP 2
239 | 
240 |             ''')
241 | 
242 |         # endregion
243 | 
244 |         # region SCP v1 Properties
245 | 
246 |         generates_timeorder = ConfigurationSetting(doc='''
247 |             :const:`True`, if the command generates new events.
248 | 
249 |             Default: :const:`False`
250 | 
251 |             Supported by: SCP 1
252 | 
253 |             ''')
254 | 
255 |         local = ConfigurationSetting(doc='''
256 |             :const:`True`, if the command should run locally on the search head.
257 | 
258 |             Default: :const:`False`
259 | 
260 |             Supported by: SCP 1
261 | 
262 |             ''')
263 | 
264 |         retainsevents = ConfigurationSetting(doc='''
265 |             :const:`True`, if the command retains events the way the sort, dedup, and cluster commands do, or whether it
266 |             transforms them the way the stats command does.
267 | 
268 |             Default: :const:`False`
269 | 
270 |             Supported by: SCP 1
271 | 
272 |             ''')
273 | 
274 |         streaming = ConfigurationSetting(doc='''
275 |             :const:`True`, if the command is streamable.
276 | 
277 |             Default: :const:`True`
278 | 
279 |             Supported by: SCP 1
280 | 
281 |             ''')
282 | 
283 |         # endregion
284 | 
285 |         # region SCP v2 Properties
286 | 
287 |         distributed = ConfigurationSetting(value=False, doc='''
288 |             True, if this command should be distributed to indexers.
289 | 
290 |             This value is ignored unless :meth:`type` is equal to :const:`streaming`. It is only this command type that
291 |             may be distributed.
292 | 
293 |             Default: :const:`False`
294 | 
295 |             Supported by: SCP 2
296 | 
297 |             ''')
298 | 
299 |         type = ConfigurationSetting(value='streaming', doc='''
300 |             A command type name.
301 | 
302 |             ====================  ======================================================================================
303 |             Value                 Description
304 |             --------------------  --------------------------------------------------------------------------------------
305 |             :const:`'events'`     Runs as the first command in the Splunk events pipeline. Cannot be distributed.
306 |             :const:`'reporting'`  Runs as the first command in the Splunk reports pipeline. Cannot be distributed.
307 |             :const:`'streaming'`  Runs as the first command in the Splunk streams pipeline. May be distributed.
308 |             ====================  ======================================================================================
309 | 
310 |             Default: :const:`'streaming'`
311 | 
312 |             Supported by: SCP 2
313 | 
314 |             ''')
315 | 
316 |         # endregion
317 | 
318 |         # region Methods
319 | 
320 |         @classmethod
321 |         def fix_up(cls, command):
322 |             """ Verifies :code:`command` class structure.
323 | 
324 |             """
325 |             if command.generate == GeneratingCommand.generate:
326 |                 raise AttributeError('No GeneratingCommand.generate override')
327 | 
328 |         # TODO: Stop looking like a dictionary because we don't obey the semantics
329 |         # N.B.: Does not use Python 2 dict copy semantics
330 |         def iteritems(self):
331 |             iteritems = SearchCommand.ConfigurationSettings.iteritems(self)
332 |             version = self.command.protocol_version
333 |             if version == 2:
334 |                 iteritems = ifilter(lambda name_value1: name_value1[0] != 'distributed', iteritems)
335 |                 if not self.distributed and self.type == 'streaming':
336 |                     iteritems = imap(
337 |                         lambda name_value: (name_value[0], 'stateful') if name_value[0] == 'type' else (name_value[0], name_value[1]), iteritems)
338 |             return iteritems
339 | 
340 |         # N.B.: Does not use Python 3 dict view semantics
341 |         if not six.PY2:
342 |             items = iteritems
343 | 
344 |         pass
345 |         # endregion
346 | 
347 |     pass
348 |     # endregion
349 | 


--------------------------------------------------------------------------------
/lib/splunklib/searchcommands/reporting_command.py:
--------------------------------------------------------------------------------
  1 | # coding=utf-8
  2 | #
  3 | # Copyright © 2011-2015 Splunk, Inc.
  4 | #
  5 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  6 | # not use this file except in compliance with the License. You may obtain
  7 | # a copy of the License at
  8 | #
  9 | #     http://www.apache.org/licenses/LICENSE-2.0
 10 | #
 11 | # Unless required by applicable law or agreed to in writing, software
 12 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 13 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 14 | # License for the specific language governing permissions and limitations
 15 | # under the License.
 16 | 
 17 | from __future__ import absolute_import, division, print_function, unicode_literals
 18 | 
 19 | from itertools import chain
 20 | 
 21 | from .internals import ConfigurationSettingsType, json_encode_string
 22 | from .decorators import ConfigurationSetting, Option
 23 | from .streaming_command import StreamingCommand
 24 | from .search_command import SearchCommand
 25 | from .validators import Set
 26 | from splunklib import six
 27 | 
 28 | 
 29 | class ReportingCommand(SearchCommand):
 30 |     """ Processes search result records and generates a reporting data structure.
 31 | 
 32 |     Reporting search commands run as either reduce or map/reduce operations. The reduce part runs on a search head and
 33 |     is responsible for processing a single chunk of search results to produce the command's reporting data structure.
 34 |     The map part is called a streaming preop. It feeds the reduce part with partial results and by default runs on the
 35 |     search head and/or one or more indexers.
 36 | 
 37 |     You must implement a :meth:`reduce` method as a generator function that iterates over a set of event records and
 38 |     yields a reporting data structure. You may implement a :meth:`map` method as a generator function that iterates
 39 |     over a set of event records and yields :class:`dict` or :class:`list(dict)` instances.
 40 | 
 41 |     ReportingCommand configuration
 42 |     ==============================
 43 | 
 44 |     Configure the :meth:`map` operation using a Configuration decorator on your :meth:`map` method. Configure it like
 45 |     you would a :class:`StreamingCommand`. Configure the :meth:`reduce` operation using a Configuration decorator on
 46 |     your :meth:`ReportingCommand` class.
 47 | 
 48 |     You can configure your command for operation under Search Command Protocol (SCP) version 1 or 2. SCP 2 requires
 49 |     Splunk 6.3 or later.
 50 | 
 51 |     """
 52 |     # region Special methods
 53 | 
 54 |     def __init__(self):
 55 |         SearchCommand.__init__(self)
 56 | 
 57 |     # endregion
 58 | 
 59 |     # region Options
 60 | 
 61 |     phase = Option(doc='''
 62 |         **Syntax:** phase=[map|reduce]
 63 | 
 64 |         **Description:** Identifies the phase of the current map-reduce operation.
 65 | 
 66 |     ''', default='reduce', validate=Set('map', 'reduce'))
 67 | 
 68 |     # endregion
 69 | 
 70 |     # region Methods
 71 | 
 72 |     def map(self, records):
 73 |         """ Override this method to compute partial results.
 74 | 
 75 |         :param records:
 76 |         :type records:
 77 | 
 78 |         You must override this method, if :code:`requires_preop=True`.
 79 | 
 80 |         """
 81 |         return NotImplemented
 82 | 
 83 |     def prepare(self):
 84 | 
 85 |         phase = self.phase
 86 | 
 87 |         if phase == 'map':
 88 |             # noinspection PyUnresolvedReferences
 89 |             self._configuration = self.map.ConfigurationSettings(self)
 90 |             return
 91 | 
 92 |         if phase == 'reduce':
 93 |             streaming_preop = chain((self.name, 'phase="map"', str(self._options)), self.fieldnames)
 94 |             self._configuration.streaming_preop = ' '.join(streaming_preop)
 95 |             return
 96 | 
 97 |         raise RuntimeError('Unrecognized reporting command phase: {}'.format(json_encode_string(six.text_type(phase))))
 98 | 
 99 |     def reduce(self, records):
100 |         """ Override this method to produce a reporting data structure.
101 | 
102 |         You must override this method.
103 | 
104 |         """
105 |         raise NotImplementedError('reduce(self, records)')
106 | 
107 |     def _execute(self, ifile, process):
108 |         SearchCommand._execute(self, ifile, getattr(self, self.phase))
109 | 
110 |     # endregion
111 | 
112 |     # region Types
113 | 
114 |     class ConfigurationSettings(SearchCommand.ConfigurationSettings):
115 |         """ Represents the configuration settings for a :code:`ReportingCommand`.
116 | 
117 |         """
118 |         # region SCP v1/v2 Properties
119 | 
120 |         required_fields = ConfigurationSetting(doc='''
121 |             List of required fields for this search which back-propagates to the generating search.
122 | 
123 |             Setting this value enables selected fields mode under SCP 2. Under SCP 1 you must also specify
124 |             :code:`clear_required_fields=True` to enable selected fields mode. To explicitly select all fields,
125 |             specify a value of :const:`['*']`. No error is generated if a specified field is missing.
126 | 
127 |             Default: :const:`None`, which implicitly selects all fields.
128 | 
129 |             Supported by: SCP 1, SCP 2
130 | 
131 |             ''')
132 | 
133 |         requires_preop = ConfigurationSetting(doc='''
134 |             Indicates whether :meth:`ReportingCommand.map` is required for proper command execution.
135 | 
136 |             If :const:`True`, :meth:`ReportingCommand.map` is guaranteed to be called. If :const:`False`, Splunk
137 |             considers it to be an optimization that may be skipped.
138 | 
139 |             Default: :const:`False`
140 | 
141 |             Supported by: SCP 1, SCP 2
142 | 
143 |             ''')
144 | 
145 |         streaming_preop = ConfigurationSetting(doc='''
146 |             Denotes the requested streaming preop search string.
147 | 
148 |             Computed.
149 | 
150 |             Supported by: SCP 1, SCP 2
151 | 
152 |             ''')
153 | 
154 |         # endregion
155 | 
156 |         # region SCP v1 Properties
157 | 
158 |         clear_required_fields = ConfigurationSetting(doc='''
159 |             :const:`True`, if required_fields represent the *only* fields required.
160 | 
161 |             If :const:`False`, required_fields are additive to any fields that may be required by subsequent commands.
162 |             In most cases, :const:`True` is appropriate for reporting commands.
163 | 
164 |             Default: :const:`True`
165 | 
166 |             Supported by: SCP 1
167 | 
168 |             ''')
169 | 
170 |         retainsevents = ConfigurationSetting(readonly=True, value=False, doc='''
171 |             Signals that :meth:`ReportingCommand.reduce` transforms _raw events to produce a reporting data structure.
172 | 
173 |             Fixed: :const:`False`
174 | 
175 |             Supported by: SCP 1
176 | 
177 |             ''')
178 | 
179 |         streaming = ConfigurationSetting(readonly=True, value=False, doc='''
180 |             Signals that :meth:`ReportingCommand.reduce` runs on the search head.
181 | 
182 |             Fixed: :const:`False`
183 | 
184 |             Supported by: SCP 1
185 | 
186 |             ''')
187 | 
188 |         # endregion
189 | 
190 |         # region SCP v2 Properties
191 | 
192 |         maxinputs = ConfigurationSetting(doc='''
193 |             Specifies the maximum number of events that can be passed to the command for each invocation.
194 | 
195 |             This limit cannot exceed the value of `maxresultrows` in limits.conf_. Under SCP 1 you must specify this
196 |             value in commands.conf_.
197 | 
198 |             Default: The value of `maxresultrows`.
199 | 
200 |             Supported by: SCP 2
201 | 
202 |             .. _limits.conf: http://docs.splunk.com/Documentation/Splunk/latest/admin/Limitsconf
203 | 
204 |             ''')
205 | 
206 |         run_in_preview = ConfigurationSetting(doc='''
207 |             :const:`True`, if this command should be run to generate results for preview; not wait for final output.
208 | 
209 |             This may be important for commands that have side effects (e.g., outputlookup).
210 | 
211 |             Default: :const:`True`
212 | 
213 |             Supported by: SCP 2
214 | 
215 |             ''')
216 | 
217 |         type = ConfigurationSetting(readonly=True, value='reporting', doc='''
218 |             Command type name.
219 | 
220 |             Fixed: :const:`'reporting'`.
221 | 
222 |             Supported by: SCP 2
223 | 
224 |             ''')
225 | 
226 |         # endregion
227 | 
228 |         # region Methods
229 | 
230 |         @classmethod
231 |         def fix_up(cls, command):
232 |             """ Verifies :code:`command` class structure and configures the :code:`command.map` method.
233 | 
234 |             Verifies that :code:`command` derives from :class:`ReportingCommand` and overrides
235 |             :code:`ReportingCommand.reduce`. It then configures :code:`command.reduce`, if an overriding implementation
236 |             of :code:`ReportingCommand.reduce` has been provided.
237 | 
238 |             :param command: :code:`ReportingCommand` class
239 | 
240 |             Exceptions:
241 | 
242 |             :code:`TypeError` :code:`command` class is not derived from :code:`ReportingCommand`
243 |             :code:`AttributeError` No :code:`ReportingCommand.reduce` override
244 | 
245 |             """
246 |             if not issubclass(command, ReportingCommand):
247 |                 raise TypeError('{} is not a ReportingCommand'.format( command))
248 | 
249 |             if command.reduce == ReportingCommand.reduce:
250 |                 raise AttributeError('No ReportingCommand.reduce override')
251 | 
252 |             if command.map == ReportingCommand.map:
253 |                 cls._requires_preop = False
254 |                 return
255 | 
256 |             f = vars(command)['map']   # Function backing the map method
257 | 
258 |             # EXPLANATION OF PREVIOUS STATEMENT: There is no way to add custom attributes to methods. See [Why does
259 |             # setattr fail on a method](http://stackoverflow.com/questions/7891277/why-does-setattr-fail-on-a-bound-method) for a discussion of this issue.
260 | 
261 |             try:
262 |                 settings = f._settings
263 |             except AttributeError:
264 |                 f.ConfigurationSettings = StreamingCommand.ConfigurationSettings
265 |                 return
266 | 
267 |             # Create new StreamingCommand.ConfigurationSettings class
268 | 
269 |             module = command.__module__ + '.' + command.__name__ + '.map'
270 |             name = b'ConfigurationSettings'
271 |             bases = (StreamingCommand.ConfigurationSettings,)
272 | 
273 |             f.ConfigurationSettings = ConfigurationSettingsType(module, name, bases)
274 |             ConfigurationSetting.fix_up(f.ConfigurationSettings, settings)
275 |             del f._settings
276 | 
277 |         pass
278 |         # endregion
279 | 
280 |     pass
281 |     # endregion
282 | 


--------------------------------------------------------------------------------
/lib/splunklib/searchcommands/streaming_command.py:
--------------------------------------------------------------------------------
  1 | # coding=utf-8
  2 | #
  3 | # Copyright 2011-2015 Splunk, Inc.
  4 | #
  5 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  6 | # not use this file except in compliance with the License. You may obtain
  7 | # a copy of the License at
  8 | #
  9 | #     http://www.apache.org/licenses/LICENSE-2.0
 10 | #
 11 | # Unless required by applicable law or agreed to in writing, software
 12 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 13 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 14 | # License for the specific language governing permissions and limitations
 15 | # under the License.
 16 | 
 17 | from __future__ import absolute_import, division, print_function, unicode_literals
 18 | 
 19 | from splunklib import six
 20 | from splunklib.six.moves import map as imap, filter as ifilter
 21 | 
 22 | from .decorators import ConfigurationSetting
 23 | from .search_command import SearchCommand
 24 | 
 25 | 
 26 | class StreamingCommand(SearchCommand):
 27 |     """ Applies a transformation to search results as they travel through the streams pipeline.
 28 | 
 29 |     Streaming commands typically filter, augment, or update, search result records. Splunk will send them in batches of
 30 |     up to 50,000 records. Hence, a search command must be prepared to be invoked many times during the course of
 31 |     pipeline processing. Each invocation should produce a set of results independently usable by downstream processors.
 32 | 
 33 |     By default Splunk may choose to run a streaming command locally on a search head and/or remotely on one or more
 34 |     indexers concurrently. The size and frequency of the search result batches sent to the command will vary based
 35 |     on scheduling considerations.
 36 | 
 37 |     StreamingCommand configuration
 38 |     ==============================
 39 | 
 40 |     You can configure your command for operation under Search Command Protocol (SCP) version 1 or 2. SCP 2 requires
 41 |     Splunk 6.3 or later.
 42 | 
 43 |     """
 44 |     # region Methods
 45 | 
 46 |     def stream(self, records):
 47 |         """ Generator function that processes and yields event records to the Splunk stream pipeline.
 48 | 
 49 |         You must override this method.
 50 | 
 51 |         """
 52 |         raise NotImplementedError('StreamingCommand.stream(self, records)')
 53 | 
 54 |     def _execute(self, ifile, process):
 55 |         SearchCommand._execute(self, ifile, self.stream)
 56 | 
 57 |     # endregion
 58 | 
 59 |     class ConfigurationSettings(SearchCommand.ConfigurationSettings):
 60 |         """ Represents the configuration settings that apply to a :class:`StreamingCommand`.
 61 | 
 62 |         """
 63 |         # region SCP v1/v2 properties
 64 | 
 65 |         required_fields = ConfigurationSetting(doc='''
 66 |             List of required fields for this search which back-propagates to the generating search.
 67 | 
 68 |             Setting this value enables selected fields mode under SCP 2. Under SCP 1 you must also specify
 69 |             :code:`clear_required_fields=True` to enable selected fields mode. To explicitly select all fields,
 70 |             specify a value of :const:`['*']`. No error is generated if a specified field is missing.
 71 | 
 72 |             Default: :const:`None`, which implicitly selects all fields.
 73 | 
 74 |             Supported by: SCP 1, SCP 2
 75 | 
 76 |             ''')
 77 | 
 78 |         # endregion
 79 | 
 80 |         # region SCP v1 properties
 81 | 
 82 |         clear_required_fields = ConfigurationSetting(doc='''
 83 |             :const:`True`, if required_fields represent the *only* fields required.
 84 | 
 85 |             If :const:`False`, required_fields are additive to any fields that may be required by subsequent commands.
 86 |             In most cases, :const:`False` is appropriate for streaming commands.
 87 | 
 88 |             Default: :const:`False`
 89 | 
 90 |             Supported by: SCP 1
 91 | 
 92 |             ''')
 93 | 
 94 |         local = ConfigurationSetting(doc='''
 95 |             :const:`True`, if the command should run locally on the search head.
 96 | 
 97 |             Default: :const:`False`
 98 | 
 99 |             Supported by: SCP 1
100 | 
101 |             ''')
102 | 
103 |         overrides_timeorder = ConfigurationSetting(doc='''
104 |             :const:`True`, if the command changes the order of events with respect to time.
105 | 
106 |             Default: :const:`False`
107 | 
108 |             Supported by: SCP 1
109 | 
110 |             ''')
111 | 
112 |         streaming = ConfigurationSetting(readonly=True, value=True, doc='''
113 |             Specifies that the command is streamable.
114 | 
115 |             Fixed: :const:`True`
116 | 
117 |             Supported by: SCP 1
118 | 
119 |             ''')
120 | 
121 |         # endregion
122 | 
123 |         # region SCP v2 Properties
124 | 
125 |         distributed = ConfigurationSetting(value=True, doc='''
126 |             :const:`True`, if this command should be distributed to indexers.
127 | 
128 |             Under SCP 1 you must either specify `local = False` or include this line in commands.conf_, if this command
129 |             should be distributed to indexers.
130 | 
131 |             ..code:
132 |                 local = true
133 | 
134 |             Default: :const:`True`
135 | 
136 |             Supported by: SCP 2
137 | 
138 |             .. commands.conf_: http://docs.splunk.com/Documentation/Splunk/latest/Admin/Commandsconf
139 | 
140 |             ''')
141 | 
142 |         maxinputs = ConfigurationSetting(doc='''
143 |             Specifies the maximum number of events that can be passed to the command for each invocation.
144 | 
145 |             This limit cannot exceed the value of `maxresultrows` in limits.conf. Under SCP 1 you must specify this
146 |             value in commands.conf_.
147 | 
148 |             Default: The value of `maxresultrows`.
149 | 
150 |             Supported by: SCP 2
151 | 
152 |             ''')
153 | 
154 |         type = ConfigurationSetting(readonly=True, value='streaming', doc='''
155 |             Command type name.
156 | 
157 |             Fixed: :const:`'streaming'`
158 | 
159 |             Supported by: SCP 2
160 | 
161 |             ''')
162 | 
163 |         # endregion
164 | 
165 |         # region Methods
166 | 
167 |         @classmethod
168 |         def fix_up(cls, command):
169 |             """ Verifies :code:`command` class structure.
170 | 
171 |             """
172 |             if command.stream == StreamingCommand.stream:
173 |                 raise AttributeError('No StreamingCommand.stream override')
174 |             return
175 | 
176 |         # TODO: Stop looking like a dictionary because we don't obey the semantics
177 |         # N.B.: Does not use Python 2 dict copy semantics
178 |         def iteritems(self):
179 |             iteritems = SearchCommand.ConfigurationSettings.iteritems(self)
180 |             version = self.command.protocol_version
181 |             if version == 1:
182 |                 if self.required_fields is None:
183 |                     iteritems = ifilter(lambda name_value: name_value[0] != 'clear_required_fields', iteritems)
184 |             else:
185 |                 iteritems = ifilter(lambda name_value2: name_value2[0] != 'distributed', iteritems)
186 |                 if not self.distributed:
187 |                     iteritems = imap(
188 |                         lambda name_value1: (name_value1[0], 'stateful') if name_value1[0] == 'type' else (name_value1[0], name_value1[1]), iteritems)
189 |             return iteritems
190 | 
191 |         # N.B.: Does not use Python 3 dict view semantics
192 |         if not six.PY2:
193 |             items = iteritems
194 | 
195 |         # endregion
196 | 


--------------------------------------------------------------------------------
/lib/splunklib/searchcommands/validators.py:
--------------------------------------------------------------------------------
  1 | # coding=utf-8
  2 | #
  3 | # Copyright 2011-2015 Splunk, Inc.
  4 | #
  5 | # Licensed under the Apache License, Version 2.0 (the "License"): you may
  6 | # not use this file except in compliance with the License. You may obtain
  7 | # a copy of the License at
  8 | #
  9 | #     http://www.apache.org/licenses/LICENSE-2.0
 10 | #
 11 | # Unless required by applicable law or agreed to in writing, software
 12 | # distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
 13 | # WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
 14 | # License for the specific language governing permissions and limitations
 15 | # under the License.
 16 | 
 17 | from __future__ import absolute_import, division, print_function, unicode_literals
 18 | 
 19 | from json.encoder import encode_basestring_ascii as json_encode_string
 20 | from collections import namedtuple
 21 | from splunklib.six.moves import StringIO
 22 | from io import open
 23 | import csv
 24 | import os
 25 | import re
 26 | from splunklib import six
 27 | from splunklib.six.moves import getcwd
 28 | 
 29 | 
 30 | class Validator(object):
 31 |     """ Base class for validators that check and format search command options.
 32 | 
 33 |     You must inherit from this class and override :code:`Validator.__call__` and
 34 |     :code:`Validator.format`. :code:`Validator.__call__` should convert the
 35 |     value it receives as argument and then return it or raise a
 36 |     :code:`ValueError`, if the value will not convert.
 37 | 
 38 |     :code:`Validator.format` should return a human readable version of the value
 39 |     it receives as argument the same way :code:`str` does.
 40 | 
 41 |     """
 42 |     def __call__(self, value):
 43 |         raise NotImplementedError()
 44 | 
 45 |     def format(self, value):
 46 |         raise NotImplementedError()
 47 | 
 48 | 
 49 | class Boolean(Validator):
 50 |     """ Validates Boolean option values.
 51 | 
 52 |     """
 53 |     truth_values = {
 54 |         '1': True, '0': False,
 55 |         't': True, 'f': False,
 56 |         'true': True, 'false': False,
 57 |         'y': True, 'n': False,
 58 |         'yes': True, 'no': False
 59 |     }
 60 | 
 61 |     def __call__(self, value):
 62 |         if not (value is None or isinstance(value, bool)):
 63 |             value = six.text_type(value).lower()
 64 |             if value not in Boolean.truth_values:
 65 |                 raise ValueError('Unrecognized truth value: {0}'.format(value))
 66 |             value = Boolean.truth_values[value]
 67 |         return value
 68 | 
 69 |     def format(self, value):
 70 |         return None if value is None else 't' if value else 'f'
 71 | 
 72 | 
 73 | class Code(Validator):
 74 |     """ Validates code option values.
 75 | 
 76 |     This validator compiles an option value into a Python code object that can be executed by :func:`exec` or evaluated
 77 |     by :func:`eval`. The value returned is a :func:`namedtuple` with two members: object, the result of compilation, and
 78 |     source, the original option value.
 79 | 
 80 |     """
 81 |     def __init__(self, mode='eval'):
 82 |         """
 83 |         :param mode: Specifies what kind of code must be compiled; it can be :const:`'exec'`, if source consists of a
 84 |             sequence of statements, :const:`'eval'`, if it consists of a single expression, or :const:`'single'` if it
 85 |             consists of a single interactive statement. In the latter case, expression statements that evaluate to
 86 |             something other than :const:`None` will be printed.
 87 |         :type mode: unicode or bytes
 88 | 
 89 |         """
 90 |         self._mode = mode
 91 | 
 92 |     def __call__(self, value):
 93 |         if value is None:
 94 |             return None
 95 |         try:
 96 |             return Code.object(compile(value, 'string', self._mode), six.text_type(value))
 97 |         except (SyntaxError, TypeError) as error:
 98 |             if six.PY2:
 99 |                 message = error.message
100 |             else:
101 |                 message = str(error)
102 | 
103 |             six.raise_from(ValueError(message), error)
104 | 
105 |     def format(self, value):
106 |         return None if value is None else value.source
107 | 
108 |     object = namedtuple('Code', ('object', 'source'))
109 | 
110 | 
111 | class Fieldname(Validator):
112 |     """ Validates field name option values.
113 | 
114 |     """
115 |     pattern = re.compile(r'''[_.a-zA-Z-][_.a-zA-Z0-9-]*$''')
116 | 
117 |     def __call__(self, value):
118 |         if value is not None:
119 |             value = six.text_type(value)
120 |             if Fieldname.pattern.match(value) is None:
121 |                 raise ValueError('Illegal characters in fieldname: {}'.format(value))
122 |         return value
123 | 
124 |     def format(self, value):
125 |         return value
126 | 
127 | 
128 | class File(Validator):
129 |     """ Validates file option values.
130 | 
131 |     """
132 |     def __init__(self, mode='rt', buffering=None, directory=None):
133 |         self.mode = mode
134 |         self.buffering = buffering
135 |         self.directory = File._var_run_splunk if directory is None else directory
136 | 
137 |     def __call__(self, value):
138 | 
139 |         if value is None:
140 |             return value
141 | 
142 |         path = six.text_type(value)
143 | 
144 |         if not os.path.isabs(path):
145 |             path = os.path.join(self.directory, path)
146 | 
147 |         try:
148 |             value = open(path, self.mode) if self.buffering is None else open(path, self.mode, self.buffering)
149 |         except IOError as error:
150 |             raise ValueError('Cannot open {0} with mode={1} and buffering={2}: {3}'.format(
151 |                 value, self.mode, self.buffering, error))
152 | 
153 |         return value
154 | 
155 |     def format(self, value):
156 |         return None if value is None else value.name
157 | 
158 |     _var_run_splunk = os.path.join(
159 |         os.environ['SPLUNK_HOME'] if 'SPLUNK_HOME' in os.environ else getcwd(), 'var', 'run', 'splunk')
160 | 
161 | 
162 | class Integer(Validator):
163 |     """ Validates integer option values.
164 | 
165 |     """
166 |     def __init__(self, minimum=None, maximum=None):
167 |         if minimum is not None and maximum is not None:
168 |             def check_range(value):
169 |                 if not (minimum <= value <= maximum):
170 |                     raise ValueError('Expected integer in the range [{0},{1}], not {2}'.format(minimum, maximum, value))
171 |                 return
172 |         elif minimum is not None:
173 |             def check_range(value):
174 |                 if value < minimum:
175 |                     raise ValueError('Expected integer in the range [{0},+∞], not {1}'.format(minimum, value))
176 |                 return
177 |         elif maximum is not None:
178 |             def check_range(value):
179 |                 if value > maximum:
180 |                     raise ValueError('Expected integer in the range [-∞,{0}], not {1}'.format(maximum, value))
181 |                 return
182 |         else:
183 |             def check_range(value):
184 |                 return
185 | 
186 |         self.check_range = check_range
187 |         return
188 | 
189 |     def __call__(self, value):
190 |         if value is None:
191 |             return None
192 |         try:
193 |             if six.PY2:
194 |                 value = long(value)
195 |             else:
196 |                 value = int(value)
197 |         except ValueError:
198 |             raise ValueError('Expected integer value, not {}'.format(json_encode_string(value)))
199 | 
200 |         self.check_range(value)
201 |         return value
202 | 
203 |     def format(self, value):
204 |         return None if value is None else six.text_type(int(value))
205 | 
206 | 
207 | class Duration(Validator):
208 |     """ Validates duration option values.
209 | 
210 |     """
211 |     def __call__(self, value):
212 | 
213 |         if value is None:
214 |             return None
215 | 
216 |         p = value.split(':', 2)
217 |         result = None
218 |         _60 = Duration._60
219 |         _unsigned = Duration._unsigned
220 | 
221 |         try:
222 |             if len(p) == 1:
223 |                 result = _unsigned(p[0])
224 |             if len(p) == 2:
225 |                 result = 60 * _unsigned(p[0]) + _60(p[1])
226 |             if len(p) == 3:
227 |                 result = 3600 * _unsigned(p[0]) + 60 * _60(p[1]) + _60(p[2])
228 |         except ValueError:
229 |             raise ValueError('Invalid duration value: {0}'.format(value))
230 | 
231 |         return result
232 | 
233 |     def format(self, value):
234 | 
235 |         if value is None:
236 |             return None
237 | 
238 |         value = int(value)
239 | 
240 |         s = value % 60
241 |         m = value // 60 % 60
242 |         h = value // (60 * 60)
243 | 
244 |         return '{0:02d}:{1:02d}:{2:02d}'.format(h, m, s)
245 | 
246 |     _60 = Integer(0, 59)
247 |     _unsigned = Integer(0)
248 | 
249 | 
250 | class List(Validator):
251 |     """ Validates a list of strings
252 | 
253 |     """
254 |     class Dialect(csv.Dialect):
255 |         """ Describes the properties of list option values. """
256 |         strict = True
257 |         delimiter = str(',')
258 |         quotechar = str('"')
259 |         doublequote = True
260 |         lineterminator = str('\n')
261 |         skipinitialspace = True
262 |         quoting = csv.QUOTE_MINIMAL
263 | 
264 |     def __init__(self, validator=None):
265 |         if not (validator is None or isinstance(validator, Validator)):
266 |             raise ValueError('Expected a Validator instance or None for validator, not {}', repr(validator))
267 |         self._validator = validator
268 | 
269 |     def __call__(self, value):
270 | 
271 |         if value is None or isinstance(value, list):
272 |             return value
273 | 
274 |         try:
275 |             value = next(csv.reader([value], self.Dialect))
276 |         except csv.Error as error:
277 |             raise ValueError(error)
278 | 
279 |         if self._validator is None:
280 |             return value
281 | 
282 |         try:
283 |             for index, item in enumerate(value):
284 |                 value[index] = self._validator(item)
285 |         except ValueError as error:
286 |             raise ValueError('Could not convert item {}: {}'.format(index, error))
287 | 
288 |         return value
289 | 
290 |     def format(self, value):
291 |         output = StringIO()
292 |         writer = csv.writer(output, List.Dialect)
293 |         writer.writerow(value)
294 |         value = output.getvalue()
295 |         return value[:-1]
296 | 
297 | 
298 | class Map(Validator):
299 |     """ Validates map option values.
300 | 
301 |     """
302 |     def __init__(self, **kwargs):
303 |         self.membership = kwargs
304 | 
305 |     def __call__(self, value):
306 | 
307 |         if value is None:
308 |             return None
309 | 
310 |         value = six.text_type(value)
311 | 
312 |         if value not in self.membership:
313 |             raise ValueError('Unrecognized value: {0}'.format(value))
314 | 
315 |         return self.membership[value]
316 | 
317 |     def format(self, value):
318 |         return None if value is None else list(self.membership.keys())[list(self.membership.values()).index(value)]
319 | 
320 | 
321 | class Match(Validator):
322 |     """ Validates that a value matches a regular expression pattern.
323 | 
324 |     """
325 |     def __init__(self, name, pattern, flags=0):
326 |         self.name = six.text_type(name)
327 |         self.pattern = re.compile(pattern, flags)
328 | 
329 |     def __call__(self, value):
330 |         if value is None:
331 |             return None
332 |         value = six.text_type(value)
333 |         if self.pattern.match(value) is None:
334 |             raise ValueError('Expected {}, not {}'.format(self.name, json_encode_string(value)))
335 |         return value
336 | 
337 |     def format(self, value):
338 |         return None if value is None else six.text_type(value)
339 | 
340 | 
341 | class OptionName(Validator):
342 |     """ Validates option names.
343 | 
344 |     """
345 |     pattern = re.compile(r'''(?=\w)[^\d]\w*$''', re.UNICODE)
346 | 
347 |     def __call__(self, value):
348 |         if value is not None:
349 |             value = six.text_type(value)
350 |             if OptionName.pattern.match(value) is None:
351 |                 raise ValueError('Illegal characters in option name: {}'.format(value))
352 |         return value
353 | 
354 |     def format(self, value):
355 |         return None if value is None else six.text_type(value)
356 | 
357 | 
358 | class RegularExpression(Validator):
359 |     """ Validates regular expression option values.
360 | 
361 |     """
362 |     def __call__(self, value):
363 |         if value is None:
364 |             return None
365 |         try:
366 |             value = re.compile(six.text_type(value))
367 |         except re.error as error:
368 |             raise ValueError('{}: {}'.format(six.text_type(error).capitalize(), value))
369 |         return value
370 | 
371 |     def format(self, value):
372 |         return None if value is None else value.pattern
373 | 
374 | 
375 | class Set(Validator):
376 |     """ Validates set option values.
377 | 
378 |     """
379 |     def __init__(self, *args):
380 |         self.membership = set(args)
381 | 
382 |     def __call__(self, value):
383 |         if value is None:
384 |             return None
385 |         value = six.text_type(value)
386 |         if value not in self.membership:
387 |             raise ValueError('Unrecognized value: {}'.format(value))
388 |         return value
389 | 
390 |     def format(self, value):
391 |         return self.__call__(value)
392 | 
393 | 
394 | __all__ = ['Boolean', 'Code', 'Duration', 'File', 'Integer', 'List', 'Map', 'RegularExpression', 'Set']
395 | 


--------------------------------------------------------------------------------
/metadata/default.meta:
--------------------------------------------------------------------------------
1 | []
2 | export = system
3 | 
4 | 


--------------------------------------------------------------------------------
/static/appIcon.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/seunomosowon/TA-mailclient/b4745263d53f03e06edf098a665a5597d40fe449/static/appIcon.png


--------------------------------------------------------------------------------
/static/appIcon_2x.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/seunomosowon/TA-mailclient/b4745263d53f03e06edf098a665a5597d40fe449/static/appIcon_2x.png


--------------------------------------------------------------------------------