manuka | A modular OSINT honeypot for blue teamers | Security library

by spaceraccoon Shell Version: Current License: GPL-3.0

X-Ray Key Features Code Snippets Community Discussions(5)Vulnerabilities Install Support

kandi X-RAY | manuka Summary

manuka is a Shell library typically used in Security applications. manuka has no bugs, it has no vulnerabilities, it has a Strong Copyleft License and it has low support. You can download it from GitHub.

Manuka is an Open-source intelligence (OSINT) honeypot that monitors reconnaissance attempts by threat actors and generates actionable intelligence for Blue Teamers. It creates a simulated environment consisting of staged OSINT sources, such as social media profiles and leaked credentials, and tracks signs of adversary interest, closely aligning to MITRE’s PRE-ATT&CK framework. Manuka gives Blue Teams additional visibility of the pre-attack reconnaissance phase and generates early-warning signals for defenders. Although they vary in scale and sophistication, most traditional honeypots focus on networks. These honeypots uncover attackers at Stage 2 (Weaponization) to 7 (Actions on Objectives) of the cyber kill chain, with the assumption that attackers are already probing the network. Manuka conducts OSINT threat detection at Stage 1 (Reconnaissance) of the cyber kill chain. Despite investing millions of dollars into network defenses, organisations can be easily compromised through a single Google search. One recent example is hackers exposing corporate meetings, therapy sessions, and college classes through Zoom calls left on the open Web. Enterprises need to detect these OSINT threats on their perimeter but lack the tools to do so. Manuka is built to scale. Users can easily add new listener modules and plug them into the Dockerized environment. They can coordinate multiple campaigns and honeypots simultaneously to broaden the honeypot surface. Furthermore, users can quickly customize and deploy Manuka to match different use cases. Manuka’s data is designed to be easily ported to other third-party analysis and visualization tools in an organisation’s workflow. Designing an OSINT honeypot presents a novel challenge due to the complexity and wide range of OSINT techniques. However, such a tool would allow Blue Teamers to “shift left” in their cyber threat intelligence strategy.

Support

Quality

Security

License

Reuse

Support

manuka has a low active ecosystem.

It has 214 star(s) with 32 fork(s). There are 19 watchers for this library.

It had no major release in the last 6 months.

There are 4 open issues and 0 have been closed. There are no pull requests.

It has a neutral sentiment in the developer community.

The latest version of manuka is current.

Quality

manuka has no bugs reported.

Security

manuka has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

License

manuka is licensed under the GPL-3.0 License. This license is Strong Copyleft.

Strong Copyleft licenses enforce sharing, and you can use them when creating open source projects.

Reuse

manuka releases are not available. You will need to build from source code and install.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of manuka

Get all kandi verified functions for this library.

manuka Key Features

No Key Features are available at this moment for manuka.

manuka Examples and Code Snippets

No Code Snippets are available at this moment for manuka.

Community Discussions

Trending Discussions on manuka

How to split a text in list items in an array in Javascript / React?

Add in a new column that appends all uppercase words in a phrase to a list per row

Filtering country to apply different stopwords

Subset datasets by variable before using expand.grid to calculate distance matrix

Matching documens with text2vec -- scaling problems

QUESTION

How to split a text in list items in an array in Javascript / React?

Asked 2021-Feb-19 at 09:51

Thank you for taking the time to read my question! Most likely I formulated my question wrong, so sorry for confusing you already. It also might be a basic Javascript question for some of you, but I can not wrap my head around it. I will try my best explaining as to what I am doing.

My data looks like this:

...

ANSWER

Answered 2021-Feb-19 at 09:42

here is how you can split a text into a list

Source https://stackoverflow.com/questions/66275075

QUESTION

Add in a new column that appends all uppercase words in a phrase to a list per row

Asked 2020-Jul-23 at 00:48

I have a dataset similar to the following:

...

ANSWER

Answered 2020-Jul-23 at 00:21

Create a list comprehension object m that compares values to .upper() to get all uppercase letters and .isalpha() to make sure you are not bringing in strings / numbers where .upper() doesn't do anything to them. Then, simply create new columns that utilize the list comprehension with .apply(m)

Source https://stackoverflow.com/questions/63044804

QUESTION

Filtering country to apply different stopwords

Asked 2020-Jul-05 at 04:41

I have the following dataset

...

ANSWER

Answered 2020-Jun-30 at 00:19

define your function with

Source https://stackoverflow.com/questions/62648446

QUESTION

Subset datasets by variable before using expand.grid to calculate distance matrix

Asked 2018-Nov-27 at 19:26

I have two datasets. One dataset has about ~30k rows, and the second dataset has ~60k rows. The smaller dataset (df1) has a unique identifier (upc), which is critical to my analysis.

The larger dataset (df2) does not have this unique identifier, but it does have a descriptive variable (product_title) that can be matched with a similar description variable in df1 and used to infer the unique identifier.

I am trying to keep things simple, so I used expand.grid.

...

ANSWER

Answered 2018-Nov-27 at 19:26

Your idea is good. One realization of it then would be

Source https://stackoverflow.com/questions/53487716

QUESTION

Matching documens with text2vec -- scaling problems

Asked 2018-Feb-16 at 16:40

I am having a few issues with scaling a text matching program. I am using text2vec which provides very good and fast results.

The main problem I am having is manipulating a large matrix which is returned by the text2vec::sim2() function.

First, some details of my hardware / OS setup: Windows 7 with 12 cores about 3.5 GHz and 128 Gb of memory. Its a pretty good machine.

Second, some basic details of what my R program is trying to achieve.

We have a database of 10 million unique canonical addresses for every house / business in address. These reference addresses also have latitude and longitude information for each entry.

I am trying to match these reference addresses to customer addresses in our database. We have about 600,000 customer addresses. The quality of these customer addresses is not good. Not good at all! They are stored as a single string field with absolutely zero checks on input.

The techical strategy to match these addresses is quite simple. Create two document term matrices (DTM) of the customer addresses and reference addresses and use cosine similarity to find the reference address which is the most similar to a specific customer address. Some customer addresses are so poor that will result in a very low cosine similarity -- so, for these addresses a "no match" would be assigned.

Despite being a pretty simple solution, the results obtained are very encouraging.

But, I am having problems scaling things....? And I am wondering if anyone has any suggestions.

There is a copy of my code below. Its pretty simple. Obviously, I cannot include real data but it should provide readers a clear idea of what I am trying to do.

SECTION A - Works very well even on the full 600,000 * 10 million input data set.

SECTION B - the text2vec::sim2() function causes R studio to shut down when the vocabulary exceeds about 140,000 tokens (i.e columns). To avoid this, I process the customer addresses in chunks of about 200.

SECTION C - This is the most expensive section. When processing addresses in chunks of 200, SECTION A and SECTION B take about 2 minutes. But SECTION C, using (what I would have thought to be super quick functions) take about 5 minutes to process to process a 10 million row * 200 column matrix.

Combined, SECIONS A:C take about 7 minutes to process 200 addresses. As there are 600,000 addresses to process, this will take about 14 days to process.

Are they are ideas to make this code run faster...?

...

ANSWER

Answered 2018-Feb-16 at 16:40

The issue in step C is that mat_sim is sparse and all the apply calls make column/row subsetting which are super slow (and convert sparse vectors to dense).

There could be several solutions:

if mat_sim is not very huge convert to the dense with as.matrix and then use apply
Better you can convert mat_sim to sparse matrix in a triplet format with as(mat_sim, "TsparseMatrix") and then use data.table to get indices of the max elements. Here is an example:

Source https://stackoverflow.com/questions/48821866

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install manuka

You can download it from GitHub.

Support

Monitors for social activities on Facebook and LinkedIn. Currently supports notification of network connection attempts. Note that the monitored social media account(s) should have email notification enabled. The corresponding email account(s) receiving the email notifications from the social media platforms should be configured to forward these emails to the centralised gmail account. Monitors for attempted login using leaked credentials on the honeypot site.

Find more information at: