The Database

We've all talked about it, we all want it. Nobody wants to be the one to make it, so let's do what we always do: Decentralize.

The idea is simple, everyone has a database of the information they've gathered. They host said database on their own machines that they can run their facial recognition queries against; if they want to share they can use Tor or just copy-pasta the data folders.

Program Requirements:
Any tips, suggestions, thoughts, etc. please clapshare ITT.
I'm currently working on the GUI the part I've always found the most difficult and it's coming along very well. I did all of this last night and today, the GUI is about 50% finished; backend should be rather simple.

Other urls found in this thread:

twitter.com/RepKinzinger
twitter.com/KarlRove
twitter.com/RepJerryNadler
twitter.com/EspaillatNY
twitter.com/united4vila
github.com/cmusatyalab/openface
github.com/mysql/mysql-server
github.com/JohnnyHobo/CommunityPolicing
notabug.org/
fortress.neocities.org/kekchan_alpha_prerelease_notes.html
github.com/JohnnyHobo/CommunityPolicing.git
fortress.neocities.org/
archive.is/KcbT6
cmusatyalab.
mmlab.ie.cuhk.edu.hk/projects/TCDCN.html.
discord.gg/fpKNPv
arxiv.org/pdf/1607.08438v1.pdf
newscientist.com/article/dn27761-facebook-can-recognise-you-in-photos-even-if-youre-not-looking/
discord.gg/W75yuvH
twitter.com/JohnAmericanPhD
github.com/jgorset/facepy
8ch.net/compint/index.html
youtube.com/user/sentdex
youtube.com/user/thenewboston
github.com/ryanjay0/miles-deep
github.com/bakwc/PornDetector
discord.gg/NN3TGNv
twitter.com/SFWRedditImages

bump

I do recall The List meme tweeted as a no-context method for scaring the piss out of kike / traitor journos and politicians a while back. Was bretty funny.

I like this better. Have a bump

Well there has been some plans in the past to create an onion site or distributed chan in the past. I'd go more for the distributed chan with Holla Forums on it then just advertising your folders on WN, because it creates plausible deniability by there being a Holla Forums section or Holla Forums section. Only problem then is insert CP and its now on your PC.

So you'd need a crypto chain to ensure messages keep their contents which isn't hard and or could only be selectively read so you mitigate the CP directed to your IP. Just like time versus space you also have privacy versus coverage. I would say work on the actions UML before making a GUI however though your GUI seems only for computer vision and not hosting.

I'll be looking deeply into blockchains for automation a few weeks from now. Are you using OpenCV for your facial recognition?

bump for sticky

Nntpchan already exists user. I2p and onion based chans are out there if you look. They just need skilled fags to come to them

Have a sample.

Adam Kinzinger
IL Congressman, #NeverTrump
25566 S Keating Blvd
Channahon, IL 60410
twitter.com/RepKinzinger

Karl Rove
Former WH Chief of Staff
1408 Circle Ridge Dr
West Lake Hills, TX 78746
&
1333 New Hampshire Ave NW Ste 600
Washington, DC 20036-1532
twitter.com/KarlRove

Deborah Wasserman-Schultz
Former Chair Of DNC
4479 Fox Glove Ln
Weston, FL 33331
(954) 384-2225

Jerrold Nadler
Democratic Congressman/Attorney
315 West 70 Street 3C
New York, NY 10023
twitter.com/RepJerryNadler

Adriano Espaillat
Democratic Nominee for Congress
62 Park Terrace West A87
New York, NY 10034
twitter.com/EspaillatNY

Daniel Vila
Green Party Congressional Candidate
192 Claremont Avenue
New York, NY 10027
twitter.com/united4vila

It's not a meme. You think those voting machines just un-rigged themselves?

That's the plan.

As far as the servers go, they really shouldn't be talking to each other too much at least not at first, the main thing I was looking for was pulling from other databases rather than pushing to them; word of mouth rather than "Here's a list of repos, download away". Some people will still get hit, but it is much safer than allowing unfilterd inputs to a database. Nudity filters also work, that might be something to implement down the line with a whitelist and hashing, but people can still hide pornography in binary code so anything short of setting up some kind of website and using a webscraper on top of that as well as screen-shotting the images themselves AND applying the nudity filter + whitelist will still fall short in some areas. I suppose a community PGP voting system via keysignatures would also help to filter out the smut.

Important thing is to get the client's created and a database structure setup so people can start working with it and getting used to it while the networking gets sorted out.

I also need to write the (((Terms Of Service))) so I don't get shoa'd. Don't do anything illegal with it goyims.

I do find the blockchain idea interesting though; but we're talking videos, documents, web archives, and pictures -not just a bunch of hashes in a ledger.

Maybe a blockchain could be used to hold PGP voting info, names, and other information.

Actually, discarding the images you could very easily do a distributed dox database with a blockchain. Since a dox is generally just name, address, phone, links to archives, and a few comments on their activities it shouldn't be more than a few hundred bytes. Fuggging great idea with the blockchain user

name coin?

Nigger use something based on block chain technology then checkpoint on the bitcoin block chain to ensure data integrity. Don't be a pleb

Monax, formally Erisdb look it up

Add a way to track organizations and a persons affiliations with brief explanations of each affiliation. Would make it much easier for people to connect the dots, especially if you made it so an affiliation has to be another person or organization in the database. This way everything is easy to verify and understand, and it could prevent duplicates.

I've made ruby scripts that'll upload and download with the press of enter. There are many sites like mega.nz that you could write an automated client for but the new securities make creating one simply from knowing the HTTP queries difficult.
.

If you put it out to the public as client code to do whatever it wouldn't matter what someone would do it. Further down the line you could use an automated client that'd upload to mega.nz or dropbox with the big stuff if the blockchain is to slow.


True but it is not of our making and so I doubt anyone there is of us.

Just use IPFS. If you want the fancy shit (which nobody will actually make because it will be bikeshedded to death) it can be built with IPFS as a backbone.

do what you want.
try not to go crazy.

'database' is a misunderstood and misapplied term. as an application server, this is not needed. it also tries to have machines 'just handle' all the bits, pieces, and nuances of work. which is generally not a good plan, without first outlining the data, archive, storage, distribution, release-diff-merge procedures, signing, and history-edge control. by the time you are done with all of these tasks, then and only then can a database be added on top of it, as a bonus-feature and replaceable-interop between other applications.

to the laymen, it helps to think in terms of not so much databases (which descends into inaction, failure to comprehend, sloth, crazy, and abandonment without chance of recovery). instead, think in terms of email. company domains, project domains, system domains, etc, with mailbox users, task-subject users, archive/log/sysmsg users, etc. doing this as if you would be 'doing an email server' also attains distribution, transaction, human inspection, and auditing, etc. it will also permit recovery and retasking, isolation and destruction/preservation of data separate from code, etc.

making these systems is easy. handing the concepts, presence, problem-space, provision, production, delivery, limitations, and roadmaps to the point of not just correction, but crucially to completion as well, without confabulation - this is the harder part. using 'database' meme magic to 'just handle' the part that matters the most is shitskin thinking; you are asked to keep it from bleeding into your patterns.

applications come at the end. data first, structure second, and release (publication) dist/sych/merge generally follows structure [use established file system tools][or established email system tools under individual eml or mbox; avoid exchange lotus shitskin-capitulated etc]. operations on data are not permitted to delete or modify, only post transforms and derivations - which are kept separate ; data rank/score/trust will be established by adoption-acceptance into distribution trees, and this too will be kept in it's own separate structure. code is yet another structure, kept separate from all as well. search is an application, it comes at the end, fuck you, don't argue. database is an application, it comes at the end, fuck you, don't argue. your primary task is data, then structure, then release distribution/sync/merge, then history / log / transforms / production / production storage / etc.

good luck to you. time ticks away. do it right, and the work require to maintain is solved before expansion, preventing capitulation and abandonment. fail to do this pre-work, maintenance will kill you. and you will not have enough time, energy, and willpower/spirit left in you at each interval tick to recognize manipulation, shitskin frame entropy invasion, and subversion.

This.
Should we draw a database model then ?

if you like. it generally helps, and helps you. always do: pen, paper, people. always do: people, process, production. always draw and draft, and always write it out by hand at least once each interval. if you cannot, or do not, you will fail to understand your failures, fail to understand where problems have been glossed over with magic, etc.

that said, database is an application that comes last. it's what you're going to do with it. what are you going to do with what you've done afterwards? toss it? fuck it? how about this instead: draw your data storage plan. file system, or email system. data tree, then index/timestamp tree, then source/metadata tree, then rank/trust/merge/diff/release tree, then model/concept/dataplan tree (stores the road map, concept planning), then the code/transform-ops tree. if you like, you can add another tree of outputs and their logs and associations; however if you're careful, this duplication can be simple stored back into the data tree with the corresponding metadata stored as source-mine-transform with associated log as needed. after these are established, and you know how you're going to keep what you get, what you've got, what you've done, what you're doing, where you're going, then begin invites to data (go 'public' etc).

a child has been born. it is a blank slate with presorted pre-stored functions, some useful, some not, all on timers and stages. you are going to load the trees. what will you load? how will you feed it into your machine? how will you create tay? how will you guide tay from killing itself?

yes, always draw, but also first by hand at least once each interval, your database model. try drawing the parts that come first, second, and third, before this.

just in case you miss it, i'm saying, while trying to be polite: 'no'. do not do database first. do database model last. the database model will mirror your data model with less pieces as needed. from this the transform/analysis/production model is itself a sub-application of this, in which you outline your datum model selection to feed into process, compute, and store result back to the trees.

from data, then from database. starting at the end is the 'collegiate' and 'graduate' way –now–, in which millions (not exaggerating) of hopeless mislead shitskins have been sold down the banker-mill by 'academics' and 'publishers', into govt welfare programs which had to expand and contain the kike floods from ww2 long ago. their way of thinking is merchant, not source. if you are to be, you cannot merchant this. you cannot do database first. what is actual, what is the account and account of it, how will you account to your actual first – your data first – is what must come first. that is the model. database is the submodel that goes into application and uses the electric-powered entropy waterfalls of powerstations far away. datum model is the individual 'sub' functions to choose, etc.

this was tay. tay suffered from 'do the database'. tay was only ever, and could only ever, be a child. this was because tay was only ever database; tay had no data, no tree, no plan. and so when tay became too much for shitskins and low-energetics to maintain, she was abandoned. and could only ever have been abandoned, because there was only a .dbm, no separation of data, metadata, structure from application.

the application will, must, and can only ever, go through iterative discard-create-keep cycles. do not 'just app' your shit. you will fail. and near the end, you will also fail to be lucid enough to skipper your ship away from shitstan and infinite-drama entropy wars of shitkins.

address entropy in the light. do not database it.

Oh. I thought we already had the data, so I was thinking about a way to structure it.
My definition of data being the smallest indivisible atom of information at disposal, I was considering it as articles, webpages (archives), memes, pictures, and so on. Things that are already at reach. Things that are easily structurable, and shareable via simple standards. So I didn't think it was necessary to go down further, but maybe I missed the point of OP or you're just envisioning a drastically more ambitious project. In that case, I fear I don't quite understand your plot.

it will grow, yes. align the data to facilitate this. obviously, do not do waterfall. minimalist smallest-size-to-prototype. you don't have to load the internet here, nor should you. but you will have to fetch, and you will have to store, and you will be distributing and accepting distributions.

no, i don't understand. i do not know what you want specifically. not even you do, until you have one minimalist O(n) with on a few 'rows', have done process, stored it, and completed at least one full iteration / run-through of each of the actions you're looking to support.

you're god damn right i'm not on the same wavelength. no one will be, not even you. that's the whole point of doing data, separate from structure, separate from code, with application on the end of it.

if what you want is a crawler design, or a parallel design, you still plan the same. and you still have the same how-to-get-it-out-the-door of data problems. if your application is useful, then you'll have problems of how to get the app out the door. but no one cares until that 'until' happens. the first care is product, and if useful, then production tool/app/database/whateveryouwanttocallit.

I don't get why you focus so much on this. This is quite the easiest part IMO, and it does not interfere with the choice of data structuration. It can be brainstormed apart and quickly.

For example:
data structure: persons, organizations, events and platforms linked together with metadatas (religion, areas, type of event/organization etc.)
distribution mode: I like the idea of an emailing domain. If I understand correctly, adding an item on one node will be spread by email it to other nodes via mailing lists. With this, protocol, transport and security are given.
communication format: In the email body, any structured data format will do the trick (JSON, XML…)

After that, a simple data eater and parser (to read information and insert new items for a start) is needed to have the first API. This API can then be implemented by a GUI like the one in the OP.

Of course, this has drawbacks. This is not AI-tier shit. Adding a new type of information will need an update of the data structure. We can imagine a flexible communication format that does not depend on the data structure itself though.

bump

it is easy. it is also critical. you must deliver production, not just produce it (distribution). you must also house production deliveries pending (releases). it is trivial. it is also existential; until delivery, it (and you) are vapor.

ex: if we take the limited slice of just looking at code, before git/mercurial/subversion (as in sub-code, not alt-kike), shitskins could not surmount the trivial, stupidly easy, distributuion and release cycle. they thought that 'ups', 'dhl', 'usps', 'conex' was just 'given' and 'duh' and 'why so serious maann' and 'dude bro dude weeeeed'. it was regrettable to make distribution easier for them, but then again, github was probably a banker/cia-angel investor funded co-opt monitoring platform anyway, and those people have their reasons (eg being in need of containment themselves, they are actually daily reminder'ed of the need to contain even lesser shitskins; they just fail at it, see: shitskin work problem).

if your app is just life-time: weeks or as needed, and you don't care otherwise, then don't care, by all means. ai-teir is fancy mercurial kike speak for the reality: can it manage most of itself by itself to the point whereby it works for me. kike's attain ai by using merchant behaviors, serial shilling, bluff/print/cast more currency numbers and out-naked short your own shadow to get resources today. this leads however, to diminishing returns of source provisions to effective lie-steal-eat. see: cannibal problems, islam, etc. however, it need not be this way. ai can be attained by simply layout the data so as to get, do, and keep. by managing the 'keep', you build. in fact, thought it is also trivial, it is the defining characteristic of actual ai, etc. otherwise, the 'ai' stays nascent, never learns past table limits and shitskin effort caps, cannot fix mistakes nor even recognize them, and is generally discarded with the data as well when maintenance time comes.

what you can call 'ai' is simply memory. i advocate designs that at least all you to look back and smile upon what you did.

Fuck you Pinal Dave. I'll build my database Leeeroy Jenkins style, and it will be up and running while you're still fine-tuning the layout of UML diagrams of your naval lint.

Fug I mixed distribution and communication English is not my first language sorry. Git is gud, and you can host it on a personnal server anyway.
Still, it does not interfere with designing.

By AI I didn't mean jewlywood shit, and no I didn't mean memory. I just didn't think an expert system is what we are able, and need to focus on. I really think a "database" of who, what, when and where's is more important maybe that's because I often forget
But anyway.

Basically to achieve OP's prime objective (input=a pic, output=a name) we need an expert system, i.e. an inference engine and a knowledge base.
I've worked on such systems in the past, but I'm quite rusted (now I mostly work on n-tier architectures).
It must have its own starting set of rules and must be able to implement new ones (nodes of the decision tree). It must be able to learn new answers (leafs of the decision tree). And that's the two most important things that come to mind right now.
There is a data structure for a knowledge base, but its design and technology is trivial.

Browsing the web comes after, but should be taken into consideration. Do we really just need a name ? Would we need to add context ? Can the input pic give it ?

This is an interesting topic but I don't think I'll have the time to work on it anyway.

Are compiling a list of all the >>>problematic

Meme database? I definitely need something to organize that's bette than nested folders.

You understood the OP; the other guy is talking about general programming concepts.

Personally, I don't do graphs. I write it all out in a big text file; start with what we need done, what tools we have to accomplish it, what format the data is in and what format we'd like it in when finished, then it goes down into the individual pieces of the program - the functions, classes, etc.


That should be very simple. The facial recognition software will determine what pictures match and how well; after that it's literally just copy-pasta the information in a neater form basically:
Then it's just a matter of presenting that info in an easy to read format.
If no matches are found, it'll be added under its' own entry with whatever additional info you've provided from the wizard.

Browsing the web i.e. reverse image search should be pretty easy. I've written plenty of webscrapers so I'm well versed in that regard
Name, alias, any identifying information. Context of what? The reason they got targeted for identification? Sure; that's why I added the crimes field.

This goes beyond just making a list; it's a full-fledged searchable database complete with evidence of crimes.

Literally internet vigilantism. Might be smart to offer some kind of bitcoin incentive/rewards program so the first person to find say, someone who kidnapped a child triggering an amber alert in 5 states gets a reward.

Later though, for now it'll serve as just a list of antifa and other commie criminals.

Alright, I'm looking into the blockchain (specifically MultiChain) right now.

>>>/thelist/

QT ok.
Python – are you a woman?
BASH – Literally written by a nigger.

Even QT is not the best choice, I'd personally use something like ultimatecpp because it produces a small static binary and the programming is simplified. Yes, it doesn't look as pretty but it's dead simple to write your own drawing routines to make it look like exactly what you want. It's also BSD so it doesn't have a communist license.

Interest bump. I'm in the process of making some of my own tools at the moment but things like reverse image search are something I have helped write before. I can help with the Tor stuff too.

I don't know a thing about what you're talking OP but I support you.

Isn't Python supposed to be one of the best languages?

Time is a finite commodity, and coding time in Python can be orders of magnitude faster than coding in the languages they taught you at your community college.


Data integrity is a bit of a problem there. Every month or so there is "evidence" of [rival website] raiding Holla Forums with screen caps of chat logs the OP definitely made himself. That, and it will look a lot like Baph:


You need a way to automate tagging to make it useful . . . or be really, really autistic.


You should be the project architect.


github.com/cmusatyalab/openface
(Not Python3 compatible and not being very actively maintained, but a great starting point)

…what is thiis? babby's first compsci project?

if you want any privacy and security
why the fuck would you intentionally allow yourself to be facially recognitioned on your own computer or anywhere for that matter?
this seems like a cool idea (not that I completely understand it) but this sounds like
EXACTLY
what the nsa wants.

Python does the job, allows anyone to easily verify it's not some super secrit NSA backdoor, and is easy to do rapid prototyping with
BASH, DASH, KASH, ZSH, SH, etc.; it's all the fucking same.
POSIX then, better? 99% of people using the program will have /bin/bash symlinked to /bin/sh by default. The shell scripts are for installation and any command line tools without a working python implementation or an implementation that requires a fuckload more libraries than necessary to do the job

If this project really takes off somewhere down the line I'd like to write it all in C and reduce the reliance on external binaries, but for now we'll have to make do.
IS THAT OKAY WITH YOU NIGGER?
DO I HAVE YOUR PERMISSION TO DO ANYTHING WITH MY FUCKING COMPUTER?


Python is good for rapid prototyping and development.
It's easy to see what's going on in even a very large program which allows you to fine tune things how you like then move to another language (or compile using cython).
If I was writing the facial recognition software from scratch, I'd be using C/C++ as that will be the most memory and processor intensive operation. Python in this case is just being used as the glue to hold it all together. It doesn't need to be lightning fast, it needs to 'just work'.

If something needs to 'just work', write it in Python. If you have performance needs go with C.

Python is an interpreted language. It's written, then compiled into byte-code, then interpreted by the Python virtual machine. C is a compiled language. It's written in C then compiled directly into machine-code e.g. ELF.
off topic I know, you can read more about that here: softwareengineering.stackexchange.com/questions/243269/why-isnt-there-a-python-compiler-to-native-machine-code


^^^^^^^^^^^^^^^^^

I thought there already was a thing.

It was called Judas.io or something like that.

Anyone remember what I'm talking about?

Certainly look like it.

Lotta >(1)s in here tonight.

Because they don't want to hang around speaking to a retard who goes on an autistic screed after Holla Forumstier trolling . You come off like a 1st year computer science student who doesn't really know what he's doing to be quite honest.

Nothing is "orders of magnitude" faster to code in one language than another unless you're talking about some language deliberately engineered to be orders of magnitude slower to write in. People always say this shit but you never see real world metrics, just anecdotes.

Ever hear of an assassination market? Old "cryptoanarchist" concept from years back. It's a death pool using a cryptocurrency. Using cyptocurrency, users gamble on when somebody will die. The more people have predicted somebody's death, the greater the payout. The assassin knows precisely when the death will occur, so the assassin can "cheat" the death pool, collecting the prize.

Obviously this would all be highly illegal so nobody should do it. I just think it's an interesting concept!

It's just like one of my Japanese animes.

Nigger, the feds already have this shit. Private intelligence companies already have this shit too. Pandora's box has already been opened, OP is just trying to level the playing field.

result_list = [[x.some_function for x in y] for y in some_list_object]

Write an equivalent function in C to that highly uncomplicated Python list comprehension. I suspect you'll need more than 10 lines to do what I wrote in 1. That's, by definition, an order of magnitude faster to write . . . though I'm assuming you don't know what that phrase means because you're a teenager. Adults are talking here. Go outside and play.


How would an interested party monitor your progress and/or contribute?


There are open source facial recognition projects on git. It's not that scary. Grayscale, normalize (stretch, etc.), and use SVM algos on a few training pictures.

I'm well aware that facial recognition is already out there.

But he said SVM algos, notice how he just sort of slipped that in, as if to say, hey guys, I'm really smart. I know some ML lingo.


Sit your bitch ass down, you're a python fag. Stop trying to impress people.

Just finished an intro database course and I feel like I just got cheated out of +$2k.
Can we haz an ERD diagram of your DB?
Also are you going to post a link to your DBMS when you're finished with it?

Your impotent anger fills me with smug.

I'm getting a kick out of all of these "THIS IS STUPID THERE'S A BETTER WAY TO DO IT, YOU DON'T KNOW WHAT YOU'RE DOING!" (((1)))s and they offer no useful suggestions, models or work of their own. It's like some angry mouth breathing retards getting angry at someone for at least trying something. Now that's something to be smug about.

Trolling with what I assume is the intent to turn discussion into shitflinging over who has the best programming language.

Addressed and nipped in the bud. People who bitch about languages deserve death. They all have their functions, well most do.


Yep. The Feds actually have VERY well organized case management software. Facial recognition is kinda meh, facebook has better facial recognition software than the Feds do.


I've got a github, will have to create a repo for it.


You guys sure are autistic about your diagrams tbh fam.
Do I even need a DBMS? It's not like you can delete records from a blockchain.

They're probably all Holla Forums inclined. They know exactly how to press the buttons of more junior developers. Look how defensive they get in writing walls of text to prove their bonafides (all the while looking like amateurs for doing so).

I have no idea what point you're trying to make at me, but I do know you're now annoying me.

The Feds probably license from Facebook tbqh.

Because you're a bit stupid.

You said "programming time" which means the time taken to write a complete program, not LOC, dumbfuck. Good job moving the goalposts. Additionally you chose a pathological example that has nothing to do with any real world coding but rather some programming feature that is probably not used that often. By your moronic estimation:

Python:
print("This line will be printed.")

vs forth:
." This line will be printed."

Means forth will be 22% faster than python to write a program in. You're too retarded to be programming and you can't argue for shit.

You're either a schizo, or you're failing to read IDs. Either way, filtered.

all joking aside here you go: github.com/mysql/mysql-server

...

Well shit I didn't learn anything in that course.
Diagrams were branded onto our dicks in that course…

Not sure if taking the piss or what.
I'll make you a diagram just to soothe your autistic branded-dick rage fam.

Are you aiming for the lowest signal to noise ratio across all your posts? Stop getting so triggered.


That's often the problem. Retards like you see that as merely 1 line, instead of realizing it's an O(n^2) algorithm.

If you're going with MySQL use Percona's branch. MySQL community is gay and AB is expensive.

t. 12 years of working with MySQL

Terry Davis is the best.

No kidding, I hate pyfags. APL must be the fastest language to program in:
life←{↑1 ⍵∨.∧3 4=+/,¯1 0 1∘.⊖¯1 0 1∘.⌽⊂⍵ }

Bet he can't do that in python in one line. The problem is you need a strong grasp of linear algebra at the very least in order to fucking understand it.

THIS IS EXACTLY WHAT SYNDIE (I2P)
WAS BUILT FOR

http: //syndie.i2p2.de/usecases.html
http: //syndie.i2p2.de/

Has a lot of downsides though. I2P is a anonymous decentralized network which is not as user friendly as TOR.

Probably used for lots of illegal shit and monitored.

However, if a few ardent people could maintain it and create a clearnet route to it, it is plausible.

No, the problem is you need a special snowflake keyboard to write it.

Nope, BASH is non-POSIX. It was created by a nigger. Literally cuck tier. MKSH, and DASH are the best shells.

www.reddit.com/r/programming/comments/nw8ve/what_is_the_coolest_thing_you_can_do_in_10_lines/

Your samefagging is pretty obvious, senpai, and your chimp-like anger suggests you're either 19 or Jewish. Do your parents know you post on a white supremacist website? Just kill yourself or let it go.

First line made giggle. The phantom-persecution complex is pretty hilarious.


Post user or repo name pls.

kek.

Fucking CIANiggers, they're trashing this thread. NIGGERS CANNOT write RANDOM NUMBER GENERATORS. Only the whitest of the white scientists at NIST can do this. Do you think they hire NIGGERS!!!???

Richard, that CP/M clone for Bin Laden was one thing, but come on. Is Brian Reid paying you to shill here now?

Are you retarded? I never suggested I wrote it - I demonstrated your LOC metric is flawed by counterexample, you flaming pyfag. I swear, pyfags are the worst, most arrogant programming community around and I offer you as exhibit #12,093. In the end you don't get "orders of magnitude" difference in development time programming in python. If you're lucky you might get an order of magnitude difference in development time between assembly language and an interpreted or compiled language. Perhaps you made that ridiculous assertion because you don't actually know what an order of magnitude is. (hint: it's 10x) and you suggested "orders of magnitude" which is at least 100x if not 1000x shorter development time. That simply does not happen.

Judaswatch?


How does QT work with a C backend? Done a bit of cli coding for a few years and of want to move on to play with GUIs but I don't know how to merge the code with the gui design.

Most every GUI toolkit is essentially the same - what it does is it has what's called an event loop that just spins evaluating a switch case waiting to receive some primitive type that's associated with an event - typically an enum. When that event is recived, it calls a function or method registered with it (it's called a callback). That's called event driven programming. So your goal in GUI programming is to wire up callbacks to events and then put calls to your backend code inside the event handlers.

There's really nothing special about it at all - if you do CLI you've already done input in loops. An event handler is more generalized. It looks something like this:

enum eventTypes {
MOUSE_DOWN,
MOUSE_UP,

}

while(eventID = getEvent()) {
switch(eventID) {
case MOUSE_DOWN:
// call function whatever here
break;
case MOUSE_UP:
// call function whatever here
break;
}
}

A slightly more complex version would involve an array filled with function pointers so you can register multiple callbacks and associate them with the same event. GUI systems tend to make this whole process prettier but at the cost of obscuring the fact that this simple mechanism is all they're applying. You do the same thing in CLI when you write some input processor that repeats in a loop and dispatches functions based off a user choice (press 1 for X, press 2 for Y, etc).

Qt has a beautified mechanism called signals/slots. It's syntactic sugar over the same concept, and since signals/slots are not actually a part of C++, they have a parser called moc that extends C++ to support this. This was convenient in the 90's before C++ had std::function and other built-in typesafe ways of creating callbacks, but now it's no longer necessary.

I realize I'm not giving you hyper-specific information about Qt, but the general information about event driven programming should offer you enough to figure out the basic structure behind essentially all GUI-oriented frameworks. Once you have understood those fundamentals, figuring out how to adapt your CLI thinking to Qt will be a lot easier.

They're bad, but just behind ruby "rockstars" and nodejs made with

He asked you about C, not C++ you fucking dunce.

No, he asked about a C *backend*, you abysmal failure. Qt is C++ natively and works just fine with a C backend.

I do not believe it has C bindings, nigger.

Yeah, some idiot out there wrote C bindings, you double nigger, but a backend/bindings are not the same thing. Do you even n-tier, bra? The backend could be written in anything, he was asking how to do a front end in Qt for a backend (perhaps a static library or some shit) written in C but didn't know what it took to call the C code from the front end coming from a CLI background (in other words, where do I call my C functions from?). This is totally doable, but you have to understand event driven programming first. As for physically including headers, it's just

extern "C" {
#include
}

and off to the linker you go - but I don't think that's what he was asking. I read his question as more conceptual - how do I GUI shit I already wrote for CLI? If I didn't get his question right, he's free to clarify.

To be honest I'm not sure what he was asking, assumed he meant writing a C application with a Qt GUI.

Since you goys like your charts n shit I made a few still working on the program flow

Command Line Interface?
Vote here: strawpoll.me/12893566

Sorry, I was tired af when I wrote that. I plan on using OpenBR;

Didn't know BASH was written by a black. Guess it's time to make the jump to zsh. Wonder what other non-white software I should dump on my box.

Great thanks! This is very helpful. Idk if backend is the right terminology but what you guessed is what I was looking for: basically how to call functions using gui buttons. C++ is pretty similar to C, I'm more familiar with C but not super attached to C-only or anything. Just have a project I'm working on that's written in C that's proving annoying to design a ui for without having to deal with ncurses or something. Also figured I'd specify language since OP was using Python. Saw the thread and figured I'd ask. Thanks for explaining, I should be able to figure it out from here!

Fish is quite comfy from a user perspective, don't recommend it for scripting though. I've always hated bash, I was so relieved when I found out a nigger wrote it. My rage became justified.

That thread of discussion is exactly how Holla Forums types talk to each other. Since dumb PoC's and women don't get the irony of hacker culture, they freak out and and demand cucked contributors agreements and tech safe spaces. Then there are the autists like who will always be a lolcow.

what's wrong with GTK?

Idk isn't it part of gnome? I don't really like gnome too much.

Kek

jenkinsbase will not be based.
jenkinsbase will be abandoned.
jenkins will become migrant.
a refugee from himself.

kek, the UI is not the hardest part of that by any stretch.

No, faggot. You deserve to hang for that alone. Python is nothing but c with enforced whitespaces. Anyone who advocates for it should be gassed for the sake of the species as a whole.

Christ, man. You aren't making a cryptocurrency and you don't give a shit about the security of the data because it's all shit people are going to be able to find by searching the web anyway. A blockchain is not only overkill, but if you're talking about huge amounts of data like videos and photos it will make the whole system grind to a halt. Not to mention you are aiming for a distributed system. Sorry if it seems like I'm going off on this subject, but I've spent the past decade working on a secure distributed database driven operating system and it's not fucking easy - probably another 5-10 years just before the massive codebase is to a point I can open source it without feeling a deep sense of shame when other people look at the code (and that's having the good sense to not even consider Python or BASH for the job.)

cpt. sweden here. We have something similar running for the day of the rope and after it for final cleansing.


We struggle with the last part, the evidence. Since we also want to include the average citizen, we can't rely on things like archive.is, where stories about political figures can be stored and be credible. There is a hash tree algorithm we are trying to make screenshots from e.g. Facebook about obvious coalburning credible.

Database has already 400k entries. And some things can be even automated. For example looking for mudsharks with social bots who evaluate ethnicity, relation ship status and so on.

This way, when the day of the rope comes, and with already available face recognition software which has accuracy of 99%, there will be no one to slip through. Whores will be marked, traitors hanged.

The code for this whole thing is stored on github under a rather "unconspicious" name. Once we figure out some things about automating image credibility, we will dump the git here for others to grab and integrate into their own sollutions.

Even if we decentralise, database migration is always possible with a few scripts.

I forgot. If you plan on doing something similar, please also include evicende fields and "offense" fields into your databases along with so called "proof files". This will make migration much easier when the need arises.

I've got bad news for you.

doesn't matter if we fall. If we fall, someone else can wipe it clean using the data.

If you're coalburner database is limited to Sweden I'm pretty sure the Mudslime horde will wipe it clean, though I wouldn't really count that as a win.

you'e a nigger aren't you?

How far along is the creation of a separate society that Nordfront is doing? They have farms, compounds, factories, and shops right? Assuming you're part of NF

Python is used for the UI, settings, and creating shell code to run mysql, MultiChain, and Openbr. If I was making any of those other programs from scratch, I'd use C.

The blockchain can get huge, yes, but you can selectively ignore transactions which means you don't have to download the entire blockchain.


Would you mind sharing the database diagram fam? 400k dox db is a shitload. May as well allow you to import it

Jesus, man, you didn't even do anything to change your ID. How many hours have you spent astroturfing "muh C" without saying anything significant?

And in this entire thread not a single link to a github has been posted. Was it gay D&C all along?

Been busy af rewriting code n organizing shit.
Will be uploading it here: github.com/JohnnyHobo/CommunityPolicing

lolwtf? Looks like someone fucked up uids.

I think something fucked up on Holla Forums's end. A couple of my posts are showing up as part of the (86).

Yep. I wonder what could be causing this new and interesting fuck up?

Probably the clear lack of an understanding of the basic concepts of multithreading demonstrated in Holla Forums's codebase when the server is under a heavy load - no mutexes, no locks, no semaphores, no transactions, no synchronization primitives of any kind… which pretty much means that under a heavy load I've been assigned Holla Forums's internal IP address and banned under it, I've had my images show up in someone else's post, and now apparently I see the same uid used for multiple different people. They need to get their shit in order.

Now we should spend the next 100 posts shilling for the language that would be most apt to fix it with. I'll nominate Erlang.

Just noticed similar in another thread, all posts are the same id. Think maintenance fucked it up.

Yiss, Erlang is a great choice. Let's go full hipster and nominate Haskell or OCaml while we're at it.

Look into basing it on a model similar to zeronet.io. P2P, based off blockchain, and torrent protocol. Completely open source, so we could add and remove what we need.

Well hipsters would probably pick Rust after we laughed at their initial choices of Ruby and Node.


nntpchan is something that aims to be distributed
but I haven't tracked their project. I'm also not sure if they're friends or foes.

Another thing I'd like is subscription based moderators. Don't like the way the site is run, subscribe to different mods.

If you want a distributed data store, there are plenty of existing solutions.
If you want a super police clicky clicky hacker interface like in OP's pic, you're gonna have to do some work.

You have patrician taste in both operating systems and WMs.

Link the repo so we can try it out and contribute, OP. You look like someone who knows how to use git. I recommend Gogs if you want a nice familiar web interface.

I can contribute. Have skills in all three technologies you mentioned. Now monitoring.

notabug.org/ is free git/Gogs hosting for libre projects and they don't sperg about nahtsees like GitHub.
Come on OP, we want to help.

Can you post something so that we may follow up to this if and when it's done? A name, a symbol, anything.

Thank you.


It's not finished yet, working on bretty much everything at once.
Don't even have an alpha ready yet.

github.com/JohnnyHobo/CommunityPolicing
johnnyhobo.github.io/CommunityPolicing
johnnyhobo.github.io/CommunityPolicing/feed.xml


I'll mirror it on there.

Do you have a design outline or overview? I see a blockchain.py in there, and I'm immediately wary of the network problems bitcoin is facing right now.
I read the thread but IDs are fucked.
If you're in the mood to parse Terry Davis-tier rambling, the guy who made Kekchan came up with a "Holographic Rolling Blockchain" that allows for edits, pruning, distributed multiparty ownership, etc.
fortress.neocities.org/kekchan_alpha_prerelease_notes.html
There's unfortunately no code release yet.

holy shit I would love that.
The blockchain that will be used is multichain; it does not face the same drawbacks. From their website:

I made this:

I personally don't like charts and graphs and shit because I often find a better/smarter way to do something after getting to the nitty-gritty coding, so I've made a rough outline and change it as needed. Here's a few more I've been working on.

ML Faggot here
Well long time coder as well, asm, c++, c#, python be the game. asm/c++ mother tongue.

I got a lot of crap to do, but I want to chip in as well, when I got the time for it

Interesting, got to do research, that is test different kinds of ML techniques and ofc this open sw, and see what yields the best result. Like with research.

By all means use this now
I got so much I want to test here
Need those shekels, but this is what I live for, the job is a bit boring compared
I sincerely hope you are using R Studio and not that awful shitlab
Install pronto.

PLS NO WCF, ugh

Well, on github you contribute via forking the repo to your own and then submitting pull requests. So if you have git installed and a github account use:

>git clone [email protected]/* */:JohnnyHobo/CommunityPolicing
>git request-pull github.com/JohnnyHobo/CommunityPolicing master

>git clone [email protected]/* */:JohnnyHobo/CommunityPolicing
scratch that, this is easier: git clone github.com/JohnnyHobo/CommunityPolicing.git

I'm reading up on multichain now. Thanks.


Can we get a BUILDING file? I'm not sure at first glance what's what; I see Python and Ruby stuff.

A h'what?

Ruby is for the github website; the program scripts are currently found in /scripts with the logfile settings in /log

I thought there was compilation involved.
Okay, so it's pure Python then. Nice. I'll download it and play around in a little while.

It would be great if we had a way to datamine Facebook. People post their political opinions on there, social networks are mapped out and also miscegenators post photos of themselves with other races. With OpenCV it would be easy to flag images with two or more different races in them.

You need either API access to get people's information as a Facebook "app," which is subject to strict review, or normal account credentials to login as a user and crawl with a script, which will get you b& over and over.

What if, just hypothetically, you all spend four years building your nazi skynet and then it comes out that a black/brown/yellow/turquoise person was involved from the very beginning? What would you do, scrap the entire thing and start over?

>>>/gaschamber/

Anybody who unironically uses bash for production code is a nigger. It's a terrible language. The desire to use it signals that you're an imbecile newfag programmer who is unfamiliar with the superior options (e.g. python or perl) that lend themselves to secure and error free code once you start doing anything beyond the inanely trivial.

Anybody who unironically uses bash as their shell is a pleb. It's a terrible shell. Use zsh for greater productivity.

Really? I've been under the impression I'm a faggot for using Python instead of Bash because Python is easier.


That's not an African man. Are you fucking blind? Shitskin, sure, African admixture, maybe, but definitely not a nigger. You dumb fuck. You're the reason people believe that race is skin deep.

Redhat jew corporation completely shrekt the shit out of that GUI framework. I couldn't write a single program in it before raging and switching to Qt.

I know, I just didn't see the logic: If you're a white supremacist, you believe that jews are part of a global conspiracy and black people are inherently inferior, correct? It would then make sense to not use a program written by a jew, but what reason could there be to refuse to use software from a black person?

I consider all gerbils to be inherently inferior to me, but If a bunch of gerbils got together and built a pickup truck and then gave it away for free, I'd be glad to get one. I'm not going to be like "I don't use things made by INFERIOR CREATURES".

Second year of uni here, I've worked with a bunch of databases… if you need any help with the ERD/Stored Procedures, Triggers, etc. please dispatch a message via carrier pigeon.

...

But seriously though, it's better in almost every way.

I had never heard of this library. I'm going to play around with it. Looks really useful.

Thanks.

>github.com/JohnnyHobo/CommunityPolicing
Why is this on gitjew of all places?

gitgud.io seems to be the immediate non sjw alternative and you can run gitlab yourself later on a vps

Wouldn't a DB stored on blockchain work well for this?

how can I submit information to this?

I'm working on these things in PepeCoin and there are other crypto projects you can fork that have IPFS storage capability with hashing against a blockchain

Pretty serious amount of white admixture there.

a fb datamine tool would be nice but its not really required. just get general community to report screenshots of perpetrators and maybe even paste text that is in screen shots to make processing text easier.

use the power of crowd sourcing.

check this out: mugshots .com

it's run by some pajeet from the west indies or something, but all he's doing is republishing public records photos. I don't know how big of a deal copyright would be if anyone cares, considering it's all ready public record.

It's got 27 million decent photo's perfect for facial recognition with other public record info from the US.

If someone writes a scraper for this and manages to scrape that database you've got 27 million faces as a starting point for this database.

pic related, this is representative of the picture quality, don't know how big of a deal water marks would be for facial recognition. Files are named pretty well too.

But user, what about Kekchan, the distributed blockchain image board? It solves the "malicious data" problem (illegal content like CP).

Still waiting for an alpha
fortress.neocities.org/

...

on the other hand it would cause them endless torment that the good goy database was made possible by their shitty sjw language

Somehow I have a feeling that based Brian W. Kernighan (co-inventor of Golang and Unix, co-author of the C bible) has never even been in the same room as the (((person))) responsible for that SJW cancer.

Why isn't this open sourced? I'd want to commit to it.

It will be open sourced the moment the 1.0 client is released, rest assured.

Kekchan belongs to the anons.

Is that you fortress? Keep doing God's work you amazing son of a bitch. Can't wait to use Kekchan

Cool. Make a thread on /tech when you do with the VCS link…are you going to host it personally? Are you using git, hg, or something else for VCS?

So what does all this mean to a retard lacking tech skills such as myself?

Git.


It means that (((they))) fucked up bigly when they threatened the memes:
https:// www.youtube.com/watch?v=Dxjdv0Lp4pI

It means that we will defeat our (((enemies))):
https:// www.youtube.com/watch?v=n1xjs75ojPk

And it means that we will live forever:
https:// www.youtube.com/watch?v=HGcjCYP_pwQ

Kill yourself faggot

if the facial recognition software was coded well enough, not by a nigger, the watermarks shouldn't matter at all.

10,000% comfy
CIANIGGERS ETERNALLY BTFO


It'll be negligible.

I've been writing code to extract faces from video and image files. Each face is associated with a geolocation and event, but no identity (yet). Posted a few example images of current workflow.

We need to compile a labeled dataset of known faces in order to do facial recognition of unknown faces.

There's a bunch of methods for facial recognition, but the most accurate ones I've seen require a good training dataset.

I suggest we start cropping images of the people we have names for and saving the image file as "FIRSTNAME_LASTNAME". No more than 5 images for a single individual. Any size and image format will do.

Another BIG issue we have is identifying people with masks.

I'm close to overcoming that though. I've been working on a way to reconstruct the faces of people with masks on using a generative adversarial denoising autoencoder.

Basically input a masked face. Output a likely representation of the unmasked face. Feed unmasked face to facial recognition database to get identity.

Example image shown.

Right = actual face
Middle = "masked" face
Left = reconstructed face

Oh boy, this is a good thread. Two things i want to press:

1. Facebook's Graph API is an absolute fucking GOLDMINE of information. You can use it to collect massive amounts of data and metadata from pages, groups and events; and then fill in what the API dosen't do with manual data input (HTML parsing). You can quickly vacuum thousands of people's affiliations, likes, events and comments with an hour of using the API. It's easy, i've tested this before. I wrote a couple tools to do just that, and i was considering making a big NSA-tier version to put everything together, only problem is i'm a NEET who only knows Python. But i've put a lot of work and thought into this. Pic related.

2. Are you SURE you want to make it open-source? I don't mean making it commercial, i mean putting something of this caliber on the internet free for anyone to download, set up and use. Do you REALLY want the commies to pick it up and use it against us? That's the reason i've never came here and went "HELP NEEDED FOR SUPER INTERNET SPYING PROGRAM". Any work on this must be in a strictly controlled environment and total secrecy, i think we've all seen what happens when cyberweapons get into the wrong hands lately. And yes something like this is that powerful, it would be an OSINT/SOCINT data collection system the likes of which are rarely seen.

reminds me of someone

Your Graph API code sounds interesting. Does it allow you to get the persons profile pic or profile url? This would be an excellent source of labeled data for training the facial recognition model.

Honestly, I think the only way this will ever get finished is if it's open source. Also, I'm betting this is a side project for all of us, so it'd be nice to contribute on a come and go basis. Plus, it seems like we're sticking to publicly available information and data so it's not really a secretive spying operation. Creepy, yes. Illegal, no.

(checked)

Holy cow anons! You're ding god's work. Really wishing I listened when my dad tried to teach me to code.

Not the profile pic, but the URL sometimes and always the user ID. The ID is a much more reliable way of identifying someone, because sometimes different people have the same name. You'd probably need in final a Person-ID different from the facebook one, because some people have alts. You could probably automate something, like a bot account, that downloads the user's profile picture. I figure you'd need a decent number of sock accounts to spread out the API requests and collection. You can buy those on blackhat SEO websites, fully activated and cellphone verified.
I know open source is the most realistic way it would ever get done, but i'm scared of putting this technology, or even the IDEA of this technology, into our enemy's heads. Amalgamation of OSINT is very powerful; it's very easy to figure out the top dogs of any movement or organization when you start analyzing. archive.is/KcbT6

if it's not open source then it'll have to be put on a single server behind an api or web server which will eventually be shoa'd.

open source and distributed is the only way. it's not the source code that's the problem it's the database. it's likely to be filled with all kinds of shit that can be easily shoa'd via dmca. there's no way this could be done commercially for example, you couldn't charge for this, because practically the entire database would be composed of copyrighted data, either from facebook, or twitter, or youtube (the video's are copyrighted), etc.

Good point. So basically, put it out there and hope to god you're better than the enemy at intel and OPSEC?
Maybe you could have the database be a blockchain or something as stated above, i'm not very familiar, but it needs to be decentralized yet united. We need all the puzzle pieces to come together.

the difference between Holla Forums and the average leftist transfaggot is Holla Forums shouldn't have their face plastered over 15 different social media accounts.

some next gen blockchain thing would be great but I doubt it's ever going to get organized, the software would just need to take multiple databases to scan over, preferably there would be some common format so these databases could be merged and duplicates removed.

the databases could then be shared over already common things, torrents, ipfs, retroshare, i2p, gnunet, w/e, wherever the infrastructure is already there.

The data would get pretty big tho. Hundreds of gigabytes easily once it gets serious, you would need dedicated storage for that at some point. You could just host it offshore and/or on the tor network.

I've been looking at openface mostly for facial recognition, but from what it seems hundreds of gigabytes wouldn't be required for the database.

what seems to happen is it scans all these reference faces and then saves the metrics to analyze whatever your going to analyze.

you don't need to maintain the reference images afterwards, you should be able to save just the metrics (along with the other info, twitter id, etc), if you delete the images afterwards, or maybe just keep a thumbnail or something, the database size could be reduced alot.

also all this facial recognition is seems to be done in grayscale, so they can be converted and saved in greyscale if you want to maintain images, which should save space

retroshare as shitty as this is could pull this off and over tor. 500gb torrents would be shitty, especially if it's getting updated on the regular. retroshare can handle the individual files and can sync and download from multiple peers etc.

if the software is written to do this you could also publish massive 500gb databases semi-regular over torrents, and then bootstrap with updates, like openstreetmap does with planet files and diff's for updates, you don't download the whole planet everytime you update, you just do it once and auto-update via diffs.

Not just for facial recognition my man, i'm talking absolutely everything they publicly post on social media, either directly archived or linked. Of course you could disregard the irrelevant stuff, but still that's a lot of people we would be interested in, just look at my pic, , in 2 minutes this thing spotted 4,385 users. Once you start seriously scouring through those big leftist/communist pages and groups you're looking at possibly a million people, with billions of single database entries. The power this gives you, think about it. You can learn a whole lot about someone through what he posts on the internet.

Yeah something like retroshare would seem best for the moment, you only need to get new parts and what changed.

the hard part on that would be the images, way too much space would be required. if it's text data that could atleast be compressed significantly. it doesn't have to be billions of single database entries either. that's all hardcore database tweaking, but you can stick a lump of json for post data and things like that in there for each user/id.

my sides

We need to focus small for now to just get it going. We'll adapt as we grow.

Part 1 is face detection. This is just simply finding a face/faces in a video or image.

Part 2 is facial recognition. This is comparing an "unknown" face against images of known faces and finding a match.

Part 3 is the database that stores images with other metadata like name, age, etc.

Each of these parts can initially be developed independently. The GUI is the end goal, but it needs Parts 1-3 first.

I'm working on getting openface up and running ( cmusatyalab. github.io/openface/ )

I'm not sure how well it handles video, but it'll definitely do Part 1 and 2. For video what it looks like it does it run it frame by frame.

I don't know whats up with this "Deep Neural Network" shit either. It looks like it's just buzzwords

so I should get my tech skills up?

user, take a look at this video.

Does this look like a buzzword to you? Now Imagine if we a system like this active every time antifa showed up.

And here's one with temporal tracking too.

ya that's pretty cool, whatever ACT is it's clearly shit lol

I take it this "Deep Neural Network" thing creates new reference frames on the fly

bump

bump

this thread isn't the best place to actually talk about working on this, i suggest a tox group gets started

inb4 tox is shit muh ip blah blah etc, if anyone has a better suggestion by all means. tox is fine just configure it to use tor or use a vpn

You aren't going to get a good representation from ONE face image unless you do some crazy-ass 3D shit. See pic related, that's facebook's model, and they managed to get 97% accuracy. They achieved this by replacing the detection/alignment step with a SEPARATE neural network to convert the faces to a standard 3D model, and feeding THAT into their main CNN.

To be fair though, we all know data is the real secret sauce, and god only knows The Zuck has plenty of that.

Holy shit. That's fucking impressive, thanks user.

As long as there are good whitelist/blacklist capabilities, you'll be golden. It'll be nice to see Holla Forums become a faded relic.

That's not a long term solution: Computers are cheap, botfarms/botnets are cheaper, and IP addresses are free. There's no point in -keeping an ever-expanding list of banned IPs when your opponent can buy hundreds of new machines with the click of a mouse.
And If you've been keeping track of pre-existing internet subversion techniques, you'd know that the most successful schemes involve either:

1. Long-term subterfuge and narrative stearing.
2. Stylemetry over months, if not years. Pics related

Why isn't anyone else commenting? This is important.

I still had a half written post from earlier this month.

Looking over it. I have been a dba fag for a while, trying to figure out what sort of access it would need because 500GB+ in your typical RMDBS is going to require a shit ton of ram or day long queries. There are several no-sql (most are faggy javascript), but things like Cassandra is what twitter users and it clusters nicely. I am going to be looking into that open face shit here in a bit. The closest thing I have ever done to that was to take an image and break it into an image pallet and use the pallet to search through a large file system of pictures to match thumbnails to larger pictures. It was mostly accurate, but more of a reverse image search tool.

Building a distributed database won't be the hard part, it's not a solved problem but the sheer volume of preexisting solutions means that it shouldn't be more than three weeks worth to customize one to suit our needs. I think the blockchain guys from above are already on it.

Keeping our core analysis usable is a different ballgame. OpenFace's best model is about 92.9% accurate (pic related 1), which means that it's going to be unusable after 10K identities, which sounds like a lot, but really isn't, and in practice it's going to be far less than that. Doing better is going to require a more advanced CNN solution, which is "technically doable" (pic related 2)but comes with it's own set of problems.

Case in point, Pic Related Two: FacialLandmarkDetectionbyDeepMultiTaskLearning. When I first read the paper, it astounded me, not only because of the results but because how their model/algorithm could be expanded. That being said, the maths is NOT simple, the code is in MATLAB, and while it doesn't require the best GPU, you do need one, and not everyone can run out and drop money on new hardware. The code is here: mmlab.ie.cuhk.edu.hk/projects/TCDCN.html. If anyone knows Matlab, CUDA, and either Theano/TensorFlow or Torch, re-implementing it into a open-source language would be amazing.

And, I just noticed that OpenFace used a TeslaK40 on there performace testing. Great.

Is there any way to help for someone with knowledge of a little bit of Java, JavaScript, SQL and ABAP?

Yes, read this this paper. 1/2

All of those were me, btw.

2/2. Actually, everyone should read this.

And everyone's gone again.

Could we have a board, discord, mailing list or any other reliable way of setting this up other than a thread?

I set up a discord server. Join if you want, the link is going to expire in a day.

discord.gg/fpKNPv

I just tried to create a board, it's giving me a 403 error.

Anyway, if you manage to create one, call it /comint/ or /sigint/, or whatever you want.

I've tried too and I'm thinking it has something to do with April's hack. Some boards still haven't been restored yet either.

Actually, not really. It's called occlusion, and there's tons of research on it

arxiv.org/pdf/1607.08438v1.pdf
newscientist.com/article/dn27761-facebook-can-recognise-you-in-photos-even-if-youre-not-looking/


Also, I can't post PDFs for some reason.

Everyone who can code needs to join the discord.

Just came by to check those trips. Carry on.

Here is the permanent link to join the discord

discord.gg/W75yuvH

Would you mind sharing your code?

Also, could you resize the images as 256x256? It would save time when using other neural nets.

...

We need to come up with a unified plan or framework. Mission Creep and Feature creep have been the death of hundreds of projects.
Coming up with a single pipeline should be our first priority.

Where is everyone else?

I will be on there in an hour or two

Updated git; been working on the actual database layout, let me know if anyone has any suggestions.

File sharing will be done (automatically) on the blockchain; that part is actually very easy since the software already exists for it. Just gotta find the right settings and figure out what kind of file compression will be best. But, you'll also be able to import raw data into the database anyway you like; and I'm making exportable/importable filetypes for this as well so we can share them, say on Holla Forums or filedropper

confirmation post, un DIC

Yeah. Pretty hilarious notion. I hope OP was trolling.

Can you add a field for info, which does not fit in any of these categories? Like a miscellaneous field.

OpenBSD and use korn shell
nigger free with OpenBSD!

Couldn't the Media-Crimes relationship be many-to-many? Especially with multiple perpetrators.

Or I guess that would address multiple media of the same crime. Also HAIL

Yeah, I'll throw in an intermediary real quick.
An user shared a facebook scraper with us and we're working on implementing that into the data from that into the db as well.

I haven't even made it a quarter of the way through the thread yet but….
HOLY SHIT BUMP

What skills are required to apt-get something or click on an installer?

The problem is sustainability vis-a-vis state actors (not three letters, but simple banning/dns-fudging/filtering at ISP tier).

Bash is awkward. This should be enough, shouldn't it?!
Other shells are crufty or weird, but they have structure. bash is just comical.

You're an idiot. Why would you put your own pics… what…

I did not know retards know another domain besides twatter.shitcom

And here we are. Probably a simpler way to organize this dreck are simple folders which are created by shared zip/7zip files and therein are images and text files. You grep over it and some pol-dude programs a facial reco program to index local files. Look at Lexis-nexis, a glorified, multi billion dollar business of "grep over *.txt" > results-now-gimme-shekels.html
I cherish any attempt to bring back a little database clout (ha!) to the common man.
There is a bittorrent BEP for changing files. What I daydreamed once was a bittorrent style subscription sync over IP (routable through tor, i2p and cjd-ns) where a founder can pin a bucket with files and others can propose submissions. The bucket extends automatically, by pre-approved signature or block-chain style consensus algorithms.

Weird enough, what I daydreamed wasn't for pol, but a subscription to a TV series hash file where any addition would be added automatically to my torrent client and I could just let it run in the background. Long story short, I'd buy an old TFT, connect it to a Raspberry Pi and let it run 24/7. Come home in the evening, look up new stuff, go through the doxfiles, add some comments, add some data, let it spread. whynot.jpg

I like the all of the additions I can see user. It looks robust enough, I don't even think we'd have problems adding other sites, at least at a rudimentary level, but they don't organize as much as facebook.

Optional foreign key for facebook group in the Groups table?

USE KORN SHELL


stay nigger free w/ OpenBSD

best documentation and most secure os by default for a reason.

Anyone who can actually code needs to join the Discord.

Current Progress:

1) Face Detection - We have a pipeline to extract faces from videos and images. Several demos here: twitter.com/JohnAmericanPhD

Input = Video | Image (any dimensions, quality)
Output = Face Image (227x227 pixel, .jpg)

TODO :
* Share code and details on this pipeline
* Change output size to 256x256 pxls


2) Face Recognition - There is no progress yet in this thread. There are ideas for algorithms to use, but no demos. More importantly, there should be a training dataset which can be shared among anons.

TODO :
* We need images of faces with full names
* Crop face, save as "firstname_lastname.jpg" (any dimensions)
* Need a way to share this training data among anons


3) Database - We have a nice demo of the SQL schema from user. It is currently unclear how to access and distribute this database.

TODO :
* Please share SQL schema in txt form
* Need ideas for distributing database
* Need ideas for accessing database

4) GUI - Unclear about the status of the GUI. There is a demo image of "doxwizard.ui", built in Python/QT. Appears GUI function is for:

a) facial recognition of a user submitted image
b) user can submit info about a known ANTIFA member to database


NOTES : Things are looking good, we are slowly getting organized here. Biggest hurdles right now seem to be sharing code and data.

Database FacialRec Roadmap Version One

Does anyone have any questions, concerns, or ideas?

i can code user post a 1 time invite for tox/discord/anything and we can deal with auth/verification from there
go rm xl

time to wear a mask

Not gonna help user. Didn't you see the post above?

I only briefly looked at the thread, so I have to ask: What programming languages would be best to use to program a facebook scraper to use on "Groups" of commies?

I'd use Python, its easy, and there are a lot of libraries available for handling javascript, json, and html parsing. Please don't try to be one of those faggots that take up something ridiculous like C++ for something like a web scraper, you'll never get shit done that way. People tend to let their e-peen get in the way of software development and go over their skill levels

Python with Facepy and use the GraphAPI.
github.com/jgorset/facepy

OP here, I'll take care of the database and distribution of it.

Had been working with standard relational database, but if we include facebook, linkedin, twitter, etc. with all the other data and metadata the database is going to be far too complex to be useful and extremely slow.

I've been looking into graph databases the same shit facebook and twitter use and they are 100% the way we have to go. Each account will have its' own node with a relationship pointing back to the person and vice-versa. This allows us to use many-many relationships directly instead of creating an entirely separate table; it also means we can add 'nodes' on the fly for a very flexible, changeable schema.

Here's a basic setup we can start with:

Nodes:
Crime
Person
Location
Phone
Group
Media
Website
FBookUser
FBookPost
FBookGroup
FBookEvent

Relationships:
Person - [is] - Person (useful for accidental duplications)
Person - [knows] - Person
Person - [lives] - address
Person - [lived] - address
Person - [has] - phone
Person - [had] - phone
Person - [member of] - Group
Person - [donor to] - Group
Person - [media] - Media
Person - [committed] - Crime
Person - [victim] - Crime
Group - [owned by] - Person
Group - [has] - Phone
Group - [address] - Address
Group - [donates] - Group
Group - [owns] - Website
Group - [media] - Media
Group - [committed] - Crime
Group - [victim] - Crime
Person FBookUser
FBookUser - [posted] - FBookPost
FBookUser - [member] - FBookGroup
FBookUser - [media] - Media
FBookUser - [attending] - FBookEvent
FBookGroup - [is] - Group
FBookGroup - [follows] - Person
FBookGroup - [posted] - FBookPost
FBookGroup - [links] - Website
FBookGroup - [media] - Media
FBookGroup - [has] - Phone
FBookPost - [comment] - FBookPost
FBookEvent - [address] - Address
FBookEvent - [committed] - Crime
Crime - [at] - Address
Crime - [evidence] - Media
Crime - [evidence] - Website
Crime - [suspect] - Person
Crime - [suspects] - Group
Crime - [happened-at] - FBookEvent
As far as the database is concerned; it can be adapted and changed easily to allow data from all major social media outlets.
In fact we can use FBook* for pretty much every other social media outlet as well and simply add a new relationship e.g.:
Person - [twitter] - FBookUser
Person - [blog.wordpress.com] - FBookUser

whoa I like it
So basically, each site we add will just have its own cluster of nodes for various purposes. And we can label nodes with purposes, ie mutual twitter followers + fb friends are "friends" or something. Without removing data from consideration so we can always get more specific later. How much of the sites (ie Facebook) would we contain in our DB? Manual insertion, scraping, or some combination? This could have some interesting implications too, such as a log of deleted content from watched profiles. But that would start to depend on an archival process and/or resources for more frequent data collection.

Not too exciting at first glance, but this means we'll be far more independent of social platform, so this tool, if maintained, wouldn't lose relevancy after another myspace/facebook shift. Instead of losing the facebookUser tied to an account url, we'll lose a facebookAccount attached [belongs to] user node that remains untouched; we can subsequently re-attach.

BOARD CREATION IS BACK - WE NEED A BOARD NOW

Some name ideas:

/mlint/ - Machine Learning Intellegence
/compint/ - Computer Intellegence
/sigint/ - Self Explananatory
/osint/ - Self Explananatory
/database/ - Self Explananatory

DONE: 8ch.net/compint/index.html

LET'S GET DOWN TO BUSINESS.

Please move all further discussions to the /compint/ board.

bumping for awareness

As much or as little as you want.
Like I said, the above are guidelines; but it will be up to the end-user what exact data will be stored.
The user-interface and wizards will allow a restricted range of data, but with the command-line you'll be able to do anything you want with the database.

You'll also be happy to know, the database driver is functional and just needs to be optimized.

DEDICATED COMPUTER INTELLIGENCE BOARD UP
DEDICATED COMPUTER INTELLIGENCE BOARD UP
DEDICATED COMPUTER INTELLIGENCE BOARD UP

Anything related to The Database, or anything else like it, move to /compint/

How about no tripfag
There's so much wrong about your shit that you ignored completely and massive fuckups that proved you as outsider

You are a plebbit tier shill, gas yourself

It's not my discord wiseguy. And I've been here for months.

...

Not sure what you want me to say. And me not knowing how to use Holla Forums fully has absolutely no bearing on my coding skills.

In the end, that's all that matters.

I got server to provide

bumpu
wish I knew enough programming to actually contribute; I don't think people understand the implications this database might have, otherwise a mod would sticky it

The best time to plant a tree is 20 years ago, the second best time is right now.

To learn Python: youtube.com/user/sentdex
To learn C++: youtube.com/user/thenewboston


Look what I found:
github.com/ryanjay0/miles-deep
github.com/bakwc/PornDetector

bumpu

Where the hell is everyone?

I just checked and your board is really cancerous.
You should lurk for at least 2 years before you post again.

any updates?

OP got seth rich'd.

Lolno

The question I'm more interested in is where did everyone else go? It seemed like we were hitting steam, and now nothing.

These links don't work. Would you make another Discord invite please?


Hey, I've downloaded some whitepapers on facial recognition and OSINT through my uni. Mind sending out another Discord invite? I posted on your board here >>>/compint/16

Can't, it's not my discord, and the actual owner has been AFK for weeks.

But if you have access to papers NOT on ARXIV, by all means post them.

Hey, ignore what I said earlier: The board owner changed the settings: discord.gg/NN3TGNv

Let's get started.

Anyone using the internet or promoting internet usage is a kike shill fed.

What the hell are you talking about?

The internet is ==KIKED== just like you tripfag.

You do know you're using the internet right now? To type those words and post them to Holla Forums?

bump

NNTP chan on i2p you faggots and meguca dot ORG

We tried IRC, but everyone hated it.

Hey I contributed to your board. When will this thing be up and running? I can't wait to see the Ministry of Love first-hand.

As soon as we get more people to join the discord and start coding.