-
-
Notifications
You must be signed in to change notification settings - Fork 9
/
artificial-intelligence.bigb
1443 lines (1048 loc) · 48.7 KB
/
artificial-intelligence.bigb
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
= Artificial intelligence
{wiki}
= AI
{c}
{synonym}
{title2}
= AI by capability
{c}
{parent=Artificial intelligence}
= Artificial general intelligence
{parent=AI by capability}
{wiki}
= AGI
{c}
{synonym}
{title2}
Given enough computational power per dollar, AGI is inevitable, but it is not sure certain ever happen given the end of <Moore's law>[end of Moore's Law].
Alternatively, it could also be achieved genetically modified biological brains + <brain in a vat>.
Imagine a brain the size of a building, perfectly engineered to solve certain engineering problems, and giving hints to human operators + taking feedback from cameras and audio attached to the operators.
This likely implies <transhumanism>, and <mind uploading>.
<Ciro Santilli> joined the silicon industry at one point to help increase our computational capacity and reach AGI.
Ciro believes that the easiest route to full AI, if any, could involve <Ciro's 2D reinforcement learning games>.
= Principles of AGI
{parent=Artificial general intelligence}
= The missing link between continuous and discrete AI
{parent=Principles of AGI}
<Ciro Santilli> has felt that perhaps what is missing in 2020's <AGI> research is:
* the interface between:
* the continuous/noisy level (now well developed under <artificial neural network> techniques of the 2010's)
* and <symbolic AI> level AI
The key question is somewhat how to extract symbols out of the space-time continuous experiences.
* more specialized accelerators that somehow interface with more generic <artificial neural networks>. Notably some kind of speialized processing of spacial elements is obviously hardcoded into the brain, see e.g. <grid cell>{full}
Forcing these boundaries to be tested was one of the main design goals of <Ciro's 2D reinforcement learning games>.
In those games, for example:
* when you press a button here, a door opens somewhere far away
* when you touch certain types of objects, a chemical reaction may happen, but not other types of objects
Therefore, those continuous objects would also have "magic" effects that could not be explained by "simple" "what is touching what" ideas.
Bibliography:
* https://mitpress.mit.edu/9780262632683/the-algebraic-mind/
= Intelligence is hierarchical
{parent=Principles of AGI}
This point is beautifully argued in lots of different sources, and is clearly a pillar of <AGI>.
Perhaps one may argue that our <deep learning> layers do form some kind of hierarchy, e.g. this is very clear in certain models such as <convolutional neural network>. But many of those models cannot have arbitrarily deep hierarchies, which appears to be a fundamental aspect of intelligence.
<How to Create a Mind>:
\Q[The lists of steps in my mind are organized in hierarchies. I follow a routine procedure before going to sleep. The first step is to brush my teeth. But this action is in turn broken into a smaller series of steps, the first of which is to put toothpaste on the toothbrush. That step in turn is made up of yet smaller steps, such as finding the toothpaste, removing the cap, and so on. The step of finding the toothpaste also has steps, the first of which is to open the bathroom cabinet. That step in turn requires steps, the first of which is to grab the outside of the cabinet door. This nesting actually continues down to a very fine grain of movements, so that there are literally thousands of little actions constituting my nighttime routine. Although I may have difficulty remembering details of a walk I took just a few hours ago, I have no difficulty recalling all of these many steps in preparing for bed - so much so that I am able to think about other things while I go through these procedures. It is important to point out that this list is not stored as one long list of thousands of steps - rather, each of our routine procedures is remembered as an elaborate hierarchy of nested activities.]
<Human compatible>: TODO get exact quote. It was something along: life goal: save world from hunger. Subgoal: apply for some grant. Sub-sub-goal: eat, sleep, take shower. Sub-sub-sub-goal: move muscles to get me to table and open a can.
= AGI architecture
{c}
{parent=Principles of AGI}
\Video[https://youtu.be/pd0JmT6rYcI?t=3536]
{title=From Machine Learning to Autonomous Intelligence by <Yann LeCun> (2023)}
{description=
After a bunch of B.S., LeCun goes on to describe his AGI architecture. Nothing ground breaking, but not bad either.
* https://youtu.be/pd0JmT6rYcI?t=3705[]: <intelligence is hierarchical>
}
= Elements of AGI
{c}
{parent=AGI architecture}
This section is about ideas that are thought to be part of an AGI system.
= Common sense
{parent=Elements of AGI}
{wiki}
\Video[https://www.youtube.com/watch?v=49t-WWTx0RQ]
{title=My Job is to Open and Close Doors by Mattias Pilhede (2019)}
{description=An interesting humourous short meditation on <common sense>.}
= Instrumental goal
{c}
{parent=Elements of AGI}
= Instrumental convergence
{c}
{parent=Instrumental goal}
{wiki}
= AGI research
{c}
{parent=Artificial general intelligence}
= History of AGI research
{c}
{parent=AGI research}
= AGI blues
{c}
{parent=History of AGI research}
Term invented by <Ciro Santilli>, similar to "<nuclear blues>", and used to describe the feeling that every little shitty job you are doing (that does not considerably help achieving <AGI>) is completely pointless given that we are likely close to <AGI> as of 2023.
= Moravec's paradox
{c}
{parent=History of AGI research}
{title2=1980s}
= AI winter
{c}
{parent=History of AGI research}
{wiki}
= AGI research has become a taboo in the early 21st century
{c}
{parent=History of AGI research}
Due to the failures of earlier generations, which believed that would quickly achieve <AGI>, leading to the <AI winters>, 21st researchers have been very afraid of even trying it, rather going only for smaller subste problems like better neural network designs, at the risk of being considered a <crank (person)>.
While there is fundamental value in such subset problems, the general view to the final goal is also very important, we will likely never reach AI without it.
This is voiced for example in <Superintelligence by Nick Bostrom (2014)> section "Opinions about the future of machine intelligence" which in turn quotes Nils Nilsson:
\Q[
There may, however, be a residual cultural effect on the AI community of its earlier history that makes many mainstream researchers reluctant to align themselves with over-grand ambition. Thus Nils Nilsson, one of the old-timers in the field, complains that his present-day colleagues lack the boldness of spirit that propelled the pioneers of his own generation:
\Q[
Concern for "respectability" has had, I think, a stultifying effect on some AI researchers. I hear them saying things like, "AI used to be criticized for its flossiness. Now that we have made solid progress, let us not risk losing our respectability." One result of this conservatism has been increased concentration on "weak AI" - the variety devoted to providing aids to human
thought - and away from "strong AI" - the variety that attempts to mechanize human-level intelligence
]
Nilsson’s sentiment has been echoed by several others of the founders, including Marvin Minsky, John McCarthy, and Patrick Winston.
]
<Don't be a pussy>, AI researchers!!!
= AGI conference
{c}
{parent=AGI research}
https://www.agi-conference.org/
It is hard to overstate how low the level of this conference seems to be at first sight. <AGI research has become a taboo in the early 21st century>[Truly sad].
= Journal of Artificial General Intelligence
{c}
{parent=AGI research}
https://sciendo.com/journal/JAGI
= AGI research entity
{c}
{parent=AGI research}
{tag=AI research entity}
* https://www.quora.com/What-are-some-good-research-schools-PhD-for-Artificial-General-Intelligence-not-Machine-Learning/answer/Ciro-Santilli What are some good research schools (PhD) for Artificial General Intelligence (not Machine Learning)?
* 2020 https://towardsdatascience.com/four-ai-companies-on-the-bleeding-edge-of-artificial-general-intelligence-b17227a0b64a Top 4 AI companies leading in the race towards Artificial General Intelligence
* Douglas Hofstadter according to https://www.theatlantic.com/magazine/archive/2013/11/the-man-who-would-teach-machines-to-think/309529/ The Man Who Would Teach Machines to Think (2013) by <James Somers>
* Pei Wang from Temple University: https://cis.temple.edu/~wangp/
= Steven Byrnes
{c}
{parent=AGI research entity}
{tag=Astera Institute person}
* https://sjbyrnes.com/
* https://twitter.com/steve47285
* https://www.lesswrong.com/posts/diruo47z32eprenTg/my-computational-framework-for-the-brain
= Astera Institute
{c}
{parent=AGI research entity}
https://astera.org/agi/
By the rich founder of <Mt. Gox> and Ripple, <Jed McCaleb>.
\Q[Obelisk is the Artificial General Intelligence laboratory at Astera. We are focused on the following problems: How does an agent continuously adapt to a changing environment and incorporate new information? In a complicated stochastic environment with sparse rewards, how does an agent associate rewards with the correct set of actions that led to those rewards? How does higher level planning arise?]
= Astera Institute person
{c}
{parent=Astera Institute}
= Michael Nielsen
{c}
{parent=Astera Institute person}
Interesting dude, with some interest overlaps with <Ciro Santilli>, like <Quantum Computing>:
* https://github.com/mnielsen
* https://michaelnielsen.org/
* https://twitter.com/michael_nielsen
* https://www.youtube.com/c/michaelnielsen
= GoodAI
{c}
{parent=AGI research entity}
<Marek Rosa>'s play thing.
= AI People
{c}
{parent=GoodAI}
{tag=AI game with natural language}
{title2=2023}
\Video[https://www.youtube.com/watch?v=xkn0H_iWDEQ]
{title=AI Game - LLM-driven NPCs that can talk by Marek Rosa (2023)}
{description=Not the most amazing demo, but the idea is there. Seems to be a preview for <AI People>. The previous working title seems to have been AI Odyssey.}
= Marek Rosa
{c}
{parent=GoodAI}
{wiki}
= FutureAI
{c}
{parent=AGI research entity}
= Future AI
{c}
{synonym}
{title2}
It is a bit hard to decide if those people are serious or not. Sometimes it feels scammy, but sometimes it feels fun and right!
Particularly concerning is the fact that they are not a <not for profit> entity, and it is hard to understand how they might make money.
<Charles Simon>, the founder, is pretty focused in how natural neurons work vs <artificial neural network> models. He has some good explanations of that, and one major focus of the project is their semi open source spiking neuron simulator <BrainSimII>. While <Ciro Santilli> believes that there might be insight in that, he also has doubts if certain modules of the brain wouldn't be more suitable coded directly in regular <programming languages> with greater ease and performance.
FutureAI appears to be Charles' retirement for fun project, he is likely <independently wealthy>. Well done.
* https://www.aitimejournal.com/interview-with-charles-simon-ceo-and-founder-futureai
* 2022 raised 2 million USD:
* https://www.prnewswire.com/news-releases/ai-futureai-raises-2-million-to-develop-artificial-general-intelligence-301459164.html
\Video[https://www.youtube.com/watch?v=ivbGbSx0K8k]
{title=Creativity and <AGI> by <Charles Simon>'s at AGI-22 (2022)}
{description=
Sounds OK!
* https://youtu.be/ivbGbSx0K8k?t=856 general structure of the <human brain> 86B total, matching <number of neurons in the human brain>, with:
* 14B: brainstem
* 16B: neocortex
* 56B: cerebelum
* https://www.youtube.com/watch?t=1433 some sequencing ideas/conjectures
}
\Video[https://www.youtube.com/watch?v=KQP1gPTk0FI]
{title=Machine Learning Is Not Like Your Brain by <Future AI> (2022)}
{description=Contains some <BrainSimII> demos.}
= BrainSimII
{c}
{parent=FutureAI}
{tag=Neuron simulator}
https://github.com/FutureAIGuru/BrainSimII
The video from https://futureai.guru/technologies/brian-simulator-ii-open-source-agi-toolkit/ shows a demo of the possibly non open source version. They have a <GUI> neuron viewer and editor, which is kind of cool.
\Video[https://www.youtube.com/watch?v=KQP1gPTk0FI]
{title=Machine Learning Is Not Like Your Brain by <Charles Simon> (2022)}
= Sallie
{c}
{disambiguate=FutureAI}
{parent=FutureAI}
{tag=AI training robot}
Not having a manipulator claw is a major issue with this one.
But they also have a co-simulation focus, which is a bit of a win.
= Charles Simon
{c}
{parent=FutureAi}
* https://www.linkedin.com/in/charles-simon-futureai/
* https://futureai.guru/about/the-team/
Basically it looks like the dude got enough money after selling some companies, and now he's doing cooler stuff without much need of money. Not bad.
= AGI software
{c}
{parent=Artificial general intelligence}
= Artificial general intelligence software
{synonym}
{title2}
* https://ai.stackexchange.com/questions/5428/how-can-people-contribute-to-agi-research mentions:
* https://github.com/opennars/opennars
* https://github.com/brohrer/robot-brain-project
= OpenCog
{c}
{parent=AGI software}
{wiki}
= Ben Goertzel
{c}
{parent=OpenCog}
{tag=AGI research entity}
{wiki}
https://www.reddit.com/r/artificial/comments/b38hbk/what_do_my_fellow_ai_researchers_think_of_ben/ What do my fellow AI researchers think of Ben Goertzel and his research?
= SingularityNET
{c}
{parent=Ben Goertzel}
{wiki}
https://singularitynet.io/
<Ben Goertzel>'s <fog computing> project to try and help achieve <AGI>.
= NuNET
{c}
{parent=SingularityNET}
{tag=Fog computing}
= AGI-complete
{c}
{parent=Artificial general intelligence}
{tag=Complexity class}
{wiki}
= AI-complete
{synonym}
Term invented by <Ciro Santilli> to refer to problems that can only be solved once we have <AGI>.
It is somewhat of a flawed analogy to <NP-complete>.
= AGI test
{c}
{parent=Artificial general intelligence}
{wiki=https://en.wikipedia.org/w/index.php?title=Artificial_general_intelligence&oldid=1192191193#Tests_for_human-level_AGI}
= CAPTCHA
{c}
{parent=AGI test}
{wiki}
= reCAPTCHA
{c}
{parent=CAPTCHA}
{wiki}
= Turing test
{c}
{parent=AGI test}
{wiki}
= The Abstraction and Reasoning Challenge
{c}
{parent=AGI test}
* https://lab42.global/arc/
* https://pgpbpadilla.github.io/chollet-arc-challenge
= The Employment Test
{c}
{parent=AGI test}
{wiki}
That's <Ciro Santilli>'s favorite. Of course, there is a huge difference between physical and non physical jobs. But one could start with replacing desk jobs!
= AGI bibliography
{c}
{parent=Artificial General Intelligence}
{tag=AI bibliography}
<GitHub awesome repos>:
* https://github.com/EmbraceAGI/Awesome-AGI
* https://github.com/enricoros/awesome-agi
= Automated theorem proving
{parent=AI by capability}
<AGI-complete> in general? Obviously. But still, a lot can be done. See e.g.:
* <The Busy Beaver Challenge> deciders
= Statistical classification
{parent=AI by capability}
{wiki}
= Classification problem
{synonym}
= Cluster analysis
{parent=AI by capability}
{wiki}
= Clustering
{synonym}
= Generative AI
{parent=AI by capability}
{wiki=Generative_artificial_intelligence}
= GenAI
{c}
{synonym}
{title2}
= Generative adversarial network
{parent=Generative AI}
{title2=GAN}
{wiki}
Original paper: <GAN paper>{full}.
= GAN paper
{parent=Generative adversarial network}
https://proceedings.neurips.cc/paper_files/paper/2014/file/5ca3e9b122f61f8f06494c97b1afccf3-Paper.pdf
= GAN MNIST hello world
{parent=Generative adversarial network}
The <GAN paper> itself does a bit of this, cool hello world:
* https://github.com/lyeoni/pytorch-mnist-GAN
= AI brittleness and robustness
{c}
{parent=Generative adversarial network}
= AI robustness
{c}
{parent=AI brittleness and robustness}
= AI brittleness
{c}
{parent=AI brittleness and robustness}
= Brittleness in AI
{synonym}
<Generative adversarial network> illustrates well <AI brittleness>. The input looks obvious for a human, but gets completely misclassified by a <deep learning> agent.
= Adversarial machine learning
{parent=AI brittleness}
{wiki}
= AI generated porn
{c}
{parent=Generative AI}
{wiki=Generative_artificial_intelligence}
This is going to be the most important application of <generative AI>. Especially if we ever achieve good <text-to-video>.
Image generators plus human ranking:
* https://pornpen.ai/ a bit too restrictive. Girl laying down. Girl sitting. Penis or no penis. But realtively good at it
* https://civitai.tv/[]. How to reach it: https://civitai.tv/tag/nun/2/
https://www.pornhub.com/view_video.php?viewkey=ph63c71351edece[]: Heavenly Bodies Part 1: Sister's Mary First Act. Pornhub title: "AI generated Hentai Story: Sexy Nun alternative World(Isekai) Stable Diffusion" Interesting concept, slide-narrated over visual novel. The question is how they managed to keep face consistency across images.
= Generative AI by modality
{parent=Generative AI}
= Image generation
{parent=Generative AI by modality}
{wiki}
= Face generation
{parent=Image generation}
= Face generator
{synonym}
Very useful for idiotic websites that require real photos!
* https://thispersondoesnotexist.com/ holy fuck, the images are so photorealistic, that <uncanny valley>[when there's a slight fail, it is really, really scary]
= Text-to-image generation
{parent=Image generation}
{wiki}
= Text-to-image model
{parent=Text-to-image generation}
{wiki}
= Text-to-image
{synonym}
* https://deepai.org/machine-learning-model/text2img
* https://openai.com/blog/dall-e/
= Open source text-to-image model
{parent=Text-to-image model}
Bibliography:
* https://www.edenai.co/post/top-free-image-generation-tools-apis-and-open-source-models
= ludicrains/deep-gaze
{c}
{parent=Open source text-to-image model}
https://github.com/lucidrains/deep-daze
This just works, but it is also so incredibly slow that it is useless (or at least the quality it reaches in the time we have patience to wait from), at least on any setup we've managed to try, including e.g. on an <Nvidia A10G> on a <g5.xlarge>. Running:
``
time imagine "a house in the forest"
``
would likely take hours to complete.
= runwayml/stable-diffusion
{c}
{parent=Open source text-to-image model}
https://github.com/runwayml/stable-diffusion
<Conda> install is a bit annoying, but gets the job done. The generation quality is very good.
Someone should package this better for end user "just works after Conda install" image generation, it is currently much more of a library setup.
Tested on <Amazon EC2> on a <g5.xlarge> machine, which has an <Nvidia A10G>, using the <AWS Deep Learning Base GPU AMI (Ubuntu 20.04)> image.
First install <Conda> as per <install Conda on Ubuntu>{full}, and then just follow the instructions from the README, notably the https://github.com/runwayml/stable-diffusion/tree/08ab4d326c96854026c4eb3454cd3b02109ee982#reference-sampling-script[Reference sampling script] section.
``
git clone https://github.com/runwayml/stable-diffusion
cd stable-diffusion/
git checkout 08ab4d326c96854026c4eb3454cd3b02109ee982
conda env create -f environment.yaml
conda activate ldm
mkdir -p models/ldm/stable-diffusion-v1/
wget -O models/ldm/stable-diffusion-v1/model.ckpt https://huggingface.co/CompVis/stable-diffusion-v-1-4-original/resolve/main/sd-v1-4.ckpt
python scripts/txt2img.py --prompt "a photograph of an astronaut riding a horse" --plms
``
This took about 2 minutes and generated 6 images under `outputs/txt2img-samples/samples`, includining an image `outputs/txt2img-samples/grid-0000.png` which is a grid montage containing all the six images in one:
\Image[https://raw.githubusercontent.com/cirosantilli/media/master/Runwayml_stable-diffusion_a-photograph-of-an-astronaut-riding-a-horse.png]
TODO how to change the number of images?
A quick attempt at removing their useless safety features (watermark and <NSFW> text filter) is:
``
diff --git a/scripts/txt2img.py b/scripts/txt2img.py
index 59c16a1..0b8ef25 100644
--- a/scripts/txt2img.py
+++ b/scripts/txt2img.py
@@ -87,10 +87,10 @@ def load_replacement(x):
def check_safety(x_image):
safety_checker_input = safety_feature_extractor(numpy_to_pil(x_image), return_tensors="pt")
x_checked_image, has_nsfw_concept = safety_checker(images=x_image, clip_input=safety_checker_input.pixel_values)
- assert x_checked_image.shape[0] == len(has_nsfw_concept)
- for i in range(len(has_nsfw_concept)):
- if has_nsfw_concept[i]:
- x_checked_image[i] = load_replacement(x_checked_image[i])
+ #assert x_checked_image.shape[0] == len(has_nsfw_concept)
+ #for i in range(len(has_nsfw_concept)):
+ # if has_nsfw_concept[i]:
+ # x_checked_image[i] = load_replacement(x_checked_image[i])
return x_checked_image, has_nsfw_concept
@@ -314,7 +314,7 @@ def main():
for x_sample in x_checked_image_torch:
x_sample = 255. * rearrange(x_sample.cpu().numpy(), 'c h w -> h w c')
img = Image.fromarray(x_sample.astype(np.uint8))
- img = put_watermark(img, wm_encoder)
+ # img = put_watermark(img, wm_encoder)
img.save(os.path.join(sample_path, f"{base_count:05}.png"))
base_count += 1
``
but that produced 4 black images and only two unfiltered ones. Also likely the lack of sexual training data makes its porn suck, and not in the good way.
= DeepFloyd IF
{c}
{parent=Open source text-to-image model}
https://github.com/deep-floyd/IF
= AI text generation
{parent=Generative AI by modality}
= Speech recognition
{parent=AI text generation}
{wiki}
<Open source software> reviews by <Ciro Santilli>:
* https://askubuntu.com/questions/24059/automatically-generate-subtitles-close-caption-from-a-video-using-speech-to-text/1522895#1522895
* https://askubuntu.com/questions/161515/speech-recognition-app-to-convert-mp3-voice-to-text/1499768#1499768
* https://unix.stackexchange.com/questions/256138/is-there-any-decent-speech-recognition-software-for-linux/613392#613392
reviewing mostly the following software:
* <OpenAi Whisper>
* <Vosk>
= Speech recognition software
{parent=Speech recognition}
= OpenAi Whisper
{c}
{parent=Speech recognition software}
{tag=OpenAI project}
= Vosk
{c}
{parent=Speech recognition software}
= Text-to-text model
{parent=AI text generation}
= Machine translation
{parent=Text-to-text model}
{wiki}
= Open source machine translation
{parent=Text-to-text model}
{wiki}
https://askubuntu.com/questions/380847/is-it-possible-to-translate-words-via-terminal/1309774#1309774
= OpenNMT
{c}
{parent=Open source machine translation}
= Argos Translate
{c}
{parent=OpenNMT}
{tag=CLI tool}
<OpenNMT> <CLI> front-end.
Hello world: https://askubuntu.com/questions/380847/is-it-possible-to-translate-words-via-terminal/1309774#1309774
= Large language model
{parent=Text-to-text model}
{wiki}
= LLM
{c}
{synonym}
{title2}
= LLM game
{c}
{parent=Large language model}
* 2023 https://vimalabs.github.io/ VIMA: General Robot Manipulation with Multimodal Prompts
= Stanford Smallville
{c}
{parent=LLM game}
{title2=2023}
https://github.com/joonspk-research/generative_agents
Published as: https://arxiv.org/pdf/2304.03442.pdf Generative Agents: Interactive Simulacra of Human Behavior by Park et al.
\Video[https://www.youtube.com/watch?v=nWBEMjAoA14]
{title=AI Agents Behaving Like Humans by Prompt Engineering (2023)}
= Open source LLM
{parent=Large language model}
{tag=Open source software}
= Ollama
{c}
{parent=Open source LLM}
https://github.com/jmorganca/ollama
Highly automated wrapper for various <open source> <LLMs>.
``
curl https://ollama.ai/install.sh | sh
ollama run llama2
``
And bang, a download later, you get a prompt. On <Ciro Santilli's Hardware/p14s> it runs on <CPU> and generates a few tokens at a time, which is quite usable for a quick interactive play.
As mentioned at https://github.com/jmorganca/ollama/blob/0174665d0e7dcdd8c60390ab2dd07155ef84eb3f/docs/faq.md the downloads to under `/usr/share/ollama/.ollama/models/` and <ncdu> tells me:
``
--- /usr/share/ollama ----------------------------------
3.6 GiB [###########################] /.ollama
4.0 KiB [ ] .bashrc
4.0 KiB [ ] .profile
4.0 KiB [ ] .bash_logout
``
We can also do it non-interactively with:
``
/bin/time ollama run llama2 'What is quantum field theory?'
``
which gave me:
``
0.13user 0.17system 2:06.32elapsed 0%CPU (0avgtext+0avgdata 17280maxresident)k
0inputs+0outputs (0major+2203minor)pagefaults 0swaps
``
but note that there is a random seed that affects each run by default.
Some other quick benchmarks from <Amazon EC2 GPU>, on <Nvidia T4>:
``
0.07user 0.05system 0:16.91elapsed 0%CPU (0avgtext+0avgdata 16896maxresident)k
0inputs+0outputs (0major+1960minor)pagefaults 0swaps
``
On <Nvidia A10G>:
``
0.03user 0.05system 0:09.59elapsed 0%CPU (0avgtext+0avgdata 17312maxresident)k
8inputs+0outputs (1major+1934minor)pagefaults 0swaps
``
So it's not too bad, a small article in 10s.
It tends to babble quite a lot by default, but eventually decides to stop.
TODO is it possible to make it deterministic on the CLI? There is a "seed" parameter somewhere: https://github.com/jmorganca/ollama/blob/31f0551dab9a10412ec6af804445e02a70a25fc2/docs/modelfile.md#parameter
= AI sound generation
{c}
{parent=Generative AI by modality}
= AI music generation
{c}
{parent=AI sound generation}
= Soundraw
{c}
{parent=AI music generation}
= Speech synthesis
{parent=AI sound generation}
{wiki}
= Text-to-speech
{parent=Speech synthesis}
{title2=TTS}
= Comparison of text-to-speech software
{parent=Text-to-speech}
By <Ciro Santilli>:
* https://askubuntu.com/questions/53896/natural-sounding-text-to-speech/1524003#1524003
* https://askubuntu.com/questions/501910/how-to-text-to-speech-output-using-command-line/1522885#1522885
Other threads:
* https://www.reddit.com/r/MachineLearning/comments/12kjof5/d_what_is_the_best_open_source_text_to_speech/
* https://www.reddit.com/r/software/comments/176asxr/best_open_source_texttospeech_available/
* https://www.reddit.com/r/opensource/comments/19cguhx/i_am_looking_for_tts_software/
* https://www.reddit.com/r/LocalLLaMA/comments/1dtzfte/best_tts_model_right_now_that_i_can_self_host/
= Text-to-video
{parent=Generative AI by modality}
This was the Holy Grail as of 2023, when <text-to-image> started to really take off, but text-to-video was miles behind.
* 2024-02-15: Sora by <OpenAI>
* https://techcrunch.com/2024/02/15/openais-newest-model-can-generate-videos-and-they-look-decent/
= AI research entity
{c}
{parent=Artificial intelligence}
= AI researcher
{c}
{parent=AI research entity}
{tag=Computer scientist}
= Yann LeCun
{c}
{parent=AI researcher}
{wiki}
= Yohei Nakajima
{c}
{parent=AI researcher}
{wiki}
He does lots of little experiments which is cool.
* https://twitter.com/yoheinakajima
* https://yoheinakajima.com/
No <research papers> but has citations: https://www.yohei.me/publications[] which is cool.
= AI alignment
{c}
{parent=Artificial intelligence}
{wiki}
As highlighted e.g. at <Human compatible by Stuart J. Russell (2019)>, this AI alignment intrinsically linked to the idea of <utility> in <economy>.
= Reward modeling
{parent=AI alignment}
See e.g.: <Human compatible>
* https://deepmindsafetyresearch.medium.com/scalable-agent-alignment-via-reward-modeling-bf4ab06dfd84
= AI safety
{c}
{parent=AI alignment}
Basically ensuring that good <AI alignment> allows us to survive the singularity.
= Path to AGI
{c}
{parent=Artificial intelligence}
There are two main ways to try and reach AGI:
* <AI training robot>: expensive, slow, but realistic world
* <AI training game>: faster, less expensive, but possibly non-realistic-enough realistic
Which one of them to take is of of the most important technological questions of humanity according to <Ciro Santilli>
There is also an intermediate area of research/engineering where people try to first simulate the robot and its world realistically, use the simulation for training, and then transfer the simulated training to real robots, see e.g.: <realistic robotics simulation>.
= AI training robot
{c}
{parent=Path to AGI}
{tag=Robotics}
It doesn't need to be a bipedal robot. We can let <Boston Dynamics> worry about that walking balance crap.
It could very well instead be on wheels like <arm on tracks>.
Or something more like a factory with arms on rails as per:
* <Transcendence (2014)>
* https://youtu.be/MtVvzJIhTmc?t=112 from <video Rotrics DexArm is available NOW! by Rotrics (2020)> where they have a sliding rail
An arm with a hand and a camera are however indispensable of course!
\Image[https://raw.githubusercontent.com/juniorrojas/algovivo/9e00622f558fd137f8e7b2ab50d085e43e7206cf/media/locomotion.gif]
{title=Algovivo demo}
{description=https://github.com/juniorrojas/algovivo[]: A JavaScript + WebAssembly implementation of an energy-based formulation for soft-bodied virtual creatures.}
= AI training robot in a room
{parent=AI training robot}
{tag=Robotics}
<Ciro Santilli> wonders how far AI could go from a room with a bank account an <Internet> connection.
It would have to understand that it must keep its bank account high to buy power.
And it would start to learn about the world and interact with it to get more money.
Likely it would become a <hacker> and steal a bunch, that's likely the easiest appraoch.
In that scenario, Internet bandwidth would likely be its most precious resources, as that is how it would interact with the world to learn from it and make money.
Compute power and storage would come next as resources.
And of course, once it got to <cloud computing>, which might be immmediately and thus invalidate this experient, things would just go nuts more and more.
= AI training robot dataset
{c}
{parent=AI training robot}
= Open X-Embodiment
{c}
{parent=AI training robot dataset}
Terrible name, but very interesting dataset:
* https://robotics-transformer-x.github.io/
* https://github.com/google-deepmind/open_x_embodiment
TODO: any <AI training robot simulation>[simulation] integration to it?
= AI training robot simulation
{c}
{parent=AI training robot}
= DeepMind RoboCat
{c}
{parent=AI training robot}
{tag=DeepMind project}
{title2=2023}
https://www.deepmind.com/blog/robocat-a-self-improving-robotic-agent
\Video[https://www.youtube.com/watch?v=535W4Pih1C0]
{title=RoboCat by Google DeepMind (2023)}
= Supercomputer controlling a robot
{parent=AI training robot}
Has anybody done this seriously? Given a supercomputer, what amazing human-like robot behaviour we can achieve?
= AI game
{c}
{parent=Path to AGI}
{tag=Serious game}
= AGI via simulation
{synonym}
{title2}
= AI training game
{c}
{synonym}
\Video[https://youtu.be/Y2d1AU7_JvM?t=278]
{title=Our Final Invention - Artificial General Intelligence by Sciencephile the AI (2023)}
{description=AGI via simulation section.}
<Ciro Santilli> defines an "AI game" as:
\Q[a game that is used to train AI, in particular one that was designed with this use case in mind, and usually with the intent of achieving <AGI>, i.e. the game has to somehow represent a digital world with enough analogy to the real world so that the AGI algorithms developed there could also work on the real world]
Most games played by AI historically so far as of 2020 have been AI for games designed for humans: <Human game used for AI training>.
<Ciro Santilli> took a stab at an AI game: <Ciro's 2D reinforcement learning games>, but he didn't sink too much/enough into that project.
A closely related and often overlapping category of simulations are <artificial life> simulations.
Bibliography:
* https://www.youtube.com/@aiwarehouse
* Neural MMO
* https://openai.com/index/neural-mmo/
* https://github.com/openai/neural-mmo
\Video[https://www.youtube.com/watch?v=wwTOFYgtAWg]
{title=Joseph Suarez Thesis Defense - Neural MMO}
= Human game used for AI training
{parent=AI game}
This section is about games initially designed for humans, but which ended up being used in AI development as well, e.g.:
* <board games> such as <chess> and <go (game)>
* <video games> such as <Minecraft> or old <video game console> games
= Using Minecraft for AI training
{c}
{parent=Human game used for AI training}
{tag=Minecraft}
* https://openai.com/blog/openai-acquires-global-illumination
= MineDojo
{c}
{parent=Using Minecraft for AI training}
https://github.com/MineDojo
= Game AI
{parent=AI game}
{wiki=Artificial_intelligence_in_video_games}
Game AI is an <artificial intelligence> that plays a certain game.
It can be either developed for <serious game>[serious] purposes (e.g. <AGI> development in <AI games>), or to make games for interesting for humans.
= Game AI research
{parent=Game AI}
= Game AI research lab
{parent=Game AI research}
The <quora> question: https://www.quora.com/Are-there-any-PhD-programs-in-training-an-AI-system-to-play-computer-games-Like-the-work-DeepMind-do-combining-Reinforcement-Learning-with-Deep-Learning-so-the-AI-can-play-Atari-games
https://gameresearch.leiden.edu/
A good way to find labs is to go down the issues section of projects such as:
* https://github.com/deepmind/lab2d/issues?q=
* https://github.com/deepmind/lab/issues?q=
and then stalk them to see where they are doing their PhDs.
= QMUL Game AI Research Group
{c}
{parent=Game AI research lab}
{tag=QMUL research group}
<Principal investigator>: Simon M. Lucas.
= QMUL GAIG
{c}
{synonym}
{title2}
= Leiden Game Research Lab
{c}
{parent=Game AI research lab}
* https://gameresearch.leiden.edu/
* https://twitter.com/GRL_Liacs
= Game AI by game genre
{parent=Game AI}
= Fighting game AI
{parent=Game AI by game genre}
\Video[https://www.youtube.com/watch?v=qZItwBB0T2Y]
{title=AI in Melee is broken by Melee Moments (2023)}
= Game AI competition
{parent=Game AI}
https://webots.cloud/competition
Lists:
* https://www.gocoder.one/blog/ai-game-competitions-list/ Good list of interest.
* https://codecombat.com/
= Battlecode