Difference between revisions of "CG4 – Questions α"

From arguably.io
Jump to navigation Jump to search
 
(33 intermediate revisions by the same user not shown)
Line 5: Line 5:
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">OpenAI - ChatGPT4.</SPAN>''''' <BR />
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">OpenAI - ChatGPT4.</SPAN>''''' <BR />
In what follows we attempt to address several basic questions about the onrushing progress with the current focus of artificial intelligence. There are several competing actors in this space. These include OpenAI, DeepMind, Anthropic, and Cohere. A number of other competitors are active in the artificial intelligence market place. But for purposes of brevity and because of the overlap we will limit focus on ChatGPT4 (CG4). Further, we focus on several salient questions that that raise questions of safety, risk and prospects. <br />
In what follows we attempt to address several basic questions about the onrushing progress with the current focus of artificial intelligence. There are several competing actors in this space. These include OpenAI, DeepMind, Anthropic, and Cohere. A number of other competitors are active in the artificial intelligence market place. But for purposes of brevity and because of the overlap we will limit focus on ChatGPT4 (CG4). Further, we focus on several salient questions that that raise questions of safety, risk and prospects. <br />
<!--  ********************************************************************************* -->
<!--  2023.10.12@22:00 linking is somewhat simpler than originally thought proceed with -->
<!--  third level (and below) links - require both outbound and inbound (return) data  -->
<!--  structure so as to simplify, facilitate level of abstraction logical consistency  -->
<!--  MUST however verify that third level links facilitate pointing directly to        -->
<!--  specific third level sections/headings                                            -->
<!--  *********************************************************************************  -->
<!--  *********************************************************************************  -->
<!--  2023.10.13 finally moving beyond tipping point - continue including sub pages      -->
<!--  then include subpage references to point to specific premises or arguments        -->
<!--  also - regularize naming convention to correct for inconsistent page naming        -->
<!--  page names will conform to original naming pathway:                                -->
<!--  arguably.io/Darwin2049/                                                            -->
<!--  *********************************************************************************  -->
<!--  *********************************************************************************  -->
<!--  20231024:1330 - Continue using this baseline document but tighten up, include      -->
<!--  links as needed back to higher level pages; also rename/move to make name conform  -->
<!--  to standard usage                                                                  -->
<!--  *********************************************************************************  -->


Specifically, risks that involve or are:
Specifically, risks that involve or are:
* '''''<SPAN STYLE="COLOR:BLUE">Interfacing/Access:</SPAN>''''' <Span Style="COLOR:WHITE; BACKGROUND:TEAL">how will different groups interact with, respond to and be affected by it; might access modalities available to one group have positive or negative implications for other groups; </SPAN> <BR />'''''<Span Style="COLOR:BLUE; BACKGROUND:YELLOW">[https://arguably.io/User:Darwin2049/chatgpt4_omega_interface Interfacing - Synthesis.] </SPAN>'''''
* '''''<SPAN STYLE="COLOR:BLUE">Political/Competitive:</SPAN>''''' <Span Style="COLOR:WHITE; BACKGROUND:TEAL">how might different groups or actors gain or lose relative advantage; also, how might it be used as a tool of control;</SPAN>  '''''<Span Style="COLOR:BLUE; BACKGROUND:YELLOW"><BR />[https://arguably.io/Darwin2049/chatgpt4_omega_political Political - Synthesis.]</SPAN>'''''


* '''''<SPAN STYLE="COLOR:BLUE">Evolutionary/Stratification:</SPAN>''''' <Span Style="COLOR:WHITE; BACKGROUND:TEAL">might new classifications of social categories emerge; were phenotypical bifurcations to emerge would or how would the manifest themselves;</SPAN><BR /> '''''<Span Style="COLOR:BLUE; BACKGROUND:YELLOW">[https://arguably.io/Darwin2049/chatgpt4_omega_evolutionary Evolutionary - Synthesis.]</SPAN>'''''
* '''''<Span Style="COLOR:BLUE; BACKGROUND:YELLOW">[https://arguably.io/User:Darwin2049/chatgpt4_omega_interface Interfacing - Synthesis.] </SPAN>'''''<Span Style="COLOR:WHITE; BACKGROUND:TEAL">how will different groups interact with, respond to and be affected by it; might access modalities available to one group have positive or negative implications for other groups;</SPAN>  


* '''''<SPAN STYLE="COLOR:BLUE">Epistemological/Ethical relativism:</SPAN>''''' <Span Style="COLOR:WHITE; BACKGROUND:TEAL">how to reconcile ethical issues within a society, between societies; more specifically, might it provide solutions or results that are acceptable to the one group but unacceptable to the other group;</SPAN><BR />'''''<Span Style="COLOR:BLUE; BACKGROUND:YELLOW">[https://arguably.io/Https://arguably.io/User:Darwin2049/chatgpt4_omega_epistemological Epistemological - Synthesis]</SPAN>''''' <BR />
* '''''<Span Style="COLOR:BLUE; BACKGROUND:YELLOW">[https://arguably.io/Darwin2049/chatgpt4_omega_political Political/Competitive- Synthesis.]</SPAN>''''' <Span Style="COLOR:WHITE; BACKGROUND:TEAL">how might different groups or actors gain or lose relative advantage; also, how might it be used as a tool of control;</SPAN> 
 
*'''''<Span Style="COLOR:BLUE; BACKGROUND:YELLOW">[https://arguably.io/Darwin2049/chatgpt4_omega_evolutionary Evolutionary/Stratification - Synthesis.]</SPAN>''''' <Span Style="COLOR:WHITE; BACKGROUND:TEAL">might new classifications of social categories emerge; were phenotypical bifurcations to emerge would or how would the manifest themselves;</SPAN>
 
*'''''<Span Style="COLOR:BLUE; BACKGROUND:YELLOW">[https://arguably.io/Https://arguably.io/User:Darwin2049/chatgpt4_omega_epistemological Epistemological - Synthesis]</SPAN>'''''<Span Style="COLOR:WHITE; BACKGROUND:TEAL">how to reconcile ethical issues within a society, between societies; more specifically, might it provide solutions or results that are acceptable to the one group but unacceptable to the other group;</SPAN><BR /><BR />


<!--  recent attention has been drawn to the evidence that a LLM such as CG4 may begin to exhibit  '''''<SPAN STYLE="COLOR:BLUE">[https://www.marktechpost.com/2023/08/13/this-ai-research-from-deepmind-aims-at-reducing-sycophancy-in-large-language-models-llms-using-simple-synthetic-data/ sycophancy]</SPAN>''''' in its interactions with a user; even if the value stance of the user can be considered as an equivocation. <BR />
<!--  recent attention has been drawn to the evidence that a LLM such as CG4 may begin to exhibit  '''''<SPAN STYLE="COLOR:BLUE">[https://www.marktechpost.com/2023/08/13/this-ai-research-from-deepmind-aims-at-reducing-sycophancy-in-large-language-models-llms-using-simple-synthetic-data/ sycophancy]</SPAN>''''' in its interactions with a user; even if the value stance of the user can be considered as an equivocation. <BR />
Line 20: Line 41:
-->
-->
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Synthesis.</SPAN>''''' <BR /> Responding to these questions calls for some baseline information and insights about the issues that this new technology entails. We propose to suggest we look <BR />
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Synthesis.</SPAN>''''' <BR /> Responding to these questions calls for some baseline information and insights about the issues that this new technology entails. We propose to suggest we look <BR />
* '''''<SPAN STYLE="COLOR:BLUE">Terms </SPAN>'''''are included to help clarify crucial elements and contextualize CG4;  
* '''''<SPAN STYLE="COLOR:BLUE">Terms</SPAN>''''' are included to help clarify crucial elements and contextualize CG4;  
* '''''<SPAN STYLE="COLOR:BLUE">Sentiment </SPAN>'''''is being expressed about it by knowledgeable observers;
* '''''<SPAN STYLE="COLOR:BLUE">Sentiment</SPAN>''''' is being expressed about it by knowledgeable observers;
* '''''<SPAN STYLE="COLOR:BLUE">Theory of Operation </SPAN>'''''of technology paradigm used to produce its results;
* '''''<SPAN STYLE="COLOR:BLUE">Theory of Operation</SPAN>''''' of technology paradigm used to produce its results;
*'''''<SPAN STYLE="COLOR:BLUE">Risks </SPAN>''''' our approach has been to present a few commonly occurring risks, whether inherent or malicious as well as some theoretical risks that might emerge;
*'''''<SPAN STYLE="COLOR:BLUE">Risks</SPAN>''''' our approach has been to present a few commonly occurring risks, whether inherent or malicious as well as some theoretical risks that might emerge;
*'''''<SPAN STYLE="COLOR:BLUE">Insights</SPAN>''''' are offered to serve as takeoff points for subsequent discussion;
*'''''<SPAN STYLE="COLOR:BLUE">Insights</SPAN>''''' are offered to serve as takeoff points for subsequent discussion;
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Terms and Basic Concepts.</SPAN>'''''<BR />
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Terms and Basic Concepts.</SPAN>'''''<BR />
CG4 has demonstrated capabilities that represent a significant leap forward in overall capability and versatility beyond what has gone before. In order to attempt an assessment prospective risks suggests reviewing recent impressions at a later date as more reporting and insights have come to light. CG4 has already demonstrated that new and unforeseen risks are tangible; in some instances novel and unforeseen capabilities have been reported. It is with this in mind that we attempt here to offer an initial profile or picture of the risks that we should expect to see with its broader use. By way of of addressing this increasingly expanding topic we offer our summary along the following plan of discourse:<BR />
CG4 has demonstrated capabilities that represent a significant leap forward in overall capability and versatility beyond what has gone before. In order to attempt an assessment prospective risks suggests reviewing recent impressions at a later date as more reporting and insights have come to light. CG4 has already demonstrated that new and unforeseen risks are tangible; in some instances novel and unforeseen capabilities have been reported. It is with this in mind that we attempt here to offer an initial profile or picture of the risks that we should expect to see with its broader use. By way of of addressing this increasingly expanding topic we offer our summary along the following plan of discourse:<BR />


'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Overview and Impressions. </SPAN>'''''  
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Overview and Impressions.</SPAN>''''' <BR />
* what has emerged so far; some initial impressions are listed;  
* what has emerged so far; some initial impressions are listed;  
* next are some caveats that have been derived from these impressions;
* next are some caveats that have been derived from these impressions;
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Theory of Operation. </SPAN>''''' for purposes of brevity a thumb nail sketch of how CG4 performs its actions is presented;
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Theory of Operation.</SPAN>''''' <BR /> For purposes of brevity a thumb nail sketch of how CG4 performs its actions is presented;
* included are some high level diagrams
* included are some high level diagrams
** also included are links to several explanatory sources; these sources include articles and video content;
* also links to several explanatory sources; these sources include articles and video content;
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Risks.</SPAN>''''' Our thesis identifies three primary types of risks; these include:  
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Risks.</SPAN>''''' <BR />Our thesis identifies three primary types of risks; these include: <BR />
* '''''<SPAN STYLE="COLOR:RED">systemic </SPAN>''''' these are inherent as a natural process of ongoing technological, sociological advance;
* '''''<SPAN STYLE="COLOR:RED">systemic </SPAN>''''' these are inherent as a natural process of ongoing technological, sociological advance;
* '''''<SPAN STYLE="COLOR:RED">malicious:</SPAN>''''' who known actors categories are; how might they use this new capability;
* '''''<SPAN STYLE="COLOR:RED">malicious:</SPAN>''''' who known actors categories are; how might they use this new capability;
* '''''<SPAN STYLE="COLOR:RED">theoretical:</SPAN>''''' or possible new uses that might heretofore not been possible;
* '''''<SPAN STYLE="COLOR:RED">theoretical:</SPAN>''''' or possible new uses that might heretofore not been possible;
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Notes, References.</SPAN>''''' We list a few notable portrayals of qualitative technological or scientific leaps; <BR /><BR />
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Notes, References.</SPAN>''''' <BR />We list a few notable portrayals of qualitative technological or scientific leaps; <BR /><BR />
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Sentiment and Impressions, An Overview.</SPAN>''''' Some observers early on were variously favorable and voiced possible moderate caution while others were guarded or expressed caution to serious fear.<BR />
 
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Optimism and Interest. </SPAN>'''''  
<!--
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Sentiment and Impressions, An Overview.</SPAN>''''' <BR />Some observers early on were variously favorable and voiced possible moderate caution while others were guarded or expressed caution to serious fear.<BR />
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Optimism and Interest.</SPAN>'''''  
* '''''<SPAN STYLE="COLOR:BLUE">Dr. Jordan Peterson.</SPAN>''''' During his discussion with [https://www.youtube.com/watch?v=S_E4t7tWHUY&t=1726s&ab_channel=JordanBPeterson Brian Roemmele] many salient topics covered. Of note some germane capabilities such as:  
* '''''<SPAN STYLE="COLOR:BLUE">Dr. Jordan Peterson.</SPAN>''''' During his discussion with [https://www.youtube.com/watch?v=S_E4t7tWHUY&t=1726s&ab_channel=JordanBPeterson Brian Roemmele] many salient topics covered. Of note some germane capabilities such as:  
** super prompts; Roemmele revealed that he was responsible for directing CG4 to focus specifically upon the topic that he wished to address and omit the usual disclaimers and preamble information that CG4 is known to issue in response to a query.
** super prompts; Roemmele revealed that he was responsible for directing CG4 to focus specifically upon the topic that he wished to address and omit the usual disclaimers and preamble information that CG4 is known to issue in response to a query.
Line 72: Line 95:
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Caution to Alarm.</SPAN>'''''
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Caution to Alarm.</SPAN>'''''


* '''''<SPAN STYLE="COLOR:BLUE"> Elizer Yudkowski.</SPAN>''''' Yudkowski is a polymath who has been following the development of LLM's for several years. The release of CG4 has resulted in his frequent presence speaking out publicly about the risks that humanity may face extinction should a artificial general intelligence emerge from current research.  
* '''''<SPAN STYLE="COLOR:BLUE">Elizer Yudkowski.</SPAN>''''' Yudkowski is a polymath who has been following the development of LLM's for several years. The release of CG4 has resulted in his frequent presence speaking out publicly about the risks that humanity may face extinction should a artificial general intelligence emerge from current research.  
** During an interview with Lex Fridman expresses fear that the field of artificial intelligence is progressing so rapidly that humans could find themselves facing [https://www.youtube.com/watch?v=jW2ihBRzLxc&ab_channel=LexClips the prospect of an extinction event]. His concern is that it could evolve with such rapidity that we as humans would be unable to assess its actual capacities and as such be unable to make reliable determinations as to whether it was presenting its authentic self or if it was presenting a dissembling version of itself. '''''<SPAN STYLE="COLOR:BLUE"> His position is that artificial intelligence will kill us all. </SPAN>'''''  
** During an interview with Lex Fridman expresses fear that the field of artificial intelligence is progressing so rapidly that humans could find themselves facing [https://www.youtube.com/watch?v=jW2ihBRzLxc&ab_channel=LexClips the prospect of an extinction event]. His concern is that it could evolve with such rapidity that we as humans would be unable to assess its actual capacities and as such be unable to make reliable determinations as to whether it was presenting its authentic self or if it was presenting a dissembling version of itself. '''''<SPAN STYLE="COLOR:BLUE"> His position is that artificial intelligence will kill us all. </SPAN>'''''  
** He has made several [https://www.youtube.com/watch?v=Yd0yQ9yxSYY&ab_channel=TED presentations regarding the potential down sides and risk] that artificial intelligence might cause. His specific concerns are that a form of intelligence could crystallize as a result of normal operation in one or another of the current popular systems. These generally are considered to be the large language models such as Chat GPT4.  
** He has made several [https://www.youtube.com/watch?v=Yd0yQ9yxSYY&ab_channel=TED presentations regarding the potential down sides and risk] that artificial intelligence might cause. His specific concerns are that a form of intelligence could crystallize as a result of normal operation in one or another of the current popular systems. These generally are considered to be the large language models such as Chat GPT4.  
** The DeepMind system is one that is believed to be comparably sophisticated. The Meta variant known as Llama is also of sufficient sophistication that an unexpected shift in capability as a result of some unforeseen emergent capability could escape the control of its human operators.
** The DeepMind system is one that is believed to be comparably sophisticated. The Meta variant known as Llama is also of sufficient sophistication that an unexpected shift in capability as a result of some unforeseen emergent capability could escape the control of its human operators.
**A recent development at the Meta artificial intelligence lab resulted in pausing the activities of an emergent capability. This capability seemed to suggest that two instances of the ai were exchanging messages using a private language that they had just developed.
**A recent development at the Meta artificial intelligence lab resulted in pausing the activities of an emergent capability. This capability seemed to suggest that two instances of the AI were exchanging messages using a private language that they had just developed.
* '''''<SPAN STYLE="COLOR:BLUE"> Elon Musk.</SPAN>''''' Musk was an early investor in OpenAI when it was still as private startup.  
* '''''<SPAN STYLE="COLOR:BLUE">Elon Musk.</SPAN>''''' Musk was an early investor in OpenAI when it was still as private startup.  
** He has tracked development of artificial intelligence in both the US as well as DeepMind in the UK. He has recently enlisted 1100 other prominent personalities and researchers to sign a petition to presented to the US Congress regarding the risks and dangers of artificial intelligence.  
** He has tracked development of artificial intelligence in both the US as well as DeepMind in the UK. He has recently enlisted 1100 other prominent personalities and researchers to sign a petition to presented to the US Congress regarding the risks and dangers of artificial intelligence.  
** His observation is that regulation has always happened by government after there has been a major adverse or tragic development. In the case of artificial intelligence his position is that the pace of development is such that a moment in time is quickly approaching when a singularity moment arrives and passes.  
** His observation is that regulation has always happened by government after there has been a major adverse or tragic development. In the case of artificial intelligence his position is that the pace of development is such that a moment in time is quickly approaching when a singularity moment arrives and passes.  
** It is at that moment that '''''<SPAN STYLE="COLOR:RED">[https://www.youtube.com/watch?v=a2ZBEC16yH4&ab_channel=FoxNews humanity unwittingly unleashing an AGI is a very serous risk]. </SPAN>'''''may discover that it has '''''<SPAN STYLE="COLOR:BLUE"> unleashed a capability that it is unable to control.</SPAN>'''''
** It is at that moment that '''''<SPAN STYLE="COLOR:RED">[https://www.youtube.com/watch?v=a2ZBEC16yH4&ab_channel=FoxNews humanity unwittingly unleashing an AGI is a very serous risk].</SPAN>''''' may discover that it has '''''<SPAN STYLE="COLOR:BLUE"> unleashed a capability that it is unable to control.</SPAN>'''''
* '''''<SPAN STYLE="COLOR:BLUE"> Gordon Chang.</SPAN>''''' Says that the US must forge ahead with all possible speed and effort to maintain its lead against the PRC in the area of artificial intelligence. '''''<SPAN STYLE="COLOR:BLUE"> Chang asserts that the [https://www.youtube.com/watch?v=_bUwiqQFRxc&ab_channel=FoxBusiness PRC culture, values and objectives are adversarial] </SPAN>''''' and that they will use whatever means necessary to achieve a commanding edge and achieve dominance with this capability in all of its geopolitical endeavors. His earlier public statements have been very explicit that '''''<SPAN STYLE="COLOR:BLUE">the West faces an existential crisis from the CCP and that it will use any and all means, including artificial intelligence to gain primacy. </SPAN>'''''
* '''''<SPAN STYLE="COLOR:BLUE">Gordon Chang.</SPAN>''''' Says that the US must forge ahead with all possible speed and effort to maintain its lead against the PRC in the area of artificial intelligence. '''''<SPAN STYLE="COLOR:BLUE"> Chang asserts that the [https://www.youtube.com/watch?v=_bUwiqQFRxc&ab_channel=FoxBusiness PRC culture, values and objectives are adversarial] </SPAN>''''' and that they will use whatever means necessary to achieve a commanding edge and achieve dominance with this capability in all of its geopolitical endeavors. His earlier public statements have been very explicit that '''''<SPAN STYLE="COLOR:BLUE">the West faces an existential crisis from the CCP and that it will use any and all means, including artificial intelligence to gain primacy. </SPAN>'''''
'''''<SPAN STYLE="COLOR:BLUE">Indirect Alarm. </SPAN>'''''
'''''<SPAN STYLE="COLOR:BLUE">Indirect Alarm.</SPAN>'''''


In several cases highly knowledgeable individuals have expressed concern about how the PRC has been positioning itself to become '''''the world leader in artificial intelligence'''''. Their backgrounds range from journalism, diplomacy, the military and Big Tech.<BR />
In several cases highly knowledgeable individuals have expressed concern about how the PRC has been positioning itself to become '''''the world leader in artificial intelligence'''''. Their backgrounds range from journalism, diplomacy, the military and Big Tech.<BR />
Line 92: Line 115:
** Recent pronouncements have suggested that advances in artificial intelligence can be co-operative rather than adversarial and confrontational. Events in the diplomatic sphere suggest otherwise.
** Recent pronouncements have suggested that advances in artificial intelligence can be co-operative rather than adversarial and confrontational. Events in the diplomatic sphere suggest otherwise.
** His efforts provide the gestation environment needed to launch potential billion dollar major concerns.  
** His efforts provide the gestation environment needed to launch potential billion dollar major concerns.  
* '''''<SPAN STYLE="COLOR:BLUE">Bill Gertz.</SPAN>''''' Bill Gertz is an author who has published several books that focus on technology and national security. A recent book titled "Deceiving the Sky" warns of a possible preemptive action on the part of the PRC. '''''<SPAN STYLE="COLOR:BLUE">His work details how the CCP has launched a detailed, highly focused set of actions that will enable it to marginalize and neutralize the current "hegemon", i.e. the USA.
* '''''<SPAN STYLE="COLOR:BLUE">Bill Gertz.</SPAN>''''' [https://www.google.com/search?q=bill+gertz+deceiving+the+sky&sca_esv=572370827&tbm=vid&sxsrf=AM9HkKnnrAQzV_Mf7Nrruz6MkYmk6oJJ9g:1696977855714&source=lnt&tbs=dur:l&sa=X&ved=2ahUKEwip-p-Sx-yBAxUvHjQIHaAtDgAQpwV6BAgUEAk&biw=1483&bih=760&dpr=2.25 Bill Gertz] is an author who has published several books that focus on technology and national security. A recent book titled "Deceiving the Sky" warns of a possible preemptive action on the part of the PRC. '''''<SPAN STYLE="COLOR:BLUE">His work details how the CCP has launched a detailed, highly focused set of actions that will enable it to marginalize and neutralize the current "hegemon", i.e. the USA.</SPAN>'''''  
  </SPAN>'''''  
** He posits that the US could find itself at a serious disadvantage in a contest with the PLA and the PLAN.  
** He posits that the US could find itself at a serious disadvantage in a contest with the PLA and the PLAN.  
** By way of clarification he points out that the leadership of the CCP has been building out an increasingly sophisticated military which it intends to use to modify, by force if necessary the global geopolitical order.
** By way of clarification he points out that the leadership of the CCP has been building out an increasingly sophisticated military which it intends to use to modify, by force if necessary the global geopolitical order.
** Additionally other references such as that explicitly expressed by [https://www.youtube.com/watch?v=c8q6P_HfIBE&ab_channel=GlobalNews President Vlad Putin] of the Russian Federation state clearly that control of artificial intelligence will be a make or break development for whatever nation-state reaches the high ground first.
** Additionally other references such as that explicitly expressed by [https://www.youtube.com/watch?v=c8q6P_HfIBE&ab_channel=GlobalNews President Vlad Putin] of the Russian Federation state clearly that control of artificial intelligence will be a make or break development for whatever nation-state reaches the high ground first.
* '''''<SPAN STYLE="COLOR:BLUE">Dr. Graham Allison.</SPAN>''''' Dr. Allison is a professor at Harvard University. A recent work that he published in 2018 is titled Destined for War. Dr Allison places front and center the reality that only a minority of power transitions have been peaceful. '''''<SPAN STYLE="COLOR:BLUE">Of sixteen power transitions fourteen have been resolved through war.</SPAN>'''''
* '''''<SPAN STYLE="COLOR:BLUE">Dr. Graham Allison.</SPAN>''''' Dr. Allison is a professor at Harvard University. A recent work that he published in 2018 is titled [https://www.youtube.com/watch?v=hDZAVNPbtBg&ab_channel=TalksatGoogle Destined for War]. Dr Allison places front and center the reality that only a minority of power transitions have been peaceful. '''''<SPAN STYLE="COLOR:BLUE">Of sixteen power transitions fourteen have been resolved through war.</SPAN>'''''
** In this volume he develops the position that there have been sixteen historical instances where a new comer power challenged the reigning hegemonic power with the intent of displacing it.   
** In this volume he develops the position that there have been sixteen historical instances where a new comer power challenged the reigning hegemonic power with the intent of displacing it.   
** Of those sixteen only two were transitioned in a peaceful manner. The rest were accomplished by armed conflict.
** Of those sixteen only two were transitioned in a peaceful manner. The rest were accomplished by armed conflict.
Line 105: Line 127:
*** a) the economy of the PRC will not only already be larger than that of the USA but will also  
*** a) the economy of the PRC will not only already be larger than that of the USA but will also  
*** b) be able to wield greater political and geopolitical influence world wide;
*** b) be able to wield greater political and geopolitical influence world wide;
*** c) and that during this period of time the risks of armed conflict will reach a point of maximum risk.
*** c) and that during this period of time the risks of armed conflict will reach a point of maximum risk. -->
<!-- image of prof allison warning of hegemon-challenger dynamics -->
<!-- image of Prof Allison warning of hegemon-challenger dynamics -->
* '''''<SPAN STYLE="COLOR:BLUE"> Amb. (ret.) Dr. Michael Pillsbury.</SPAN>''''' Deep, lucid insight into CCP, PRC doctrine, philosophy, world view (Mandarin fluent).
<!--
* '''''<SPAN STYLE="COLOR:BLUE">Amb. (ret.) Dr. Michael Pillsbury.</SPAN>''''' Deep, lucid insight into CCP, PRC doctrine, philosophy, world view (Mandarin fluent).
** Dr. Pillsbury details how '''''<SPAN STYLE="COLOR:BLUE">the CCP has mobilized the entire population of the PRC to prepare to seize the high ground in science and technology - through any means necessary.  </SPAN>'''''
** Dr. Pillsbury details how '''''<SPAN STYLE="COLOR:BLUE">the CCP has mobilized the entire population of the PRC to prepare to seize the high ground in science and technology - through any means necessary.  </SPAN>'''''
** His publication Hundred Year Marathon goes into great detail how a succession of CCP leadership has been gradually repositioning the PRC to be a major global competitor. He makes very clear that the PRC is a command economy comparable to how the former USSR functioned.  
** [https://www.youtube.com/watch?v=rio_oDWbr4s&ab_channel=TheKevinRobertsShowHis publication Hundred Year Marathon] goes into great detail how a succession of CCP leadership has been gradually repositioning the PRC to be a major global competitor. He makes very clear that the PRC is a command economy comparable to how the former USSR functioned.  
**He clarifies how the CCP leadership has focused its thinking to position itself to displace what it views as the "reigning hegemon - the USA". A study of this work makes very clear that the CCP leadership take very seriously its charter to become the world's global leader in all things.
**He clarifies how the CCP leadership has focused its thinking to position itself to displace what it views as the "reigning hegemon - the USA". A study of this work makes very clear that the CCP leadership take very seriously its charter to become the world's global leader in all things.
** The list of all crucial science and technology is presented that shows that the CCP plans to own and control anything that matters. That the PRC is an authoritarian society is made very clear as well.
** The list of all crucial science and technology is presented that shows that the CCP plans to own and control anything that matters. That the PRC is an authoritarian society is made very clear as well.
** Dr. Pillsbury provides background and context into the values and mindset of the current CCP leader, Xi Jinping. When a fuller recognition of just how implacably focused this leader is then there might be a sea change in how the US and the West refocuses its efforts to meet this growing challenge.
** Dr. Pillsbury provides background and context into the values and mindset of the current CCP leader, Xi Jinping. When a fuller recognition of just how implacably focused this leader is then there might be a sea change in how the US and the West refocuses its efforts to meet this growing challenge.
* '''''<SPAN STYLE="COLOR:BLUE"> Brig. Gen (ret.) Robert Spalding III.</SPAN>''''' Provides a catalog of insightful military risks, concerns.  
* '''''<SPAN STYLE="COLOR:BLUE">Brig. Gen (ret.) Robert Spalding III.</SPAN>''''' Provides a catalog of insightful military risks, concerns.  
**Stealth War: details how the CCP has directed its various agencies to identify and target US vulnerabilities; he clarifies that the CCP will use unrestricted warfare, i.e. '''''<SPAN STYLE="COLOR:BLUE">unrestricted warfare i.e. warfare through any other means necessary to accomplish its goals. </SPAN>'''''
**Stealth War: details how the CCP has directed its various agencies to identify and target US vulnerabilities; he clarifies that the CCP will use unrestricted warfare, i.e. '''''<SPAN STYLE="COLOR:BLUE">unrestricted warfare i.e. warfare through any other means necessary to accomplish its goals.</SPAN>'''''
** The CCP is the main, existential threat: In chapter after chapter he details how the CCP uses intellectual property theft - which is the actual know-how and detailed design information needed to replicate a useful new high tech or scientific breakthrough; thus saving themselves years and billions of dollars;
** The CCP is the main, existential threat: In chapter after chapter he details how the CCP uses intellectual property theft - which is the actual know-how and detailed design information needed to replicate a useful new high tech or scientific breakthrough; thus saving themselves years and billions of dollars;
*** financial leverage: it uses American financial markets to raise money to develop its own military base; yet its companies are not subjected to the same accounting and reporting standards as are US companies;
*** financial leverage: it uses American financial markets to raise money to develop its own military base; yet its companies are not subjected to the same accounting and reporting standards as are US companies;
Line 131: Line 154:
**The following additional notes are included as a means to draw attention to the reality of achieving advances in biotechnology at a rate outpacing what has already happened with the SARS-COV2 virus.
**The following additional notes are included as a means to draw attention to the reality of achieving advances in biotechnology at a rate outpacing what has already happened with the SARS-COV2 virus.
*** '''''<SPAN STYLE="COLOR:RED">Passaging:</SPAN>''''' The technique called "passaging" is a means of increasing the infectivity and lethality of a particular pathogen through targeted rapid evolution. It is use to introduce a pathogen, in this case a virus into a novel environment. This means that the virus has no "experience" of successfully lodging itself into the new organism. Likewise that organism immune system has no experience in combatting it;
*** '''''<SPAN STYLE="COLOR:RED">Passaging:</SPAN>''''' The technique called "passaging" is a means of increasing the infectivity and lethality of a particular pathogen through targeted rapid evolution. It is use to introduce a pathogen, in this case a virus into a novel environment. This means that the virus has no "experience" of successfully lodging itself into the new organism. Likewise that organism immune system has no experience in combatting it;
*** '''''<SPAN STYLE="COLOR:RED">Chimeric Organism are used:</SPAN>''''' Initially a target set of organisms, in the case of SARS-COVID19, chimeric mice with the intended pathogen.  
*** '''''<SPAN STYLE="COLOR:RED">Chimeric Organism are used:</SPAN>''''' Initially a target set of organisms, in the case of SARS-COVID19, chimeric mice with the intended pathogen.  
***  '''''<SPAN STYLE="COLOR:RED">Chimeric Offspring:</SPAN>''''' several generations of chimeric mice were used to incorporate genetic information using human DNA;  
***  '''''<SPAN STYLE="COLOR:RED">Chimeric Offspring:</SPAN>''''' several generations of chimeric mice were used to incorporate genetic information using human DNA;  
*** '''''<SPAN STYLE="COLOR:RED">Specific Target:</SPAN>''''' DNA that was spliced into the virus genome was selected to human airway passage tissue; the pathogen was explicitly intended to attack the organism's respiratory organs;
*** '''''<SPAN STYLE="COLOR:RED">Specific Target:</SPAN>''''' DNA that was spliced into the virus genome was selected to human airway passage tissue; the pathogen was explicitly intended to attack the organism's respiratory organs;
*** '''''<SPAN STYLE="COLOR:RED">Initial Infection: </SPAN>''''' These mice were then inoculated with the SARS-COVID19 virus; initially they showed little reaction to the pathogen.  
*** '''''<SPAN STYLE="COLOR:RED">Initial Infection: </SPAN>''''' These mice were then inoculated with the SARS-COVID19 virus; initially they showed little reaction to the pathogen.  
*** '''''<SPAN STYLE="COLOR:RED">Iterative Infections were performed:</SPAN>''''' These mice produced offspring which were subsequently inoculated with the same pathogen;
*** '''''<SPAN STYLE="COLOR:RED">Iterative Infections were performed:</SPAN>''''' These mice produced offspring which were subsequently inoculated with the same pathogen;
*** '''''<SPAN STYLE="COLOR:RED">Directed Evolution via Passaging:</SPAN>''''' After a number of iterations the result was a SARS-COVID19 virus that had undergone several natural mutation steps and was now extremely infective and lethal; this more dangerous variant mandated highly specialized and sophisticated personal protection equipment and an extremely isolated work environment;
*** '''''<SPAN STYLE="COLOR:RED">Directed Evolution via Passaging:</SPAN>''''' After a number of iterations the result was a SARS-COVID19 virus that had undergone several natural mutation steps and was now extremely infective and lethal; this more dangerous variant mandated highly specialized and sophisticated personal protection equipment and an extremely isolated work environment;
*** '''''<SPAN STYLE="COLOR:RED">Increasing Infectivity:</SPAN>''''' Iterative passaging increased the infectivity and lethality of the virus as expected;  
*** '''''<SPAN STYLE="COLOR:RED">Increasing Infectivity:</SPAN>''''' Iterative passaging increased the infectivity and lethality of the virus as expected;  
***  '''''<SPAN STYLE="COLOR:RED">Highly Contagious and Dangerous:</SPAN>''''' Ultimately the infectivity and lethality of the mutated virus reached a stage where it could be weaponized;
***  '''''<SPAN STYLE="COLOR:RED">Highly Contagious and Dangerous:</SPAN>''''' Ultimately the infectivity and lethality of the mutated virus reached a stage where it could be weaponized;
*** '''''<SPAN STYLE="COLOR:RED">Accidental Handling:</SPAN>''''' Any kind of release, whether intended or unintended could successfully spread the virus via aerosol, or airborne transmission ;
*** '''''<SPAN STYLE="COLOR:RED">Accidental Handling:</SPAN>''''' Any kind of release, whether intended or unintended could successfully spread the virus via aerosol, or airborne transmission ;
*** '''''<SPAN STYLE="COLOR:RED">Only the HIV could do this:</SPAN>''''' Ongoing reporting and investigative work has conclusively demonstrated that the HIV was the source of the COVID-19 pandemic;  
*** '''''<SPAN STYLE="COLOR:RED">Only the HIV could do this:</SPAN>''''' Ongoing reporting and investigative work has conclusively demonstrated that the HIV was the source of the COVID-19 pandemic;  
*** '''''<SPAN STYLE="COLOR:RED">National Lockdown:</SPAN>''''' Of note is the extremely curious fact that the CCP imposed a nationwide lockdown that went into effect 48 hours after being announced.
*** '''''<SPAN STYLE="COLOR:RED">National Lockdown:</SPAN>''''' Of note is the extremely curious fact that the CCP imposed a nationwide lockdown that went into effect 48 hours after being announced.
*** '''''<SPAN STYLE="COLOR:RED">With "Leaks":</SPAN>''''' The CCP imposed extreme lockdown measures which in some cases proved to be extreme measures.  
*** '''''<SPAN STYLE="COLOR:RED">With "Leaks":</SPAN>''''' The CCP imposed extreme lockdown measures which in some cases proved to be extreme measures.  
*** '''''<SPAN STYLE="COLOR:RED">International Spreading was Allowed:</SPAN>''''' Further puzzling behavior on the part of the CCP is the fact that international air travel out of the PRC continued for one month after the lockdown.
*** '''''<SPAN STYLE="COLOR:RED">International Spreading was Allowed:</SPAN>''''' Further puzzling behavior on the part of the CCP is the fact that international air travel out of the PRC continued for one month after the lockdown.
*** '''''<SPAN STYLE="COLOR:RED">Military Applications:</SPAN>''''' Subsequent research by the team and their collaborators have demonstrated that the HIV has been driven by military, bioweapon considerations.</SPAN>'''''  
*** '''''<SPAN STYLE="COLOR:RED">Military Applications:</SPAN>''''' Subsequent research by the team and their collaborators have demonstrated that the HIV has been driven by military, bioweapon considerations.</SPAN>'''''  
*** '''''<SPAN STYLE="COLOR:RED">Unrestricted Warfare:</SPAN>''''' Extensive investigation revealed that the PLA had been studying how to weaponize an engineered virus as recently as 2016 and published a report on how best to use such a weapon.
*** '''''<SPAN STYLE="COLOR:RED">Unrestricted Warfare:</SPAN>''''' Extensive investigation revealed that the PLA had been studying how to weaponize an engineered virus as recently as 2016 and published a report on how best to use such a weapon.
* '''''<SPAN STYLE="COLOR:BLUE"> Dr Eric Li.</SPAN>'''''  
* '''''<SPAN STYLE="COLOR:BLUE">Dr Eric Li.</SPAN>''''' https://www.ted.com/talks/eric_x_li_a_tale_of_two_political_systems?language=en CCP Venture Capitalist. ]; Dr Li is a Chinese national. He was educated in the US at the University of California at Berkeley and later at Stanford University. His PhD is from Fudan University in Shanghai China. He has focused his resources '''''<SPAN STYLE="COLOR:BLUE"> the growth and development of [https://www.wikiwand.com/en/Eric_X._Li#Business_ventures PRC high technology companies.] </SPAN>'''''  
https://www.ted.com/talks/eric_x_li_a_tale_of_two_political_systems?language=en CCP Venture Capitalist. ]; Dr Li is a Chinese national. He was educated in the US at the University of California at Berkeley and later at Stanford University. His PhD is from Fudan University in Shanghai China. He has focused his resources '''''<SPAN STYLE="COLOR:BLUE"> the growth and development of [https://www.wikiwand.com/en/Eric_X._Li#Business_ventures PRC high technology companies.] </SPAN>'''''  
** During a presentation at TED Talks Dr Li made the case that there are notable value differences between the Chinese world view and that typically found in the West.  
** During a presentation at TED Talks Dr Li made the case that there are notable value differences between the Chinese world view and that typically found in the West.  
** In his summary of value comparisons he builds the position that the Chinese model works in ways that Western observers claim that it does not. He champions the Chinese "meritocratic system" as a means of managing public affairs.
** In his summary of value comparisons he builds the position that the Chinese model works in ways that Western observers claim that it does not. He champions the Chinese "meritocratic system" as a means of managing public affairs.
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">CAVEATS.</SPAN>'''''<br />
-->
The above impressions by notable observers in the field of artificial intelligence and geopolitics is small, unscientific sampling of positions. However in each case these individuals are well respected and known in their fields. From these we propose a few caveats and observations as a way forward.<br />
<!--
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">CAVEATS.</SPAN>'''''<BR />
The above impressions by notable observers in the field of artificial intelligence and geopolitics is small, unscientific sampling of positions. However in each case these individuals are well respected and known in their fields. From these we propose a few caveats and observations as a way forward.<BR />
We propose the tentative conclusion that several factors should condition our synthesis going forward. These include the premises that CG4:
We propose the tentative conclusion that several factors should condition our synthesis going forward. These include the premises that CG4:
[[File:Exa_scale_el_capitain.jpg|left|450px]]
[[File:Exa_scale_el_capitain.jpg|left|450px]]
* '''''<SPAN STYLE="COLOR:BLUE">Inherently Dual Use Technology:</SPAN>''''' it should be kept clearly in mind that its further use entails significant risks; the prospect that totally unforeseen and unexpected uses will emerge must be accepted as the new reality; like any new capability it will be applied toward positive and benign outcomes but we should not be surprised when adverse and negative uses emerge. Hopes of only positive outcomes are hopelessly unrealistic.
* '''''<SPAN STYLE="COLOR:BLUE">Inherently Dual Use Technology:</SPAN>''''' it should be kept clearly in mind that its further use entails significant risks; the prospect that totally unforeseen and unexpected uses will emerge must be accepted as the new reality; like any new capability it will be applied toward positive and benign outcomes but we should not be surprised when adverse and negative uses emerge. Hopes of only positive outcomes are hopelessly unrealistic.
* '''''<SPAN STYLE="COLOR:BLUE">Cognitive Prosthetic: (CP):</SPAN>''''' The kinds of tasks that CG4 can perform suggest that it is well beyond the category of an appliance but rather is closer to a heavy industrial tool. Because of its ability to adapt its functionality to user defined task it might be more accurately something akin to a prosthetic. But in this case one that supports and extends human reasoning ability.<br />
* '''''<SPAN STYLE="COLOR:BLUE">Cognitive Prosthetic: (CP):</SPAN>''''' The kinds of tasks that CG4 can perform suggest that it is well beyond the category of an appliance but rather is closer to a heavy industrial tool. Because of its ability to adapt its functionality to user defined task it might be more accurately something akin to a prosthetic. But in this case one that supports and extends human reasoning ability.<BR />
CG4 possesses no sense of self, sentience, intentionality or emotion. Yet it continues to demonstrate itself to be a very powerful tool for expression or problem solving.<br /> Given its power to offer cognitive amplification we suggest further that it can form the basis of a more advanced form of symbolic representation and communication.
CG4 possesses no sense of self, sentience, intentionality or emotion. Yet it continues to demonstrate itself to be a very powerful tool for expression or problem solving.<BR /> Given its power to offer cognitive amplification we suggest further that it can form the basis of a more advanced form of symbolic representation and communication.
* '''''<SPAN STYLE="COLOR:BLUE">Emergence: it has shown novel and unexpected capabilities; </SPAN>''''' capabilities that the designers had not anticipated have spontaneously arisen; however though no one so far is proposing that sentience, intentionality, consciousness or self reference might be present; the lights clearly are on but the doors and windows are still open and there is still nobody home.
* '''''<SPAN STYLE="COLOR:BLUE">Emergence: it has shown novel and unexpected capabilities; </SPAN>''''' capabilities that the designers had not anticipated have spontaneously arisen; however though no one so far is proposing that sentience, intentionality, consciousness or self reference might be present; the lights clearly are on but the doors and windows are still open and there is still nobody home.
[[File:Columbus00.jpg|left|350px|Santa Maria: Columbus Expedition Flag Ship]]
[[File:Columbus00.jpg|left|350px|Santa Maria: Columbus Expedition Flag Ship]]
* '''''<SPAN STYLE="COLOR:BLUE">Hosting: Von Neuman Architecture:</SPAN>''''' to date, CG4 is functioning on an ensemble of von Neuman machines.  As it is currently accessed has been developed and is hosted on classical computing equipment. Despite this fact it has shown a stunning ability to provide context sensitive an meaningful responses to a broad range of queries. However intrinsic to its fundamental modeling approach has shown itself as being prone to actions that can intercept or subvert its intended agreeable functionality. To what extent solving this "alignment" problem is successful going forward remains to be seen. As training data sets grew in size processing power demands grew dramatically. Subsequently the more sophisticated LLM's such as CG4 and its peers have required environments that are massively parallel and operate at state of the art speeds. This has meant that in order to keep training times within acceptable limits some of the most powerful environments have been brought into play. Going to larger training data sets and keeping time to training conclusion may call for hosting on such exa-scale computing environments such as the El Capitan system or one of its near peer processing ensembles.
* '''''<SPAN STYLE="COLOR:BLUE">Hosting: Von Neuman Architecture:</SPAN>''''' to date, CG4 is functioning on an ensemble of von Neuman machines.  As it is currently accessed has been developed and is hosted on classical computing equipment. Despite this fact it has shown a stunning ability to provide context sensitive an meaningful responses to a broad range of queries. However intrinsic to its fundamental modeling approach has shown itself as being prone to actions that can intercept or subvert its intended agreeable functionality. To what extent solving this "alignment" problem is successful going forward remains to be seen. As training data sets grew in size processing power demands grew dramatically. Subsequently the more sophisticated LLM's such as CG4 and its peers have required environments that are massively parallel and operate at state of the art speeds. This has meant that in order to keep training times within acceptable limits some of the most powerful environments have been brought into play. Going to larger training data sets and keeping time to training conclusion may call for hosting on such exa-scale computing environments such as the El Capitan system or one of its near peer processing ensembles.


[[File:Ford00.jpg|right|350px|US Super Carrier]]
[[File:Ford00.jpg|right|350px|US Super Carrier]]
'''''<SPAN STYLE="COLOR:BLUE">Hosting: Quantum Computing.</SPAN>''''' Because of the inherent novelty of what a quantum computing environment might make possible the following discussion limits itself to what is currently known. <BR />
'''''<SPAN STYLE="COLOR:BLUE">Hosting: Quantum Computing.</SPAN>''''' Because of the inherent novelty of what a quantum computing environment might make possible the following discussion limits itself to what is currently known. <BR />


Published reports from giants such as IBM, Intel and Google lay out explicit timelines and project goals. In each case the expected goal of qubit availability is provided. <BR />
Published reports from giants such as IBM, Intel and Google lay out explicit timelines and project goals. In each case the expected goal of qubit availability is provided. <BR />
Line 169: Line 192:
If we take a short step into the near term future then we might be obliged to attempt to assimilate and rationalize developments happening on a daily basis. Any or even all of which can have transformative implications.  <BR />
If we take a short step into the near term future then we might be obliged to attempt to assimilate and rationalize developments happening on a daily basis. Any or even all of which can have transformative implications.  <BR />


The upshot is that as '''''<SPAN STYLE="COLOR:BLUE">quantum computing </SPAN>'''''becomes more prevalent the field of deep learning will take another forward "quantum leap" - literally which will put those who possess it at an incalculable advantage. It will be like trying to go after King Kong with Spad S. - firing with bb pellets. <BR />
The upshot is that as '''''<SPAN STYLE="COLOR:BLUE">quantum computing</SPAN>''''' becomes more prevalent the field of deep learning will take another forward "quantum leap" - literally which will put those who possess it at an incalculable advantage. It will be like trying to go after King Kong with Spad S. - firing with bb pellets.<BR />


The central problem in these most recent developments arises because of the well recognized human inability to process changes that happen in a nonlinear fashion. If change is introduced in a relatively linear fashion at a slow to moderate pace then most people adapt to and accommodate the change. However if change happens geometrically like what we see in the areas of deep learning then it is much more difficult to adapt to change. <BR />
The central problem in these most recent developments arises because of the well recognized human inability to process changes that happen in a nonlinear fashion. If change is introduced in a relatively linear fashion at a slow to moderate pace then most people adapt to and accommodate the change. However if change happens geometrically like what we see in the areas of deep learning then it is much more difficult to adapt to change. <BR />


* '''''<SPAN STYLE="COLOR:BLUE">CG.X-Quantum will completely eclipse current incarnations of CG4:</SPAN>''''' Quantum computing devices are already demonstrating their ability to solve problems in seconds or minutes that classical von Neuman computing machines require decades, centuries or even millennia to solve.  
* '''''<SPAN STYLE="COLOR:BLUE">CG.X-Quantum will completely eclipse current incarnations of CG4:</SPAN>'''''  
Quantum computing devices are already demonstrating their ability to solve problems in seconds or minutes that classical von Neuman computing machines require decades, centuries or even millennia to solve.  
[[File:IBMQ2.jpg|right|500px|IBM Quantum System 2]]
[[File:IBMQ2.jpg|right|500px|IBM Quantum System 2]]
* '''''<SPAN STYLE="COLOR:BLUE">Phase Shift.</SPAN>''''' Classical physics describes how states of matter possess different properties depending upon their energy state or environment. Thus on the surface of the earth we can experience the gas of the atmosphere. In environments that are somewhat above the freezing point of water we are unaware of the fact that we inhale and exhale atmospheric gas. It is odorless, colorless and tasteless. <br />  
* '''''<SPAN STYLE="COLOR:BLUE">Phase Shift.</SPAN>''''' Classical physics describes how states of matter possess different properties depending upon their energy state or environment. Thus on the surface of the earth we can experience the gas of the atmosphere. In environments that are somewhat above the freezing point of water we are unaware of the fact that we inhale and exhale atmospheric gas. It is odorless, colorless and tasteless. <br />  
** '''''<SPAN STYLE="COLOR:BLUE">STATE: Gas. </SPAN>'''''Were we to collect sufficient quantities of atmospheric gas into a sealable container it would be possible to cool the gasses comprising earth's atmosphere into liquids. The idea of wetness, shape conformability and other properties of a liquid would suddenly become evident. Yet it would be difficult to adduce wetness if we never had experience with a liquids.<br />  
** '''''<SPAN STYLE="COLOR:BLUE">STATE: Gas.</SPAN>''''' Were we to collect sufficient quantities of atmospheric gas into a sealable container it would be possible to cool the gasses comprising earth's atmosphere into liquids. The idea of wetness, shape conformability and other properties of a liquid would suddenly become evident. Yet it would be difficult to adduce wetness if we never had experience with a liquids.<br />  
** '''''<SPAN STYLE="COLOR:BLUE">STATE: Liquid. </SPAN>'''''If one never had contact with the liquid state of H2O then properties as buoyancy, wetness, conformability (i.e. to the shape of a container, evaporation, discoloration and similar properties might be very difficult to imagine. It would be difficult if not even impossible to envision something that was otherwise undetectable because it was odorless, colorless and tasteless could somehow cause one's death by drowning. Yet were one to be plunged into a large enough body of water but not possess swimming skills one could very well die by drowning. <br />  
** '''''<SPAN STYLE="COLOR:BLUE">STATE: Liquid.</SPAN>''''' If one never had contact with the liquid state of H2O then properties as buoyancy, wetness, conformability (i.e. to the shape of a container, evaporation, discoloration and similar properties might be very difficult to imagine. It would be difficult if not even impossible to envision something that was otherwise undetectable because it was odorless, colorless and tasteless could somehow cause one's death by drowning. Yet were one to be plunged into a large enough body of water but not possess swimming skills one could very well die by drowning. <br />  
** '''''<SPAN STYLE="COLOR:BLUE">STATE: Solid. </SPAN>''''' The process can be repeated. If we were to use water as a basis then we might discover another state of matter that water can exhibit. This is the state of ice. Many of us use this material (ice) to condition our beverages. That a heretofore wet liquid could become solid might also defy our ability to imagine it taking on solid form.  These all consist of the same substance, i.e. H2O. Yet properties found in one state, or phase bear little or no resemblance to those in the subsequent state. We should expect to see an evolution comparable happening in fast forward motion that is very comparable to "gas to liquid, liquid to solid". When DeepMind or CG4 are re-hosted in a quantum computing environment equally unimaginable capabilities will become the norm.<br />
** '''''<SPAN STYLE="COLOR:BLUE">STATE: Solid.</SPAN>''''' The process can be repeated. If we were to use water as a basis then we might discover another state of matter that water can exhibit. This is the state of ice. Many of us use this material (ice) to condition our beverages. That a heretofore wet liquid could become solid might also defy our ability to imagine it taking on solid form.  These all consist of the same substance, i.e. H2O. Yet properties found in one state, or phase bear little or no resemblance to those in the subsequent state. We should expect to see an evolution comparable happening in fast forward motion that is very comparable to "gas to liquid, liquid to solid". When DeepMind or CG4 are re-hosted in a quantum computing environment equally unimaginable capabilities will become the norm.<br />


[[File:IBMQUANTUM2.jpg|left|550px|Exponential Computational Capabilities]]
[[File:IBMQUANTUM2.jpg|left|550px|Exponential Computational Capabilities]]
Line 187: Line 211:
In some cases they will prove to be fairly technical.<BR />
In some cases they will prove to be fairly technical.<BR />


A quick peek at this scaling chart shows that the IBM-Q-System TWO with 432 QuBits is completely off of the chart. Which if one views the associated video will see, IBM plans to offer joined systems together in trios. The result will be well '''''<SPAN STYLE="COLOR:BLUE">over one thousand qubits. </SPAN>'''''Making a conceptual or imaginative leap to what this kind of capability means requires considerable insight and a mental framework that benefits from value scales either only poorly understood or understood by very few alive today.<BR />
A quick peek at this scaling chart shows that the IBM-Q-System TWO with 432 QuBits is completely off of the chart. Which if one views the associated video will see, IBM plans to offer joined systems together in trios. The result will be well '''''<SPAN STYLE="COLOR:BLUE">over one thousand qubits.</SPAN>''''' Making a conceptual or imaginative leap to what this kind of capability means requires considerable insight and a mental framework that benefits from value scales either only poorly understood or understood by very few alive today.<BR />


'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">RESCALING.</SPAN>'''''<br />
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">RESCALING.</SPAN>'''''<BR />
Progress has moved forward apace in the world of quantum computing. By and large most members of the public have little awareness of its reality or what it might mean. What might be worth pondering briefly are the numbers in this comparison table. Note that with relatively small numbers of QuBits the amount of computational capacity becomes extremely large - by contemporary standards. If we look at the advertised progress that IBM is making with its '''''<SPAN STYLE="COLOR:BLUE">Osprey and Quantum System Two [https://newsroom.ibm.com/2022-11-09-IBM-Unveils-400-Qubit-Plus-Quantum-Processor-and-Next-Generation-IBM-Quantum-System-Two 432 QuBits - Roadmap]</SPAN>''''' device then we are seeing computational capacity that requires a redefinition of our understanding of what is possible. <br />
Progress has moved forward apace in the world of quantum computing. By and large most members of the public have little awareness of its reality or what it might mean. What might be worth pondering briefly are the numbers in this comparison table. Note that with relatively small numbers of QuBits the amount of computational capacity becomes extremely large - by contemporary standards. If we look at the advertised progress that IBM is making with its '''''<SPAN STYLE="COLOR:BLUE">Osprey and Quantum System Two [https://newsroom.ibm.com/2022-11-09-IBM-Unveils-400-Qubit-Plus-Quantum-Processor-and-Next-Generation-IBM-Quantum-System-Two 432 QuBits - Roadmap]</SPAN>''''' device then we are seeing computational capacity that requires a redefinition of our understanding of what is possible. <br />


[[File:IBMQ03.jpg|left|650px|IBM Quantum Computing Roadmap]]
[[File:IBMQ03.jpg|left|650px|IBM Quantum Computing Roadmap]]


Taking a very short leap of the imagination one might envision the capability of modeling molecular processes in fast forward time. One can imagine creating a '''''<SPAN STYLE="COLOR:BLUE">Sim City</SPAN>''''' type model with a large to very large number of agents, hundreds, thousands or tens of thousands where each has very detailed knowledge about their own domain. They might have comparable communications channels as those described in the "Sim City" experiment. The processing likewise being sped up by an arbitrary scaling factor. <br />
Taking a very short leap of the imagination one might envision the capability of modeling molecular processes in fast forward time. One can imagine creating a '''''<SPAN STYLE="COLOR:BLUE">Sim City</SPAN>''''' type model with a large to very large number of agents, hundreds, thousands or tens of thousands where each has very detailed knowledge about their own domain. They might have comparable communications channels as those described in the "Sim City" experiment. The processing likewise being sped up by an arbitrary scaling factor. <BR />


'''''<SPAN STYLE="COLOR:BLUE">Molecular modeling at quantum computing speed</SPAN>''''' can mean that any number of parallel pathways can be examined and fine tuned almost instantly. Existing experiments with various deep learning based systems have shown that they are capable of creating their own experimental validation and verification processes and actions. They would only need some human intervention for equipment assembly and setup. With that done such as system could overnight be capable of exploring various experimental pathways forward that optimized a particular molecular structure for a highly specific purpose, or combination of purposes, then recommend the processing steps to create them. <br />
'''''<SPAN STYLE="COLOR:BLUE">Molecular modeling at quantum computing speed</SPAN>''''' can mean that any number of parallel pathways can be examined and fine tuned almost instantly. Existing experiments with various deep learning based systems have shown that they are capable of creating their own experimental validation and verification processes and actions. They would only need some human intervention for equipment assembly and setup. With that done such as system could overnight be capable of exploring various experimental pathways forward that optimized a particular molecular structure for a highly specific purpose, or combination of purposes, then recommend the processing steps to create them. <BR />
This kind of capability suggests that heretofore unforeseen or unimagined psycho-actives might become available in the very near term. Usage of such capabilities could mean a bifurcation of humans from one species to a successor species. This can not reasonably be ruled out. <br />
This kind of capability suggests that heretofore unforeseen or unimagined psycho-actives might become available in the very near term. Usage of such capabilities could mean a bifurcation of humans from one species to a successor species. This can not reasonably be ruled out. <BR />
'''''<SPAN STYLE="COLOR:BLUE">Geopolitical, Geo-Economic modeling</SPAN>''''' at quantum speed can mean the creation of plans that can either cause ones own national strengths and standing could be considerably enhanced or those of a rival equally diminished or worse, damaged. This necessarily inheres from the ability to direct such as system to use known historical facts and rules of human endeavors to be used to play a model forward to see what kinds of outcomes might obtains. If one were to take a case in point one might use such a capability to play forward the events that led up to The Great War, or World War One to explore if there might have been other actions available to the leaders of the time to avert the catastrophe that ensued.<BR />
'''''<SPAN STYLE="COLOR:BLUE">Geopolitical, Geo-Economic modeling</SPAN>''''' at quantum speed can mean the creation of plans that can either cause ones own national strengths and standing could be considerably enhanced or those of a rival equally diminished or worse, damaged. This necessarily inheres from the ability to direct such as system to use known historical facts and rules of human endeavors to be used to play a model forward to see what kinds of outcomes might obtains. If one were to take a case in point one might use such a capability to play forward the events that led up to The Great War, or World War One to explore if there might have been other actions available to the leaders of the time to avert the catastrophe that ensued.<BR />


Line 206: Line 230:
Existing CG4 functionality allows for sentiment analysis. This means that it is capable of analyzing a tract of text and deriving a sense of the sentiment that the author used in its composition. Looking at a far more sophisticated capability it is within the realm of reason to expect that a CG4-class system might iteratively develop theories that enable it to characterize various human states of mind. It might further be able to formulate theories that enable it to crystallize real world hypotheses of significant accuracy. Working with this kind of capability it would not seem to be far fetched to see  CG4 type system teaching itself to interact with a human user in a seemingly highly empathetic and charming manner. The prospect of such a development should be both fascinating but also off-putting since it would enable such a system to be a highly agreeable interlocutor but could also become highly persuasive and manipulative. Further when operating at electronic speeds it could all happen in real time. <BR />
Existing CG4 functionality allows for sentiment analysis. This means that it is capable of analyzing a tract of text and deriving a sense of the sentiment that the author used in its composition. Looking at a far more sophisticated capability it is within the realm of reason to expect that a CG4-class system might iteratively develop theories that enable it to characterize various human states of mind. It might further be able to formulate theories that enable it to crystallize real world hypotheses of significant accuracy. Working with this kind of capability it would not seem to be far fetched to see  CG4 type system teaching itself to interact with a human user in a seemingly highly empathetic and charming manner. The prospect of such a development should be both fascinating but also off-putting since it would enable such a system to be a highly agreeable interlocutor but could also become highly persuasive and manipulative. Further when operating at electronic speeds it could all happen in real time. <BR />
Looking at an only slightly more sophisticated approach even an existing CG4 class system should be able to create a set of agents and apply various interaction scripts to them to observe their behavior patterns and compare them to the behavior patterns of actual human users.
Looking at an only slightly more sophisticated approach even an existing CG4 class system should be able to create a set of agents and apply various interaction scripts to them to observe their behavior patterns and compare them to the behavior patterns of actual human users.
<!--                                                                                                                                                      -->
-->
<!--  2023.09.18                                                                                                                                          -->
<!--  2023.09.18                                                                                                                                          -->
<!-- add more on theory of mind and how it can be used to generate instant by instant hypotheses about the user's state of knowledge, perceptions        -->
<!-- add more on theory of mind and how it can be used to generate instant by instant hypotheses about the user's state of knowledge, perceptions        -->
<!-- and what the users goals and motivations might be                                                                                                    -->
<!-- and what the users goals and motivations might be                                                                                                    -->
<!--                                                                                                                                                     -->
<!--
<!--                                                                                                                                                      -->
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Risks</SPAN>'''''
 
 
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">RISKS</SPAN>'''''<BR />
The following collects these reactions suggesting promise or risk. They seem to partition into questions of promise and Risk. The main risk categories are: '''''<SPAN STYLE="COLOR:RED">systemic, malicious</SPAN>''''' and '''''<SPAN STYLE="COLOR:RED">theoretical.</SPAN>'''''
The following collects these reactions suggesting promise or risk. They seem to partition into questions of promise and Risk. The main risk categories are: '''''<SPAN STYLE="COLOR:RED">systemic, malicious</SPAN>''''' and '''''<SPAN STYLE="COLOR:RED">theoretical.</SPAN>'''''
*'''''<SPAN STYLE="COLOR:RED">Systemic. </SPAN>'''''These risks arise innately from the emergence and adaptation of new technology or scientific insights. During the early years of private automobile usage the risks of traffic accidents was very low. This was because there were very few in private hands. But as they began to proliferate. Traffic accident risks escalated. Ultimately civil authorities were obliged to act to regulate their use and ownership. As some observers have pointed out, regulation usually occurs after there has been an unfortunate or tragic event. The pattern can be seen in civil aviation and later in control and usage of heavy transportation or construction equipment. In each case training became formalized and licensing became obligatory for airplane ownership and usage, trucking or heavy construction equipment.
*'''''<SPAN STYLE="COLOR:RED">Systemic.</SPAN>''''' These risks arise innately from the emergence and adaptation of new technology or scientific insights. During the early years of private automobile usage the risks of traffic accidents was very low. This was because there were very few in private hands. But as they began to proliferate. Traffic accident risks escalated. Ultimately civil authorities were obliged to act to regulate their use and ownership. As some observers have pointed out, regulation usually occurs after there has been an unfortunate or tragic event. The pattern can be seen in civil aviation and later in control and usage of heavy transportation or construction equipment. In each case training became formalized and licensing became obligatory for airplane ownership and usage, trucking or heavy construction equipment.
*'''''<SPAN STYLE="COLOR:RED">Malicious.</SPAN>''''' History if littered with examples of how a new scientific advance or technological advance was applied in ways not intended by the inventor. The Montgolfier hot air balloons were considered an entertaining novelty. Their use during World War One as surveillance and attack platforms cast a new and totally different perception on their capabilities. We should expect the same lines of development with CG4. Its peers and derivatives should be considered as no different.
*'''''<SPAN STYLE="COLOR:RED">Malicious.</SPAN>''''' History if littered with examples of how a new scientific advance or technological advance was applied in ways not intended by the inventor. The Montgolfier hot air balloons were considered an entertaining novelty. Their use during World War One as surveillance and attack platforms cast a new and totally different perception on their capabilities. We should expect the same lines of development with CG4. Its peers and derivatives should be considered as no different.
*'''''<SPAN STYLE="COLOR:RED">Theoretical. </SPAN>'''''CG4 has shown itself to be a powerful cognitive appliance or augmentation tool. Given that it is capable of going right to the core of what makes humans the apex predator we should take very seriously the kinds of unintended and unexpected ways that it can be applied. This way suggests considerable caution.
*'''''<SPAN STYLE="COLOR:RED">Theoretical.</SPAN>''''' CG4 has shown itself to be a powerful cognitive appliance or augmentation tool. Given that it is capable of going right to the core of what makes humans the apex predator we should take very seriously the kinds of unintended and unexpected ways that it can be applied. This way suggests considerable caution.<BR />
 
'''''<SPAN STYLE="COLOR:BLUE">Recent Reactions.</SPAN>''''' Since the most recent artificial intelligence systems have swept over the public awareness, sentiment has begun to crystallize. There have been four general types of sentiment that have crystallized over time. These include: voices of enthusiastic encouragement, cautious action, urgent preemption.
'''''<SPAN STYLE="COLOR:BLUE">Recent Reactions. </SPAN>'''''Since the most recent artificial intelligence systems have swept over the public awareness, sentiment has begun to crystallize. There have been four general types of sentiment that have crystallized over time. These include: voices of enthusiastic encouragement, cautious action, urgent preemption.
*'''''<SPAN STYLE="COLOR:BLUE">Enthusiastic Encouragement.</SPAN>''''' Several industry watchers have expressed positive reactions to the availability of CG4 and its siblings. Their position has been that these are powerful tools for good and that they should be viewed as means that illuminate the pathway forward to higher standards of living and human potential.
*'''''<SPAN STYLE="COLOR:BLUE">Enthusiastic Encouragement.</SPAN>''''' Several industry watchers have expressed positive reactions to the availability of CG4 and its siblings. Their position has been that these are powerful tools for good and that they should be viewed as means that illuminate the pathway forward to higher standards of living and human potential.
*'''''<SPAN STYLE="COLOR:BLUE">Cautious Action. </SPAN>'''''Elon Musk and eleven hundred knowledgeable industry observers or participants signed a petition to the US Congress urging caution and regulation of the rapidly advancing areas of artificial intelligence. They voiced concern that the potential usage of these new capabilities can have severe consequences. They also expressed concern that the rapid pace of the adaption of artificial intelligence tools can very quickly lead to job losses across a broad cross section of currently employed individuals. This topic is showing itself to be dynamic and fast changing. Therefore it merits regular review for the most recent insights and developments. In recent weeks and months there have been sources signaling that the use of this new technology will make substantial beneficial impact on their activities and outcomes. Therefore they wish to see it advance as quickly as possible. Not doing so would place the advances made in the West and the US at risk. Which could mean foreclosing on having the most capable system possible to use when dealing with external threats such as those that can be posed by the CCP.
*'''''<SPAN STYLE="COLOR:BLUE">Cautious Action.</SPAN>''''' Elon Musk and eleven hundred knowledgeable industry observers or participants signed a petition to the US Congress urging caution and regulation of the rapidly advancing areas of artificial intelligence. They voiced concern that the potential usage of these new capabilities can have severe consequences. They also expressed concern that the rapid pace of the adaption of artificial intelligence tools can very quickly lead to job losses across a broad cross section of currently employed individuals. This topic is showing itself to be dynamic and fast changing. Therefore it merits regular review for the most recent insights and developments. In recent weeks and months there have been sources signaling that the use of this new technology will make substantial beneficial impact on their activities and outcomes. Therefore they wish to see it advance as quickly as possible. Not doing so would place the advances made in the West and the US at risk. Which could mean foreclosing on having the most capable system possible to use when dealing with external threats such as those that can be posed by the CCP.


[[File:Kingkong00.jpg|right|250px|King Kong Enraged]]
[[File:Kingkong00.jpg|right|250px|King Kong Enraged]]
* '''''<SPAN STYLE="COLOR:BLUE">Urgent Preemption. </SPAN>'''''Those familiar with geopolitics and national security hold the position that any form of pause would be suicide for the US because known competitors and adversaries will race ahead to advance their artificial intelligence capabilities at warp speed. In the process obviating anything that the leading companies in the West might accomplish. They argue that any kind of pause can not even be contemplated. Some voices go so far as to point out that deep learning systems require large data sets to be trained on. They posit the reality that the PRC has a population of 1.37 billion people. A recent report indicates that in 2023 there were 827 million WeChat users in the PRC. Further that the PRC makes use of data collection systems such as WeChat and TenCent. In each case these are systems that capture the messaging information from hundreds of millions of PRC residents on any given day. They also capture a comparably large amount of financial and related information each and every day. Topping the list of enterprises that gather user data we find:
* '''''<SPAN STYLE="COLOR:BLUE">Urgent Preemption.</SPAN>''''' Those familiar with geopolitics and national security hold the position that any form of pause would be suicide for the US because known competitors and adversaries will race ahead to advance their artificial intelligence capabilities at warp speed. In the process obviating anything that the leading companies in the West might accomplish. They argue that any kind of pause can not even be contemplated. Some voices go so far as to point out that deep learning systems require large data sets to be trained on. They posit the reality that the PRC has a population of 1.37 billion people. A recent report indicates that in 2023 there were 827 million WeChat users in the PRC. Further that the PRC makes use of data collection systems such as WeChat and TenCent. In each case these are systems that capture the messaging information from hundreds of millions of PRC residents on any given day. They also capture a comparably large amount of financial and related information each and every day. Topping the list of enterprises that gather user data we find:
**'''''<SPAN STYLE="COLOR:BLUE">WeChat. </SPAN>'''''According to [https://www.bankmycell.com/blog/number-of-wechat-users/#:~:text=WeChat%20Number%20of%20Users%3A%20WeChat,more%20WeChat%20users%2C%20by%202025.  BankMyCell.COM] the '''''<SPAN STYLE="COLOR:BLUE">PRC boasts of 1.671 billion users. </SPAN>'''''
**'''''<SPAN STYLE="COLOR:BLUE">WeChat.</SPAN>''''' According to [https://www.bankmycell.com/blog/number-of-wechat-users/#:~:text=WeChat%20Number%20of%20Users%3A%20WeChat,more%20WeChat%20users%2C%20by%202025.  BankMyCell.COM] the '''''<SPAN STYLE="COLOR:BLUE">PRC boasts of 1.671 billion users. </SPAN>'''''
**'''''<SPAN STYLE="COLOR:BLUE">TenCent. </SPAN>'''''Recent reporting by [https://webtribunal.net/blog/tencent-stats/] TenCent has '''''<SPAN STYLE="COLOR:BLUE">over one billion users. </SPAN>'''''
**'''''<SPAN STYLE="COLOR:BLUE">TenCent.</SPAN>''''' Recent reporting by [https://webtribunal.net/blog/tencent-stats/] TenCent has'''''<SPAN STYLE="COLOR:BLUE"> over one billion users.</SPAN>'''''
** '''''<SPAN STYLE="COLOR:BLUE">Surveillance. </SPAN>'''''Reporting from '''''<SPAN STYLE="COLOR:BLUE">the PRC has [http://chinascope.org/archives/30749 over 600,000,000 surveillance cameras]  </SPAN>'''''within a country of 1.37 billion people.
**'''''<SPAN STYLE="COLOR:BLUE">Surveillance.</SPAN>'''''Reporting from'''''<SPAN STYLE="COLOR:BLUE"> the PRC has [http://chinascope.org/archives/30749 over 600,000,000 surveillance cameras]  </SPAN>'''''within a country of 1.37 billion people.


The result is that the Peoples Republic of China (PRC) has an almost bottomless sea of data with which to work when they wish to train their deep learning systems with. Furthermore they have no legislation that focuses on privacy. A disadvantage for the PRC AI efforts is that their entire data sets are limited to the Chinese people, their languages and its dialects.  
The result is that the Peoples Republic of China (PRC) has an almost bottomless sea of data with which to work when they wish to train their deep learning systems with. Furthermore they have no legislation that focuses on privacy. A disadvantage for the PRC AI efforts is that their entire data sets are limited to the Chinese people, their languages and its dialects.  
Line 237: Line 257:


A fundamental fact notwithstanding any prior qualifications is the reality that the CCP has launched an all-of-national resources project to seize the high ground in any area of high technology that matters. Where enterprises in the West can mount an effort with say one hundred participants, the PRC can stand up a comparable project with a thousand participants.
A fundamental fact notwithstanding any prior qualifications is the reality that the CCP has launched an all-of-national resources project to seize the high ground in any area of high technology that matters. Where enterprises in the West can mount an effort with say one hundred participants, the PRC can stand up a comparable project with a thousand participants.
 
-->
<!-- 20230907: BITE THE BULLET AND JUST BULLDOZE ON WITH IT; deal with the real stumble stone which is how this thing          -->
<!-- 20230907: BITE THE BULLET AND JUST BULLDOZE ON WITH IT; deal with the real stumble stone which is how this thing          -->
<!-- really works... what it is and what it does; use judgement calls to assess enough, not too little, not too much            -->   
<!-- really works... what it is and what it does; use judgement calls to assess enough, not too little, not too much            -->   
Line 246: Line 266:
<!-- that make this thing what it is and say how it does it all...                                                              -->
<!-- that make this thing what it is and say how it does it all...                                                              -->
<!-- given that that first pass on trying to explain it was almost but not quite there...                                      -->
<!-- given that that first pass on trying to explain it was almost but not quite there...                                      -->
<!--
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">CG4 – Theory of Operation: </SPAN>'''''CG4 is a narrow artificial intelligence system that is a Generative Pre-trained Transformer. <BR />  
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">CG4 – Theory of Operation: </SPAN>'''''CG4 is a narrow artificial intelligence system that is a Generative Pre-trained Transformer. <BR />  
In order to make sense of this one would be well advised to understand several fundamental concepts associated with this technology. Because this is a highly technical subject the following is intended to introduce the core elements. The reader is encouraged to review the literature and body of insight that is currently available as explanatory video content.<BR />
In order to make sense of this one would be well advised to understand several fundamental concepts associated with this technology. Because this is a highly technical subject the following is intended to introduce the core elements. The reader is encouraged to review the literature and body of insight that is currently available as explanatory video content.<BR />
-->
<!-- 20230907: o.k... maintain momentum... moved terms down together and can use them as a basis for describing what and how            -->
<!-- 20230907: o.k... maintain momentum... moved terms down together and can use them as a basis for describing what and how            -->
<!-- but need to keep limited, focused to avoid opening huge "closet" of dreadfully gory technical details; challenge: just enough!      -->
<!-- but need to keep limited, focused to avoid opening huge "closet" of dreadfully gory technical details; challenge: just enough!      -->
Line 253: Line 275:
<!-- move it all over to version05 - put focus on theoretical issues topics;                                                            -->
<!-- move it all over to version05 - put focus on theoretical issues topics;                                                            -->
<!--                                                                                                                                    -->
<!--                                                                                                                                    -->
 
<!--
By way of clarifying the topics of this  work we organize these concepts into two primary groups. The first group offer basic information
By way of clarifying the topics of this  work we organize these concepts into two primary groups. The first group offer basic information
on the fundamental building blocks of Large Language Models of which CG4 is a recent example. The second group introduces or otherwise clarifies
on the fundamental building blocks of Large Language Models of which CG4 is a recent example. The second group introduces or otherwise clarifies
Line 266: Line 288:
**'''''<SPAN STYLE="BLUE">[https://www.youtube.com/watch?v=aircAruvnKk&ab_channel=3Blue1Brown Neural_Network_Basics]</SPAN>''''' Some basic principles in how neural networks structures are mapped to a problem.<BR />
**'''''<SPAN STYLE="BLUE">[https://www.youtube.com/watch?v=aircAruvnKk&ab_channel=3Blue1Brown Neural_Network_Basics]</SPAN>''''' Some basic principles in how neural networks structures are mapped to a problem.<BR />
**'''''<SPAN STYLE="BLUE">[https://www.youtube.com/watch?v=IHZwWFHWa-w&ab_channel=3Blue1Brown Gradient Descent]</SPAN>'''''A somewhat technical topic requiring a careful examination on the part of the reader who is unfamiliar with the mathematical background.<BR />  
**'''''<SPAN STYLE="BLUE">[https://www.youtube.com/watch?v=IHZwWFHWa-w&ab_channel=3Blue1Brown Gradient Descent]</SPAN>'''''A somewhat technical topic requiring a careful examination on the part of the reader who is unfamiliar with the mathematical background.<BR />  
**'''''<SPAN STYLE="BLUE">[https://www.youtube.com/watch?v=Ilg3gGewQ5U&ab_channel=3Blue1Brown Back Propagation], </SPAN>''''' intuitively, what is going on?<BR /> It graphically shows the mathematics behind how a neural network attempts to "home in" on a target; what we can see is the mathematics that underlies how training is iteratively done and the distance difference between the neural network's learning state gradually begins to converge with the actual desired response;
**'''''<SPAN STYLE="BLUE">[https://www.youtube.com/watch?v=Ilg3gGewQ5U&ab_channel=3Blue1Brown Back Propagation],</SPAN>''''' intuitively, what is going on?<BR /> It graphically shows the mathematics behind how a neural network attempts to "home in" on a target; what we can see is the mathematics that underlies how training is iteratively done and the distance difference between the neural network's learning state gradually begins to converge with the actual desired response;
**'''''<SPAN STYLE="BLUE">[https://www.youtube.com/watch?v=tIeHLnjs5U8&ab_channel=3Blue1Brown Back Propagation Theory]</SPAN>'''''How various layers in a deep learning neural network interact with each other.<BR />A much closer mathematical description of how prior and successive neuron values contribute to successive neuron values; then how they ultimately contribute to the output neuron that they connect to and whether it will activate or not; this is a highly mathematical description using advanced partial derivative calculus;
**'''''<SPAN STYLE="BLUE">[https://www.youtube.com/watch?v=tIeHLnjs5U8&ab_channel=3Blue1Brown Back Propagation Theory]</SPAN>'''''How various layers in a deep learning neural network interact with each other.<BR />A much closer mathematical description of how prior and successive neuron values contribute to successive neuron values; then how they ultimately contribute to the output neuron that they connect to and whether it will activate or not; this is a highly mathematical description using advanced partial derivative calculus;


<!--      SHOULD GET A LOT EASIER FROM HERE; ALSO ADD STRUCTURE: TO HAVE A) WHAT... AND THEN B) WATCH A VIDEO                        -->
*'''''<SPAN STYLE="COLOR:BLUE">Deep Learning:</SPAN>''''' the technique of using representing an artificial neural network type structure to solve extremely complex problems; [https://www.youtube.com/watch?v=aircAruvnKk&ab_channel=3Blue1Brown a neural network] it typically will work using a large number of what are called "hidden layers"; they achieve greater performance when they are trained on very large data sets; these data sets can range from millions to billions of examples;
<!--      but this editor is a REAL BEAR!                                                                                            -->
<!--      20230911:1700 STRONGLY SUGGESTS FINDING A WYSIWYG editor since this consumes easily half of the work up time                -->
<!--                                                                                                                                  -->
*'''''<SPAN STYLE="COLOR:BLUE">Deep Learning: </SPAN>'''''the technique of using representing an artificial neural network type structure to solve extremely complex problems; [https://www.youtube.com/watch?v=aircAruvnKk&ab_channel=3Blue1Brown a neural network] it typically will work using a large number of what are called "hidden layers"; they achieve greater performance when they are trained on very large data sets; these data sets can range from millions to billions of examples;


[[File:Hiddenlayer00.jpeg|400px|left|Hidden Layers]]
[[File:Hiddenlayer00.jpeg|400px|left|Hidden Layers]]
*'''''<SPAN STYLE="COLOR:BLUE">Hidden Layer: </SPAN>'''''One or more network layers that stands between an input layer and the final output layer. Each input layer is connected to all of the nodes in the first hidden layer. Subsequent hidden layers are added as a means to provide greater discriminatory resolution to the neural network. Each node in a hidden layer can be connected to each node in the following hidden layer. The more hidden layers the more sophisticated can be the ability of the neural network to perform its tasks.<BR />In neural networks, [https://deepai.org/machine-learning-glossary-and-terms/hidden-layer-machine-learning#:~:text=In%20neural%20networks%2C%20a%20hidden,inputs%20entered%20into%20the%20network. a hidden layer] is located between the input and output of the algorithm, in which the function applies weights to the inputs and directs them through an activation function as the output. In short, the hidden layers perform nonlinear transformations of the inputs entered into the network. Hidden layers vary depending on the function of the neural network, and similarly, the layers may vary depending on their associated weights. Useful insight into what hidden layers are doing can be viewed using [https://www.youtube.com/watch?v=0QczhVg5HaI&ab_channel=EmergentGarden this hidden layers video] For an interactive view of '''''<SPAN STYLE="COLOR:BLUE">how hidden layers enable a neural network to learn</SPAN>''''' a new function one can monitor [https://www.youtube.com/watch?v=joA6fEAbAQc&ab_channel=sentdex this video by sentdex]; it has no audio but provides excellent insight into how the various layers and nodes within a layer develops an approximation of the function that it is attempting to learn.<BFR />
*'''''<SPAN STYLE="COLOR:BLUE">Hidden Layer:</SPAN>''''' One or more network layers that stands between an input layer and the final output layer. Each input layer is connected to all of the nodes in the first hidden layer. Subsequent hidden layers are added as a means to provide greater discriminatory resolution to the neural network. Each node in a hidden layer can be connected to each node in the following hidden layer. The more hidden layers the more sophisticated can be the ability of the neural network to perform its tasks.<BR />In neural networks, [https://deepai.org/machine-learning-glossary-and-terms/hidden-layer-machine-learning#:~:text=In%20neural%20networks%2C%20a%20hidden,inputs%20entered%20into%20the%20network. a hidden layer] is located between the input and output of the algorithm, in which the function applies weights to the inputs and directs them through an activation function as the output. In short, the hidden layers perform nonlinear transformations of the inputs entered into the network. Hidden layers vary depending on the function of the neural network, and similarly, the layers may vary depending on their associated weights. Useful insight into what hidden layers are doing can be viewed using [https://www.youtube.com/watch?v=0QczhVg5HaI&ab_channel=EmergentGarden this hidden layers video] For an interactive view of '''''<SPAN STYLE="COLOR:BLUE">how hidden layers enable a neural network to learn</SPAN>''''' a new function one can monitor [https://www.youtube.com/watch?v=joA6fEAbAQc&ab_channel=sentdex this video by sentdex]; it has no audio but provides excellent insight into how the various layers and nodes within a layer develops an approximation of the function that it is attempting to learn.<BR />
[[FILE:Tokens011.png|500px|right|tokens are vectors of words or word fragment data]]
[[FILE:Tokens011.png|500px|right|tokens are vectors of words or word fragment data]]
*'''''<SPAN STYLE="COLOR:BLUE">Parameters: </SPAN>'''''[https://towardsdatascience.com/neural-networks-parameters-hyperparameters-and-optimization-strategies-3f0842fac0a5#:~:text=Parameters%3A%20these%20are%20the%20coefficients,parameters%20which%20minimize%20the%20error. are the coefficients of the model], and they are chosen by the model itself. It means that the algorithm, while learning, optimizes these coefficients (according to a given optimization strategy) and returns an array of parameters which minimize the error. To give an example, in a linear regression task, you have your model that will look like y=b + ax, where b and a will be your parameter. The only thing you have to do with those parameters is to initialize them.  
*'''''<SPAN STYLE="COLOR:BLUE">Parameters: </SPAN>'''''[https://towardsdatascience.com/neural-networks-parameters-hyperparameters-and-optimization-strategies-3f0842fac0a5#:~:text=Parameters%3A%20these%20are%20the%20coefficients,parameters%20which%20minimize%20the%20error. are the coefficients of the model], and they are chosen by the model itself. It means that the algorithm, while learning, optimizes these coefficients (according to a given optimization strategy) and returns an array of parameters which minimize the error. To give an example, in a linear regression task, you have your model that will look like y=b + ax, where b and a will be your parameter. The only thing you have to do with those parameters is to initialize them.  
Line 282: Line 300:
*'''''<SPAN STYLE="COLOR:BLUE">Tokens: </SPAN>'''''[https://www.youtube.com/watch?v=wVN12smEvqg&ab_channel=HuggingFace Making text into tokens]. These become vector representations. They form the basis of the neuron values for each neuron in a neural network. The narrator provides a short and succinct introduction to the basic concepts of what a token is and how it is represented as a vector.
*'''''<SPAN STYLE="COLOR:BLUE">Tokens: </SPAN>'''''[https://www.youtube.com/watch?v=wVN12smEvqg&ab_channel=HuggingFace Making text into tokens]. These become vector representations. They form the basis of the neuron values for each neuron in a neural network. The narrator provides a short and succinct introduction to the basic concepts of what a token is and how it is represented as a vector.


*'''''<SPAN STYLE="COLOR:BLUE">Generative Artificial Intelligence.</SPAN>'''''[https://www.wikiwand.com/en/Generative_artificial_intelligence Generative artificial intelligence (AI)] is artificial intelligence capable of generating text, images, or other media, using generative models. Generative AI models learn the patterns and structure of their input training data and then generate new data that has similar characteristics. A detailed oil painting of figures in a [https://www.wikiwand.com/en/Th%C3%A9%C3%A2tre_d'Op%C3%A9ra_Spatial futuristic opera scene Théâtre d'Opéra Spatial], an image generated by Midjourney.  In the early 2020s, advances in transformer-based deep neural networks enabled a number of generative AI systems notable for accepting natural language prompts as input. These include large language model chatbots such as ChatGPT, Bing Chat, Bard, and LLaMA, and text-to-image artificial intelligence art systems such as Stable Diffusion, Midjourney, and DALL-E.<BR />
*'''''<SPAN STYLE="COLOR:BLUE">Generative Artificial Intelligence</SPAN>''''' [https://www.wikiwand.com/en/Generative_artificial_intelligence Generative artificial intelligence (AI)] is artificial intelligence capable of generating text, images, or other media, using generative models. Generative AI models learn the patterns and structure of their input training data and then generate new data that has similar characteristics. A detailed oil painting of figures in a [https://www.wikiwand.com/en/Th%C3%A9%C3%A2tre_d'Op%C3%A9ra_Spatial futuristic opera scene Théâtre d'Opéra Spatial], an image generated by Midjourney.  In the early 2020s, advances in transformer-based deep neural networks enabled a number of generative AI systems notable for accepting natural language prompts as input. These include large language model chatbots such as ChatGPT, Bing Chat, Bard, and LLaMA, and text-to-image artificial intelligence art systems such as Stable Diffusion, Midjourney, and DALL-E.<BR />


[[File:Dream00.jpg|450px|left|Dream from MidJourney]]
[[File:Dream00.jpg|450px|left|Dream from MidJourney]]
A summary of the key elements of what generative artificial intelligence are explained in [https://www.youtube.com/watch?v=1U83yhGY_pI&ab_channel=EyeonTech  this video] (11 minutes); note that a key and crucial point to keep in mind is that generative ai generates new output based upon a user's request or query; it uses massive amounts of training data to draw from in order to generate its output; note further that the "G" in GPT is derived from generative artificial intelligence;<BR /> Subsequent developments with generative ai systems have been moving forward apace at [https://www.youtube.com/watch?v=hfIUstzHs9A&ab_channel=IBMTechnology companies such as IBM]. Their efforts are targeting a range of salient topic areas. To name a few their teams are addressing topics such as molecular structure, code, vision, earth science.  
A summary of the key elements of what generative artificial intelligence are explained in [https://www.youtube.com/watch?v=1U83yhGY_pI&ab_channel=EyeonTech  this video] (11 minutes); note that a key and crucial point to keep in mind is that generative ai generates new output based upon a user's request or query; it uses massive amounts of training data to draw from in order to generate its output; note further that the "G" in GPT is derived from generative artificial intelligence;<BR /> Subsequent developments with generative ai systems have been moving forward apace at [https://www.youtube.com/watch?v=hfIUstzHs9A&ab_channel=IBMTechnology companies such as IBM]. Their efforts are targeting a range of salient topic areas. To name a few their teams are addressing topics such as molecular structure, code, vision, earth science.  


*'''''<SPAN STYLE="COLOR:BLUE">Encoder-Decoder.</SPAN>'''''[https://vitalflux.com/encoder-decoder-architecture-neural-network/ According to Analytics Yogi]In the field of AI / machine learning, the encoder-decoder architecture is a widely-used framework for developing neural networks that can perform natural language processing (NLP) tasks such as language translation, etc which requires sequence to sequence modeling. This architecture involves a two-stage process where the input data is first encoded into a fixed-length numerical representation, which is then decoded to produce an output that matches the desired format. As a data scientist, understanding the encoder-decoder architecture and its underlying neural network principles is crucial for building sophisticated models that can handle complex data sets. By leveraging encoder-decoder neural network architecture, data scientists can design neural networks that can learn from large amounts of data, accurately classify and generate outputs, and perform tasks that require high-level reasoning and decision-making. An [https://www.youtube.com/watch?v=4Bdc55j80l8&ab_channel=TheA.I.Hacker-MichaelPhi excellent if  somewhat technical video] describes and explains the architecture of what is going on with the encoder and decoder components of the GPT system.  
*'''''<SPAN STYLE="COLOR:BLUE">Encoder-Decoder.</SPAN>''''' [https://vitalflux.com/encoder-decoder-architecture-neural-network/ According to Analytics Yogi]In the field of AI / machine learning, the encoder-decoder architecture is a widely-used framework for developing neural networks that can perform natural language processing (NLP) tasks such as language translation, etc which requires sequence to sequence modeling. This architecture involves a two-stage process where the input data is first encoded into a fixed-length numerical representation, which is then decoded to produce an output that matches the desired format. As a data scientist, understanding the encoder-decoder architecture and its underlying neural network principles is crucial for building sophisticated models that can handle complex data sets. By leveraging encoder-decoder neural network architecture, data scientists can design neural networks that can learn from large amounts of data, accurately classify and generate outputs, and perform tasks that require high-level reasoning and decision-making. An [https://www.youtube.com/watch?v=4Bdc55j80l8&ab_channel=TheA.I.Hacker-MichaelPhi excellent if  somewhat technical video] describes and explains the architecture of what is going on with the encoder and decoder components of the GPT system.  


*'''''<SPAN STYLE="COLOR:BLUE">Bidirectional Encoder Representations from Transformers (BERT) </SPAN>'''''An insightful paper on several key features of what BERT is about and how it functions. [https://arxiv.org/abs/1905.05583 official archive paper on BERT].  
*'''''<SPAN STYLE="COLOR:BLUE">Bidirectional Encoder Representations from Transformers (BERT)</SPAN>''''' An insightful paper on several key features of what BERT is about and how it functions. [https://arxiv.org/abs/1905.05583 official archive paper on BERT].  


*'''''<SPAN STYLE="COLOR:BLUE">BERT - [https://www.youtube.com/watch?v=_BFp4kjSB-I&ab_channel=SebastianRaschka very useful video] </SPAN>''''' a prediction system that uses training data that uses statistical mechanisms to anticipate a subsequent input based upon the most recent input;
*'''''<SPAN STYLE="COLOR:BLUE">BERT - [https://www.youtube.com/watch?v=_BFp4kjSB-I&ab_channel=SebastianRaschka very useful video] </SPAN>''''' a prediction system that uses training data that uses statistical mechanisms to anticipate a subsequent input based upon the most recent input;


*'''''<SPAN STYLE="COLOR:BLUE">Large Language Model. </SPAN>'''''Is a prediction system that uses training data to enable a deep neural network to perform recognition tasks. It uses statistical mechanisms to anticipate subsequent input based upon the prior input. Prediction systems use this training data to iteratively condition  and train the deep learning system for its specific tasks. The following two videos provide focus on  large language models; '''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=lnA9DMvHtfI&ab_channel=Graphicsin5Minutes  this is part one of large language models. ]</SPAN>''''' it describes how words are used to predict subsequent words in an input text; '''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=YDiSFS-yHwk&ab_channel=Graphicsin5Minutes here is part two ]</SPAN>''''' and it is an expansion upon the earlier video but offers more technical detail. Together they provide a fairly concise synopsis of how large language models make predictions of how words are associated with each other in a body of text. They explain further how they require a very large training data set so as to achieve their performance.
*'''''<SPAN STYLE="COLOR:BLUE">Large Language Model.</SPAN>''''' Is a prediction system that uses training data to enable a deep neural network to perform recognition tasks. It uses statistical mechanisms to anticipate subsequent input based upon the prior input. Prediction systems use this training data to iteratively condition  and train the deep learning system for its specific tasks. The following two videos provide focus on  large language models; '''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=lnA9DMvHtfI&ab_channel=Graphicsin5Minutes  this is part one of large language models. ]</SPAN>''''' it describes how words are used to predict subsequent words in an input text; '''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=YDiSFS-yHwk&ab_channel=Graphicsin5Minutes here is part two ]</SPAN>''''' and it is an expansion upon the earlier video but offers more technical detail. Together they provide a fairly concise synopsis of how large language models make predictions of how words are associated with each other in a body of text. They explain further how they require a very large training data set so as to achieve their performance.
[[FILE:Decoder00.png|550px|left|Decoder]]  
[[FILE:Decoder00.png|550px|left|Decoder]]  


*'''''<SPAN STYLE="COLOR:BLUE">Transformer. </SPAN>'''''A transformer is a deep learning system. [https://www.youtube.com/watch?v=4Bdc55j80l8&ab_channel=TheA.I.Hacker-MichaelPhi This discussion presents key concepts of how transformers work from a more conceptual point of view.] Its architecture relies on what is called parallel multi-head attention mechanism.  The modern transformer was proposed in the 2017 paper titled "Attention is all you need".<BR /> There is some overlap with topics described above in the section on encoder-decoder architecture. A significant contribution is because it enables creation of a neural network model that  requires less training time than previous recurrent neural architectures; It addressed such issues as inadequate long short-term memory (LSTM); this had been a shortcoming for earlier models because contextually significant terms might be beyond the positional proximity of semantically meaningful terms. Its later variation have been adopted for training large language models on large (language) datasets, such as the Wikipedia corpus and Common Crawl, by virtue of the parallelized processing of input sequence. Earlier efforts to resolve such difficulties were fundamentally influenced by the '''''Attention Is All You Need''''' paper by Ashish Vaswani et al of the Google Brain team. A breakdown of its major concepts are presented in : '''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=sznZ78HquPc&ab_channel=Lucidate: Attention is all you need. ] </SPAN>''''' <BR />
*'''''<SPAN STYLE="COLOR:BLUE">Transformer.</SPAN>''''' A transformer is a deep learning system. [https://www.youtube.com/watch?v=4Bdc55j80l8&ab_channel=TheA.I.Hacker-MichaelPhi This discussion presents key concepts of how transformers work from a more conceptual point of view.] Its architecture relies on what is called parallel multi-head attention mechanism.  The modern transformer was proposed in the 2017 paper titled "Attention is all you need".<BR /> There is some overlap with topics described above in the section on encoder-decoder architecture. A significant contribution is because it enables creation of a neural network model that  requires less training time than previous recurrent neural architectures; It addressed such issues as inadequate long short-term memory (LSTM); this had been a shortcoming for earlier models because contextually significant terms might be beyond the positional proximity of semantically meaningful terms. Its later variation have been adopted for training large language models on large (language) datasets, such as the Wikipedia corpus and Common Crawl, by virtue of the parallelized processing of input sequence. Earlier efforts to resolve such difficulties were fundamentally influenced by the '''''Attention Is All You Need''''' paper by Ashish Vaswani et al of the Google Brain team. A breakdown of its major concepts are presented in :'''''<SPAN STYLE="COLOR:BLUE"> [https://www.youtube.com/watch?v=sznZ78HquPc&ab_channel=Lucidate: Attention is all you need. ]</SPAN>'''''<BR />


*'''''<SPAN STYLE="COLOR:BLUE">Generative Pre-trained Transformer: </SPAN>'''''[https://www.wikiwand.com/en/Generative_pre-trained_transformer Generative pre-trained transformers (GPT)] are a type of large language model (LLM) and a prominent framework for generative artificial intelligence. The first GPT was introduced in 2018 by OpenAI. GPT models are artificial neural networks that are based on the transformer architecture, pre-trained on large data sets of unlabeled text, and able to generate novel human-like content.  As of 2023, most LLMs have these characteristics and are sometimes referred to broadly as GPTs. An overview of the important features can be viewed in [https://www.youtube.com/watch?v=9ebPNEHRwXU&ab_channel=ConnorShorten the narrator] touches on a number of related topics as well.
*'''''<SPAN STYLE="COLOR:BLUE">Generative Pre-trained Transformer:</SPAN>''''' [https://www.wikiwand.com/en/Generative_pre-trained_transformer Generative pre-trained transformers (GPT)] are a type of large language model (LLM) and a prominent framework for generative artificial intelligence. The first GPT was introduced in 2018 by OpenAI. GPT models are artificial neural networks that are based on the transformer architecture, pre-trained on large data sets of unlabeled text, and able to generate novel human-like content.  As of 2023, most LLMs have these characteristics and are sometimes referred to broadly as GPTs. An overview of the important features can be viewed in [https://www.youtube.com/watch?v=9ebPNEHRwXU&ab_channel=ConnorShorten the narrator] touches on a number of related topics as well.


*'''''<SPAN STYLE="COLOR:BLUE">Generative Adversarial Networks.</SPAN>'''''[https://www.wikiwand.com/en/Generative_adversarial_network a GAN is ]A generative adversarial network (GAN) is a class of machine learning framework and a prominent framework for approaching generative AI. The concept was initially developed by Ian Goodfellow and his colleagues in June 2014. In a GAN, two neural networks contest with each other in the form of a zero-sum game, where one agent's gain is another agent's loss.<BR /> Given a training set, this technique learns to generate new data with the same statistics as the training set. For example, a GAN trained on photographs can generate new photographs that look at least superficially authentic to human observers, having many realistic characteristics. Though originally proposed as a form of generative model for unsupervised learning, GANs have also proved useful for semi-supervised learning, fully supervised learning, and reinforcement learning. This video provides a short synopsis of the [https://www.youtube.com/watch?v=_qB4B6ttXk8&ab_channel=AICoffeeBreakwithLetitia GANS capability.]
*'''''<SPAN STYLE="COLOR:BLUE">Generative Adversarial Networks.</SPAN>''''' [https://www.wikiwand.com/en/Generative_adversarial_network a GAN is ]A generative adversarial network (GAN) is a class of machine learning framework and a prominent framework for approaching generative AI. The concept was initially developed by Ian Goodfellow and his colleagues in June 2014. In a GAN, two neural networks contest with each other in the form of a zero-sum game, where one agent's gain is another agent's loss.<BR /> Given a training set, this technique learns to generate new data with the same statistics as the training set. For example, a GAN trained on photographs can generate new photographs that look at least superficially authentic to human observers, having many realistic characteristics. Though originally proposed as a form of generative model for unsupervised learning, GANs have also proved useful for semi-supervised learning, fully supervised learning, and reinforcement learning. This video provides a short synopsis of the [https://www.youtube.com/watch?v=_qB4B6ttXk8&ab_channel=AICoffeeBreakwithLetitia GANS capability.]


*'''''<SPAN STYLE="COLOR:BLUE">Recurrent Networks. </SPAN>'''''A [https://www.wikiwand.com/en/Recurrent_neural_network#Fully_recurrent recurrent neural network] is a kind of deep neural network created by applying the same set of weights recursively over a structured input, to produce a structured prediction over variable-size input structures, or a scalar prediction on it, by traversing a given structure in topological order. '''''Fully recurrent neural networks (FRNN) connect the outputs of all neurons to the inputs of all neurons. '''''This is the most general neural network topology because all other topologies can be represented by setting some connection weights to zero to simulate the lack of connections between those neurons. The illustration to the right may be misleading to many because practical neural network topologies are frequently organized in "layers" and the drawing gives that appearance. However, what appears to be layers are, in fact, different steps in time of the same fully recurrent neural network. A recurrent neural network is a kind of deep neural network] created by applying the same set of weights recursively over a structured input, to produce a structured prediction over variable-size input structures, or a scalar prediction on it, by traversing a given structure in topological order.  For an intuitive overview of recurrent neural networks [https://www.youtube.com/watch?v=LHXXI4-IEns&ab_channel=TheA.I.Hacker-MichaelPhi have a look here]
*'''''<SPAN STYLE="COLOR:BLUE">Recurrent Networks. </SPAN>''''' A [https://www.wikiwand.com/en/Recurrent_neural_network#Fully_recurrent recurrent neural network] is a kind of deep neural network created by applying the same set of weights recursively over a structured input, to produce a structured prediction over variable-size input structures, or a scalar prediction on it, by traversing a given structure in topological order. '''''Fully recurrent neural networks (FRNN) connect the outputs of all neurons to the inputs of all neurons. '''''This is the most general neural network topology because all other topologies can be represented by setting some connection weights to zero to simulate the lack of connections between those neurons. The illustration to the right may be misleading to many because practical neural network topologies are frequently organized in "layers" and the drawing gives that appearance. However, what appears to be layers are, in fact, different steps in time of the same fully recurrent neural network. A recurrent neural network is a kind of deep neural network] created by applying the same set of weights recursively over a structured input, to produce a structured prediction over variable-size input structures, or a scalar prediction on it, by traversing a given structure in topological order.  For an intuitive overview of recurrent neural networks [https://www.youtube.com/watch?v=LHXXI4-IEns&ab_channel=TheA.I.Hacker-MichaelPhi have a look here]


*'''''<SPAN STYLE="COLOR:BLUE">Recursive Networks. </SPAN>''''' [https://www.wikiwand.com/en/Recursive_neural_network A recursive neural network is a kind of deep neural network] created by applying the same set of weights recursively over a structured input, to produce a structured prediction over variable-size input structures, or a scalar prediction on it, by traversing a given structure in topological order. Recursive neural networks, sometimes abbreviated as RvNNs, have been successful, for instance, in learning sequence and tree structures in natural language processing, mainly phrase and sentence continuous representations based on word embedding. RvNNs have first been introduced to learn distributed representations of structure, such as logical terms. Models and general frameworks have been developed in further works since the 1990s. Addressing the task of attempting to transition from words to phrases in natural language understanding was addressed using recursive neural networks. A [https://www.youtube.com/watch?v=AqEF2HIMjYA&ab_channel=HugoLarochelle short but useful video] presents several core elements of how the problem was solved.
*'''''<SPAN STYLE="COLOR:BLUE">Recursive Networks.</SPAN>''''' [https://www.wikiwand.com/en/Recursive_neural_network A recursive neural network is a kind of deep neural network] created by applying the same set of weights recursively over a structured input, to produce a structured prediction over variable-size input structures, or a scalar prediction on it, by traversing a given structure in topological order. Recursive neural networks, sometimes abbreviated as RvNNs, have been successful, for instance, in learning sequence and tree structures in natural language processing, mainly phrase and sentence continuous representations based on word embedding. RvNNs have first been introduced to learn distributed representations of structure, such as logical terms. Models and general frameworks have been developed in further works since the 1990s. Addressing the task of attempting to transition from words to phrases in natural language understanding was addressed using recursive neural networks. A [https://www.youtube.com/watch?v=AqEF2HIMjYA&ab_channel=HugoLarochelle short but useful video] presents several core elements of how the problem was solved.


*'''''<SPAN STYLE="COLOR:BLUE">Fine Tuning CG4.</SPAN>''''' Use instructions from the [https://platform.openai.com/docs/guides/fine-tuning OpenAI Documentation page] to fine tune CG4. Here is some insight on how to [https://www.youtube.com/watch?v=boHXgQ5eQic&ab_channel=JamesBriggs fine tune a Chat GPT3.5] system. For a somewhat less technical overview on when to use fine tuning and some basic instructions on how to perform fine tuning then have a look at [https://www.youtube.com/watch?v=z_x7j5qv6qo&ab_channel=MichaelBorman this video]. For observers with an interest in more practical uses then this video explains how to [https://www.youtube.com/watch?v=c_nCjlSB1Zk&ab_channel=AIJason perform fine tuning to capture business knowledge.]
*'''''<SPAN STYLE="COLOR:BLUE">Fine Tuning CG4.</SPAN>''''' Use instructions from the [https://platform.openai.com/docs/guides/fine-tuning OpenAI Documentation page] to fine tune CG4. Here is some insight on how to [https://www.youtube.com/watch?v=boHXgQ5eQic&ab_channel=JamesBriggs fine tune a Chat GPT3.5] system. For a somewhat less technical overview on when to use fine tuning and some basic instructions on how to perform fine tuning then have a look at [https://www.youtube.com/watch?v=z_x7j5qv6qo&ab_channel=MichaelBorman this video]. For observers with an interest in more practical uses then this video explains how to [https://www.youtube.com/watch?v=c_nCjlSB1Zk&ab_channel=AIJason perform fine tuning to capture business knowledge.]


*'''''<SPAN STYLE="COLOR:BLUE">Supervised or Unsupervised Learning. </SPAN>'''''[https://www.wikiwand.com/en/Supervised_learning Supervised learning (SL)] is a paradigm in machine learning where input objects (for example, a vector of predictor variables) and a desired output value (also known as human-labeled supervisory signal) train a model. The training data is processed, building a function that maps new data on expected output values. An optimal scenario will allow for the algorithm to correctly determine output values for unseen instances. This requires the learning algorithm to generalize from the training data to unseen situations in a "reasonable" way (see inductive bias). This statistical quality of an algorithm is measured through the so-called generalization error. [https://www.youtube.com/watch?v=W01tIRP_Rqs&ab_channel=IBMTechnology This short video] presents a summarized explanation of the difference. Unsupervised learning is a paradigm in machine learning where, in contrast to supervised learning and semi-supervised learning, '''''<SPAN STYLE="COLOR:BLUE">algorithms learn patterns exclusively from unlabeled data.</SPAN>'''''
*'''''<SPAN STYLE="COLOR:BLUE">Supervised or Unsupervised Learning.</SPAN>''''' [https://www.wikiwand.com/en/Supervised_learning Supervised learning (SL)] is a paradigm in machine learning where input objects (for example, a vector of predictor variables) and a desired output value (also known as human-labeled supervisory signal) train a model. The training data is processed, building a function that maps new data on expected output values. An optimal scenario will allow for the algorithm to correctly determine output values for unseen instances. This requires the learning algorithm to generalize from the training data to unseen situations in a "reasonable" way (see inductive bias). This statistical quality of an algorithm is measured through the so-called generalization error. [https://www.youtube.com/watch?v=W01tIRP_Rqs&ab_channel=IBMTechnology This short video] presents a summarized explanation of the difference. Unsupervised learning is a paradigm in machine learning where, in contrast to supervised learning and semi-supervised learning,'''''<SPAN STYLE="COLOR:BLUE">algorithms learn patterns exclusively from unlabeled data.</SPAN>'''''


*'''''<SPAN STYLE="COLOR:BLUE">Pretrained. </SPAN>'''''A pretrained AI model [https://blogs.nvidia.com/blog/2022/12/08/what-is-a-pretrained-ai-model/ is a deep learning model] — an expression of a brain-like neural algorithm that finds patterns or makes predictions based on data — that’s trained on large datasets to accomplish a specific task. It can be used as is or further fine-tuned to fit an application’s specific needs. Why Are Pretrained AI Models Used? Instead of building an AI model from scratch, developers can use pretrained models and customize them to meet their requirements. To build an AI application, developers first need an AI model that can accomplish a particular task, whether that’s identifying a mythical horse, detecting a safety hazard for an autonomous vehicle or diagnosing a cancer based on medical imaging. That model needs a lot of representative data to learn from.  This learning process entails going through several layers of incoming data and emphasizing goals-relevant characteristics at each layer. To create a model that can recognize a unicorn, for example, one might first feed it images of unicorns, horses, cats, tigers and other animals. This is the incoming data.<BR /><BR />Then, layers of representative data traits are constructed, beginning with the simple — like lines and colors — and advancing to complex structural features. These characteristics are assigned varying degrees of relevance by calculating probabilities. As opposed to a cat or tiger, for example, the more like a horse a creature appears, the greater the likelihood that it is a unicorn. Such probabilistic values are stored at each neural network layer in the AI model, and as layers are added, its understanding of the representation improves. To create such a model from scratch, developers require enormous datasets, often with billions of rows of data. These can be pricey and challenging to obtain, but compromising on data can lead to poor performance of the model.
*'''''<SPAN STYLE="COLOR:BLUE">Pretrained.</SPAN>''''' A pretrained AI model [https://blogs.nvidia.com/blog/2022/12/08/what-is-a-pretrained-ai-model/ is a deep learning model] — an expression of a brain-like neural algorithm that finds patterns or makes predictions based on data — that’s trained on large datasets to accomplish a specific task. It can be used as is or further fine-tuned to fit an application’s specific needs. Why Are Pretrained AI Models Used? Instead of building an AI model from scratch, developers can use pretrained models and customize them to meet their requirements. To build an AI application, developers first need an AI model that can accomplish a particular task, whether that’s identifying a mythical horse, detecting a safety hazard for an autonomous vehicle or diagnosing a cancer based on medical imaging. That model needs a lot of representative data to learn from.  This learning process entails going through several layers of incoming data and emphasizing goals-relevant characteristics at each layer. To create a model that can recognize a unicorn, for example, one might first feed it images of unicorns, horses, cats, tigers and other animals. This is the incoming data.<BR /><BR />Then, layers of representative data traits are constructed, beginning with the simple — like lines and colors — and advancing to complex structural features. These characteristics are assigned varying degrees of relevance by calculating probabilities. As opposed to a cat or tiger, for example, the more like a horse a creature appears, the greater the likelihood that it is a unicorn. Such probabilistic values are stored at each neural network layer in the AI model, and as layers are added, its understanding of the representation improves. To create such a model from scratch, developers require enormous datasets, often with billions of rows of data. These can be pricey and challenging to obtain, but compromising on data can lead to poor performance of the model.


[[File:AgentGPT00.jpeg|left|550px|Sim City Autonomous Agents]]
[[File:AgentGPT00.jpeg|left|550px|Sim City Autonomous Agents]]
*'''''<SPAN STYLE="COLOR:BLUE">Autonomous Agent: </SPAN>'''''an instance of CG4 that is capable of formulating goals and then structuring subtasks that enable the achievement of those subtasks. A recent development has come to light wherein researchers at Stanford University and Google were able to demonstrate autonomous and asynchronous problem solving by having CG4 able to call instances of itself or of CG3. The result was that they were able to create collective of asynchronous problem solvers. They enabled these problem solvers a mechanism to interact and communicate with each other. The result was a very small scale simulation of a village. The "village" consisted of 25 agents. Each agent was assigned private memory as well as goals.<BR/>Of note is that the agent architecture is a bare bones minimal set of behavior controllers. A much more complex and sophisticated set can be envisioned wherein each agent can be developed out to the point that they become far more lifelike. This can mean that they might have goals but such characteristics as beliefs, which allow for correct or incorrect understanding, theory of mind of other agents, or users.<BR />It would be a fairly small step to postulate a substantially larger collection of agents. This larger collection of agents might be put to the use of solving problems involving actual real people in real world situations. For instance one can imagine creating a population consisting of hundred or thousands of agents. These agents might be instantiated to possess positions or values regarding a range of topics. They can further be configured to associate themselves with elements or factors in the world that they operate in. [[FILE:Agent01.jpg|left|550px|Autonomous Agents Simplified Architecture]]<BR />For instance, a subset of agents might be instantiated to exhibit a value to specific factors in the sim-world. A more concrete example might be that they attach considerable value to having the equivalent of "traffic management", i.e. the analog of "traffic lights" in their world vs. having the equivalent of "stop signs"; other agents might possess nearly opposite value; this sets up the possibility that in a larger collective that conflict can arise. With that conflict there might develop agents that lean toward mediation and compromise. Others might be more adamant and less cooperative. The upshot is that very complex models of human behavior can be modeled by adding more traits beyond those of goals and memory.<BR/><BR />
*'''''<SPAN STYLE="COLOR:BLUE">Autonomous Agent:</SPAN>''''' an instance of CG4 that is capable of formulating goals and then structuring subtasks that enable the achievement of those subtasks. A recent development has come to light wherein researchers at Stanford University and Google were able to demonstrate autonomous and asynchronous problem solving by having CG4 able to call instances of itself or of CG3. The result was that they were able to create collective of asynchronous problem solvers. They enabled these problem solvers a mechanism to interact and communicate with each other. The result was a very small scale simulation of a village. The "village" consisted of 25 agents. Each agent was assigned private memory as well as goals.<BR/>Of note is that the agent architecture is a bare bones minimal set of behavior controllers. A much more complex and sophisticated set can be envisioned wherein each agent can be developed out to the point that they become far more lifelike. This can mean that they might have goals but such characteristics as beliefs, which allow for correct or incorrect understanding, theory of mind of other agents, or users.<BR />It would be a fairly small step to postulate a substantially larger collection of agents. This larger collection of agents might be put to the use of solving problems involving actual real people in real world situations. For instance one can imagine creating a population consisting of hundred or thousands of agents. These agents might be instantiated to possess positions or values regarding a range of topics. They can further be configured to associate themselves with elements or factors in the world that they operate in. [[FILE:Agent01.jpg|left|550px|Autonomous Agents Simplified Architecture]]<BR />For instance, a subset of agents might be instantiated to exhibit a value to specific factors in the sim-world. A more concrete example might be that they attach considerable value to having the equivalent of "traffic management", i.e. the analog of "traffic lights" in their world vs. having the equivalent of "stop signs"; other agents might possess nearly opposite value; this sets up the possibility that in a larger collective that conflict can arise. With that conflict there might develop agents that lean toward mediation and compromise. Others might be more adamant and less cooperative. The upshot is that very complex models of human behavior can be modeled by adding more traits beyond those of goals and memory.<BR/><BR />
Recently there appears to have been a shift in landscape of the topic of artificial intelligence agents. What now appears to be coming into focus is the ability to construct specifically targeted tools that make use of multiple autonomous agents to cooperatively solve problems. [https://www.youtube.com/watch?v=zdwgIe4zdsU&ab_channel=WesRoth Knowledgeable observers] have been taking note of this trend and providing insight into what it means and how it might affect the further development of the field.  
Recently there appears to have been a shift in landscape of the topic of artificial intelligence agents. What now appears to be coming into focus is the ability to construct specifically targeted tools that make use of multiple autonomous agents to cooperatively solve problems. [https://www.youtube.com/watch?v=zdwgIe4zdsU&ab_channel=WesRoth Knowledgeable observers] have been taking note of this trend and providing insight into what it means and how it might affect the further development of the field.  
 
-->
<!--
*'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Recent Topics or Issues</SPAN>'''''
*'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Recent Topics or Issues</SPAN>'''''


*'''''<SPAN STYLE="COLOR:BLUE">Consciousness: </SPAN>'''''[https://arxiv.org/abs/2308.08708 Recent consciousness research] has thrown more light on the subject. A number of models have developed in recent years that attempt to address the so-called "hard problem". The basis of this problem focuses on the question of "what it is like to be a bat or a dolphin or a wolf". And identifying the neural correlates that mediate the experience of being. The authors propose approaching the question of if artificial consciousness is possible they suggest the utilization of computational functionalism, empirical neuroscientific evidence then they suggest that a theory-heavy approach be used to assess the viability of the various models that have been proposed to date. They then list the current candidates. These include:  
*'''''<SPAN STYLE="COLOR:BLUE">Consciousness:</SPAN>''''' [https://arxiv.org/abs/2308.08708 Recent consciousness research] has thrown more light on the subject. A number of models have developed in recent years that attempt to address the so-called "hard problem". The basis of this problem focuses on the question of "what it is like to be a bat or a dolphin or a wolf". And identifying the neural correlates that mediate the experience of being. The authors propose approaching the question of if artificial consciousness is possible they suggest the utilization of computational functionalism, empirical neuroscientific evidence then they suggest that a theory-heavy approach be used to assess the viability of the various models that have been proposed to date. They then list the current candidates. These include:  
** recurrent processing theory
** recurrent processing theory
** global workspace theory
** global workspace theory
Line 330: Line 349:


*'''''<SPAN STYLE="COLOR:BLUE">Awareness:</SPAN>''''' This is a characteristic trait of an organism that is capable of sensing the world around itself. It likewise will possess the capability to recognize, interpret and respond to various internal states. A feature of awareness is the ability to record, synopsize, tag and store episodic memories. With awareness there may or may not be self consciousness. However the organism will be able to respond to changes in its environment.  
*'''''<SPAN STYLE="COLOR:BLUE">Awareness:</SPAN>''''' This is a characteristic trait of an organism that is capable of sensing the world around itself. It likewise will possess the capability to recognize, interpret and respond to various internal states. A feature of awareness is the ability to record, synopsize, tag and store episodic memories. With awareness there may or may not be self consciousness. However the organism will be able to respond to changes in its environment.  
*'''''<SPAN STYLE="COLOR:BLUE">Sentience: </SPAN>'''''Derivative from the concept of sense. As in possessing sufficient sensorial apparatus to intercept states and developments in the real world such as temperature, light, chemical odors and acoustic patterns.  
*'''''<SPAN STYLE="COLOR:BLUE">Sentience: </SPAN>''''' Derivative from the concept of sense. As in possessing sufficient sensorial apparatus to intercept states and developments in the real world such as temperature, light, chemical odors and acoustic patterns.  
*'''''<SPAN STYLE="COLOR:BLUE">World Model:</SPAN>''''' The ability to create an abstract representation of an external reality. This model postulates that the sentient agent can further create a self-model that is an actor in this external world model and can interact or otherwise affect state in the world model. But also very importantly that events in the world model can give rise to events that can affect the agent. These events can be either adaptively positive or entail adverse risk.
*'''''<SPAN STYLE="COLOR:BLUE">World Model:</SPAN>''''' The ability to create an abstract representation of an external reality. This model postulates that the sentient agent can further create a self-model that is an actor in this external world model and can interact or otherwise affect state in the world model. But also very importantly that events in the world model can give rise to events that can affect the agent. These events can be either adaptively positive or entail adverse risk.
*'''''<SPAN STYLE="COLOR:BLUE">Emergence: </SPAN>'''''experimenters and developers report observing a working system exhibit properties that had not heretofore been programmed in; they "emerge" from the innate capabilities of the system;  
*'''''<SPAN STYLE="COLOR:BLUE">Emergence:</SPAN>''''' experimenters and developers report observing a working system exhibit properties that had not heretofore been programmed in; they "emerge" from the innate capabilities of the system;  
*'''''<SPAN STYLE="COLOR:BLUE">Alignment: </SPAN>'''''the imperative of imparting "guard rails" or otherwise limitations on what an artificial intelligence system can be allowed to do;
*'''''<SPAN STYLE="COLOR:BLUE">Alignment:</SPAN>''''' the imperative of imparting "guard rails" or otherwise limitations on what an artificial intelligence system can be allowed to do;
*'''''<SPAN STYLE="COLOR:BLUE">Hallucinations. </SPAN>''''' CG4 has produced results to queries wherein it created references that superficially look legitimate but upon closer inspection prove to be nonexistent.<BR /><BR /><BR />
*'''''<SPAN STYLE="COLOR:BLUE">Hallucinations.</SPAN>''''' CG4 has produced results to queries wherein it created references that superficially look legitimate but upon closer inspection prove to be nonexistent.<BR /><BR /><BR />
[[File:TheoryOfMind.jpeg|left|450px|Theory of Mind]]<BR />
[[File:TheoryOfMind.jpeg|left|450px|Theory of Mind]]<BR />
*'''''<SPAN STYLE="COLOR:BLUE">Theory of Mind: </SPAN>'''''In order for a species to build a society, successful socialization processes between members of a species is fundamental. Development of a society requires that family members develop means of communicating needs and wants with each other. When this step is successful then collections of families can aggregate into clans. The basic is that each member develop a means of formulating or otherwise formalizing representations of their own mental and physical state. The key step forward is to be able to attribute comparable representations to others. When this step is successful then a theory of mind can crystallize. Intentions, wants and needs can then be represented. Intentions, wants and needs can then be used to develop plans. The more sophisticated the representation of self-state the more refined the clan's adaptive success will be.<BR />A recent paper on [https://arxiv.org/ftp/arxiv/papers/2302/2302.02083.pdf Theory Of Mind] has illuminated this topic and is worth perusing to see how advances in the user interface experience will develop going forward. It is worth noting that ascribing belief to the user's state of knowledge is a crucial factor. This topic is crucial in '''''<SPAN STYLE="COLOR:BLUE">understanding false beliefs, </SPAN>'''''how they are recognized and responded to. We can imagine that during an interaction session that CG4 or a derivative descendant might have one or more autonomous agents operating to address this exact question, moment by moment.
*'''''<SPAN STYLE="COLOR:BLUE">Theory of Mind:</SPAN>''''' In order for a species to build a society, successful socialization processes between members of a species is fundamental. Development of a society requires that family members develop means of communicating needs and wants with each other. When this step is successful then collections of families can aggregate into clans. The basic is that each member develop a means of formulating or otherwise formalizing representations of their own mental and physical state. The key step forward is to be able to attribute comparable representations to others. When this step is successful then a theory of mind can crystallize. Intentions, wants and needs can then be represented. Intentions, wants and needs can then be used to develop plans. The more sophisticated the representation of self-state the more refined the clan's adaptive success will be.<BR />A recent paper on [https://arxiv.org/ftp/arxiv/papers/2302/2302.02083.pdf Theory Of Mind] has illuminated this topic and is worth perusing to see how advances in the user interface experience will develop going forward. It is worth noting that ascribing belief to the user's state of knowledge is a crucial factor. This topic is crucial in '''''<SPAN STYLE="COLOR:BLUE">understanding false beliefs,</SPAN>''''' how they are recognized and responded to. We can imagine that during an interaction session that CG4 or a derivative descendant might have one or more autonomous agents operating to address this exact question, moment by moment.
 
-->
 
<!-- [[File:GooglePredict.jpg|left|550px|Making Predictions based upon training data]]  
<!-- [[File:GooglePredict.jpg|left|550px|Making Predictions based upon training data]]  
[[File:DeepLearning.jpg|left|500px|Source:IBM. Hidden Layers]]
[[File:DeepLearning.jpg|left|500px|Source:IBM. Hidden Layers]]
[[File:Transformer.png|left|500px|What is a Transformer]]
[[File:Transformer.png|left|500px|What is a Transformer]]
[[File:Tokens00.png|left|500px|What are tokens?]]
[[File:Tokens00.png|left|500px|What are tokens?]]
'''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=6tzn5-XlhwU&ab_channel=Lucidate Training and Inferencing a Neural Network] </SPAN>''''' <BR />
'''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=6tzn5-XlhwU&ab_channel=Lucidate Training and Inferencing a Neural Network] </SPAN>'''''<BR />
'''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=ZvrJaqaK65Y&ab_channel=Lucidate Fine Tuning GPT]Fine Tuning GPT</SPAN>'''''  <BR />
'''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=ZvrJaqaK65Y&ab_channel=Lucidate Fine Tuning GPT]Fine Tuning GPT</SPAN>'''''  <BR />
'''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=uFiI5fK-7B4&ab_channel=Lucidate General Fine Tuning]General Fine Tuning</SPAN>'''''  <BR />
'''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=uFiI5fK-7B4&ab_channel=Lucidate General Fine Tuning]General Fine Tuning</SPAN>'''''  <BR />
Line 352: Line 370:
Illustrative video describing how the prediction process works. Google Search is attempting to predict what a person is about to type. Generative Pre-Trained language models are attempting to do the same thing. But they require a very large corpus of language to work with in order to arrive at a high probability that they have made the right prediction.<BR />
Illustrative video describing how the prediction process works. Google Search is attempting to predict what a person is about to type. Generative Pre-Trained language models are attempting to do the same thing. But they require a very large corpus of language to work with in order to arrive at a high probability that they have made the right prediction.<BR />
-->
-->
<!--  
<!--  
In the case of Google Search, the search terms are the drivers of the process. In the large language model the tokens are the drivers of the process. The degree of success is dependent upon the size of the training data set.  The larger the training data set, the more tokens available the more accurate the response will be that maps to the input. However note again very carefully: there is no sentience, intentionality, self reflective model or consciousness.  <BR />
In the case of Google Search, the search terms are the drivers of the process. In the large language model the tokens are the drivers of the process. The degree of success is dependent upon the size of the training data set.  The larger the training data set, the more tokens available the more accurate the response will be that maps to the input. However note again very carefully: there is no sentience, intentionality, self reflective model or consciousness.  <BR />
Line 359: Line 376:
<!--
<!--
According to Wikipedia: A  '''''<SPAN STYLE="COLOR:BLUE">[https://www.wikiwand.com/en/Large_language_model Large Language Model] </SPAN>'''''(LLM - Wikipedia) is a '''''<SPAN STYLE="COLOR:BLUE">[https://www.wikiwand.com/en/Language_model Language Model] </SPAN>''''' consisting of a '''''<SPAN STYLE="COLOR:BLUE">[https://www.wikiwand.com/en/Artificial_neural_network Neural Network]</SPAN>'''''  with many parameters (typically billions of weights or more), trained on large quantities of unlabeled text using Self-Supervised Learning or Semi-Supervised Learning. LLMs emerged around 2018 and perform well at a wide variety of tasks. This has shifted the focus of Natural Language Processing research away from the previous paradigm of training specialized supervised models for specific tasks.<BR />
According to Wikipedia: A  '''''<SPAN STYLE="COLOR:BLUE">[https://www.wikiwand.com/en/Large_language_model Large Language Model] </SPAN>'''''(LLM - Wikipedia) is a '''''<SPAN STYLE="COLOR:BLUE">[https://www.wikiwand.com/en/Language_model Language Model] </SPAN>''''' consisting of a '''''<SPAN STYLE="COLOR:BLUE">[https://www.wikiwand.com/en/Artificial_neural_network Neural Network]</SPAN>'''''  with many parameters (typically billions of weights or more), trained on large quantities of unlabeled text using Self-Supervised Learning or Semi-Supervised Learning. LLMs emerged around 2018 and perform well at a wide variety of tasks. This has shifted the focus of Natural Language Processing research away from the previous paradigm of training specialized supervised models for specific tasks.<BR />
It uses what is known as the Transformer Model. The Turing site offers useful insight as well into how the transformer model constructs a response from an input. Because the topic is highly technical we leave it to the interested reader to examine the detail processing steps.<BR />
It uses what is known as the Transformer Model. The Turing site offers useful insight as well into how the transformer model constructs a response from an input. Because the topic is highly technical we leave it to the interested reader to examine the detail processing steps.<BR />
The transformer model is a neural network that learns context and understanding as a result of sequential data analysis. The mechanics of how a transformer model works is beyond the technical scope of this summary but a good summary can be found here.<BR />
The transformer model is a neural network that learns context and understanding as a result of sequential data analysis. The mechanics of how a transformer model works is beyond the technical scope of this summary but a good summary can be found here.<BR />
If we use the associated diagram as a reference model then we can see that when we migrate to a deep learning model with a large number of hidden layers then the ability of the deep learning neural network escalates. If we examine closely the facial images at the bottom of the diagram then we can see that there are a number of faces. Included in the diagram is a blow up of a selected feature from one of the faces. In this case it comes from the image of George Washington. If we are using a deep learning system with billions to hundreds of billions of parameters then we should expect that the ability of the deep learning model to possess the most exquisite ability to discern extremely find detail recognition tasks. Which is in fact exactly what happens.
If we use the associated diagram as a reference model then we can see that when we migrate to a deep learning model with a large number of hidden layers then the ability of the deep learning neural network escalates. If we examine closely the facial images at the bottom of the diagram then we can see that there are a number of faces. Included in the diagram is a blow up of a selected feature from one of the faces. In this case it comes from the image of George Washington. If we are using a deep learning system with billions to hundreds of billions of parameters then we should expect that the ability of the deep learning model to possess the most exquisite ability to discern extremely find detail recognition tasks. Which is in fact exactly what happens.


Line 370: Line 385:
Generative Pre-Trained language models are attempting to do the same thing. But they require a very large corpus of langue to work with in order to arrive at a high probability that they have made the right prediction.
Generative Pre-Trained language models are attempting to do the same thing. But they require a very large corpus of langue to work with in order to arrive at a high probability that they have made the right prediction.


'''''<SPAN STYLE="COLOR:BLUE">Fundamentals. </SPAN>'''''Starting with the basics here is a link to a video that explains how a neural network learns. <BR />
'''''<SPAN STYLE="COLOR:BLUE">Fundamentals.</SPAN>''''' Starting with the basics here is a link to a video that explains how a neural network learns. <BR />


'''''<SPAN STYLE="COLOR:BLUE">CG4 – What is CG4 doing?: </SPAN>'''''
'''''<SPAN STYLE="COLOR:BLUE">CG4 – What is CG4 doing?:</SPAN> '''''
Large Language Models perform their function by accepting text as input and assigning several parameters to each token that is created. A token can be a whole word or part of a word. The position of the word or word fragment. The Graphics in Five Minutes channel provides a very concise description of how words are converted to tokens and then how tokens are used to make predictions.<BR />
Large Language Models perform their function by accepting text as input and assigning several parameters to each token that is created. A token can be a whole word or part of a word. The position of the word or word fragment. The Graphics in Five Minutes channel provides a very concise description of how words are converted to tokens and then how tokens are used to make predictions.<BR />
Earlier natural language processing efforts attempted to extract semantically meaningful content by parsing terms and generating complex data structures that could be further manipulated with the goal of attempting to derive understanding of the language text.<BR />
Earlier natural language processing efforts attempted to extract semantically meaningful content by parsing terms and generating complex data structures that could be further manipulated with the goal of attempting to derive understanding of the language text.<BR />
Line 384: Line 399:
<!--
<!--
We can see in this diagram the main processing steps that take place in the transformer. The two main processing cycles include encoding processing and decoding processing. As this is a fairly technical discussion we will defer examination of the internal processing actions for a later iteration. The following four references offer an overview of what basic steps are taken to train and fine tune a GPT system.<BR /><BR /><BR /><BR />
We can see in this diagram the main processing steps that take place in the transformer. The two main processing cycles include encoding processing and decoding processing. As this is a fairly technical discussion we will defer examination of the internal processing actions for a later iteration. The following four references offer an overview of what basic steps are taken to train and fine tune a GPT system.<BR /><BR /><BR /><BR />
 
'''''<SPAN STYLE="COLOR:BLUE">[https://www.google.com/search?q=attention+is+all+you+need+explained&oq=attention+is+all+you+need&aqs=chrome.1.0i512l10.641946273j0j15&sourceid=chrome&ie=UTF-8#kpvalbx=_IhGNZLHII6rMkPIPgPGL-A8_30 Attention is all you need: Transformer model: processing] "From the Attention is all you need paper" Transformer model:</SPAN>''''' Processing <BR />
'''''<SPAN STYLE="COLOR:BLUE">[https://www.google.com/search?q=attention+is+all+you+need+explained&oq=attention+is+all+you+need&aqs=chrome.1.0i512l10.641946273j0j15&sourceid=chrome&ie=UTF-8#kpvalbx=_IhGNZLHII6rMkPIPgPGL-A8_30 Attention is all you need: Transformer model: processing] "From the Attention is all you need paper" Transformer model: </SPAN>''''' Processing <BR />
 
Large Language Models attempt to predict the next token, or word fragment from an input text. In part one the narrator describes how an input is transformed using a neural network to predict an output. In the case of language models the prediction process is attempting to predict what should come next based upon the word or token that has just been processed. However in order to generate accurate predictions very large bodies of text are required to pre-train the model.<BR />
Large Language Models attempt to predict the next token, or word fragment from an input text. In part one the narrator describes how an input is transformed using a neural network to predict an output. In the case of language models the prediction process is attempting to predict what should come next based upon the word or token that has just been processed. However in order to generate accurate predictions very large bodies of text are required to pre-train the model.<BR />


* '''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=lnA9DMvHtfI&ab_channel=Graphicsin5Minutes Part One.]</SPAN>''''' In this video the narrator describes how words are used to predict subsequent words in an input text.
* '''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=lnA9DMvHtfI&ab_channel=Graphicsin5Minutes Part One.]</SPAN>''''' In this video the narrator describes how words are used to predict subsequent words in an input text.
Line 398: Line 410:


'''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=2_okfp981gs&ab_channel=LennartSvensson Tokenization]</SPAN>''''' is the process of creating the mapping of words or word fragments to their position in the input text. The training step enables a deep neural network to learn language structures and patterns. The neural network will then be fine tuned for improved performance. In the case of CG4 the size of the corpus of text that was used for training has not been revealed but is rumored to be over one trillion parameters.<BR />
'''''<SPAN STYLE="COLOR:BLUE">[https://www.youtube.com/watch?v=2_okfp981gs&ab_channel=LennartSvensson Tokenization]</SPAN>''''' is the process of creating the mapping of words or word fragments to their position in the input text. The training step enables a deep neural network to learn language structures and patterns. The neural network will then be fine tuned for improved performance. In the case of CG4 the size of the corpus of text that was used for training has not been revealed but is rumored to be over one trillion parameters.<BR />


They perform their processing by accepting text as input and assigning several parameters to each token that is created. A token can be a whole word or part of a word. The position of the word or word fragment. The Graphics in Five Minutes channel provides a very concise description of how words are converted to tokens and then how tokens are used to make predictions.<BR />
They perform their processing by accepting text as input and assigning several parameters to each token that is created. A token can be a whole word or part of a word. The position of the word or word fragment. The Graphics in Five Minutes channel provides a very concise description of how words are converted to tokens and then how tokens are used to make predictions.<BR />
Line 409: Line 420:
'''''image source:<SPAN STYLE="COLOR:BLUE">[https://www.ibm.com/cloud/learn/neural-networks IBM. Hidden Layers]</SPAN>'''''
'''''image source:<SPAN STYLE="COLOR:BLUE">[https://www.ibm.com/cloud/learn/neural-networks IBM. Hidden Layers]</SPAN>'''''
-->
-->
 
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Overview and Summary so far.</SPAN>''''' If we step back for a moment and summarize what some observers have had to say about this new capability then we might tentatively start with that:
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Overview and Summary so far. </SPAN>''''' If we step back for a moment and summarize what some observers have had to say about this new capability then we might tentatively start with that:
* is based upon and is a refinement of its predecessor, the Chat GPT 3.5 system;
* is based upon and is a refinement of its predecessor, the Chat GPT 3.5 system;
* has been developed using the generative predictive transformer (GPT) model;
* has been developed using the generative predictive transformer (GPT) model;
Line 420: Line 430:
* uses deep learning neural networks and very large training data sets;
* uses deep learning neural networks and very large training data sets;
* uses a SAAS model; like Google Search, Youtube or Morningstar Financial;
* uses a SAAS model; like Google Search, Youtube or Morningstar Financial;
 
<!--
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Interim Observations and Conclusions.</SPAN>'''''
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Interim Observations and Conclusions.</SPAN>'''''
* this technology will continue to introduce novel, unpredictable and disruptive risks;
* this technology will continue to introduce novel, unpredictable and disruptive risks;
Line 444: Line 454:
* a world wide web 24/7 accessible SAAS;
* a world wide web 24/7 accessible SAAS;


'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER"> can: </SPAN>'''''<BR />
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER"> can:</SPAN>'''''<BR />
* '''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">converse: </SPAN>'''''
* '''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">converse: </SPAN>'''''
** explain its responses
** explain its responses
** self critique and improve own responses;
** self critique and improve own responses;
** responses are relevant, consistent and topically associated;
** offer responses that are contextually relevant, consistent and topically associated;
** summarize convoluted documents or stories and explain difficult abstract questions
** summarize convoluted documents or stories and explain difficult abstract questions;
** calibrate its response style to resemble known news presenters or narrators;
** calibrate its response style to resemble known news presenters or narrators;
** understand humor
** understand humor;
** convincingly accurate responses to queries suggests the need for a New Turing Test;
** convincingly accurate responses to queries suggests the need for a New Turing Test;
* '''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">reason: </SPAN>'''''
* '''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">reason: </SPAN>'''''
Line 462: Line 472:
** score in the 90% level on the SAT, Bar and Medical Exams
** score in the 90% level on the SAT, Bar and Medical Exams


'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER"> has: </SPAN>'''''<BR />
'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">has:</SPAN>'''''<BR />
* demonstrated competencies will disruptively encroach upon current human professional competencies;
* demonstrated competencies will disruptively encroach upon current human professional competencies;
* knowledge base, training data sets had 2021 cutoff date;
* knowledge base, training data sets had 2021 cutoff date;
Line 483: Line 493:


*'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Risks.</SPAN>''''' Responding to the picture as is in evidence so far strongly suggests that the last major risk category be included, i.e. hypothetical risks. With this full set available it should be possible to offer a preliminary assessment of how risks will manifest. What we can observe is that this is a new technology and it will exhibit applications and consequences comparable to those that have gone before. Specifically this means that there will be groups that benefit from its availability as well as those who will be victimized by it. Only time will tell as more cases are brought to light.
*'''''<Span Style="COLOR:BLUE; BACKGROUND:SILVER">Risks.</SPAN>''''' Responding to the picture as is in evidence so far strongly suggests that the last major risk category be included, i.e. hypothetical risks. With this full set available it should be possible to offer a preliminary assessment of how risks will manifest. What we can observe is that this is a new technology and it will exhibit applications and consequences comparable to those that have gone before. Specifically this means that there will be groups that benefit from its availability as well as those who will be victimized by it. Only time will tell as more cases are brought to light.
-->

Latest revision as of 22:02, 30 October 2023

OPENAI.png

OpenAI - ChatGPT4.
In what follows we attempt to address several basic questions about the onrushing progress with the current focus of artificial intelligence. There are several competing actors in this space. These include OpenAI, DeepMind, Anthropic, and Cohere. A number of other competitors are active in the artificial intelligence market place. But for purposes of brevity and because of the overlap we will limit focus on ChatGPT4 (CG4). Further, we focus on several salient questions that that raise questions of safety, risk and prospects.


Specifically, risks that involve or are:

  • Interfacing - Synthesis. how will different groups interact with, respond to and be affected by it; might access modalities available to one group have positive or negative implications for other groups;
  • Epistemological - Synthesishow to reconcile ethical issues within a society, between societies; more specifically, might it provide solutions or results that are acceptable to the one group but unacceptable to the other group;

Synthesis.
Responding to these questions calls for some baseline information and insights about the issues that this new technology entails. We propose to suggest we look

  • Terms are included to help clarify crucial elements and contextualize CG4;
  • Sentiment is being expressed about it by knowledgeable observers;
  • Theory of Operation of technology paradigm used to produce its results;
  • Risks our approach has been to present a few commonly occurring risks, whether inherent or malicious as well as some theoretical risks that might emerge;
  • Insights are offered to serve as takeoff points for subsequent discussion;

Terms and Basic Concepts.
CG4 has demonstrated capabilities that represent a significant leap forward in overall capability and versatility beyond what has gone before. In order to attempt an assessment prospective risks suggests reviewing recent impressions at a later date as more reporting and insights have come to light. CG4 has already demonstrated that new and unforeseen risks are tangible; in some instances novel and unforeseen capabilities have been reported. It is with this in mind that we attempt here to offer an initial profile or picture of the risks that we should expect to see with its broader use. By way of of addressing this increasingly expanding topic we offer our summary along the following plan of discourse:

Overview and Impressions.

  • what has emerged so far; some initial impressions are listed;
  • next are some caveats that have been derived from these impressions;

Theory of Operation.
For purposes of brevity a thumb nail sketch of how CG4 performs its actions is presented;

  • included are some high level diagrams
  • also links to several explanatory sources; these sources include articles and video content;

Risks.
Our thesis identifies three primary types of risks; these include:

  • systemic these are inherent as a natural process of ongoing technological, sociological advance;
  • malicious: who known actors categories are; how might they use this new capability;
  • theoretical: or possible new uses that might heretofore not been possible;

Notes, References.
We list a few notable portrayals of qualitative technological or scientific leaps;

Overview and Summary so far. If we step back for a moment and summarize what some observers have had to say about this new capability then we might tentatively start with that:

  • is based upon and is a refinement of its predecessor, the Chat GPT 3.5 system;
  • has been developed using the generative predictive transformer (GPT) model;
  • has been trained on a very large data set including textual material that can be found on the internet; unconfirmed rumors suggest that it has been trained on 1 trillion parameters;
  • is capable of sustaining conversational interaction using text based input provided by a user;
  • can provide contextually relevant and consistent responses;
  • can link topics in a chronologically consistent manner and refer back to them in current prompt requests;
  • is a Large Language Models that uses prediction as the basis of its actions;
  • uses deep learning neural networks and very large training data sets;
  • uses a SAAS model; like Google Search, Youtube or Morningstar Financial;