; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0000778 (gene) of Chayote v1 genome

Gene IDSed0000778
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase
Genome locationLG11:14964467..14966177
RNA-Seq ExpressionSed0000778
SyntenySed0000778
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039476.1 uncharacterized protein E6C27_scaffold64G002900 [Cucumis melo var. makuwa]4.5e-6852.55Show/hide
Query:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD
        ++DP+  + IERLKALGATT++GTT+P D EAW+ L+EKCF V  CPE+RKV L  FLLQ GA DWW++   RR     I W EFK AFF+++YPRS++D
Subjt:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD

Query:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKS--
        AKRNEFL+LTQGSMTVAEY KKY ELSKYA  +IEDE++R K FE+GL EEIRTS TA ++WNDF KLVEAALRV KSL+ERK ++E SKN    + S  
Subjt:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKS--

Query:  -----------FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC
                   F P V  RG+FK +  G   S S     ++  + S+ P  + G     +    V    K+S C
Subjt:  -----------FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC

KAA0060484.1 Gag protease polyprotein-like protein [Cucumis melo var. makuwa]1.9e-7154.01Show/hide
Query:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD
        ++DPE  + IERLKALGATT++GTT+PADAEAW+ L+EKCF V  CPE+RKV L  FLLQ GA DWW++   RR     I W+EFK AFF+++YPRS++D
Subjt:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD

Query:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNY--------
        AKRNEFL+LTQGSMT+AEY KKY ELS YA  +IEDE++RCK FE+GL EEIRT  TA ++WNDF KLVEAALRVEKSL+ERK ++E SKN         
Subjt:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNY--------

Query:  ---GGGNKS--FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC
            G  +S  F P V  RG+FK +  G S S+S  +  ++  + S+ P  + G     +    V   +K+S C
Subjt:  ---GGGNKS--FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC

TYJ95881.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]2.3e-6450.36Show/hide
Query:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD
        ++DPE  +  ERLKALGATT++GTT+P D EAW+ L+EKCF V    E+RKV L  FLLQ  A DWW++   RR     + W EFK AFF+++YPRS++D
Subjt:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD

Query:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKS--
        AK NEF++LTQG+MTVAEY KKY ELSKYA  +I DE +RCK FE+GL EEIRT  TA ++WNDF KLVE ALRVEKSL+ERK ++E SKN    + S  
Subjt:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKS--

Query:  -----------FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC
                   F PRV  RGSFK +  G S S+S     ++  ++S+    + G     +    V    K+S C
Subjt:  -----------FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC

TYK15233.1 uncharacterized protein E5676_scaffold892G00030 [Cucumis melo var. makuwa]4.5e-6852.55Show/hide
Query:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD
        ++DP+  + IERLKALGATT++GTT+P D EAW+ L+EKCF V  CPE+RKV L  FLLQ GA DWW++   RR     I W EFK AFF+++YPRS++D
Subjt:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD

Query:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKS--
        AKRNEFL+LTQGSMTVAEY KKY ELSKYA  +IEDE++R K FE+GL EEIRTS TA ++WNDF KLVEAALRV KSL+ERK ++E SKN    + S  
Subjt:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKS--

Query:  -----------FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC
                   F P V  RG+FK +  G   S S     ++  + S+ P  + G     +    V    K+S C
Subjt:  -----------FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC

TYK22919.1 reverse transcriptase [Cucumis melo var. makuwa]6.5e-5947.69Show/hide
Query:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD
        +++PE  + I+RLKALGATT++GTT+P DAEA + L+EKCF V  CPE+RKV L +FLLQ GA  WW +   RR   + + W EFK AFF+++YP  ++D
Subjt:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD

Query:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTASEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKSFAP
        AKRNEFL+L QGSMTV EY KKY ELSKYA  +IEDE++RCK FE+GL E     T+ ++WNDF KLVEAALRV+KSL+ERK ++E SKN          
Subjt:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTASEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKSFAP

Query:  RVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC
                 +R    S  R+     ++  + S+ P  + G     +    V    K+S C
Subjt:  RVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC

TrEMBL top hitse value%identityAlignment
A0A5A7TBS0 CCHC-type domain-containing protein2.2e-6852.55Show/hide
Query:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD
        ++DP+  + IERLKALGATT++GTT+P D EAW+ L+EKCF V  CPE+RKV L  FLLQ GA DWW++   RR     I W EFK AFF+++YPRS++D
Subjt:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD

Query:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKS--
        AKRNEFL+LTQGSMTVAEY KKY ELSKYA  +IEDE++R K FE+GL EEIRTS TA ++WNDF KLVEAALRV KSL+ERK ++E SKN    + S  
Subjt:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKS--

Query:  -----------FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC
                   F P V  RG+FK +  G   S S     ++  + S+ P  + G     +    V    K+S C
Subjt:  -----------FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC

A0A5A7UZM6 Gag protease polyprotein-like protein9.4e-7254.01Show/hide
Query:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD
        ++DPE  + IERLKALGATT++GTT+PADAEAW+ L+EKCF V  CPE+RKV L  FLLQ GA DWW++   RR     I W+EFK AFF+++YPRS++D
Subjt:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD

Query:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNY--------
        AKRNEFL+LTQGSMT+AEY KKY ELS YA  +IEDE++RCK FE+GL EEIRT  TA ++WNDF KLVEAALRVEKSL+ERK ++E SKN         
Subjt:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNY--------

Query:  ---GGGNKS--FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC
            G  +S  F P V  RG+FK +  G S S+S  +  ++  + S+ P  + G     +    V   +K+S C
Subjt:  ---GGGNKS--FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC

A0A5D3BB91 Reverse transcriptase1.1e-6450.36Show/hide
Query:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD
        ++DPE  +  ERLKALGATT++GTT+P D EAW+ L+EKCF V    E+RKV L  FLLQ  A DWW++   RR     + W EFK AFF+++YPRS++D
Subjt:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD

Query:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKS--
        AK NEF++LTQG+MTVAEY KKY ELSKYA  +I DE +RCK FE+GL EEIRT  TA ++WNDF KLVE ALRVEKSL+ERK ++E SKN    + S  
Subjt:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKS--

Query:  -----------FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC
                   F PRV  RGSFK +  G S S+S     ++  ++S+    + G     +    V    K+S C
Subjt:  -----------FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC

A0A5D3CTK6 CCHC-type domain-containing protein2.2e-6852.55Show/hide
Query:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD
        ++DP+  + IERLKALGATT++GTT+P D EAW+ L+EKCF V  CPE+RKV L  FLLQ GA DWW++   RR     I W EFK AFF+++YPRS++D
Subjt:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD

Query:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKS--
        AKRNEFL+LTQGSMTVAEY KKY ELSKYA  +IEDE++R K FE+GL EEIRTS TA ++WNDF KLVEAALRV KSL+ERK ++E SKN    + S  
Subjt:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTA-SEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKS--

Query:  -----------FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC
                   F P V  RG+FK +  G   S S     ++  + S+ P  + G     +    V    K+S C
Subjt:  -----------FAPRVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC

A0A5D3DI38 Reverse transcriptase3.2e-5947.69Show/hide
Query:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD
        +++PE  + I+RLKALGATT++GTT+P DAEA + L+EKCF V  CPE+RKV L +FLLQ GA  WW +   RR   + + W EFK AFF+++YP  ++D
Subjt:  RNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKD

Query:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTASEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKSFAP
        AKRNEFL+L QGSMTV EY KKY ELSKYA  +IEDE++RCK FE+GL E     T+ ++WNDF KLVEAALRV+KSL+ERK ++E SKN          
Subjt:  AKRNEFLKLTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTASEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKSFAP

Query:  RVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC
                 +R    S  R+     ++  + S+ P  + G     +    V    K+S C
Subjt:  RVVDRGSFKIRNCGLSSSRSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAGAAATGATCCAGAAATTGTTTTTAGAATTGAACGACTAAAAGCCTTAGGTGCAACTACATATTCTGGGACAACGGACCCAGCTGATGCTGAAGCATGGATGAA
CTTGTTGGAAAAATGTTTCGATGTAATGGGGTGCCCTGAAAATAGAAAAGTGAGACTTGGTACTTTTCTACTCCAAGAAGGAGCTAATGATTGGTGGAAATTATGGTGTG
AAAGACGATCTAAATTCAAAGTTATTGAGTGGAGTGAGTTCAAGATGGCTTTTTTTAATGAATATTATCCACGATCTTACAAAGATGCCAAACGAAATGAGTTCTTGAAG
TTGACTCAAGGGTCAATGACCGTGGCAGAATATCACAAAAAGTATGTCGAGCTTTCAAAGTATGCTGCAAATATCATTGAAGACGAGATTGATAGGTGTAAAGGGTTTGA
AGATGGCTTGGGGGAAGAGATTCGAACTTCCACTACTGCGTCTGAATGGAATGATTTTGGAAAATTAGTGGAAGCAGCCTTGAGAGTTGAGAAAAGTTTATCTGAAAGAA
AAACCCAGAAAGAACCCTCGAAGAATTACGGAGGTGGCAACAAGAGTTTTGCACCAAGGGTGGTAGATCGAGGAAGTTTTAAAATTAGGAATTGTGGATTGTCAAGCTCC
AGATCTGATTTAAATGCATCATCTAGAACGTTAAACAATTCTAACCAACCTTCAAGGACCTTTGGAAATCAACGCACTTGGAAGCAAGGAGAATATGTATTTAATTATAA
TAAGAATTCAACATGTCAAATTAACCAGAGACAGTCTGGCCCTCAAGTGAAGTCTGTGACAAAAAATGGTGAAGGTCAATACAACATTGGTAGCTCTATCGACACCCATG
CACGTAAGCTAAAGCTTGAAGATATACCGGGGCAAGAGAATTTGCCAATTTATTTCCAGAAGAGTTGCTTGGGTTGCCACCTGATAGGGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCAGAAATGATCCAGAAATTGTTTTTAGAATTGAACGACTAAAAGCCTTAGGTGCAACTACATATTCTGGGACAACGGACCCAGCTGATGCTGAAGCATGGATGAA
CTTGTTGGAAAAATGTTTCGATGTAATGGGGTGCCCTGAAAATAGAAAAGTGAGACTTGGTACTTTTCTACTCCAAGAAGGAGCTAATGATTGGTGGAAATTATGGTGTG
AAAGACGATCTAAATTCAAAGTTATTGAGTGGAGTGAGTTCAAGATGGCTTTTTTTAATGAATATTATCCACGATCTTACAAAGATGCCAAACGAAATGAGTTCTTGAAG
TTGACTCAAGGGTCAATGACCGTGGCAGAATATCACAAAAAGTATGTCGAGCTTTCAAAGTATGCTGCAAATATCATTGAAGACGAGATTGATAGGTGTAAAGGGTTTGA
AGATGGCTTGGGGGAAGAGATTCGAACTTCCACTACTGCGTCTGAATGGAATGATTTTGGAAAATTAGTGGAAGCAGCCTTGAGAGTTGAGAAAAGTTTATCTGAAAGAA
AAACCCAGAAAGAACCCTCGAAGAATTACGGAGGTGGCAACAAGAGTTTTGCACCAAGGGTGGTAGATCGAGGAAGTTTTAAAATTAGGAATTGTGGATTGTCAAGCTCC
AGATCTGATTTAAATGCATCATCTAGAACGTTAAACAATTCTAACCAACCTTCAAGGACCTTTGGAAATCAACGCACTTGGAAGCAAGGAGAATATGTATTTAATTATAA
TAAGAATTCAACATGTCAAATTAACCAGAGACAGTCTGGCCCTCAAGTGAAGTCTGTGACAAAAAATGGTGAAGGTCAATACAACATTGGTAGCTCTATCGACACCCATG
CACGTAAGCTAAAGCTTGAAGATATACCGGGGCAAGAGAATTTGCCAATTTATTTCCAGAAGAGTTGCTTGGGTTGCCACCTGATAGGGAAGTAA
Protein sequenceShow/hide protein sequence
MFRNDPEIVFRIERLKALGATTYSGTTDPADAEAWMNLLEKCFDVMGCPENRKVRLGTFLLQEGANDWWKLWCERRSKFKVIEWSEFKMAFFNEYYPRSYKDAKRNEFLK
LTQGSMTVAEYHKKYVELSKYAANIIEDEIDRCKGFEDGLGEEIRTSTTASEWNDFGKLVEAALRVEKSLSERKTQKEPSKNYGGGNKSFAPRVVDRGSFKIRNCGLSSS
RSDLNASSRTLNNSNQPSRTFGNQRTWKQGEYVFNYNKNSTCQINQRQSGPQVKSVTKNGEGQYNIGSSIDTHARKLKLEDIPGQENLPIYFQKSCLGCHLIGK