; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016089 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016089
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr12:32977928..32982387
RNA-Seq ExpressionLag0016089
SyntenyLag0016089
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158344.1 uncharacterized protein LOC111024851 [Momordica charantia]7.7e-2545.12Show/hide
Query:  EATKCRAFSLTLTRIARQWYNKLPRKSIGSFKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEK
        EA +CR FS TLT  AR W+ +L R SI SFKEL+  F  QF+G R + K    LLT+KQ++ ESL  Y+  F+ E +QVEG  D VAL A +SG++DE+
Subjt:  EATKCRAFSLTLTRIARQWYNKLPRKSIGSFKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEK

Query:  LLNSIGEDQPRTYVEFVSRTQKYINAEELMKSKRAEREAQRVTTINKGKRKEERSKRPRGEDGD
        L+ S G+  P T++E +SR QKY++A EL+   R + E +R     K +R+ E+  R   E  D
Subjt:  LLNSIGEDQPRTYVEFVSRTQKYINAEELMKSKRAEREAQRVTTINKGKRKEERSKRPRGEDGD

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]7.9e-3041.28Show/hide
Query:  NTQPRDADKE---DLEDLIGQANPSFVDEIIRAKVPHKFKLPTFPQ----------------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGS
        +++ R + KE   DLE+L+ QA+  F +EI+R KVP KFKLPT  Q                       SEA +CR FS TL   AR W+ +L R SI S
Subjt:  NTQPRDADKE---DLEDLIGQANPSFVDEIIRAKVPHKFKLPTFPQ----------------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGS

Query:  FKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELM
        FK L+  F  QF+G R R +    LLT+KQR+ ESL  Y+  F+ E +QVEG  D V+L A +SG++DE L  S G+  P T+ E +SR Q+Y++A E  
Subjt:  FKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELM

Query:  KSKRAEREAQRVTTINKGKRKEERSKRPRGEDGDR
         SKR E + +R  T  K +R  ++ +  R E  DR
Subjt:  KSKRAEREAQRVTTINKGKRKEERSKRPRGEDGDR

XP_022159109.1 uncharacterized protein LOC111025548 [Momordica charantia]1.9e-2840Show/hide
Query:  NTQPRDADKE---DLEDLIGQANPSFVDEIIRAKVPHKFKLPTFPQ----------------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGS
        +++ R   KE   DLE+L+GQA+  F +EI+R KVP KFKLPT                          S+A +CR FS TL   AR W+ +L R SI S
Subjt:  NTQPRDADKE---DLEDLIGQANPSFVDEIIRAKVPHKFKLPTFPQ----------------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGS

Query:  FKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELM
        FK L+  F  QF+G R R +    LLT+KQR+ ESL+ Y+  F+ E +Q+EG  D V+L A +SG++DE L  S  +  P T+ E +SR Q+Y++A E  
Subjt:  FKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELM

Query:  KSKRAEREAQRVTTINKGKRKEERSKRPRGEDGDR
         SKR E + +R  T  K +R  ++ +  R E  DR
Subjt:  KSKRAEREAQRVTTINKGKRKEERSKRPRGEDGDR

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]1.8e-2637.38Show/hide
Query:  LEDLIGQANPSFVDEIIRAKVPHKFKLPTFPQ----------------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGSFKELSCVFAKQFLG
        L+D+  +  P F  +I+ AK P +F LP                          AS+A  CRAF LTL   AR+W+ +L   SI SF +LS  F   F  
Subjt:  LEDLIGQANPSFVDEIIRAKVPHKFKLPTFPQ----------------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGSFKELSCVFAKQFLG

Query:  ARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELMKSKRAEREAQRVTT
        AR R K    LLTVKQ+ GE+L  YI  ++NE+ QV+GYDD +AL+ ++ GL+  KL  S+ +  P +Y E ++R +KY NAEE  K++  E+       
Subjt:  ARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELMKSKRAEREAQRVTT

Query:  INKGKRKEERSKRP
          K  ++E R  RP
Subjt:  INKGKRKEERSKRP

XP_024047974.1 uncharacterized protein LOC112101548 [Citrus clementina]1.6e-2739.17Show/hide
Query:  EDLIGQANPSFVDEIIRAKVPHKFKLPT--------FPQ--------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGSFKELSCVFAKQFLGA
        +D++ ++ P F  EI+RA+ P  F+LP+        +P                S A  CRAF LTL+R AR+W+  L   SI SF EL   F   F  A
Subjt:  EDLIGQANPSFVDEIIRAKVPHKFKLPT--------FPQ--------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGSFKELSCVFAKQFLGA

Query:  RDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELMKSKRAEREAQRVTTI
        R R K    LLTVKQ  GESL  YI  ++ E  QV+GYDD VAL+ ++ GLQ  +L  S+ ++ P TY E +SR +KY NAEE  +SK+   + +  ++ 
Subjt:  RDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELMKSKRAEREAQRVTTI

Query:  NKGKRKEERSKRPRGED
        NK  +++ R  RP   D
Subjt:  NKGKRKEERSKRPRGED

TrEMBL top hitse value%identityAlignment
A0A6J1D7D2 uncharacterized protein LOC1110183076.3e-2538.64Show/hide
Query:  VDEIIRAKVPHKFKLPTFPQ----------------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGSFKELSCVFAKQFLGARDRRKQQFNLL
        ++EI++ KVP KFKLPT  Q                       SEA KCR FS TL+  AR W+ +L R SI SFK L+  F  QF+G R R +    LL
Subjt:  VDEIIRAKVPHKFKLPTFPQ----------------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGSFKELSCVFAKQFLGARDRRKQQFNLL

Query:  TVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELMKSKRAEREAQRVTTINKGKRKEERSK
        T+KQR+ ESL+ Y+  F+ E +QVEG  + V+L A +S ++DE L  S G+  P T+ E +SR QKY++A E   SKR            +GK+ ++  +
Subjt:  TVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELMKSKRAEREAQRVTTINKGKRKEERSK

Query:  RPRGEDGDRDRHSCSSSRSR
        R     GD+ + S    R R
Subjt:  RPRGEDGDRDRHSCSSSRSR

A0A6J1DWY0 uncharacterized protein LOC1110252933.8e-3041.28Show/hide
Query:  NTQPRDADKE---DLEDLIGQANPSFVDEIIRAKVPHKFKLPTFPQ----------------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGS
        +++ R + KE   DLE+L+ QA+  F +EI+R KVP KFKLPT  Q                       SEA +CR FS TL   AR W+ +L R SI S
Subjt:  NTQPRDADKE---DLEDLIGQANPSFVDEIIRAKVPHKFKLPTFPQ----------------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGS

Query:  FKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELM
        FK L+  F  QF+G R R +    LLT+KQR+ ESL  Y+  F+ E +QVEG  D V+L A +SG++DE L  S G+  P T+ E +SR Q+Y++A E  
Subjt:  FKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELM

Query:  KSKRAEREAQRVTTINKGKRKEERSKRPRGEDGDR
         SKR E + +R  T  K +R  ++ +  R E  DR
Subjt:  KSKRAEREAQRVTTINKGKRKEERSKRPRGEDGDR

A0A6J1DZ49 uncharacterized protein LOC1110248513.7e-2545.12Show/hide
Query:  EATKCRAFSLTLTRIARQWYNKLPRKSIGSFKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEK
        EA +CR FS TLT  AR W+ +L R SI SFKEL+  F  QF+G R + K    LLT+KQ++ ESL  Y+  F+ E +QVEG  D VAL A +SG++DE+
Subjt:  EATKCRAFSLTLTRIARQWYNKLPRKSIGSFKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEK

Query:  LLNSIGEDQPRTYVEFVSRTQKYINAEELMKSKRAEREAQRVTTINKGKRKEERSKRPRGEDGD
        L+ S G+  P T++E +SR QKY++A EL+   R + E +R     K +R+ E+  R   E  D
Subjt:  LLNSIGEDQPRTYVEFVSRTQKYINAEELMKSKRAEREAQRVTTINKGKRKEERSKRPRGEDGD

A0A6J1E1E7 uncharacterized protein LOC1110255489.4e-2940Show/hide
Query:  NTQPRDADKE---DLEDLIGQANPSFVDEIIRAKVPHKFKLPTFPQ----------------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGS
        +++ R   KE   DLE+L+GQA+  F +EI+R KVP KFKLPT                          S+A +CR FS TL   AR W+ +L R SI S
Subjt:  NTQPRDADKE---DLEDLIGQANPSFVDEIIRAKVPHKFKLPTFPQ----------------------ASEATKCRAFSLTLTRIARQWYNKLPRKSIGS

Query:  FKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELM
        FK L+  F  QF+G R R +    LLT+KQR+ ESL+ Y+  F+ E +Q+EG  D V+L A +SG++DE L  S  +  P T+ E +SR Q+Y++A E  
Subjt:  FKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELM

Query:  KSKRAEREAQRVTTINKGKRKEERSKRPRGEDGDR
         SKR E + +R  T  K +R  ++ +  R E  DR
Subjt:  KSKRAEREAQRVTTINKGKRKEERSKRPRGEDGDR

A0A7J0DWQ5 Ribonuclease H8.5e-2234.88Show/hide
Query:  LEDLIGQANPSFVDEIIRAKVPHKFKLPTFPQASEATKCRAFSLTLTRIARQWYNKLPRKSIGSFKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESL
        ++ L+ Q +P F + ++R ++  KFKLPT     +   C+AFS TL   AR W+ KL   ++ SF ELS +F   F+  R+R+K   +L TV Q+  ESL
Subjt:  LEDLIGQANPSFVDEIIRAKVPHKFKLPTFPQASEATKCRAFSLTLTRIARQWYNKLPRKSIGSFKELSCVFAKQFLGARDRRKQQFNLLTVKQRSGESL

Query:  NGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELMKSKRAER-----EAQRVTTINKGKRKEERSKRPRGE
          ++  F+  V+ VE   D+V + A++ GL+   L +S+ ++ P T     S+  KYI AEEL ++KR  R       +   T     R+E R KRP   
Subjt:  NGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELMKSKRAER-----EAQRVTTINKGKRKEERSKRPRGE

Query:  DGDRDRHSCSSSRSR
          DRD    ++ R R
Subjt:  DGDRDRHSCSSSRSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAACCAGAAACTCGATGGAGAGAGGATTCGAGGGCTTCGATGCGAGAGAGAAGAGAGGAGACGGGGTTGAAATTGGTAATGCGATTGAAATTTCTTGGTACGAACA
CTTGGGTATTGAGAACAATGCTGGAATTGAGCAGATTGAAGAATTTGAGAAGCAAGAGCTTGAAGAAGGAGGTCGAAGGGAGACGGAGGATGGTGGGTTTGGAAGCATGG
AGTGGCGAAATCGTCGCCCCAGCCGCAGCAACAGGCGGAAGGGAGATAGGTCCAGTGTAGATGGGGTTGATGAGGATGGTGATGAGAGGCCATTAGAGGCTCAAAGCTTG
AAGCTGAGATTCTTCAAGGGCTTGGGTTTTGGGAAAAAATGGTGGTCTTCAACTTCAACCCACTACATTTTCCTCTCCTTTACAACTTTCCTCCTTGATTTGTCGCATCG
TCTACCATTGTTGAATCATCTTGTCCTCGTCGACGCTACCTGTGCCATCTTGCCATCGTCGTCGCCACTAGCTAGCTATATCAGTGAGTGTGTAGAGAAGAAAATAGAGT
TGGAGAGAAGTAGTGAACTAAAGAAGAAAACATGTGTTTGCTTGGCAATATTGACTTCTTTATTCGACGACAAAGGTACATCGGCTGAAAATTTTGAGGCTGAGGCCGAG
GAGCTGATGCCGAGGCCGACCAGGCGGTTAGGCCCAATGACCACCTGGGTCGTTTGGGGAAGCCTTCTTGGGCCCAGGGAAGGCAGGGTTCGTGAGGCATCGGAGCATGT
GCGGCAAGCATCACATCGTTGTGCAGTGCTTACTTTCTTTTTTGCAGGTCACGTCTTCCCCGGATTCAAACAAATTCACTATTGGTGTCACGTGAAGGCCAGGAGAATGG
AGAAGGAAAATCCAGTGGGCGAGGGAGGTCCTCGGCCTCAGGCTAATGCACAGGGCATGGAGATGGAAGTTCTCAGAGGAAGAGTAAATGAGATGGGGCAGAGTTTGGCC
GAGATTCTGAGTATCCTGAGGCAACCGAATCCTAGCACGAAGTACCAAAAGAGTCTCGTGCGTGATCGAGAGAAAGGAAAAGGGATCTTAGATGAAGAAGAAGGGGAAAC
AAACAGTTCTACTAGCAAATTGCGAAAACCAGAGGGTGGCAAAGAATTTGACTTGAAGGAGCCAGGGTCGAGTAAATGGGTAGAGCGTAAAGGTGCACCTGACATCCCAG
ACGAAACCAGTACGTTAGGTTCGCGCATGAGGACTGATGCCGATGTCGAGGCCAAGATCCGGGCGAAGATTGAACTTGAGATCCAGGCCGAGGTTGAGGCTAAGTTGAGG
GCCAAAGCCGAAGTCGCAGCCATAGCTAAGGTCGAGGCCGAGGTCGAGGTCAGGGCTAGGGCTAGGATACAGGGAAACACACAACCCAGAGATGCAGACAAAGAAGATTT
AGAGGATTTAATAGGTCAGGCCAATCCATCTTTTGTCGACGAGATCATCCGGGCAAAAGTTCCACACAAGTTTAAGTTGCCGACCTTTCCACAGGCTTCTGAGGCAACCA
AGTGTCGTGCATTTTCTCTGACCTTGACCAGAATAGCACGACAATGGTACAACAAGTTGCCACGCAAGTCCATTGGCTCATTCAAGGAGTTGTCTTGCGTTTTTGCCAAA
CAATTCTTGGGGGCAAGGGATCGAAGGAAACAACAATTTAATTTGTTGACTGTCAAACAAAGGTCGGGGGAAAGTTTGAATGGGTATATCACGTGTTTCAGTAATGAGGT
GGTACAGGTGGAAGGATATGATGATGAAGTGGCTCTAACGGCGGTTATCTCAGGGTTGCAAGACGAGAAATTGTTAAACTCCATAGGAGAGGATCAACCACGTACATATG
TCGAGTTTGTCTCCAGGACACAAAAATACATAAACGCAGAGGAATTAATGAAGTCCAAGCGTGCAGAAAGGGAAGCGCAGAGGGTGACCACTATTAACAAGGGCAAAAGG
AAAGAAGAAAGAAGCAAGAGGCCGCGGGGGGAGGATGGAGACCGAGATCGTCACAGTTGCTCCTCTAGCCGAAGTCGTGTAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAACCAGAAACTCGATGGAGAGAGGATTCGAGGGCTTCGATGCGAGAGAGAAGAGAGGAGACGGGGTTGAAATTGGTAATGCGATTGAAATTTCTTGGTACGAACA
CTTGGGTATTGAGAACAATGCTGGAATTGAGCAGATTGAAGAATTTGAGAAGCAAGAGCTTGAAGAAGGAGGTCGAAGGGAGACGGAGGATGGTGGGTTTGGAAGCATGG
AGTGGCGAAATCGTCGCCCCAGCCGCAGCAACAGGCGGAAGGGAGATAGGTCCAGTGTAGATGGGGTTGATGAGGATGGTGATGAGAGGCCATTAGAGGCTCAAAGCTTG
AAGCTGAGATTCTTCAAGGGCTTGGGTTTTGGGAAAAAATGGTGGTCTTCAACTTCAACCCACTACATTTTCCTCTCCTTTACAACTTTCCTCCTTGATTTGTCGCATCG
TCTACCATTGTTGAATCATCTTGTCCTCGTCGACGCTACCTGTGCCATCTTGCCATCGTCGTCGCCACTAGCTAGCTATATCAGTGAGTGTGTAGAGAAGAAAATAGAGT
TGGAGAGAAGTAGTGAACTAAAGAAGAAAACATGTGTTTGCTTGGCAATATTGACTTCTTTATTCGACGACAAAGGTACATCGGCTGAAAATTTTGAGGCTGAGGCCGAG
GAGCTGATGCCGAGGCCGACCAGGCGGTTAGGCCCAATGACCACCTGGGTCGTTTGGGGAAGCCTTCTTGGGCCCAGGGAAGGCAGGGTTCGTGAGGCATCGGAGCATGT
GCGGCAAGCATCACATCGTTGTGCAGTGCTTACTTTCTTTTTTGCAGGTCACGTCTTCCCCGGATTCAAACAAATTCACTATTGGTGTCACGTGAAGGCCAGGAGAATGG
AGAAGGAAAATCCAGTGGGCGAGGGAGGTCCTCGGCCTCAGGCTAATGCACAGGGCATGGAGATGGAAGTTCTCAGAGGAAGAGTAAATGAGATGGGGCAGAGTTTGGCC
GAGATTCTGAGTATCCTGAGGCAACCGAATCCTAGCACGAAGTACCAAAAGAGTCTCGTGCGTGATCGAGAGAAAGGAAAAGGGATCTTAGATGAAGAAGAAGGGGAAAC
AAACAGTTCTACTAGCAAATTGCGAAAACCAGAGGGTGGCAAAGAATTTGACTTGAAGGAGCCAGGGTCGAGTAAATGGGTAGAGCGTAAAGGTGCACCTGACATCCCAG
ACGAAACCAGTACGTTAGGTTCGCGCATGAGGACTGATGCCGATGTCGAGGCCAAGATCCGGGCGAAGATTGAACTTGAGATCCAGGCCGAGGTTGAGGCTAAGTTGAGG
GCCAAAGCCGAAGTCGCAGCCATAGCTAAGGTCGAGGCCGAGGTCGAGGTCAGGGCTAGGGCTAGGATACAGGGAAACACACAACCCAGAGATGCAGACAAAGAAGATTT
AGAGGATTTAATAGGTCAGGCCAATCCATCTTTTGTCGACGAGATCATCCGGGCAAAAGTTCCACACAAGTTTAAGTTGCCGACCTTTCCACAGGCTTCTGAGGCAACCA
AGTGTCGTGCATTTTCTCTGACCTTGACCAGAATAGCACGACAATGGTACAACAAGTTGCCACGCAAGTCCATTGGCTCATTCAAGGAGTTGTCTTGCGTTTTTGCCAAA
CAATTCTTGGGGGCAAGGGATCGAAGGAAACAACAATTTAATTTGTTGACTGTCAAACAAAGGTCGGGGGAAAGTTTGAATGGGTATATCACGTGTTTCAGTAATGAGGT
GGTACAGGTGGAAGGATATGATGATGAAGTGGCTCTAACGGCGGTTATCTCAGGGTTGCAAGACGAGAAATTGTTAAACTCCATAGGAGAGGATCAACCACGTACATATG
TCGAGTTTGTCTCCAGGACACAAAAATACATAAACGCAGAGGAATTAATGAAGTCCAAGCGTGCAGAAAGGGAAGCGCAGAGGGTGACCACTATTAACAAGGGCAAAAGG
AAAGAAGAAAGAAGCAAGAGGCCGCGGGGGGAGGATGGAGACCGAGATCGTCACAGTTGCTCCTCTAGCCGAAGTCGTGTAGACTAG
Protein sequenceShow/hide protein sequence
MRTRNSMERGFEGFDAREKRGDGVEIGNAIEISWYEHLGIENNAGIEQIEEFEKQELEEGGRRETEDGGFGSMEWRNRRPSRSNRRKGDRSSVDGVDEDGDERPLEAQSL
KLRFFKGLGFGKKWWSSTSTHYIFLSFTTFLLDLSHRLPLLNHLVLVDATCAILPSSSPLASYISECVEKKIELERSSELKKKTCVCLAILTSLFDDKGTSAENFEAEAE
ELMPRPTRRLGPMTTWVVWGSLLGPREGRVREASEHVRQASHRCAVLTFFFAGHVFPGFKQIHYWCHVKARRMEKENPVGEGGPRPQANAQGMEMEVLRGRVNEMGQSLA
EILSILRQPNPSTKYQKSLVRDREKGKGILDEEEGETNSSTSKLRKPEGGKEFDLKEPGSSKWVERKGAPDIPDETSTLGSRMRTDADVEAKIRAKIELEIQAEVEAKLR
AKAEVAAIAKVEAEVEVRARARIQGNTQPRDADKEDLEDLIGQANPSFVDEIIRAKVPHKFKLPTFPQASEATKCRAFSLTLTRIARQWYNKLPRKSIGSFKELSCVFAK
QFLGARDRRKQQFNLLTVKQRSGESLNGYITCFSNEVVQVEGYDDEVALTAVISGLQDEKLLNSIGEDQPRTYVEFVSRTQKYINAEELMKSKRAEREAQRVTTINKGKR
KEERSKRPRGEDGDRDRHSCSSSRSRVD