; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G25920 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G25920
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionAutophagy-related protein 18g isoform X1
Genome locationChr1:21282108..21286471
RNA-Seq ExpressionCSPI01G25920
SyntenyCSPI01G25920
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024019.1 hypothetical protein SDJN02_15048, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-9567.18Show/hide
Query:  LYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEEEQQHEYHNT
        +YFTS+NGE SS+ I+SGA+KLD +SKT TLP+GE+TI+VV+KLPALRL+SLL AM+L KDT+I + PPFTNL  I  G +L+PLP  +EE+Q    +N 
Subjt:  LYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEEEQQHEYHNT

Query:  QSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQN
         S++RT  SGPSSSS +E P   FN   ALFLR+GSWQVV  ++ DLVLKFDYRNKK++WE+V EGPSKHKIEI+WSNIIGI+AAIEDHRQGILQLELQ 
Subjt:  QSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQN

Query:  PPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE
        PPRFYKEIE++P K  KW +E DFT GRAS+NR++F+VF+PG+LGT+YKRLMKNK ++E
Subjt:  PPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE

XP_016900954.1 PREDICTED: uncharacterized protein LOC107991120 [Cucumis melo]3.7e-10693.63Show/hide
Query:  MDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEEEQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRN
        MDLRKDTEI MKPPFTNLQ+IFNGRTLVPLPQINEEEQQHEY NTQSNQRT FSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVAN+ESDLVLKFDYRN
Subjt:  MDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEEEQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRN

Query:  KKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNK
        KK+SWEVV EGPSKHKIEI+WSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGR SM+RKHFSVFAPGILGTYYKRLMKNK
Subjt:  KKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNK

Query:  EMIE
        +++E
Subjt:  EMIE

XP_022158816.1 uncharacterized protein LOC111025282 [Momordica charantia]1.0e-9265.45Show/hide
Query:  MEKKKEDS---GKRLYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLP
        M+KKKE+S    K+LY TS+NGE SS+SI+SGAE+LD +S+T TLPQG +TI+VV+KLPALRL+SLL AM+L +DT+I +  PF+NL +I  GR+L+PLP
Subjt:  MEKKKEDS---GKRLYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLP

Query:  QINEEEQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPF--NAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQA
          N E+ Q    N+ SN+RT  SGPSSSS   P   P   NAAPALFLRIGSWQVV  +E DLVL+FDYR KK+SWE+V EGPSKHKIEI+WSNIIGI+A
Subjt:  QINEEEQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPF--NAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQA

Query:  AIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE
        A EDHRQGILQLELQ PPRFYKEIE +  K  KW +E DFT GRAS+NR++FSVF+PG+LG +YKR+MKNK ++E
Subjt:  AIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE

XP_022961562.1 uncharacterized protein LOC111462107 [Cucurbita moschata]8.3e-9866.79Show/hide
Query:  KKKEDSGKRLYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEE
        KK E+S K++YFTS+NGE SS+ I+SGA+KLD +SKT TLP+GE+TI+VV+KLPALRL+SLL AM+L KDT+I + PPFTNL  I  G +L+PLP  +EE
Subjt:  KKKEDSGKRLYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEE

Query:  EQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQ
        +Q    +N  S++RT  SGPSSSS +E P   FN   ALFLR+GSWQVV  ++ DLVLKFDYRNKK++WE+V EGPSKHKIEI+WSNIIGI+AAIEDHRQ
Subjt:  EQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQ

Query:  GILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE
        GILQLELQ PPRFYKEIE++P K  KW +E DFT GRAS+NR++F+VF+PG+LGT+YKRLMKNK ++E
Subjt:  GILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE

XP_022968779.1 uncharacterized protein LOC111467912 [Cucurbita maxima]4.1e-9766.42Show/hide
Query:  KKKEDSGKRLYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEE
        KK E+S K++YFTS+NGE SS+ I+SGA+KLD +SKT  LP+GE+TI+VV+KLPALRL+SLL AM+L KDT+I + PPFTNL  I  G +L+PLP  +EE
Subjt:  KKKEDSGKRLYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEE

Query:  EQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQ
        +Q    +N  S++RT  SGPSSSS +E P   FN   ALFLR+GSWQVV  ++ DLVLKFDYRNKK++WE+V EGPSKHKIEI+WSNIIGI+AAIEDHRQ
Subjt:  EQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQ

Query:  GILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE
        GILQLELQ PPRFYKEIE++P K  KW +E DFT GRAS+NR++F+VF+PG+LGT+YKRLMKNK ++E
Subjt:  GILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE

TrEMBL top hitse value%identityAlignment
A0A0A0M1F9 Uncharacterized protein4.4e-11399.51Show/hide
Query:  MDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEEEQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRN
        MDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEEEQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRN
Subjt:  MDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEEEQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRN

Query:  KKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNK
        KKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNK
Subjt:  KKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNK

Query:  EMIE
        EM+E
Subjt:  EMIE

A0A1S4DY92 uncharacterized protein LOC1079911201.8e-10693.63Show/hide
Query:  MDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEEEQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRN
        MDLRKDTEI MKPPFTNLQ+IFNGRTLVPLPQINEEEQQHEY NTQSNQRT FSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVAN+ESDLVLKFDYRN
Subjt:  MDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEEEQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRN

Query:  KKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNK
        KK+SWEVV EGPSKHKIEI+WSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGR SM+RKHFSVFAPGILGTYYKRLMKNK
Subjt:  KKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNK

Query:  EMIE
        +++E
Subjt:  EMIE

A0A6J1E232 uncharacterized protein LOC1110252825.1e-9365.45Show/hide
Query:  MEKKKEDS---GKRLYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLP
        M+KKKE+S    K+LY TS+NGE SS+SI+SGAE+LD +S+T TLPQG +TI+VV+KLPALRL+SLL AM+L +DT+I +  PF+NL +I  GR+L+PLP
Subjt:  MEKKKEDS---GKRLYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLP

Query:  QINEEEQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPF--NAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQA
          N E+ Q    N+ SN+RT  SGPSSSS   P   P   NAAPALFLRIGSWQVV  +E DLVL+FDYR KK+SWE+V EGPSKHKIEI+WSNIIGI+A
Subjt:  QINEEEQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPF--NAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQA

Query:  AIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE
        A EDHRQGILQLELQ PPRFYKEIE +  K  KW +E DFT GRAS+NR++FSVF+PG+LG +YKR+MKNK ++E
Subjt:  AIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE

A0A6J1HAI6 uncharacterized protein LOC1114621074.0e-9866.79Show/hide
Query:  KKKEDSGKRLYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEE
        KK E+S K++YFTS+NGE SS+ I+SGA+KLD +SKT TLP+GE+TI+VV+KLPALRL+SLL AM+L KDT+I + PPFTNL  I  G +L+PLP  +EE
Subjt:  KKKEDSGKRLYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEE

Query:  EQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQ
        +Q    +N  S++RT  SGPSSSS +E P   FN   ALFLR+GSWQVV  ++ DLVLKFDYRNKK++WE+V EGPSKHKIEI+WSNIIGI+AAIEDHRQ
Subjt:  EQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQ

Query:  GILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE
        GILQLELQ PPRFYKEIE++P K  KW +E DFT GRAS+NR++F+VF+PG+LGT+YKRLMKNK ++E
Subjt:  GILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE

A0A6J1I0N3 uncharacterized protein LOC1114679122.0e-9766.42Show/hide
Query:  KKKEDSGKRLYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEE
        KK E+S K++YFTS+NGE SS+ I+SGA+KLD +SKT  LP+GE+TI+VV+KLPALRL+SLL AM+L KDT+I + PPFTNL  I  G +L+PLP  +EE
Subjt:  KKKEDSGKRLYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEE

Query:  EQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQ
        +Q    +N  S++RT  SGPSSSS +E P   FN   ALFLR+GSWQVV  ++ DLVLKFDYRNKK++WE+V EGPSKHKIEI+WSNIIGI+AAIEDHRQ
Subjt:  EQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQ

Query:  GILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE
        GILQLELQ PPRFYKEIE++P K  KW +E DFT GRAS+NR++F+VF+PG+LGT+YKRLMKNK ++E
Subjt:  GILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54300.1 unknown protein1.3e-1635.34Show/hide
Query:  PALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPS------KHKIEIEWSNIIGIQAAIEDHRQ-GILQLELQNPPRFYKEIETRPLKLFKWEE
        P   +RIG W VVA +  D+V KF +  KKL WE +   P       K KIEI+W+++   + +I    + GIL++EL+  P F+ E   +  K  +W++
Subjt:  PALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPS------KHKIEIEWSNIIGIQAAIEDHRQ-GILQLELQNPPRFYKEIETRPLKLFKWEE

Query:  -EYDFTQGRASMNRKHFSVFAPGILGTYYKRLM
         ++DFT   AS  R+H   F PG+L    ++L+
Subjt:  -EYDFTQGRASMNRKHFSVFAPGILGTYYKRLM

AT2G24100.1 unknown protein1.2e-2240.48Show/hide
Query:  PALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQG
        PA  LRIG W+  +  E DLV K  +   KL WEV+ +G  K KIEI+WS+I+ ++A + +   G L + L   P F++E   +P K   W+   DFT G
Subjt:  PALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQG

Query:  RASMNRKHFSVFAPGILGTYYKRLMK
        +ASMNR+HF    PGI+  ++++L++
Subjt:  RASMNRKHFSVFAPGILGTYYKRLMK

AT3G05770.1 unknown protein2.9e-1631.79Show/hide
Query:  INEEEQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGP------SKHKIEIEWSNIIG
        IN+ E   + H T  +Q+T  S  +S+    P        P   ++IG    VA +  D+V KF +  KKL WE +   P       K KIEI+W+++  
Subjt:  INEEEQQHEYHNTQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGP------SKHKIEIEWSNIIG

Query:  IQAAIEDHRQ-GILQLELQNPPRFYKEIETRPLKLFKWEE-EYDFTQGRASMNRKHFSVFAPGILGTYYKRLM
         + +I    + GIL++EL+  P F+ E   +  K  +W++ +YDFT  +AS  R+H   F PG+L    ++L+
Subjt:  IQAAIEDHRQ-GILQLELQNPPRFYKEIETRPLKLFKWEE-EYDFTQGRASMNRKHFSVFAPGILGTYYKRLM

AT4G30780.1 unknown protein1.3e-2140.48Show/hide
Query:  PALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQG
        PA  L+IG W+  +  E DLV K  +   KL WEV+ +G  K KIEI+WS+I+ ++A   +   G L L L   P F++E   +P K   W+   DFT G
Subjt:  PALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQG

Query:  RASMNRKHFSVFAPGILGTYYKRLMK
        +ASMNR+HF   A GI+  ++++L++
Subjt:  RASMNRKHFSVFAPGILGTYYKRLMK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGAAGAAAGAAGACAGTGGGAAAAGACTGTATTTTACATCAGACAATGGAGAATGTTCAAGTTATTCAATAGTGTCTGGTGCTGAAAAACTTGATGAATACTC
CAAAACTCGAACCCTTCCTCAGGGTGAAAAGACAATTGATGTAGTGATAAAACTACCAGCACTTCGTCTGACTTCTCTCCTTGAAGCAATGGATTTACGAAAGGATACAG
AAATTCCAATGAAACCTCCATTTACAAATCTTCAAAGGATATTTAATGGACGTACACTAGTCCCTCTGCCTCAAATAAATGAAGAAGAACAACAACATGAATATCATAAT
ACTCAATCAAATCAAAGAACGACCTTTTCTGGACCAAGCAGCTCTTCCTTTGCTGAACCACCCAATACTCCCTTTAATGCTGCTCCTGCCCTTTTCCTTCGTATCGGCTC
TTGGCAGGTTGTGGCCAACAGTGAAAGTGATTTGGTTTTGAAATTTGATTATAGAAACAAGAAGCTATCTTGGGAGGTTGTGCTGGAAGGGCCTTCCAAGCACAAGATTG
AAATTGAATGGTCTAATATCATAGGAATTCAAGCTGCCATTGAAGATCATAGACAAGGAATTCTCCAACTTGAGCTGCAAAATCCACCAAGATTTTACAAGGAGATTGAA
ACCAGACCACTGAAGCTGTTCAAATGGGAAGAAGAATATGATTTCACACAAGGCAGAGCTTCTATGAACAGGAAACACTTTTCAGTGTTTGCACCAGGAATACTTGGAAC
GTATTATAAGAGACTGATGAAAAACAAGGAAATGATTGAAACACATATGATGTGTGAACTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTG
TTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTT
GTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGT
TGTTGTTGTTGTTGTTTGTTTGTTTTCAATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGAAGAAAGAAGACAGTGGGAAAAGACTGTATTTTACATCAGACAATGGAGAATGTTCAAGTTATTCAATAGTGTCTGGTGCTGAAAAACTTGATGAATACTC
CAAAACTCGAACCCTTCCTCAGGGTGAAAAGACAATTGATGTAGTGATAAAACTACCAGCACTTCGTCTGACTTCTCTCCTTGAAGCAATGGATTTACGAAAGGATACAG
AAATTCCAATGAAACCTCCATTTACAAATCTTCAAAGGATATTTAATGGACGTACACTAGTCCCTCTGCCTCAAATAAATGAAGAAGAACAACAACATGAATATCATAAT
ACTCAATCAAATCAAAGAACGACCTTTTCTGGACCAAGCAGCTCTTCCTTTGCTGAACCACCCAATACTCCCTTTAATGCTGCTCCTGCCCTTTTCCTTCGTATCGGCTC
TTGGCAGGTTGTGGCCAACAGTGAAAGTGATTTGGTTTTGAAATTTGATTATAGAAACAAGAAGCTATCTTGGGAGGTTGTGCTGGAAGGGCCTTCCAAGCACAAGATTG
AAATTGAATGGTCTAATATCATAGGAATTCAAGCTGCCATTGAAGATCATAGACAAGGAATTCTCCAACTTGAGCTGCAAAATCCACCAAGATTTTACAAGGAGATTGAA
ACCAGACCACTGAAGCTGTTCAAATGGGAAGAAGAATATGATTTCACACAAGGCAGAGCTTCTATGAACAGGAAACACTTTTCAGTGTTTGCACCAGGAATACTTGGAAC
GTATTATAAGAGACTGATGAAAAACAAGGAAATGATTGAAACACATATGATGTGTGAACTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTG
TTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTT
GTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGT
TGTTGTTGTTGTTGTTTGTTTGTTTTCAATTTAA
Protein sequenceShow/hide protein sequence
MEKKKEDSGKRLYFTSDNGECSSYSIVSGAEKLDEYSKTRTLPQGEKTIDVVIKLPALRLTSLLEAMDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEEEQQHEYHN
TQSNQRTTFSGPSSSSFAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIEWSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIE
TRPLKLFKWEEEYDFTQGRASMNRKHFSVFAPGILGTYYKRLMKNKEMIETHMMCELVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVV
VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVCLFSI