; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039083 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039083
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr2:35561330..35564453
RNA-Seq ExpressionLag0039083
SyntenyLag0039083
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025376.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.9e-3837.6Show/hide
Query:  IEESMVVNTTLPKSSSK-------EKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESM+V+ T  KS SK        K   N     TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+ EKVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSK-------EKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTNI-------------------------------------KE-------------RRSKKLQP-----
        LK+LILKLA+E KIELD+DEVAQ+N   I+  +N+                                     KE              R K  QP     
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTNI-------------------------------------KE-------------RRSKKLQP-----

Query:  -----------------KRKRSKKFSQPQQLVNKD---LRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRRRKSM
                         K  R+KK   P+ +  KD   L+LR                           H AS      NY S S   N      +R S+
Subjt:  -----------------KRKRSKKFSQPQQLVNKD---LRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRRRKSM

Query:  FD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVN
        FD                               TR S F+RLS+STSKK +PSTS FDRLK+T+DQ +R+M + + K F E N
Subjt:  FD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVN

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-4348.28Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLA------TIKGKTN--IKERRSKKLQPKRKRSKKFSQPQQLVNKDLRLRSHQAS------NYSSFSIPKNEYGR
        LK+LILKLA+E KIELD+DEVAQ+N A       IKGK    ++ RRS  L     RS     P++++        H AS      NY S S   N    
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLA------TIKGKTN--IKERRSKKLQPKRKRSKKFSQPQQLVNKDLRLRSHQAS------NYSSFSIPKNEYGR

Query:  DRRRKSMFD---VHLTRPSAFQRLSV-------------------------STSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVN
          +R S+FD      TR S FQRLSV                         ST KK +PSTS FDRLK+T+DQ +R+M + + K F E N
Subjt:  DRRRKSMFD---VHLTRPSAFQRLSV-------------------------STSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVN

KAA0061611.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.5e-3837.14Show/hide
Query:  IEESMVVNTTLPKSSSK-------EKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK        K   +     TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSK-------EKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTNI-------------------------------------KE-------------RRSKKLQP-----
        LK+LILKLA+E KIELD+DEVAQ+N   I+  +N+                                     KE              R K  QP     
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTNI-------------------------------------KE-------------RRSKKLQP-----

Query:  -----------------KRKRSKKFSQPQQLVNKD---LRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRRRKSM
                         K +R+KK   P+ +  KD   L+LR                           H AS      NY S S   N      +R S+
Subjt:  -----------------KRKRSKKFSQPQQLVNKD---LRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRRRKSM

Query:  FD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNTTRSFKTDQDHDEGYVG
        FD                               TR S F+RLS+STSKK +PSTSVFDRLK+T DQ +R+M +L+ K F E N       D D     V 
Subjt:  FD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNTTRSFKTDQDHDEGYVG

Query:  SLKKTLSSVSTN
        S  K   SV  N
Subjt:  SLKKTLSSVSTN

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.3e-4144.72Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESMVV  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTN--IKERRSKKLQPKRKRSKKFSQPQQLVNKDLRLRSHQAS------NYSSFSIPKNEYGRDRRRKS
        LK+LILKLA+E KI+LD+DE        IKGK    ++ RRS  L     RS     P++++        H  S      NY S+    N     ++R S
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTN--IKERRSKKLQPKRKRSKKFSQPQQLVNKDLRLRSHQAS------NYSSFSIPKNEYGRDRRRKS

Query:  MFD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVN
        +FD                               TR SAF+RLS+STSKK +PSTS FDRLK+ +DQ +R+M +L+ K F E N
Subjt:  MFD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVN

TYK04576.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.5e-3837.38Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQKK+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTNI-------------------------------------KE-------------RRSKKLQP-----
        LK+LILKLA+E KIELD+DEVAQ+N   I+  +N+                                     KE              R K  QP     
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTNI-------------------------------------KE-------------RRSKKLQP-----

Query:  -----------------KRKRSKKFSQPQQLVNKD---LRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRRRKSM
                         K +R+KK   P+ +  KD   L+LR                           H AS      NY S S   N      +R S+
Subjt:  -----------------KRKRSKKFSQPQQLVNKD---LRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRRRKSM

Query:  FD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNTTRSFKTDQDHDEGYVG
        FD                               TR S F+RLS+STSKK +PSTS FDRLK+ +DQ +R+M +L+ K F E N       D D     V 
Subjt:  FD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNTTRSFKTDQDHDEGYVG

Query:  SLKKTLSSVSTN
        S  K   SV  N
Subjt:  SLKKTLSSVSTN

TrEMBL top hitse value%identityAlignment
A0A5A7SHQ6 Retrotransposon gag protein9.4e-3937.6Show/hide
Query:  IEESMVVNTTLPKSSSK-------EKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESM+V+ T  KS SK        K   N     TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+ EKVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSK-------EKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTNI-------------------------------------KE-------------RRSKKLQP-----
        LK+LILKLA+E KIELD+DEVAQ+N   I+  +N+                                     KE              R K  QP     
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTNI-------------------------------------KE-------------RRSKKLQP-----

Query:  -----------------KRKRSKKFSQPQQLVNKD---LRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRRRKSM
                         K  R+KK   P+ +  KD   L+LR                           H AS      NY S S   N      +R S+
Subjt:  -----------------KRKRSKKFSQPQQLVNKD---LRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRRRKSM

Query:  FD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVN
        FD                               TR S F+RLS+STSKK +PSTS FDRLK+T+DQ +R+M + + K F E N
Subjt:  FD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVN

A0A5A7URH1 Ty3-gypsy retrotransposon protein5.7e-4448.28Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLA------TIKGKTN--IKERRSKKLQPKRKRSKKFSQPQQLVNKDLRLRSHQAS------NYSSFSIPKNEYGR
        LK+LILKLA+E KIELD+DEVAQ+N A       IKGK    ++ RRS  L     RS     P++++        H AS      NY S S   N    
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLA------TIKGKTN--IKERRSKKLQPKRKRSKKFSQPQQLVNKDLRLRSHQAS------NYSSFSIPKNEYGR

Query:  DRRRKSMFD---VHLTRPSAFQRLSV-------------------------STSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVN
          +R S+FD      TR S FQRLSV                         ST KK +PSTS FDRLK+T+DQ +R+M + + K F E N
Subjt:  DRRRKSMFD---VHLTRPSAFQRLSV-------------------------STSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVN

A0A5A7V7A0 Retrotransposon gag protein7.2e-3937.14Show/hide
Query:  IEESMVVNTTLPKSSSK-------EKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK        K   +     TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSK-------EKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTNI-------------------------------------KE-------------RRSKKLQP-----
        LK+LILKLA+E KIELD+DEVAQ+N   I+  +N+                                     KE              R K  QP     
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTNI-------------------------------------KE-------------RRSKKLQP-----

Query:  -----------------KRKRSKKFSQPQQLVNKD---LRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRRRKSM
                         K +R+KK   P+ +  KD   L+LR                           H AS      NY S S   N      +R S+
Subjt:  -----------------KRKRSKKFSQPQQLVNKD---LRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRRRKSM

Query:  FD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNTTRSFKTDQDHDEGYVG
        FD                               TR S F+RLS+STSKK +PSTSVFDRLK+T DQ +R+M +L+ K F E N       D D     V 
Subjt:  FD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNTTRSFKTDQDHDEGYVG

Query:  SLKKTLSSVSTN
        S  K   SV  N
Subjt:  SLKKTLSSVSTN

A0A5A7VFA5 Ty3-gypsy retrotransposon protein4.5e-4144.72Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESMVV  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTN--IKERRSKKLQPKRKRSKKFSQPQQLVNKDLRLRSHQAS------NYSSFSIPKNEYGRDRRRKS
        LK+LILKLA+E KI+LD+DE        IKGK    ++ RRS  L     RS     P++++        H  S      NY S+    N     ++R S
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTN--IKERRSKKLQPKRKRSKKFSQPQQLVNKDLRLRSHQAS------NYSSFSIPKNEYGRDRRRKS

Query:  MFD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVN
        +FD                               TR SAF+RLS+STSKK +PSTS FDRLK+ +DQ +R+M +L+ K F E N
Subjt:  MFD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVN

A0A5D3C2C8 Retrotransposon gag protein7.2e-3937.38Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQKK+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTNI-------------------------------------KE-------------RRSKKLQP-----
        LK+LILKLA+E KIELD+DEVAQ+N   I+  +N+                                     KE              R K  QP     
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKTNI-------------------------------------KE-------------RRSKKLQP-----

Query:  -----------------KRKRSKKFSQPQQLVNKD---LRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRRRKSM
                         K +R+KK   P+ +  KD   L+LR                           H AS      NY S S   N      +R S+
Subjt:  -----------------KRKRSKKFSQPQQLVNKD---LRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRRRKSM

Query:  FD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNTTRSFKTDQDHDEGYVG
        FD                               TR S F+RLS+STSKK +PSTS FDRLK+ +DQ +R+M +L+ K F E N       D D     V 
Subjt:  FD----------------------------VHLTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNTTRSFKTDQDHDEGYVG

Query:  SLKKTLSSVSTN
        S  K   SV  N
Subjt:  SLKKTLSSVSTN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCA
TCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGT
GTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTA
AAGCTGGCTAAGGAAGGCAAAATTGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACAATCAAAGGAAAAACAAACATCAAAGAAAGAAGATCTAAGAAACT
TCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGAACAAAGATCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTC
CAAAGAATGAGTATGGCCGCGACAGAAGAAGAAAATCAATGTTCGATGTCCACCTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAA
CCTTCGACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAACACGACAAGAAGCTT
CAAGACTGATCAAGACCATGATGAAGGGTACGTAGGCAGCTTAAAGAAAACTTTAAGTTCAGTCTCTACAAACAAAAAAAAGGAAAGTGCAACAAATATTGAAGCACGAC
GATATAAATTGAGGAAATTCATCACTAGTGGGGGCAACACAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCAAAT
TCGAAGGTTCTCACGCGCTTCGCTGCATTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGC
TGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGTTCACTGCAGTTTTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCCAAGTTCGAA
GGTTCTCACGTCGCTTCGCTGAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCT
CACACGCTTCGCTCTGCATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAGTTCGAAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCC
TCAAAGTTCAAAGGTTCTCACTCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCG
CTTCGCTGCAGTTCCTTTCTCCAAGTTTGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCAA
GTTCGAAGGTTCTCACGTCGCTTCGCTGCGCTCATGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCG
AAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCTC
ACGCTGCGCTGCTTCCTTCTCCAAGTTCGAGGGTCCTCATGCTACGCTCGGCTACATTGCTGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCGTGCACTCGTACT
GGAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCC
CAACTCAAGGAACACGTCCTTGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGGCAACACAA
GTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGACCGCG
TGGCGGCGACACAAGTCCATGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAA
GGAACATGACCGTGCACTCGTGCTGAAAGGCGTGACGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCTTGCACTCGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCA
TCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGT
GTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTA
AAGCTGGCTAAGGAAGGCAAAATTGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACAATCAAAGGAAAAACAAACATCAAAGAAAGAAGATCTAAGAAACT
TCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGAACAAAGATCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTC
CAAAGAATGAGTATGGCCGCGACAGAAGAAGAAAATCAATGTTCGATGTCCACCTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAA
CCTTCGACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAACACGACAAGAAGCTT
CAAGACTGATCAAGACCATGATGAAGGGTACGTAGGCAGCTTAAAGAAAACTTTAAGTTCAGTCTCTACAAACAAAAAAAAGGAAAGTGCAACAAATATTGAAGCACGAC
GATATAAATTGAGGAAATTCATCACTAGTGGGGGCAACACAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCAAAT
TCGAAGGTTCTCACGCGCTTCGCTGCATTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGC
TGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGTTCACTGCAGTTTTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCCAAGTTCGAA
GGTTCTCACGTCGCTTCGCTGAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCT
CACACGCTTCGCTCTGCATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAGTTCGAAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCC
TCAAAGTTCAAAGGTTCTCACTCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCG
CTTCGCTGCAGTTCCTTTCTCCAAGTTTGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCAA
GTTCGAAGGTTCTCACGTCGCTTCGCTGCGCTCATGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCG
AAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCTC
ACGCTGCGCTGCTTCCTTCTCCAAGTTCGAGGGTCCTCATGCTACGCTCGGCTACATTGCTGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCGTGCACTCGTACT
GGAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCC
CAACTCAAGGAACACGTCCTTGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGGCAACACAA
GTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGACCGCG
TGGCGGCGACACAAGTCCATGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAA
GGAACATGACCGTGCACTCGTGCTGAAAGGCGTGACGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCTTGCACTCGTGCTGA
Protein sequenceShow/hide protein sequence
MRKEGRNDEETIEESMVVNTTLPKSSSKEKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLIL
KLAKEGKIELDLDEVAQSNLATIKGKTNIKERRSKKLQPKRKRSKKFSQPQQLVNKDLRLRSHQASNYSSFSIPKNEYGRDRRRKSMFDVHLTRPSAFQRLSVSTSKKSQ
PSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNTTRSFKTDQDHDEGYVGSLKKTLSSVSTNKKKESATNIEARRYKLRKFITSGGNTANGSCFLQVRRFPRASLQFLPPN
SKVLTRFAAFLPHSSKVLTRFAAVPSLQVRRFSRRFAAVPSSKFEGSHAFTAVFPHSSKVLTRFTAVPSPQVRRFSRRFAEFLPPSLKVLTSLRFAHALRCVPSSKFEGS
HTLRSAFLPQVRRFSRASLQFLPQVRRFSRAALQFLPQSSKVLTRFVAVLSSKFEGSHALRCSSFPQVRRFSRASLQFLSPSLKVLTRFVAVPSSQFEGSHALRCSSFPQ
VRRFSRRFAALMCFAAVPSSKFEGSHIASLRSFLQVRRFSRTSLQFLPPSSKVLTRFAALQRYFLKSKDVNCPHAALLPSPSSRVLMLRSATLLRYFLKSKDVNCPCTRT
GRRGGGTSPRNMSQLKEHVRALVLKGVAAAQVQGTCPNSRNTSLRGGGTSPRNMSQLKEHVLATRAERRGGNTSPRNMSQLKEHVRALVLKGVAATQVQGTCPNSRNMTA
WRRHKSMEHVPTQGTRPCTRAERRGGDTSPRNMSQLKEHDRALVLKGVTATQVQGTCPNSRNMSLHSC