; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016192 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016192
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr12:34604549..34610478
RNA-Seq ExpressionLag0016192
SyntenyLag0016192
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]8.1e-8552.51Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-
        M ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+   R +    ++T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP
              I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HP
Subjt:  ------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP

Query:  VERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK--------RSKKFSQPQQDFRLRS-HQAS------NYSSFSIPKN
        VE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  +   + KD   LQ +R         RS     P++   + + H AS      NY S     N
Subjt:  VERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK--------RSKKFSQPQQDFRLRS-HQAS------NYSSFSIPKN

Query:  EYGRDRR------------RKSMF----VST-----------FTRPSAFQRLSVSTSKKSQPSTSVFDHLKVTSDQPKRKMDNLEVKLFDEVNNDKKL
            ++R            R S+F    V+T           +TR S  +RLS+ST KK +PSTS FD LK+T+DQ +R+M + + K F E N+D K+
Subjt:  EYGRDRR------------RKSMF----VST-----------FTRPSAFQRLSVSTSKKSQPSTSVFDHLKVTSDQPKRKMDNLEVKLFDEVNNDKKL

KAA0058295.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]5.1e-7967.37Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-
        M ELTNT+Q+KGELV+NYINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+   R +    ++T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP
              I+ESMVV+ T  KS SK K       H        TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+ EKVDDP YCKYHRVI HP
Subjt:  ------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP

Query:  VERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK
        VE+CFVLK+LILKLA+E KIELD+DEVAQ+N   I+
Subjt:  VERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.9e-8350.38Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-
        M ELTNTKQ  GE V+NYINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHDMELSIA+R  +  L+   R +     +T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP
              + ESMVV  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI HP
Subjt:  ------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP

Query:  VERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK--------RSKKFSQPQQDFRLRS-HQAS------NYSSFSIPKN
        VE+CFVLK+LILKLA+E KI+LD+DE        IKG       KD   LQP+R         RS     P++   + + H  S      NY S+    N
Subjt:  VERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK--------RSKKFSQPQQDFRLRS-HQAS------NYSSFSIPKN

Query:  EYGRDRRRKSMF----------------------------VSTFTRPSAFQRLSVSTSKKSQPSTSVFDHLKVTSDQPKRKMDNLEVKLFDEVNNDKKL
             ++R S+F                             ST+TR SAF+RLS+STSKK +PSTS FD LK+ +DQ +R+M +L+ K F E N+D K+
Subjt:  EYGRDRRRKSMF----------------------------VSTFTRPSAFQRLSVSTSKKSQPSTSVFDHLKVTSDQPKRKMDNLEVKLFDEVNNDKKL

TYK11948.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.1e-7944.92Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-
        M ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS VEMC QGMHW LLYIL+GI+PRTFEELATRAHDMELSIA+R  +D L+   R +    ++T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP
              I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K YPFPD+D+ DMLEQLLE +LI+L +CK+P++  KVDDP YCKYHRVI HP
Subjt:  ------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP

Query:  VERCFVLKDLILKLAKEGKIELDLDEVAQSN---------LATIKGNNK---------------HQRKKDPKKLQ-----------------PKRKRSKK
        +E CFVLK+LILKLA+E KIELD+DEVAQ+N         + TI   NK               HQ+ + P  +Q                  K +R+KK
Subjt:  VERCFVLKDLILKLAKEGKIELDLDEVAQSN---------LATIKGNNK---------------HQRKKDPKKLQ-----------------PKRKRSKK

Query:  FSQP-------QQDFRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRR------------RKSMF-----------
           P       +  F+LR                           H AS      NY S     N    ++R            R S+F           
Subjt:  FSQP-------QQDFRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRR------------RKSMF-----------

Query:  ----VSTFTRPSAFQRLSVSTSKKSQPSTSVFDHLKVTSDQPKRKMDNLEVKLFDEVNNDKKL
               +T+ SAF+RLS+STSKK +PST  FD LK+T+DQ +R++ +L+ K F E N+D K+
Subjt:  ----VSTFTRPSAFQRLSVSTSKKSQPSTSVFDHLKVTSDQPKRKMDNLEVKLFDEVNNDKKL

XP_022159413.1 uncharacterized protein LOC111025834 [Momordica charantia]3.0e-7968.26Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEETI
        M ELTN++QRKGE VV YINRWRA+SLDCKDRLTELS+VE+C QGMHWELLYIL+ IKPRTFEELATRAHDMELSIA+R ++D L+L++ KE ++ E+T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEETI

Query:  EESMVVNTTLPKSSSK------EKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLK
         E   VNT  PK  SK      EK++ N    L+LKERQ+K+YPFP++DIP MLEQLLE +LI LP+C RPEEM KVDDPKYCKYHRVI HPVE+CFVLK
Subjt:  EESMVVNTTLPKSSSK------EKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLK

Query:  DLILKLAKEGKIELDLDEVAQSNLATIKGN
        + IL LA+EGKIELD +E+AQSN A +  N
Subjt:  DLILKLAKEGKIELDLDEVAQSNLATIKGN

TrEMBL top hitse value%identityAlignment
A0A5A7URH1 Ty3-gypsy retrotransposon protein3.9e-8552.51Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-
        M ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+   R +    ++T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP
              I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HP
Subjt:  ------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP

Query:  VERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK--------RSKKFSQPQQDFRLRS-HQAS------NYSSFSIPKN
        VE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  +   + KD   LQ +R         RS     P++   + + H AS      NY S     N
Subjt:  VERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK--------RSKKFSQPQQDFRLRS-HQAS------NYSSFSIPKN

Query:  EYGRDRR------------RKSMF----VST-----------FTRPSAFQRLSVSTSKKSQPSTSVFDHLKVTSDQPKRKMDNLEVKLFDEVNNDKKL
            ++R            R S+F    V+T           +TR S  +RLS+ST KK +PSTS FD LK+T+DQ +R+M + + K F E N+D K+
Subjt:  EYGRDRR------------RKSMF----VST-----------FTRPSAFQRLSVSTSKKSQPSTSVFDHLKVTSDQPKRKMDNLEVKLFDEVNNDKKL

A0A5A7UXF0 Ty3-gypsy retrotransposon protein2.5e-7967.37Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-
        M ELTNT+Q+KGELV+NYINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+   R +    ++T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP
              I+ESMVV+ T  KS SK K       H        TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+ EKVDDP YCKYHRVI HP
Subjt:  ------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP

Query:  VERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK
        VE+CFVLK+LILKLA+E KIELD+DEVAQ+N   I+
Subjt:  VERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK

A0A5A7VFA5 Ty3-gypsy retrotransposon protein4.8e-8350.38Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-
        M ELTNTKQ  GE V+NYINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHDMELSIA+R  +  L+   R +     +T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP
              + ESMVV  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI HP
Subjt:  ------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP

Query:  VERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK--------RSKKFSQPQQDFRLRS-HQAS------NYSSFSIPKN
        VE+CFVLK+LILKLA+E KI+LD+DE        IKG       KD   LQP+R         RS     P++   + + H  S      NY S+    N
Subjt:  VERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK--------RSKKFSQPQQDFRLRS-HQAS------NYSSFSIPKN

Query:  EYGRDRRRKSMF----------------------------VSTFTRPSAFQRLSVSTSKKSQPSTSVFDHLKVTSDQPKRKMDNLEVKLFDEVNNDKKL
             ++R S+F                             ST+TR SAF+RLS+STSKK +PSTS FD LK+ +DQ +R+M +L+ K F E N+D K+
Subjt:  EYGRDRRRKSMF----------------------------VSTFTRPSAFQRLSVSTSKKSQPSTSVFDHLKVTSDQPKRKMDNLEVKLFDEVNNDKKL

A0A5D3CLC5 Retrotransposon gag protein2.5e-7944.92Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-
        M ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS VEMC QGMHW LLYIL+GI+PRTFEELATRAHDMELSIA+R  +D L+   R +    ++T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEET-

Query:  ------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP
              I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K YPFPD+D+ DMLEQLLE +LI+L +CK+P++  KVDDP YCKYHRVI HP
Subjt:  ------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHP

Query:  VERCFVLKDLILKLAKEGKIELDLDEVAQSN---------LATIKGNNK---------------HQRKKDPKKLQ-----------------PKRKRSKK
        +E CFVLK+LILKLA+E KIELD+DEVAQ+N         + TI   NK               HQ+ + P  +Q                  K +R+KK
Subjt:  VERCFVLKDLILKLAKEGKIELDLDEVAQSN---------LATIKGNNK---------------HQRKKDPKKLQ-----------------PKRKRSKK

Query:  FSQP-------QQDFRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRR------------RKSMF-----------
           P       +  F+LR                           H AS      NY S     N    ++R            R S+F           
Subjt:  FSQP-------QQDFRLR--------------------------SHQAS------NYSSFSIPKNEYGRDRR------------RKSMF-----------

Query:  ----VSTFTRPSAFQRLSVSTSKKSQPSTSVFDHLKVTSDQPKRKMDNLEVKLFDEVNNDKKL
               +T+ SAF+RLS+STSKK +PST  FD LK+T+DQ +R++ +L+ K F E N+D K+
Subjt:  ----VSTFTRPSAFQRLSVSTSKKSQPSTSVFDHLKVTSDQPKRKMDNLEVKLFDEVNNDKKL

A0A6J1DYQ8 uncharacterized protein LOC1110258341.4e-7968.26Show/hide
Query:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEETI
        M ELTN++QRKGE VV YINRWRA+SLDCKDRLTELS+VE+C QGMHWELLYIL+ IKPRTFEELATRAHDMELSIA+R ++D L+L++ KE ++ E+T 
Subjt:  MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEETI

Query:  EESMVVNTTLPKSSSK------EKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLK
         E   VNT  PK  SK      EK++ N    L+LKERQ+K+YPFP++DIP MLEQLLE +LI LP+C RPEEM KVDDPKYCKYHRVI HPVE+CFVLK
Subjt:  EESMVVNTTLPKSSSK------EKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLK

Query:  DLILKLAKEGKIELDLDEVAQSNLATIKGN
        + IL LA+EGKIELD +E+AQSN A +  N
Subjt:  DLILKLAKEGKIELDLDEVAQSNLATIKGN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTC
TTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGC
TAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCTTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTT
CCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATAT
GTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATCG
GTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATCGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACA
ATCAAAGGAAATAACAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAAGACTTCCGTCTTCGATC
GCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAAGAAGAAAATCAATGTTCGTGTCCACCTTCACCCGACCTTCAGCTTTCC
AAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCACCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTG
AAACTTTTCGATGAAGTAAACAACGACAAGAAGCTTCAAGTAGCATCCCGTCACTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTT
CTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTT
TGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTTCAAGGTCGAAGGTTCTCACACGCTGCGTTGCAGTTCTTTCTCCCCAA
GTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCG
CTGTAGTTCCTTCCCCCTAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAGGTTCTCACGCGCTTCGGTGAAGTTCCTTCCTCCAGTCTGA
AGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGTTCCTTCCTCCAAGTTTGAAGGT
TCTCATATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCATTCCTCCAAATTCGAAGGTTTCGAA
GGTTCTCAAGCGCTGTGATTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCA
GCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCCACTGTTCCTTCCTCCAAGTTCGAAGGTTCTCAGG
CGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAG
TTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCATTCTCCAAGTT
CGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGAC
GTCCTCGTTCCGCTTCATCTTCAAATGTTGGCAGTTGACGGCGTTCGCTTCGCTTCATCTTCAAAAATTGACTGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGAC
CGTAGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAAGCTGATGACGACCGTGGTGGTGAAATCACTG
CAAGTGAAAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTG
ATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGATCGTGGTGACCAC
CCCTGCAGGAAACTACAGTCATCAAAGTGATTGGTCTAGACAGAGGAGACCACCATTCATTTTGAGGGGATTCAGATTTGGAGACAGAGTCAGAGAATTCAGAGTCCAGA
GAATTCTGCCAGAGTCCAGAGTCGGCAGAACAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAG
ATCAACAAGCCAACTGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCG
ATCATCCAAGAGATCAACAAGCCAACTGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAA
GTCAACAGACCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCTGATCATCCAAGA
AGATCAACAAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTC
TTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGC
TAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCTTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTT
CCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATAT
GTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATCG
GTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATCGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACA
ATCAAAGGAAATAACAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAAGACTTCCGTCTTCGATC
GCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAAGAAGAAAATCAATGTTCGTGTCCACCTTCACCCGACCTTCAGCTTTCC
AAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCACCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTG
AAACTTTTCGATGAAGTAAACAACGACAAGAAGCTTCAAGTAGCATCCCGTCACTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTT
CTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTT
TGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTTCAAGGTCGAAGGTTCTCACACGCTGCGTTGCAGTTCTTTCTCCCCAA
GTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCG
CTGTAGTTCCTTCCCCCTAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAGGTTCTCACGCGCTTCGGTGAAGTTCCTTCCTCCAGTCTGA
AGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGTTCCTTCCTCCAAGTTTGAAGGT
TCTCATATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCATTCCTCCAAATTCGAAGGTTTCGAA
GGTTCTCAAGCGCTGTGATTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCA
GCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCCACTGTTCCTTCCTCCAAGTTCGAAGGTTCTCAGG
CGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAG
TTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCATTCTCCAAGTT
CGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGAC
GTCCTCGTTCCGCTTCATCTTCAAATGTTGGCAGTTGACGGCGTTCGCTTCGCTTCATCTTCAAAAATTGACTGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGAC
CGTAGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAAGCTGATGACGACCGTGGTGGTGAAATCACTG
CAAGTGAAAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTG
ATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGATCGTGGTGACCAC
CCCTGCAGGAAACTACAGTCATCAAAGTGATTGGTCTAGACAGAGGAGACCACCATTCATTTTGAGGGGATTCAGATTTGGAGACAGAGTCAGAGAATTCAGAGTCCAGA
GAATTCTGCCAGAGTCCAGAGTCGGCAGAACAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAG
ATCAACAAGCCAACTGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCG
ATCATCCAAGAGATCAACAAGCCAACTGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAA
GTCAACAGACCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCTGATCATCCAAGA
AGATCAACAAGCTAA
Protein sequenceShow/hide protein sequence
MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLLNMRKEGRNDEETIEESMVVNTTL
PKSSSKEKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLAT
IKGNNKHQRKKDPKKLQPKRKRSKKFSQPQQDFRLRSHQASNYSSFSIPKNEYGRDRRRKSMFVSTFTRPSAFQRLSVSTSKKSQPSTSVFDHLKVTSDQPKRKMDNLEV
KLFDEVNNDKKLQVASRHFEGSSLYPAALFLLQVRGFSVVRLLRCSSSKCEGSYVVRCCIVPSSLKFDGSHAALLEFLLPKFEGSHALRCSSFFSRSKVLTRCVAVLSPQ
VRRFTHFAAVPSPKFEGSHALRSAIPSPKFEGSHALRCSSFPLSLKVLTRFAAVPSSKFKVLTRFGEVPSSSLKVLTRFAAVPSPKFEGSTRFAAVPSSQFKVPSSKFEG
SHIASLRAALRCSSFLQVRRFSLASLQFIPPNSKVSKVLKRCDSLQFLPPSSKVLMRFVAPSSKFEGSLTRCCSSFLQVRRFPHALRSLLLQVRRRLSPLFLPPSSKVLR
RFVATFLQVRRFSHALLQLLPPSSKVPSRASLAPSPSSKALLSTAPSPSSKALLSTAPSPSSKALLSTAHSPSSKVLLSTPLFEGSPLRFSFSKFEGSPLLLFKCLAAVD
VLVPLHLQMLAVDGVRFASSSKIDCGEITASEKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKADDDRGGEITASEKLMTTVVTTPAGNYSHQSDWSRQTGGEITASEKL
MTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTIVVTTPAGNYSHQSDWSRQRRPPFILRGFRFGDRVREFRVQRILPESRVGRTGRSSKRSTSQPTDQEDQQVSRPIIQE
INKPTDRSRRSTSQQADHPRDQQANRPIKKINKSAGRSSKRSTSQLTDQEDQQVSRPIIQEINKPTDRSRRSTSQQTDHPRRSTSQPIEQIIKPTGRSKRSSSQQADHPR
RSTS