; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001089 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001089
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr4:24237332..24241759
RNA-Seq ExpressionLag0001089
SyntenyLag0001089
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.0e-4453.98Show/hide
Query:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK-----
        MLEQLLE QLI+LP+CKR E+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  +   + KD   LQ +R      
Subjt:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK-----

Query:  ---RSKKFSQPQQLVMLNKSFSKTLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSV
           RS     P++++ +    + ++  +   N  +S     +EV+NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR S  +RLS+
Subjt:  ---RSKKFSQPQQLVMLNKSFSKTLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSV

Query:  STSKKSQPSTSAFDRLKVTSDQPKRK
        ST KK +PSTS+FDRLK+T+DQ +R+
Subjt:  STSKKSQPSTSAFDRLKVTSDQPKRK

KAA0065608.1 retrotransposon gag protein [Cucumis melo var. makuwa]8.8e-4555.05Show/hide
Query:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKF
        MLEQL+E QLI+LP+CKR E+  KVDDP YCKYHRVI H +E+CFVLK+LILKLA + KIELD+DEVAQ+N A +   +       P  L       + F
Subjt:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKF

Query:  SQPQQLVMLNKSFSKTLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQP
         +     +L  +   T      +N   SY    EEVDNS + +QRT VFDRIKP TTR SVFQR+SMA  EEE QC  ST TR S F+RLS+STSKK +P
Subjt:  SQPQQLVMLNKSFSKTLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQP

Query:  STSAFDRLKVTSDQPKRK
        STSAFDRLK+T+DQ +++
Subjt:  STSAFDRLKVTSDQPKRK

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.4e-4454.87Show/hide
Query:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK-----
        MLEQL+E QLI+L  CKR  +  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KI+LD+DE        IKG       KD   LQP+R      
Subjt:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK-----

Query:  ---RSKKFSQPQQLVMLNKSFSKTLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSV
           RS     P++++ +    + ++     +N   SY    EEVDNS + +QRTSVFDRIKP TTR  VFQR+SMA  EEENQC  ST TR SAF+RLS+
Subjt:  ---RSKKFSQPQQLVMLNKSFSKTLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSV

Query:  STSKKSQPSTSAFDRLKVTSDQPKRK
        STSKK +PSTSAFDRLK+ +DQ +R+
Subjt:  STSKKSQPSTSAFDRLKVTSDQPKRK

KAA0066166.1 Retrotransposon gag protein [Cucumis melo var. makuwa]3.4e-4451.77Show/hide
Query:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKF
        MLEQLLE QLI+LPKCKR ++  KVDDP YCKYHRVI HPVE+CF+LK +ILKLA+E KIELD+ EVAQ+N   ++                        
Subjt:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKF

Query:  SQPQQLVMLNKSFSK-----TLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTS
        S P     LN    +       H      +  +Y    EEVDNS + +QRT VFDRIKP  TR S+FQR SMA  EE+NQC MSTST+ SAF+RLS+STS
Subjt:  SQPQQLVMLNKSFSK-----TLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTS

Query:  KKSQPSTSAFDRLKVTSDQPKRKWTT
        K+ +P TS FDRLK+T+DQ +R+  T
Subjt:  KKSQPSTSAFDRLKVTSDQPKRKWTT

TYK15207.1 Retrotransposon gag protein [Cucumis melo var. makuwa]1.5e-4451.77Show/hide
Query:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKF
        MLEQLLE QLI+LPKCKR ++  KVDDP YCKYHRVI HPVE+CF+LK++ILKLA+E KIELD+ EVAQ+N   ++                        
Subjt:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKF

Query:  SQPQQLVMLNKSFSK-----TLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTS
        S P     LN    +       H      +  +Y    EEVDNS + +QRT VFDRIKP  TR S+FQR SMA  EE+NQC MSTST+ SAF+RLS+STS
Subjt:  SQPQQLVMLNKSFSK-----TLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTS

Query:  KKSQPSTSAFDRLKVTSDQPKRKWTT
        K+ +P TS FDRLK+T+DQ +R+  T
Subjt:  KKSQPSTSAFDRLKVTSDQPKRKWTT

TrEMBL top hitse value%identityAlignment
A0A5A7U974 Retrotransposon gag protein2.1e-4442.39Show/hide
Query:  EAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI-------------------------------
        E QLI+LP+CKR E++EKVDDP YCKYHR+I HPVE+CFVLK+LILKLA+E KI+LD+DEVAQ+N   +                               
Subjt:  EAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI-------------------------------

Query:  -------------------------------------------------KGNNKHQRK----KDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKT-LHRKT
                                                         KGN  H++K    K   K +P +++ +KF QP++ + L + F ++ L   +
Subjt:  -------------------------------------------------KGNNKHQRK----KDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKT-LHRKT

Query:  KENLATSYC-----IDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSAFDRLK
        +E L  + C     ++V       EEVDNS + +QRTSVFDRIKP TTR SVFQR+SMA  EEENQC MST TR SAF+RLS+S SKK +PSTSAFDRLK
Subjt:  KENLATSYC-----IDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSAFDRLK

Query:  VTSDQPKRK
        +T+DQ +R+
Subjt:  VTSDQPKRK

A0A5A7URH1 Ty3-gypsy retrotransposon protein9.5e-4553.98Show/hide
Query:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK-----
        MLEQLLE QLI+LP+CKR E+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  +   + KD   LQ +R      
Subjt:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK-----

Query:  ---RSKKFSQPQQLVMLNKSFSKTLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSV
           RS     P++++ +    + ++  +   N  +S     +EV+NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR S  +RLS+
Subjt:  ---RSKKFSQPQQLVMLNKSFSKTLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSV

Query:  STSKKSQPSTSAFDRLKVTSDQPKRK
        ST KK +PSTS+FDRLK+T+DQ +R+
Subjt:  STSKKSQPSTSAFDRLKVTSDQPKRK

A0A5A7VII4 Retrotransposon gag protein1.6e-4451.77Show/hide
Query:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKF
        MLEQLLE QLI+LPKCKR ++  KVDDP YCKYHRVI HPVE+CF+LK +ILKLA+E KIELD+ EVAQ+N   ++                        
Subjt:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKF

Query:  SQPQQLVMLNKSFSK-----TLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTS
        S P     LN    +       H      +  +Y    EEVDNS + +QRT VFDRIKP  TR S+FQR SMA  EE+NQC MSTST+ SAF+RLS+STS
Subjt:  SQPQQLVMLNKSFSK-----TLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTS

Query:  KKSQPSTSAFDRLKVTSDQPKRKWTT
        K+ +P TS FDRLK+T+DQ +R+  T
Subjt:  KKSQPSTSAFDRLKVTSDQPKRKWTT

A0A5D3CA53 Retrotransposon gag protein4.3e-4555.05Show/hide
Query:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKF
        MLEQL+E QLI+LP+CKR E+  KVDDP YCKYHRVI H +E+CFVLK+LILKLA + KIELD+DEVAQ+N A +   +       P  L       + F
Subjt:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKF

Query:  SQPQQLVMLNKSFSKTLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQP
         +     +L  +   T      +N   SY    EEVDNS + +QRT VFDRIKP TTR SVFQR+SMA  EEE QC  ST TR S F+RLS+STSKK +P
Subjt:  SQPQQLVMLNKSFSKTLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQP

Query:  STSAFDRLKVTSDQPKRK
        STSAFDRLK+T+DQ +++
Subjt:  STSAFDRLKVTSDQPKRK

A0A5D3CTF5 Retrotransposon gag protein7.3e-4551.77Show/hide
Query:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKF
        MLEQLLE QLI+LPKCKR ++  KVDDP YCKYHRVI HPVE+CF+LK++ILKLA+E KIELD+ EVAQ+N   ++                        
Subjt:  MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKF

Query:  SQPQQLVMLNKSFSK-----TLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTS
        S P     LN    +       H      +  +Y    EEVDNS + +QRT VFDRIKP  TR S+FQR SMA  EE+NQC MSTST+ SAF+RLS+STS
Subjt:  SQPQQLVMLNKSFSK-----TLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTS

Query:  KKSQPSTSAFDRLKVTSDQPKRKWTT
        K+ +P TS FDRLK+T+DQ +R+  T
Subjt:  KKSQPSTSAFDRLKVTSDQPKRKWTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGAACAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTAT
TGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATCGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGCTA
CAATCAAAGGAAATAACAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAAT
AAATCCTTCTCCAAAACTTTACACAGAAAGACAAAAGAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAAAGTGAACAAAGGACTTC
TGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCAATGTCCACCTCCACTCGAC
CTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCAACATCTGCTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAATGGACA
ACTTGGAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTACCTTCCCCCCAAATCGAGGTTCTCACGCGCTTCGGTGCAGTTCCTTC
CTCACAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACG
CGTTTCACTGCAGTTCTTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCCTGAGTTTCTTCCTCCA
AGTTTGAAGGTTCTCACATCGCTTCGCTTCGCTCACGTGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCCAAGTT
CGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGC
AGTTCCTTCCTAAAGTTCGAAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAGTTC
CTTCCTCACAATTCGAAGGTTCTCACGCTCTTCGCTGTAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCGTTCCTTCCTCCAAGTGCGAAGGTTCTCA
CACGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTCGCTGCATTTCCTTCCC
CCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTTAAAGTTCGAAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGC
TTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTTCTCCAAG
TTTGAAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCGCTCATGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCT
GCGATCCTTCCTCCAAGTTCGATGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTTGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAG
GTTCTCACGCGCTTCGCAGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTTCACGTCGCTTCGCTGCGCT
CATGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCT
TCCCCAAGTTCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGTTGCATTTTCTTCCCCCCAAATTCGAAGGTTCTCACG
CGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTTTCACGCGCTTTGTGCAGTTCCTTCCTC
CAAATTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCA
CTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCTCACACTACGCTGCTT
CCTTCTCCAAGTTCGAGGGTCCTCATGCTACGCTCGGCTACATTGCTGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCCTGCACTCATGCTGAAAAGGGCATGGC
GGCGACACAAGTCCAAGGACATGTCCCAAAGCGAGGAACATGTCCCTGTACTCGTGCTGAAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAAC
ATGTCCGTGCACTCGTGCTGAAAGGCGTGGTGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGGCGACA
CAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGGAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCC
GTGCACTCGTGCGACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCGTGGCGGCGACACAAGTCCAAGGAAC
ATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGAAAGGCGTGGCAGCGACACAAGTCCAAGGAACAAGTCCCAACTCAAGGAACATGTCCTTGCACTCGTGCTG
AAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGGAAGGCGCGGCGGCGACACAAGTCCAAGGAACATGTCCC
AACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGTACTCGTGCTGAAAGGCG
TGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGAACAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTAT
TGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATCGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGCTA
CAATCAAAGGAAATAACAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAAT
AAATCCTTCTCCAAAACTTTACACAGAAAGACAAAAGAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAAAGTGAACAAAGGACTTC
TGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCAATGTCCACCTCCACTCGAC
CTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCAACATCTGCTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAATGGACA
ACTTGGAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTACCTTCCCCCCAAATCGAGGTTCTCACGCGCTTCGGTGCAGTTCCTTC
CTCACAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACG
CGTTTCACTGCAGTTCTTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCCTGAGTTTCTTCCTCCA
AGTTTGAAGGTTCTCACATCGCTTCGCTTCGCTCACGTGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCCAAGTT
CGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGC
AGTTCCTTCCTAAAGTTCGAAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAGTTC
CTTCCTCACAATTCGAAGGTTCTCACGCTCTTCGCTGTAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCGTTCCTTCCTCCAAGTGCGAAGGTTCTCA
CACGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTCGCTGCATTTCCTTCCC
CCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTTAAAGTTCGAAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGC
TTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTTCTCCAAG
TTTGAAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCGCTCATGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCT
GCGATCCTTCCTCCAAGTTCGATGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTTGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAG
GTTCTCACGCGCTTCGCAGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTTCACGTCGCTTCGCTGCGCT
CATGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCT
TCCCCAAGTTCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGTTGCATTTTCTTCCCCCCAAATTCGAAGGTTCTCACG
CGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTTTCACGCGCTTTGTGCAGTTCCTTCCTC
CAAATTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCA
CTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCTCACACTACGCTGCTT
CCTTCTCCAAGTTCGAGGGTCCTCATGCTACGCTCGGCTACATTGCTGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCCTGCACTCATGCTGAAAAGGGCATGGC
GGCGACACAAGTCCAAGGACATGTCCCAAAGCGAGGAACATGTCCCTGTACTCGTGCTGAAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAAC
ATGTCCGTGCACTCGTGCTGAAAGGCGTGGTGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGGCGACA
CAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGGAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCC
GTGCACTCGTGCGACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCGTGGCGGCGACACAAGTCCAAGGAAC
ATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGAAAGGCGTGGCAGCGACACAAGTCCAAGGAACAAGTCCCAACTCAAGGAACATGTCCTTGCACTCGTGCTG
AAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGGAAGGCGCGGCGGCGACACAAGTCCAAGGAACATGTCCC
AACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGTACTCGTGCTGAAAGGCG
TGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGA
Protein sequenceShow/hide protein sequence
MLEQLLEAQLIELPKCKRTEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLN
KSFSKTLHRKTKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSAFDRLKVTSDQPKRKWT
TWSEWKLLPPSSKVPTRFAAVPSPQIEVLTRFGAVPSSQFEGSHALRCSSFLQVRRFSRRFAAVPSSKFEGSHAFHCSSFLTVRRFSRASLQFLPPSSKVLTSLPEFLPP
SLKVLTSLRFAHVLRCVPSSKFEGSHTLRSAIPSPKFEGSHALRAVPSSKFEGSHAFRCISFPQIRRFSRASLQFLPKVRRFSRAALQFLPQSSKVLTRFVAVLSSKFEF
LPHNSKVLTLFAVVPSPKFEGSHALRCVPSSKCEGSHTLRSAIPSPKFEGSHALRAVPSSKFEGSHAFAAFPSPKFEGSHALRCSSFLKVRRFSRAALQFLPQSSKVLTR
FVAVLSSKFEGSHALRCSSFPQVRRFSRASLQFLSPSLKFLPPKFEGSHVASLRSCASLQFLPPSLKVLTSLRCDPSSKFDGSHALRSAIPSPSLKVLTRFVQFLPPNSK
VLTRFAAVPSPQVRRFSRASLQFLPPSSKVSRRFAALMRFAAVPSSKFEGSHIASLRSFLQVRRFSRASLCNSFPKFEGSHALRAVPSSKFEGSHAFRCIFFPPNSKVLT
RFAAVPSSKFEGSHALRSAIPSPKFEGFHALCAVPSSKFEGSHALRCSSFPPNSKVLTRFAAVPSPQVRRFSRTSLQFLPPSSKVLTRFAALQRYFLKSKDVNCPHTTLL
PSPSSRVLMLRSATLLRYFLKSKDVNCPCTHAEKGMAATQVQGHVPKRGTCPCTRAERRGGGTSPRNMSQLKEHVRALVLKGVVAAQVQGTCPNSRNTSLQLVLKGVAAT
QVQGTCPNSRNMSVHSCWKARRRHKSKEHVPTQGTCPCTRATRAERRGGGTSPRNMSQLKEHVLAWRRHKSKEHVPTQGTRPCTRAERRGSDTSPRNKSQLKEHVLALVL
KGVAATQVQGTCPNSRNMSVHSCWKARRRHKSKEHVPTQGTCPCTRAERRGGGTSPRNMSQLKEHVLVLVLKGVAATQVQGTCPNSRNTSLHSC