; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032880 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032880
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr11:38470317..38475402
RNA-Seq ExpressionLag0032880
SyntenyLag0032880
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050736.1 retrotransposon gag protein [Cucumis melo var. makuwa]6.9e-4339.38Show/hide
Query:  MEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNL-------------------------------------------------
        +EKVDDP YCKYHR+I HPVE+CFVLK+LILKLA E KI+LD+DEVAQ+N                                                  
Subjt:  MEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNL-------------------------------------------------

Query:  -----------------------------------ATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKENLA------TSYCI
                                             I  K K +R K   K +P + + +KF QP++ + L + F ++F +   E +       T+  +
Subjt:  -----------------------------------ATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKENLA------TSYCI

Query:  NV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLEL
         V       EEVDNS + +QRTSVFDRIKP TTR SVF R+SMA  +EENQC MST T+ SAF+RLS+S SKK+R STS FDRLK+TNDQ +R+M +L+ 
Subjt:  NV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLEL

Query:  KLFDEVNSDEKLHSSIPSRLIKTMT
        K F E N D+K++S +PSR+ + ++
Subjt:  KLFDEVNSDEKLHSSIPSRLIKTMT

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.9e-4251.09Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRM--------RSKKFSQPQQLVMLNKSFSK
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + 
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRM--------RSKKFSQPQQLVMLNKSFSK

Query:  TFHKKTKENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQP
        +   +   N  +S     +EV+NS +  QRTSVFDRIKP TTR SVF R+S+A  +EENQC     T+ S  +RLS+ST KK+R STS FDRLK+TNDQ 
Subjt:  TFHKKTKENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQP

Query:  KRKMNNLELKLFDEVNSDEKLHSSIPSRL
        +R+M + + K F E N D+K+HS +PSR+
Subjt:  KRKMNNLELKLFDEVNSDEKLHSSIPSRL

KAA0056218.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.3e-4539.72Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN----------------------------------------------------
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E KIELD+DEVAQ+N                                                    
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN----------------------------------------------------

Query:  --------------------------------LATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKENLA------TSYCINV
                                           I  K K +R K   K +P + + + F QP++ + L +   ++F +   E +       T+  + V
Subjt:  --------------------------------LATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKENLA------TSYCINV

Query:  -------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKL
               EE+DNS + +QRTSVFDRIKP TTR SVF R+SMA  KEEN+C  ST T+ SAF+RLS+STSKK+R ST VFDRLK+TNDQ +R+M  L+ K 
Subjt:  -------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKL

Query:  FDEVNSDEKLHSSIPSRLIKTMTKIRAFKCKSSLS---QEPKLHDAPSPHELKRFFAAVP
        F E N D K+H+ +PSR+          K K S+    +EPKLH APSP ELK    A P
Subjt:  FDEVNSDEKLHSSIPSRLIKTMTKIRAFKCKSSLS---QEPKLHDAPSPHELKRFFAAVP

KAA0065608.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.3e-4352.73Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKE
        KVDDP YCKYHRVI H +E+CFVLK+LILKLA + KIELD+DEVAQ+N A +   S       P  L    +  + F +     +L  +   T      +
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKE

Query:  NLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLE
        N   SY    EEVDNS + +QRT VFDRIKP TTR SVF R+SMA  +EE QC  ST T+ S F+RLS+STSKK+R STS FDRLK+TNDQ +++M +L+
Subjt:  NLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLE

Query:  LKLFDEVNSDEKLHSSIPSR
         K F E N D+K+HS +PSR
Subjt:  LKLFDEVNSDEKLHSSIPSR

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.8e-4349.2Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRM--------RSKKFSQPQQLVMLNKSFSK
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E KI+LD+DE  +               KD   LQP+R         RS     P++++ +    + 
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRM--------RSKKFSQPQQLVMLNKSFSK

Query:  TFHKKTKENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQP
        +  +   +N   SY    EEVDNS + +QRTSVFDRIKP TTR  VF R+SMA  +EENQC  ST T+ SAF+RLS+STSKK+R STS FDRLK+ NDQ 
Subjt:  TFHKKTKENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQP

Query:  KRKMNNLELKLFDEVNSDEKLHSSIPSRLIKTMTKIRAFKCKSSLSQEPK
        +R+M +L+ K F E N D+K+HS IPSR +     I     + SL+ +P+
Subjt:  KRKMNNLELKLFDEVNSDEKLHSSIPSRLIKTMTKIRAFKCKSSLSQEPK

TrEMBL top hitse value%identityAlignment
A0A5A7U974 Retrotransposon gag protein3.3e-4339.38Show/hide
Query:  MEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNL-------------------------------------------------
        +EKVDDP YCKYHR+I HPVE+CFVLK+LILKLA E KI+LD+DEVAQ+N                                                  
Subjt:  MEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNL-------------------------------------------------

Query:  -----------------------------------ATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKENLA------TSYCI
                                             I  K K +R K   K +P + + +KF QP++ + L + F ++F +   E +       T+  +
Subjt:  -----------------------------------ATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKENLA------TSYCI

Query:  NV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLEL
         V       EEVDNS + +QRTSVFDRIKP TTR SVF R+SMA  +EENQC MST T+ SAF+RLS+S SKK+R STS FDRLK+TNDQ +R+M +L+ 
Subjt:  NV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLEL

Query:  KLFDEVNSDEKLHSSIPSRLIKTMT
        K F E N D+K++S +PSR+ + ++
Subjt:  KLFDEVNSDEKLHSSIPSRLIKTMT

A0A5A7UM99 Ty3-gypsy retrotransposon protein3.5e-4539.72Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN----------------------------------------------------
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E KIELD+DEVAQ+N                                                    
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN----------------------------------------------------

Query:  --------------------------------LATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKENLA------TSYCINV
                                           I  K K +R K   K +P + + + F QP++ + L +   ++F +   E +       T+  + V
Subjt:  --------------------------------LATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKENLA------TSYCINV

Query:  -------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKL
               EE+DNS + +QRTSVFDRIKP TTR SVF R+SMA  KEEN+C  ST T+ SAF+RLS+STSKK+R ST VFDRLK+TNDQ +R+M  L+ K 
Subjt:  -------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKL

Query:  FDEVNSDEKLHSSIPSRLIKTMTKIRAFKCKSSLS---QEPKLHDAPSPHELKRFFAAVP
        F E N D K+H+ +PSR+          K K S+    +EPKLH APSP ELK    A P
Subjt:  FDEVNSDEKLHSSIPSRLIKTMTKIRAFKCKSSLS---QEPKLHDAPSPHELKRFFAAVP

A0A5A7URH1 Ty3-gypsy retrotransposon protein4.8e-4251.09Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRM--------RSKKFSQPQQLVMLNKSFSK
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + 
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRM--------RSKKFSQPQQLVMLNKSFSK

Query:  TFHKKTKENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQP
        +   +   N  +S     +EV+NS +  QRTSVFDRIKP TTR SVF R+S+A  +EENQC     T+ S  +RLS+ST KK+R STS FDRLK+TNDQ 
Subjt:  TFHKKTKENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQP

Query:  KRKMNNLELKLFDEVNSDEKLHSSIPSRL
        +R+M + + K F E N D+K+HS +PSR+
Subjt:  KRKMNNLELKLFDEVNSDEKLHSSIPSRL

A0A5A7VFA5 Ty3-gypsy retrotransposon protein8.7e-4449.2Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRM--------RSKKFSQPQQLVMLNKSFSK
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E KI+LD+DE  +               KD   LQP+R         RS     P++++ +    + 
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRM--------RSKKFSQPQQLVMLNKSFSK

Query:  TFHKKTKENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQP
        +  +   +N   SY    EEVDNS + +QRTSVFDRIKP TTR  VF R+SMA  +EENQC  ST T+ SAF+RLS+STSKK+R STS FDRLK+ NDQ 
Subjt:  TFHKKTKENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQP

Query:  KRKMNNLELKLFDEVNSDEKLHSSIPSRLIKTMTKIRAFKCKSSLSQEPK
        +R+M +L+ K F E N D+K+HS IPSR +     I     + SL+ +P+
Subjt:  KRKMNNLELKLFDEVNSDEKLHSSIPSRLIKTMTKIRAFKCKSSLSQEPK

A0A5D3CA53 Retrotransposon gag protein2.5e-4352.73Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKE
        KVDDP YCKYHRVI H +E+CFVLK+LILKLA + KIELD+DEVAQ+N A +   S       P  L    +  + F +     +L  +   T      +
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKE

Query:  NLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLE
        N   SY    EEVDNS + +QRT VFDRIKP TTR SVF R+SMA  +EE QC  ST T+ S F+RLS+STSKK+R STS FDRLK+TNDQ +++M +L+
Subjt:  NLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLE

Query:  LKLFDEVNSDEKLHSSIPSR
         K F E N D+K+HS +PSR
Subjt:  LKLFDEVNSDEKLHSSIPSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTAAAGCTGGCTATGGAAAG
AAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGAA
TGAGAAGTAAAAAGTTCTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGACAAAAGAGAACCTTGCGACTTCCTACTGCATC
AACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTATGGC
CGCGACAAAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCAACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAATCGATCTTCAACATCTGTCT
TTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACGAGAAGCTTCATAGTAGCATCCCG
TCACGATTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGGTT
CTTCGCTGCAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCC
TTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTC
GATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTTCAAGGTCGAAGGTTCTCACTCGCTGCGT
TGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACACACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCG
AAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCTAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGTTCCTTCCCCCCAAGTTCGA
AGGTTCTTACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGC
AGTTCCTTCCTCCAAGTTTGAAGGTTCTCATATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCA
TTCCTCCAAATTCGAAGGTTTCGAAGGTTCTCAAGCGCTGTGATTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTCCTTCCTCCAAGTTC
GAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCCACTGTTCCT
TCCTCCAAGTTCGAAGGTTCTCAGGCGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCAC
GCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCATTCTCCAAGTTCGAAGGTGCTT
CTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGTGGTTGACGTCCTCGTTCC
GCTTCATCTTCAAATGTTGGCAGTTGACGGCGTTCGCTTCGCTTCATCTTCAAAAATTGACTGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACAACCGTGGTGACCA
CCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGAAAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAAC
TACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAAGCTGATGACGACCGTGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCGT
GGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTACAAGTGAAAAGCTGATGACGACCATGGTGACCACCCCTG
CAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGATCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAA
AGTGATTGGTCTAGACAGATTTGGAGACAGAGTCAGAGAACTCAGAGTCCAGAGCATTCTGCCAAGAGTCCAGAGTCGGCAGGCCGATCATCCAAGAGGATCAACAAGCT
AACAAGCCGATCCAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGAT
CAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTAAAGCTGGCTATGGAAAG
AAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGAA
TGAGAAGTAAAAAGTTCTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGACAAAAGAGAACCTTGCGACTTCCTACTGCATC
AACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTATGGC
CGCGACAAAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCAACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAATCGATCTTCAACATCTGTCT
TTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACGAGAAGCTTCATAGTAGCATCCCG
TCACGATTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGGTT
CTTCGCTGCAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCC
TTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTC
GATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTTCAAGGTCGAAGGTTCTCACTCGCTGCGT
TGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACACACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCG
AAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCTAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGTTCCTTCCCCCCAAGTTCGA
AGGTTCTTACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGC
AGTTCCTTCCTCCAAGTTTGAAGGTTCTCATATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCA
TTCCTCCAAATTCGAAGGTTTCGAAGGTTCTCAAGCGCTGTGATTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTCCTTCCTCCAAGTTC
GAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCCACTGTTCCT
TCCTCCAAGTTCGAAGGTTCTCAGGCGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCAC
GCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCATTCTCCAAGTTCGAAGGTGCTT
CTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGTGGTTGACGTCCTCGTTCC
GCTTCATCTTCAAATGTTGGCAGTTGACGGCGTTCGCTTCGCTTCATCTTCAAAAATTGACTGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACAACCGTGGTGACCA
CCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGAAAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAAC
TACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAAGCTGATGACGACCGTGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCGT
GGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTACAAGTGAAAAGCTGATGACGACCATGGTGACCACCCCTG
CAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGATCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAA
AGTGATTGGTCTAGACAGATTTGGAGACAGAGTCAGAGAACTCAGAGTCCAGAGCATTCTGCCAAGAGTCCAGAGTCGGCAGGCCGATCATCCAAGAGGATCAACAAGCT
AACAAGCCGATCCAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGAT
CAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGA
Protein sequenceShow/hide protein sequence
MEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKENLATSYCI
NVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATKEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIP
SRLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKRFFAAVPSLQVRGSLHCTLLRCSFSKFEGSSLYPAALFLLQVRGFSVVRLLRCSSSKCEGSYVVRCCIVPSSLKF
DGSHAALLEFLLPKFEGSHALRCSSFFSRSKVLTRCVAVLSPQVRRFTHFAAVPSPKFEGSHALRSAIPSPKFEGSHALRCSSFPLSLKVLTRFAAVPSSKFKVPSPQVR
RFLRALLQFLPHSSKVLTRFAAVPSPQVRRFSRRFAAVPSSKFEGSHIASLRAALRCSSFLQVRRFSLASLQFIPPNSKVSKVLKRCDSLQFLPPSSKVLMRFVAPSSKF
EGSLTRCCSSFLQVRRFPHALRSLLLQVRRRLSPLFLPPSSKVLRRFVATFLQVRRFSHALLQLLPPSSKVPSRASLAPSPSSKALLSTAPSPSSKALLSTAHSPSSKVL
LSTPLFEGSPLRFSFSKFEGSPLLLFKCLAVVDVLVPLHLQMLAVDGVRFASSSKIDCGEITASEKLMTTVVTTPAGNYSHQSDWSRKTGGEITASEKLMTTVVTTPAGN
YSHQSDWSRQVVKSLQVKADDDRGGEITASEKLMTTVVTTPAGNYSHQSDWSRQTGGEITTSEKLMTTMVTTPAGNYSHQSDWSRQVVKSLQVKLMTIVVTTPAGNYSHQ
SDWSRQIWRQSQRTQSPEHSAKSPESAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRPIIQEDQQANKPIQQIIKPTG