; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024240 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024240
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr10:1507056..1508799
RNA-Seq ExpressionLag0024240
SyntenyLag0024240
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-6447.98Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGMHH-------LTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K   +  +H        TL+ERQ K+Y FPD D+ +MLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGMHH-------LTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDFDEVAQSNLATIKGKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQLATAK------KLFSKTFHKKEKENF
        LK+LI KLA E KIELD DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++  +        +   ++F +   E  
Subjt:  LKDLILKLAMERKIELDFDEVAQSNLATIKGKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQLATAK------KLFSKTFHKKEKENF

Query:  -------------ATSYYINVEEVDNSKKGEQRTSVFDRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTN
                       + Y + EE+DNS + +QRTSVFD IKP TTR SVFQR+SM   +EENQC   T+ + SAF+R S+S SKK+R ST  FDRLK+TN
Subjt:  -------------ATSYYINVEEVDNSKKGEQRTSVFDRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTN

Query:  DQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL
        DQ +R+M  L+ K F E N D+K+HS +PSRMKRK SV INTEGSL
Subjt:  DQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL

KAA0040811.1 retrotransposon gag protein [Cucumis melo var. makuwa]7.3e-6743.77Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGMHH-------LTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K + +  +H        TL+ERQ K+Y FPD D+ +MLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGMHH-------LTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDFDEVAQSN---------------------------------------------------------------------------
        LK+LILKL  E KIELD DEVAQ+N                                                                           
Subjt:  LKDLILKLAMERKIELDFDEVAQSN---------------------------------------------------------------------------

Query:  ---------LATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLATAKKLFSKTF---HKKEKENFATSY----------YINVEEVDNSKKGEQRTSVF
                    I  K K +R K   K +P + + + F QP+Q  T  + F ++F   H KE       +          Y + EEVDNS + +QRTSVF
Subjt:  ---------LATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLATAKKLFSKTF---HKKEKENFATSY----------YINVEEVDNSKKGEQRTSVF

Query:  DRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFS
        DRIKP TTR SVFQR+S+   EEENQC  ST TR SAF+  S+STSKK+R STS FDRLK+ NDQ +R+M +L++K F E N D+K+HS +PSRMKRK S
Subjt:  DRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFS

Query:  VLINTEGSL
        V INTEGSL
Subjt:  VLINTEGSL

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-6854.06Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--MHHLTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQ K+Y FPD D+ +MLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--MHHLTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDFDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLATAKKLFSKTFHKKEKENFATSYYINVEEVDN
        LK+LILKLA E+KIELD DEVAQ+N A I+  S   + KD   LQ +R         RS     P+++       + +  + +  N+ +S     +EV+N
Subjt:  LKDLILKLAMERKIELDFDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLATAKKLFSKTFHKKEKENFATSYYINVEEVDN

Query:  SKKGEQRTSVFDRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHS
        S +  QRTSVFDRIKP TTR SVFQR+S+   EEENQC    +TR S  +R S+ST KK+R STS FDRLK+TNDQ +R+M + + K F E N D+K+HS
Subjt:  SKKGEQRTSVFDRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHS

Query:  SIPSRMKRKFSVLINTEGSL
         +PSRMKRK  V INTEGSL
Subjt:  SIPSRMKRKFSVLINTEGSL

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]8.9e-6553.14Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGMHH-------LTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESMVV  T  KS SK K   +  +H        TL+ERQ K+Y FPD D+ +MLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGMHH-------LTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDFDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKRS------KKFSQPQQLATAKKLFSKTFHKKEKENFATSYYINVEEVDNSK
        LK+LILKLA E KI+LD DE        IKG       KD   LQP+R  +      + F +       +     T    E +N   SY    EEVDNS 
Subjt:  LKDLILKLAMERKIELDFDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKRS------KKFSQPQQLATAKKLFSKTFHKKEKENFATSYYINVEEVDNSK

Query:  KGEQRTSVFDRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSI
        + +QRTSVFDRIKP TTR  VFQR+SM   EEENQC  ST+TR SAF+R S+STSKK+R STS FDRLK+ NDQ +R+M +L+ K F E N D+K+HS I
Subjt:  KGEQRTSVFDRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSI

Query:  PSRMKRKFSVLINTEGSL
        PS   RK SV IN EGSL
Subjt:  PSRMKRKFSVLINTEGSL

TYK04576.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.5e-6443.77Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--MHHLTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQ K+Y FPD D+ +MLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--MHHLTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDFDEVAQSN---------------------------------------LATIKGKSKHQRKKD---------------PKKLQ-
        LK+LILKLA E+KIELD DEVAQ+N                                       + TI  ++K    KD               P  +Q 
Subjt:  LKDLILKLAMERKIELDFDEVAQSN---------------------------------------LATIKGKSKHQRKKD---------------PKKLQ-

Query:  ----------------PKRKRSKK-------------FSQPQQLATAKKLFSKTFHKKEKENF-------------ATSYYINVEEVDNSKKGEQRTSVF
                         K +R+KK             F Q ++  T  +   ++F +   E                 + Y + +EV+NS +  QRTSVF
Subjt:  ----------------PKRKRSKK-------------FSQPQQLATAKKLFSKTFHKKEKENF-------------ATSYYINVEEVDNSKKGEQRTSVF

Query:  DRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFS
        DRIKP TTR SVFQR+SM   EEENQC    +TR S F+R S+STSKK+R STS FDRLK+ NDQ +R+M +L+ K F E N D+K+HS +PSRMKRK S
Subjt:  DRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFS

Query:  VLINTEGSL
        V INTEG++
Subjt:  VLINTEGSL

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein5.6e-6547.98Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGMHH-------LTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K   +  +H        TL+ERQ K+Y FPD D+ +MLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGMHH-------LTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDFDEVAQSNLATIKGKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQLATAK------KLFSKTFHKKEKENF
        LK+LI KLA E KIELD DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++  +        +   ++F +   E  
Subjt:  LKDLILKLAMERKIELDFDEVAQSNLATIKGKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQLATAK------KLFSKTFHKKEKENF

Query:  -------------ATSYYINVEEVDNSKKGEQRTSVFDRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTN
                       + Y + EE+DNS + +QRTSVFD IKP TTR SVFQR+SM   +EENQC   T+ + SAF+R S+S SKK+R ST  FDRLK+TN
Subjt:  -------------ATSYYINVEEVDNSKKGEQRTSVFDRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTN

Query:  DQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL
        DQ +R+M  L+ K F E N D+K+HS +PSRMKRK SV INTEGSL
Subjt:  DQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL

A0A5A7TGM1 Retrotransposon gag protein3.5e-6743.77Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGMHH-------LTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K + +  +H        TL+ERQ K+Y FPD D+ +MLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGMHH-------LTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDFDEVAQSN---------------------------------------------------------------------------
        LK+LILKL  E KIELD DEVAQ+N                                                                           
Subjt:  LKDLILKLAMERKIELDFDEVAQSN---------------------------------------------------------------------------

Query:  ---------LATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLATAKKLFSKTF---HKKEKENFATSY----------YINVEEVDNSKKGEQRTSVF
                    I  K K +R K   K +P + + + F QP+Q  T  + F ++F   H KE       +          Y + EEVDNS + +QRTSVF
Subjt:  ---------LATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLATAKKLFSKTF---HKKEKENFATSY----------YINVEEVDNSKKGEQRTSVF

Query:  DRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFS
        DRIKP TTR SVFQR+S+   EEENQC  ST TR SAF+  S+STSKK+R STS FDRLK+ NDQ +R+M +L++K F E N D+K+HS +PSRMKRK S
Subjt:  DRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFS

Query:  VLINTEGSL
        V INTEGSL
Subjt:  VLINTEGSL

A0A5A7URH1 Ty3-gypsy retrotransposon protein6.4e-6954.06Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--MHHLTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQ K+Y FPD D+ +MLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--MHHLTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDFDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLATAKKLFSKTFHKKEKENFATSYYINVEEVDN
        LK+LILKLA E+KIELD DEVAQ+N A I+  S   + KD   LQ +R         RS     P+++       + +  + +  N+ +S     +EV+N
Subjt:  LKDLILKLAMERKIELDFDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLATAKKLFSKTFHKKEKENFATSYYINVEEVDN

Query:  SKKGEQRTSVFDRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHS
        S +  QRTSVFDRIKP TTR SVFQR+S+   EEENQC    +TR S  +R S+ST KK+R STS FDRLK+TNDQ +R+M + + K F E N D+K+HS
Subjt:  SKKGEQRTSVFDRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHS

Query:  SIPSRMKRKFSVLINTEGSL
         +PSRMKRK  V INTEGSL
Subjt:  SIPSRMKRKFSVLINTEGSL

A0A5A7V7A0 Retrotransposon gag protein7.4e-6543.28Show/hide
Query:  IEESMVVNTTLPKSSSK-------EKRQTNGMHHLTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK        K   +     TLKERQ K+Y FPD D+ +MLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSK-------EKRQTNGMHHLTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDFDEVAQSN---------------------------------------LATIKGKSKHQRKKD---------------PKKLQ-
        LK+LILKLA E+KIELD DEVAQ+N                                       + TI  ++K    KD               P  +Q 
Subjt:  LKDLILKLAMERKIELDFDEVAQSN---------------------------------------LATIKGKSKHQRKKD---------------PKKLQ-

Query:  ----------------PKRKRSKKFSQPQQL-------------ATAKKLFSKTFHKKEKENF-------------ATSYYINVEEVDNSKKGEQRTSVF
                         K +R+KK   P+ +              T  +   ++F +   E                 + Y + +EV+NS +  QRTSVF
Subjt:  ----------------PKRKRSKKFSQPQQL-------------ATAKKLFSKTFHKKEKENF-------------ATSYYINVEEVDNSKKGEQRTSVF

Query:  DRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFS
        DRIKP TTR SVFQR+SM   EEENQC    +TR S F+R S+STSKK+R STSVFDRLK+T+DQ +R+M +L+ K F E N D+K+HS +PSRMKRK S
Subjt:  DRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFS

Query:  VLINTEGSL
        V INTEG++
Subjt:  VLINTEGSL

A0A5A7VFA5 Ty3-gypsy retrotransposon protein4.3e-6553.14Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGMHH-------LTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESMVV  T  KS SK K   +  +H        TL+ERQ K+Y FPD D+ +MLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGMHH-------LTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDFDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKRS------KKFSQPQQLATAKKLFSKTFHKKEKENFATSYYINVEEVDNSK
        LK+LILKLA E KI+LD DE        IKG       KD   LQP+R  +      + F +       +     T    E +N   SY    EEVDNS 
Subjt:  LKDLILKLAMERKIELDFDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKRS------KKFSQPQQLATAKKLFSKTFHKKEKENFATSYYINVEEVDNSK

Query:  KGEQRTSVFDRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSI
        + +QRTSVFDRIKP TTR  VFQR+SM   EEENQC  ST+TR SAF+R S+STSKK+R STS FDRLK+ NDQ +R+M +L+ K F E N D+K+HS I
Subjt:  KGEQRTSVFDRIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSI

Query:  PSRMKRKFSVLINTEGSL
        PS   RK SV IN EGSL
Subjt:  PSRMKRKFSVLINTEGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCTATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAACGACAAACGAATGGAATGCA
TCACTTAACTTTAAAGGAAAGACAGAATAAAATCTATCATTTCCCTGATGTCGACATCCCTAATATGTTGGAGCAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGT
GTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAAGACTTAATTCTA
AAGCTGGCTATGGAAAGAAAAATTGAGCTCGACTTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGGAAAGAGCAAACATCAAAGAAAAAAGGATCCTAAGAA
ACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGCGACTGCGAAGAAACTCTTCTCTAAAACTTTCCACAAAAAGGAAAAAGAGAACTTTG
CAACTTCCTACTACATCAACGTAGAAGAAGTTGACAATTCCAAGAAGGGTGAACAAAGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTC
CAAAGAATGAGTATGGTCGCGACAGAAGAAGAAAATCAATGTTCGGTGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCCTAGTGTCTCCACATCGAAGAAAAATCG
ATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACGAGAAGC
TTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTGGGGGCAACACAGCGAATGGAAGTTGCTTCCTCCAAGT
TCGAAGGTTCCCACGTGCTTCGCTGCAGTTCCTTCCCCCCAAATTCGAGGGTTCTCACGTGCTTCGGTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTTACGCGCTTCGC
TGCAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCACTGCAGTTCTTTCCTCACAGTTC
GAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCCCTGAGTTTCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGC
TTCACTCACGCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGTTGCAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCTATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAACGACAAACGAATGGAATGCA
TCACTTAACTTTAAAGGAAAGACAGAATAAAATCTATCATTTCCCTGATGTCGACATCCCTAATATGTTGGAGCAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGT
GTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAAGACTTAATTCTA
AAGCTGGCTATGGAAAGAAAAATTGAGCTCGACTTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGGAAAGAGCAAACATCAAAGAAAAAAGGATCCTAAGAA
ACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGCGACTGCGAAGAAACTCTTCTCTAAAACTTTCCACAAAAAGGAAAAAGAGAACTTTG
CAACTTCCTACTACATCAACGTAGAAGAAGTTGACAATTCCAAGAAGGGTGAACAAAGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTC
CAAAGAATGAGTATGGTCGCGACAGAAGAAGAAAATCAATGTTCGGTGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCCTAGTGTCTCCACATCGAAGAAAAATCG
ATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACGAGAAGC
TTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTGGGGGCAACACAGCGAATGGAAGTTGCTTCCTCCAAGT
TCGAAGGTTCCCACGTGCTTCGCTGCAGTTCCTTCCCCCCAAATTCGAGGGTTCTCACGTGCTTCGGTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTTACGCGCTTCGC
TGCAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCACTGCAGTTCTTTCCTCACAGTTC
GAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCCCTGAGTTTCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGC
TTCACTCACGCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGTTGCAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGTAG
Protein sequenceShow/hide protein sequence
MRKEGRNDEETIEESMVVNTTLPKSSSKEKRQTNGMHHLTLKERQNKIYHFPDVDIPNMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLIL
KLAMERKIELDFDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLATAKKLFSKTFHKKEKENFATSYYINVEEVDNSKKGEQRTSVFDRIKPPTTRPSVF
QRMSMVATEEENQCSVSTFTRPSAFQRPSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSLKWGQHSEWKLLPPS
SKVPTCFAAVPSPQIRGFSRASVQFLPHSSKVLTRFAAVPSLQVRRFSRRFAAVLSSKFEGSHAFHCSSFLTVRRFSRASLQFLPPQVRRFSRRFPEFLPPSLKVLTSLR
FTHALRCVPSSKFEGSHTLRCSSFLTIRRFSRASL