; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017947 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017947
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr5:12140870..12143330
RNA-Seq ExpressionLag0017947
SyntenyLag0017947
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-6649.71Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKT
        LK+LI KLA E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++          +  L +SF +   ++ 
Subjt:  LKDLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKT

Query:  KENLA--TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTN
         E  A  T+  + V       EE+DNS + +QRTSVFD IKP TTR SVF R+SM   +EENQC   T  Q SAF+RLS+S SKK R ST  FDRLK+TN
Subjt:  KENLA--TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTN

Query:  DEPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKFLP
        D+ +R+M  L+ K F E N D K+HS +PSRMKRK SV INTEGSL   P
Subjt:  DEPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKFLP

KAA0040811.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.1e-6743.83Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K + +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDLDEVAQSN---------------------------------------------------------------------------
        LK+LILKL  E KIELD+DEVAQ+N                                                                           
Subjt:  LKDLILKLAMERKIELDLDEVAQSN---------------------------------------------------------------------------

Query:  ---------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF-HKKTKENLATSYC------------INVEEVDNSKKSEQRTSVF
                    I  K K +R K   K +P + + + F QP+Q + L + F ++F     KE L  + C             + EEVDNS + +QRTSVF
Subjt:  ---------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF-HKKTKENLATSYC------------INVEEVDNSKKSEQRTSVF

Query:  DRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS
        DRIKP TTR SVF R+S+   EEENQC  ST T+ SAF+ LS+STSKK R STS FDRLK+ ND+ +R+M +L++K F E N D K+HS +PSRMKRK S
Subjt:  DRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS

Query:  VLINTEGSLKFLP
        V INTEGSL   P
Subjt:  VLINTEGSLKFLP

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.7e-6853.7Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKTKENLATSYCINVEEVDN
        LK+LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +   +   N  +S     +EV+N
Subjt:  LKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKTKENLATSYCINVEEVDN

Query:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHS
        S +  QRTSVFDRIKP TTR SVF R+S+   EEENQC     T+ S  +RLS+ST KK R STS FDRLK+TND+ +R+M + + K F E N D K+HS
Subjt:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHS

Query:  SIPSRMKRKFSVLINTEGSLKFLP
         +PSRMKRK  V INTEGSL   P
Subjt:  SIPSRMKRKFSVLINTEGSLKFLP

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.0e-6551.85Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESMVV  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKTKENLATSYCINVEEVDN
        LK+LILKLA E KI+LD+DE  +               KD   LQP+R         RS     P++++ +    + +  +   +N   SY    EEVDN
Subjt:  LKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKTKENLATSYCINVEEVDN

Query:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHS
        S + +QRTSVFDRIKP TTR  VF R+SM   EEENQC  ST T+ SAF+RLS+STSKK R STS FDRLK+ ND+ +R+M +L+ K F E N D K+HS
Subjt:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHS

Query:  SIPSRMKRKFSVLINTEGSLKFLP
         IPS   RK SV IN EGSL   P
Subjt:  SIPSRMKRKFSVLINTEGSLKFLP

TYK04576.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.2e-6444.34Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQKK+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDLDEVAQSN---------------------------------------LATIKEKSKHQRKKD---------------PKKLQ-
        LK+LILKLA E+KIELD+DEVAQ+N                                       + TI  ++K    KD               P  +Q 
Subjt:  LKDLILKLAMERKIELDLDEVAQSN---------------------------------------LATIKEKSKHQRKKD---------------PKKLQ-

Query:  ----------------PKRKRSKKFSQPQQL-----------------VMLNKSFSKTFHKKTKENLATSYCINVE---------EVDNSKKSEQRTSVF
                         K +R+KK   P+ +                   L +SF +   ++  E  A      VE         EV+NS +  QRTSVF
Subjt:  ----------------PKRKRSKKFSQPQQL-----------------VMLNKSFSKTFHKKTKENLATSYCINVE---------EVDNSKKSEQRTSVF

Query:  DRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS
        DRIKP TTR SVF R+SM   EEENQC     T+ S F+RLS+STSKK R STS FDRLK+ ND+ +R+M +L+ K F E N D K+HS +PSRMKRK S
Subjt:  DRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS

Query:  VLINTEGSLKFLPPN
        V INTEG++  L  N
Subjt:  VLINTEGSLKFLPPN

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein6.1e-6749.71Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKT
        LK+LI KLA E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++          +  L +SF +   ++ 
Subjt:  LKDLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKT

Query:  KENLA--TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTN
         E  A  T+  + V       EE+DNS + +QRTSVFD IKP TTR SVF R+SM   +EENQC   T  Q SAF+RLS+S SKK R ST  FDRLK+TN
Subjt:  KENLA--TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTN

Query:  DEPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKFLP
        D+ +R+M  L+ K F E N D K+HS +PSRMKRK SV INTEGSL   P
Subjt:  DEPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKFLP

A0A5A7TGM1 Retrotransposon gag protein5.5e-6843.83Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K + +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDLDEVAQSN---------------------------------------------------------------------------
        LK+LILKL  E KIELD+DEVAQ+N                                                                           
Subjt:  LKDLILKLAMERKIELDLDEVAQSN---------------------------------------------------------------------------

Query:  ---------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF-HKKTKENLATSYC------------INVEEVDNSKKSEQRTSVF
                    I  K K +R K   K +P + + + F QP+Q + L + F ++F     KE L  + C             + EEVDNS + +QRTSVF
Subjt:  ---------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF-HKKTKENLATSYC------------INVEEVDNSKKSEQRTSVF

Query:  DRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS
        DRIKP TTR SVF R+S+   EEENQC  ST T+ SAF+ LS+STSKK R STS FDRLK+ ND+ +R+M +L++K F E N D K+HS +PSRMKRK S
Subjt:  DRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS

Query:  VLINTEGSLKFLP
        V INTEGSL   P
Subjt:  VLINTEGSLKFLP

A0A5A7URH1 Ty3-gypsy retrotransposon protein8.5e-6953.7Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKTKENLATSYCINVEEVDN
        LK+LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +   +   N  +S     +EV+N
Subjt:  LKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKTKENLATSYCINVEEVDN

Query:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHS
        S +  QRTSVFDRIKP TTR SVF R+S+   EEENQC     T+ S  +RLS+ST KK R STS FDRLK+TND+ +R+M + + K F E N D K+HS
Subjt:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHS

Query:  SIPSRMKRKFSVLINTEGSLKFLP
         +PSRMKRK  V INTEGSL   P
Subjt:  SIPSRMKRKFSVLINTEGSLKFLP

A0A5A7VFA5 Ty3-gypsy retrotransposon protein4.3e-6551.85Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESMVV  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKTKENLATSYCINVEEVDN
        LK+LILKLA E KI+LD+DE  +               KD   LQP+R         RS     P++++ +    + +  +   +N   SY    EEVDN
Subjt:  LKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKTKENLATSYCINVEEVDN

Query:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHS
        S + +QRTSVFDRIKP TTR  VF R+SM   EEENQC  ST T+ SAF+RLS+STSKK R STS FDRLK+ ND+ +R+M +L+ K F E N D K+HS
Subjt:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHS

Query:  SIPSRMKRKFSVLINTEGSLKFLP
         IPS   RK SV IN EGSL   P
Subjt:  SIPSRMKRKFSVLINTEGSLKFLP

A0A5D3C2C8 Retrotransposon gag protein5.7e-6544.34Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQKK+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAMERKIELDLDEVAQSN---------------------------------------LATIKEKSKHQRKKD---------------PKKLQ-
        LK+LILKLA E+KIELD+DEVAQ+N                                       + TI  ++K    KD               P  +Q 
Subjt:  LKDLILKLAMERKIELDLDEVAQSN---------------------------------------LATIKEKSKHQRKKD---------------PKKLQ-

Query:  ----------------PKRKRSKKFSQPQQL-----------------VMLNKSFSKTFHKKTKENLATSYCINVE---------EVDNSKKSEQRTSVF
                         K +R+KK   P+ +                   L +SF +   ++  E  A      VE         EV+NS +  QRTSVF
Subjt:  ----------------PKRKRSKKFSQPQQL-----------------VMLNKSFSKTFHKKTKENLATSYCINVE---------EVDNSKKSEQRTSVF

Query:  DRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS
        DRIKP TTR SVF R+SM   EEENQC     T+ S F+RLS+STSKK R STS FDRLK+ ND+ +R+M +L+ K F E N D K+HS +PSRMKRK S
Subjt:  DRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS

Query:  VLINTEGSLKFLPPN
        V INTEG++  L  N
Subjt:  VLINTEGSLKFLPPN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCATCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCA
TCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGT
GTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCTTAAAGGACTTAATTCTA
AAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAA
ACTTCAACCCAAGAGAAAGAGAAGTAAAAAGTTCTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAGACTTTCCACAAAAAGACAAAAGAGAACCTTG
CGACTTCCTACTGCATCAACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTC
CATAGAATGAGTATGGTCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCAACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCG
ATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATGAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGC
TTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTTCCTTCCCCCAAATTCGAAGATTCTCACGCGCTTCGCT
ACAGTTCCTTCCCCCCAAGTTCGAAGATTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCAAGTTCGA
AGGTTCTCACGTCGCTTCGCTGCATTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGCT
CTCACACGCTTCGCTACAGTTTTTTCTCCCTAACTTCGAAGGTTCTCACGCGCTTCGCTGCAATTCCTTCCTCCCTAAGTTTGAAGGTTCTCACGTTGCTTCGCTGCAGT
TCCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGCTGCAGTTCCTTCTCTCCACGTTCAAAGGTTCTCATGCATTTCGCTACAGTTCATTTCCTCCAAGTTCAAAGGT
TCTCACGCGCTTCGGTGAAGTTCCTTCCTCCAAGTCTGAAGGTTCTCACGTGATTCGGAGAAGTTCCTTCCTCCCAAGTTCGAAGGTTCTTACGCGCTTCGCTGCAGTTC
CTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCGCTGCAGTTTCTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGGTT
CTCACTCGCTTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCGCTGCAGTTTCTTCTCCCTAAGTTC
GAAGGTTCTCACGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTTTCGAAGGTTCTCACG
CGCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCATCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCA
TCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGT
GTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCTTAAAGGACTTAATTCTA
AAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAA
ACTTCAACCCAAGAGAAAGAGAAGTAAAAAGTTCTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAGACTTTCCACAAAAAGACAAAAGAGAACCTTG
CGACTTCCTACTGCATCAACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTC
CATAGAATGAGTATGGTCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCAACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCG
ATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATGAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGC
TTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTTCCTTCCCCCAAATTCGAAGATTCTCACGCGCTTCGCT
ACAGTTCCTTCCCCCCAAGTTCGAAGATTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCAAGTTCGA
AGGTTCTCACGTCGCTTCGCTGCATTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGCT
CTCACACGCTTCGCTACAGTTTTTTCTCCCTAACTTCGAAGGTTCTCACGCGCTTCGCTGCAATTCCTTCCTCCCTAAGTTTGAAGGTTCTCACGTTGCTTCGCTGCAGT
TCCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGCTGCAGTTCCTTCTCTCCACGTTCAAAGGTTCTCATGCATTTCGCTACAGTTCATTTCCTCCAAGTTCAAAGGT
TCTCACGCGCTTCGGTGAAGTTCCTTCCTCCAAGTCTGAAGGTTCTCACGTGATTCGGAGAAGTTCCTTCCTCCCAAGTTCGAAGGTTCTTACGCGCTTCGCTGCAGTTC
CTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCGCTGCAGTTTCTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGGTT
CTCACTCGCTTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCGCTGCAGTTTCTTCTCCCTAAGTTC
GAAGGTTCTCACGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTTTCGAAGGTTCTCACG
CGCTGTGA
Protein sequenceShow/hide protein sequence
MRKEGRNDEETIEESMVVNTTLPKSSSKEKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLIL
KLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTFHKKTKENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVF
HRMSMVATEEENQCSMSTSTQPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDEPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKFLPPNSKILTRFA
TVPSPQVRRFSRASLQFLPHSSKVLTRFVAVPSPKFEGSHVASLHFLPPSLKVLTSLRFALRFVAVPSSKFEGSHTLRYSFFSLTSKVLTRFAAIPSSLSLKVLTLLRCS
SFLQVRRFSCALLQFLLSTFKGSHAFRYSSFPPSSKVLTRFGEVPSSKSEGSHVIRRSSFLPSSKVLTRFAAVPSSKFEGSHAFRCSFFSLSSKVLTRFDAVPSSLSSKV
LTRFAAVPSSKFEGFEVPSSKFEGSHAFRCSFFSLSSKVLTRFDAVPSSLSSKVLTRFAAVPSSKFEGFEGSHAL