; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036246 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036246
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr3:42398659..42402879
RNA-Seq ExpressionLag0036246
SyntenyLag0036246
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.1e-6449.42Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K   +  +H        TL+ERQKK+YPFPD D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLVMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKK
        LK+LI KL  E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++          +  L +SF +   ++ 
Subjt:  LKDLILKLVMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKK

Query:  KENLA--TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTN
         E  A  T+  + V       EE+DNS + +QRTSVFD IKP TTR SVF R+SMA  +EENQC   T  + S F+RLS+S SKK R ST  FDRLK+TN
Subjt:  KENLA--TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTN

Query:  DQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTKGSL
        DQ +R+M  L+ K F E N D K+HS +PSRMKRK SV INT+GSL
Subjt:  DQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTKGSL

KAA0040811.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.3e-6643.77Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K + +  +H        TL+ERQKK+YPFPD D+ DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLVMERKIELDLDEVAQSN---------------------------------------------------------------------------
        LK+LILKL  E KIELD+DEVAQ+N                                                                           
Subjt:  LKDLILKLVMERKIELDLDEVAQSN---------------------------------------------------------------------------

Query:  ---------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF-HKKKKENLATSYC------------INVEEVDNSKKSEQRTSVF
                    I  K K +R K   K +P + + + F QP+Q + L + F ++F     KE L  + C             + EEVDNS + +QRTSVF
Subjt:  ---------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF-HKKKKENLATSYC------------INVEEVDNSKKSEQRTSVF

Query:  DRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS
        DRIKP TTR SVF R+S+   EEENQC  ST  R S F+ LS+STSKK R STS FDRLK+ NDQ +R+M +L++K F E N D K+HS +PSRMKRK S
Subjt:  DRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS

Query:  VLINTKGSL
        V INT+GSL
Subjt:  VLINTKGSL

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.6e-6754.06Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLVMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDN
        LK+LILKL  E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +   +   N  +S     +EV+N
Subjt:  LKDLILKLVMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDN

Query:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS
        S +  QRTSVFDRIKP TTR SVF R+S+A  EEENQC      R S  +RLS+ST KK R STS FDRLK+TNDQ +R+M + + K F E N D K+HS
Subjt:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS

Query:  SIPSRMKRKFSVLINTKGSL
         +PSRMKRK  V INT+GSL
Subjt:  SIPSRMKRKFSVLINTKGSL

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.4e-6351.88Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESMVV  T  KS SK K   +  +H        TL+ERQKK+YPFPD D+ DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLVMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDN
        LK+LILKL  E KI+LD+DE  +               KD   LQP+R         RS     P++++ +    + +    + +N   SY    EEVDN
Subjt:  LKDLILKLVMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDN

Query:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS
        S + +QRTSVFDRIKP TTR  VF R+SMA  EEENQC  ST  R S F+RLS+STSKK R STS FDRLK+ NDQ +R+M +L+ K F E N D K+HS
Subjt:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS

Query:  SIPSRMKRKFSVLINTKGSL
         IPS   RK SV IN +GSL
Subjt:  SIPSRMKRKFSVLINTKGSL

TYK04576.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.2e-6443.65Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQKK+YPFPD D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLVMERKIELDLDEVAQSN---------------------------------------LATIKEKSKHQRKKD---------------PKKLQ-
        LK+LILKL  E+KIELD+DEVAQ+N                                       + TI  ++K    KD               P  +Q 
Subjt:  LKDLILKLVMERKIELDLDEVAQSN---------------------------------------LATIKEKSKHQRKKD---------------PKKLQ-

Query:  ----------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKKKENLATSYCINVEEVD----------NSKKSEQRTSVF
                         K +R+KK             F Q ++ + L +   ++F   H ++   +   +  ++ EVD          NS +  QRTSVF
Subjt:  ----------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKKKENLATSYCINVEEVD----------NSKKSEQRTSVF

Query:  DRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS
        DRIKP TTR SVF R+SMA  EEENQC      R S F+RLS+STSKK R STS FDRLK+ NDQ +R+M +L+ K F E N D K+HS +PSRMKRK S
Subjt:  DRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS

Query:  VLINTKGSLKFLLSKFE
        V INT+G++  LL   +
Subjt:  VLINTKGSLKFLLSKFE

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein1.5e-6449.42Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K   +  +H        TL+ERQKK+YPFPD D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLVMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKK
        LK+LI KL  E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++          +  L +SF +   ++ 
Subjt:  LKDLILKLVMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKK

Query:  KENLA--TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTN
         E  A  T+  + V       EE+DNS + +QRTSVFD IKP TTR SVF R+SMA  +EENQC   T  + S F+RLS+S SKK R ST  FDRLK+TN
Subjt:  KENLA--TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTN

Query:  DQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTKGSL
        DQ +R+M  L+ K F E N D K+HS +PSRMKRK SV INT+GSL
Subjt:  DQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTKGSL

A0A5A7TGM1 Retrotransposon gag protein2.1e-6643.77Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K + +  +H        TL+ERQKK+YPFPD D+ DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLVMERKIELDLDEVAQSN---------------------------------------------------------------------------
        LK+LILKL  E KIELD+DEVAQ+N                                                                           
Subjt:  LKDLILKLVMERKIELDLDEVAQSN---------------------------------------------------------------------------

Query:  ---------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF-HKKKKENLATSYC------------INVEEVDNSKKSEQRTSVF
                    I  K K +R K   K +P + + + F QP+Q + L + F ++F     KE L  + C             + EEVDNS + +QRTSVF
Subjt:  ---------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF-HKKKKENLATSYC------------INVEEVDNSKKSEQRTSVF

Query:  DRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS
        DRIKP TTR SVF R+S+   EEENQC  ST  R S F+ LS+STSKK R STS FDRLK+ NDQ +R+M +L++K F E N D K+HS +PSRMKRK S
Subjt:  DRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS

Query:  VLINTKGSL
        V INT+GSL
Subjt:  VLINTKGSL

A0A5A7URH1 Ty3-gypsy retrotransposon protein3.2e-6754.06Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLVMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDN
        LK+LILKL  E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +   +   N  +S     +EV+N
Subjt:  LKDLILKLVMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDN

Query:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS
        S +  QRTSVFDRIKP TTR SVF R+S+A  EEENQC      R S  +RLS+ST KK R STS FDRLK+TNDQ +R+M + + K F E N D K+HS
Subjt:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS

Query:  SIPSRMKRKFSVLINTKGSL
         +PSRMKRK  V INT+GSL
Subjt:  SIPSRMKRKFSVLINTKGSL

A0A5A7VFA5 Ty3-gypsy retrotransposon protein1.6e-6351.88Show/hide
Query:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESMVV  T  KS SK K   +  +H        TL+ERQKK+YPFPD D+ DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLVMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDN
        LK+LILKL  E KI+LD+DE  +               KD   LQP+R         RS     P++++ +    + +    + +N   SY    EEVDN
Subjt:  LKDLILKLVMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDN

Query:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS
        S + +QRTSVFDRIKP TTR  VF R+SMA  EEENQC  ST  R S F+RLS+STSKK R STS FDRLK+ NDQ +R+M +L+ K F E N D K+HS
Subjt:  SKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS

Query:  SIPSRMKRKFSVLINTKGSL
         IPS   RK SV IN +GSL
Subjt:  SIPSRMKRKFSVLINTKGSL

A0A5D3C2C8 Retrotransposon gag protein2.5e-6443.65Show/hide
Query:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQKK+YPFPD D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLVMERKIELDLDEVAQSN---------------------------------------LATIKEKSKHQRKKD---------------PKKLQ-
        LK+LILKL  E+KIELD+DEVAQ+N                                       + TI  ++K    KD               P  +Q 
Subjt:  LKDLILKLVMERKIELDLDEVAQSN---------------------------------------LATIKEKSKHQRKKD---------------PKKLQ-

Query:  ----------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKKKENLATSYCINVEEVD----------NSKKSEQRTSVF
                         K +R+KK             F Q ++ + L +   ++F   H ++   +   +  ++ EVD          NS +  QRTSVF
Subjt:  ----------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKKKENLATSYCINVEEVD----------NSKKSEQRTSVF

Query:  DRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS
        DRIKP TTR SVF R+SMA  EEENQC      R S F+RLS+STSKK R STS FDRLK+ NDQ +R+M +L+ K F E N D K+HS +PSRMKRK S
Subjt:  DRIKPPTTRPSVFHRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFS

Query:  VLINTKGSLKFLLSKFE
        V INT+G++  LL   +
Subjt:  VLINTKGSLKFLLSKFE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCA
TCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGTCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGT
GTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAATATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTA
AAGTTGGTTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAA
ACTTCAACCCAAGAGAAAGAGAAGTAAAAAGTTCTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTG
CGACTTCCTACTGCATCAACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTC
CATAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCATTCGACCTTCACCTTTCCAAAGGCTAAGTGTTTCCACATCGAAGAAAAGTCG
ATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGC
TTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGAAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGC
TATTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTACGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCT
CTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGTAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACACGCTCCATTGCAGTTCCTTCTTTCCAAGGTCGAAGGTT
CAAACTCGCTGCGTTGTAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTCCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTT
CCTCCAAAGTGGAAGGTTCTCCGCTGCTGCAGTTCATTCTTCCAGGTTCGAAGGTTCTCACGTCGCTTCGCTGTAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCT
TCGCTGCAGTTCCTTCTCTCCAAGGTCGAAGGTTCTCACGCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCTAAAT
TCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCCCTTCGC
TGCAGTTCCTTCCTCCAATTCCTTCCCCCCAAGTTCAAAGGTTCTCACGTCGCTTCGCTGCGCTCATGCACTTCGCTGCAGTTCCTTCCTCCAAGTTTAAAGGTTCTCAC
ATCGCTTCGCTGCGATCCTTCCTCCAAGTTTGAAGGTTCTCACATTGCCTCGCTGCGATCCTTCCTCCAAGTTCGAAGTTCCTCCCTCCAAATTCGAAGGTTCTCACGCG
CTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCC
AAGTTCGAAGCTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCACCAAATTCGAAGGTTCTCACGCGCTT
CGCTGCAGTTCCTGCCCCCCAAGTTCGAAGGTTCTCACGCGCTTTGATGCAGTTCTTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAATTCCTTCCTCCAAGT
TCGAAGGTTCTCATGCACTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTG
CAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTCCCTCCAAATTAGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGC
AATTCCTTCCCCCAAGTTCGAAGCTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCACCAAATTCGAAGG
TTCTCACGCGCTTCGCTGCAGTTCCTGCCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGATGCAGTTCTTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAATT
CCTTCCTCCAAGTTCGAAGGTTCTCATGCACTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTCGAAGGTTCT
CACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACCCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTT
CCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATACGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCAC
GCGCTTCGCTCTCCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCC
CCCAAATACGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTCCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACG
CGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATACGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCC
AAGTTCGAAGGTTCTCACGCGCTTCGCTCTCCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCT
TCGTTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTTCCCCCAAGTTCGAAGGTTCTCACGCGGTTCGCTGCAGTTCCTTTCCCCCA
AGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTTCTTCCTCCAAGTTCAAAGGTTCTCACGCGCCTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACAGCTTCG
TTGCAGTTCCTTCCTCCAAGTTCGAAGGTCCTCATGCTACGCTCGGCTACACTGCTGCGCTACTTCCTAAAGTCCAAAGACGTCAGTTGTCCCTACACTCATGCTGAAAG
GGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCA
TCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGTCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGT
GTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAATATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTA
AAGTTGGTTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAA
ACTTCAACCCAAGAGAAAGAGAAGTAAAAAGTTCTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTG
CGACTTCCTACTGCATCAACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTC
CATAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCATTCGACCTTCACCTTTCCAAAGGCTAAGTGTTTCCACATCGAAGAAAAGTCG
ATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGC
TTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGAAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGC
TATTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTACGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCT
CTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGTAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACACGCTCCATTGCAGTTCCTTCTTTCCAAGGTCGAAGGTT
CAAACTCGCTGCGTTGTAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTCCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTT
CCTCCAAAGTGGAAGGTTCTCCGCTGCTGCAGTTCATTCTTCCAGGTTCGAAGGTTCTCACGTCGCTTCGCTGTAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCT
TCGCTGCAGTTCCTTCTCTCCAAGGTCGAAGGTTCTCACGCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCTAAAT
TCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCCCTTCGC
TGCAGTTCCTTCCTCCAATTCCTTCCCCCCAAGTTCAAAGGTTCTCACGTCGCTTCGCTGCGCTCATGCACTTCGCTGCAGTTCCTTCCTCCAAGTTTAAAGGTTCTCAC
ATCGCTTCGCTGCGATCCTTCCTCCAAGTTTGAAGGTTCTCACATTGCCTCGCTGCGATCCTTCCTCCAAGTTCGAAGTTCCTCCCTCCAAATTCGAAGGTTCTCACGCG
CTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCC
AAGTTCGAAGCTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCACCAAATTCGAAGGTTCTCACGCGCTT
CGCTGCAGTTCCTGCCCCCCAAGTTCGAAGGTTCTCACGCGCTTTGATGCAGTTCTTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAATTCCTTCCTCCAAGT
TCGAAGGTTCTCATGCACTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTG
CAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTCCCTCCAAATTAGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGC
AATTCCTTCCCCCAAGTTCGAAGCTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCACCAAATTCGAAGG
TTCTCACGCGCTTCGCTGCAGTTCCTGCCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGATGCAGTTCTTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAATT
CCTTCCTCCAAGTTCGAAGGTTCTCATGCACTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTCGAAGGTTCT
CACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACCCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTT
CCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATACGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCAC
GCGCTTCGCTCTCCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCC
CCCAAATACGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTCCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACG
CGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATACGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCC
AAGTTCGAAGGTTCTCACGCGCTTCGCTCTCCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCT
TCGTTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTTCCCCCAAGTTCGAAGGTTCTCACGCGGTTCGCTGCAGTTCCTTTCCCCCA
AGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTTCTTCCTCCAAGTTCAAAGGTTCTCACGCGCCTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACAGCTTCG
TTGCAGTTCCTTCCTCCAAGTTCGAAGGTCCTCATGCTACGCTCGGCTACACTGCTGCGCTACTTCCTAAAGTCCAAAGACGTCAGTTGTCCCTACACTCATGCTGAAAG
GGCGTGA
Protein sequenceShow/hide protein sequence
MRKEGRNDEETIEESMVVNTTLPKSSSKEKRQTNGAHHLTLKERQKKIYPFPDVDIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLIL
KLVMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVF
HRMSMAATEEENQCSMSTSIRPSPFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTKGSLKFLLSKFEGPYTVR
YCVVPSPSSRVLSCTTATLFLLQVRRILCGALLHCSLFSQVRWFSRSFAVVSSPQVRRFSHAPLQFLLSKVEGSNSLRCSSFSPSSKVHALRSSSFSQIRRFSRASLQFL
PPKWKVLRCCSSFFQVRRFSRRFAVVPSSKFEGSHVLRCSSFSPRSKVLTRCVAVLSPQVRRFTHFAAVPSPKFEGSHALRSAIPSPKFEGSHALRAVPSSKFEGSHALR
CSSFLQFLPPKFKGSHVASLRSCTSLQFLPPSLKVLTSLRCDPSSKFEGSHIASLRSFLQVRSSSLQIRRFSRASLHFLPPNSKVLTRFAAVPSSKFEGSHALRSAIPSP
KFEASHALRAVPSSKFEGSHALRCISFPPNSKVLTRFAAVPAPQVRRFSRALMQFFPPSSKVLTRFVAIPSSKFEGSHALRAVPSSKFEGSHALRCISFPQIRRFSRASL
QFLPPKFEGSHALRAVPPSKLEVPSSKFEGSHALRSAIPSPKFEASHALRAVPSSKFEGSHALRCISFPPNSKVLTRFAAVPAPQVRRFSRASMQFFPPSSKVLTRFVAI
PSSKFEGSHALRAVPSSKFEGSHALRCISFPPNSKVLTRFAAVPSPQVRRFSPASLQFLPPSSKVLTRFAAVPSSKFEGSHALRCISFPPNTKVLTRFAAVPSSKFEGSH
ALRSPIPSPKFEGSHALRAVPSSKFEGSHALRCISFPPNTKVLTRFAAVPSSKFEGSHALRSPIPSPKFEGSHALRAVPSSKFEGSHALRCISFPPNTKVLTRFAAVPSS
KFEGSHALRSPIPSPKFEGSHALRAVPSSKFEGSHALRCISFPPNSKVLTRFAAVPFPQVRRFSRGSLQFLSPKFEGSHALRCSFFLQVQRFSRASLQFLPPSSKVLTAS
LQFLPPSSKVLMLRSATLLRYFLKSKDVSCPYTHAERA