; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026157 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026157
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr10:30998757..31002927
RNA-Seq ExpressionLag0026157
SyntenyLag0026157
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.6e-6349.71Show/hide
Query:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        M+V  T  KS SK K   +  +H        TL+ER+KK YPFPD D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+L
Subjt:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKERKIELDLDEVAQSNLATIKEKSK--------HQRNK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKKKENL
        I KLA+E KIELD+DEVAQ+N   +   S          QR         +P  ++ ++K     SQ ++          +  L +SF +   ++  E  
Subjt:  ILKLAKERKIELDLDEVAQSNLATIKEKSK--------HQRNK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKKKENL

Query:  A--TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPK
        A  T+  + V       EE+DNS + +QRTSVFD IKP TTR SVF R+SMA  +EE QC   T  + SAF+RLS+S SKK R ST  FDRLK+TNDQ +
Subjt:  A--TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPK

Query:  RKMNNLKLKLFDEVNSDNKLHSSIPSRMKRKFSVLINTEGSL
        R+M  LK K F E N D+K+HS +PSRMKRK SV INTEGSL
Subjt:  RKMNNLKLKLFDEVNSDNKLHSSIPSRMKRKFSVLINTEGSL

KAA0040811.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.5e-6644.44Show/hide
Query:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        M+V  T  KS SK K + +  +H        TL+ER+KK YPFPD D+ DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H VE+CFVLK+L
Subjt:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKERKIELDLDEVAQSN-------------------------------------------------------------------------------
        ILKL +E KIELD+DEVAQ+N                                                                               
Subjt:  ILKLAKERKIELDLDEVAQSN-------------------------------------------------------------------------------

Query:  -----LATIKEKSKHQRNKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF-HKKKKENLATSYC------------INVEEVDNSKKSEQRTSVFDRIK
                I  K K +RNK   K +P + + + F QP+Q + L + F ++F     KE L  + C             + EEVDNS + +QRTSVFDRIK
Subjt:  -----LATIKEKSKHQRNKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF-HKKKKENLATSYC------------INVEEVDNSKKSEQRTSVFDRIK

Query:  PPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPSRMKRKFSVLIN
        P TTR SVF R+S+   EEE QC  ST TR SAF+ LS+STSKK R STS FDRLK+ NDQ +R+M +LK+K F E N D+K+HS +PSRMKRK SV IN
Subjt:  PPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPSRMKRKFSVLIN

Query:  TEGSL
        TEGSL
Subjt:  TEGSL

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.5e-6654.11Show/hide
Query:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        MVV+ T  KS SK K       H        TLKER++K YPFPD D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+L
Subjt:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKERKIELDLDEVAQSNLATIKEKSKHQRNKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDNSKKS
        ILKLA+E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +   +   N  +S     +EV+NS + 
Subjt:  ILKLAKERKIELDLDEVAQSNLATIKEKSKHQRNKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDNSKKS

Query:  EQRTSVFDRIKPPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPS
         QRTSVFDRIKP TTR SVF R+S+A  EEE QC     TR S  +RLS+ST KK R STS FDRLK+TNDQ +R+M + K K F E N D+K+HS +PS
Subjt:  EQRTSVFDRIKPPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPS

Query:  RMKRKFSVLINTEGSL
        RMKRK  V INTEGSL
Subjt:  RMKRKFSVLINTEGSL

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.4e-6352.85Show/hide
Query:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        MVV  T  KS SK K   +  +H        TL+ER+KK YPFPD D+ DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI HPVE+CFVLK+L
Subjt:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKERKIELDLDEVAQSNLATIKEKSKHQRNKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDNSKKS
        ILKLA+E KI+LD+DE                + KD   LQP+R         RS     P++++ +    + +    + +N   SY    EEVDNS + 
Subjt:  ILKLAKERKIELDLDEVAQSNLATIKEKSKHQRNKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDNSKKS

Query:  EQRTSVFDRIKPPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPS
        +QRTSVFDRIKP TTR  VF R+SMA  EEE QC  ST TR SAF+RLS+STSKK R STS FDRLK+ NDQ +R+M +LK K F E N D+K+HS IPS
Subjt:  EQRTSVFDRIKPPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPS

Query:  RMKRKFSVLINTEGSL
           RK SV IN EGSL
Subjt:  RMKRKFSVLINTEGSL

TYK04576.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.2e-6343.58Show/hide
Query:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        MVV+ T  KS SK K       H        TLKER+KK YPFPD D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+L
Subjt:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKERKIELDLDEVAQSN---------------------------------------LATIKEKSKHQRNKD---------------PKKLQ-----
        ILKLA+E+KIELD+DEVAQ+N                                       + TI  ++K    KD               P  +Q     
Subjt:  ILKLAKERKIELDLDEVAQSN---------------------------------------LATIKEKSKHQRNKD---------------PKKLQ-----

Query:  ------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKKKENLATSYCINVEEVD----------NSKKSEQRTSVFDRIK
                     K +R+KK             F Q ++ + L +   ++F   H ++   +   +  ++ EVD          NS +  QRTSVFDRIK
Subjt:  ------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKKKENLATSYCINVEEVD----------NSKKSEQRTSVFDRIK

Query:  PPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPSRMKRKFSVLIN
        P TTR SVF R+SMA  EEE QC     TR S F+RLS+STSKK R STS FDRLK+ NDQ +R+M +LK K F E N D+K+HS +PSRMKRK SV IN
Subjt:  PPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPSRMKRKFSVLIN

Query:  TEGSLKFLLSKFE
        TEG++  LL   +
Subjt:  TEGSLKFLLSKFE

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein1.3e-6349.71Show/hide
Query:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        M+V  T  KS SK K   +  +H        TL+ER+KK YPFPD D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+L
Subjt:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKERKIELDLDEVAQSNLATIKEKSK--------HQRNK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKKKENL
        I KLA+E KIELD+DEVAQ+N   +   S          QR         +P  ++ ++K     SQ ++          +  L +SF +   ++  E  
Subjt:  ILKLAKERKIELDLDEVAQSNLATIKEKSK--------HQRNK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKKKENL

Query:  A--TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPK
        A  T+  + V       EE+DNS + +QRTSVFD IKP TTR SVF R+SMA  +EE QC   T  + SAF+RLS+S SKK R ST  FDRLK+TNDQ +
Subjt:  A--TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPK

Query:  RKMNNLKLKLFDEVNSDNKLHSSIPSRMKRKFSVLINTEGSL
        R+M  LK K F E N D+K+HS +PSRMKRK SV INTEGSL
Subjt:  RKMNNLKLKLFDEVNSDNKLHSSIPSRMKRKFSVLINTEGSL

A0A5A7TGM1 Retrotransposon gag protein7.2e-6744.44Show/hide
Query:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        M+V  T  KS SK K + +  +H        TL+ER+KK YPFPD D+ DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H VE+CFVLK+L
Subjt:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKERKIELDLDEVAQSN-------------------------------------------------------------------------------
        ILKL +E KIELD+DEVAQ+N                                                                               
Subjt:  ILKLAKERKIELDLDEVAQSN-------------------------------------------------------------------------------

Query:  -----LATIKEKSKHQRNKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF-HKKKKENLATSYC------------INVEEVDNSKKSEQRTSVFDRIK
                I  K K +RNK   K +P + + + F QP+Q + L + F ++F     KE L  + C             + EEVDNS + +QRTSVFDRIK
Subjt:  -----LATIKEKSKHQRNKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF-HKKKKENLATSYC------------INVEEVDNSKKSEQRTSVFDRIK

Query:  PPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPSRMKRKFSVLIN
        P TTR SVF R+S+   EEE QC  ST TR SAF+ LS+STSKK R STS FDRLK+ NDQ +R+M +LK+K F E N D+K+HS +PSRMKRK SV IN
Subjt:  PPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPSRMKRKFSVLIN

Query:  TEGSL
        TEGSL
Subjt:  TEGSL

A0A5A7URH1 Ty3-gypsy retrotransposon protein7.2e-6754.11Show/hide
Query:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        MVV+ T  KS SK K       H        TLKER++K YPFPD D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+L
Subjt:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKERKIELDLDEVAQSNLATIKEKSKHQRNKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDNSKKS
        ILKLA+E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +   +   N  +S     +EV+NS + 
Subjt:  ILKLAKERKIELDLDEVAQSNLATIKEKSKHQRNKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDNSKKS

Query:  EQRTSVFDRIKPPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPS
         QRTSVFDRIKP TTR SVF R+S+A  EEE QC     TR S  +RLS+ST KK R STS FDRLK+TNDQ +R+M + K K F E N D+K+HS +PS
Subjt:  EQRTSVFDRIKPPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPS

Query:  RMKRKFSVLINTEGSL
        RMKRK  V INTEGSL
Subjt:  RMKRKFSVLINTEGSL

A0A5A7VFA5 Ty3-gypsy retrotransposon protein1.7e-6352.85Show/hide
Query:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        MVV  T  KS SK K   +  +H        TL+ER+KK YPFPD D+ DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI HPVE+CFVLK+L
Subjt:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKERKIELDLDEVAQSNLATIKEKSKHQRNKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDNSKKS
        ILKLA+E KI+LD+DE                + KD   LQP+R         RS     P++++ +    + +    + +N   SY    EEVDNS + 
Subjt:  ILKLAKERKIELDLDEVAQSNLATIKEKSKHQRNKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDNSKKS

Query:  EQRTSVFDRIKPPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPS
        +QRTSVFDRIKP TTR  VF R+SMA  EEE QC  ST TR SAF+RLS+STSKK R STS FDRLK+ NDQ +R+M +LK K F E N D+K+HS IPS
Subjt:  EQRTSVFDRIKPPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPS

Query:  RMKRKFSVLINTEGSL
           RK SV IN EGSL
Subjt:  RMKRKFSVLINTEGSL

A0A5D3C2C8 Retrotransposon gag protein5.7e-6443.58Show/hide
Query:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        MVV+ T  KS SK K       H        TLKER+KK YPFPD D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+L
Subjt:  MVVNTTLHKSSSKGKRQTNVTHH-------LTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKERKIELDLDEVAQSN---------------------------------------LATIKEKSKHQRNKD---------------PKKLQ-----
        ILKLA+E+KIELD+DEVAQ+N                                       + TI  ++K    KD               P  +Q     
Subjt:  ILKLAKERKIELDLDEVAQSN---------------------------------------LATIKEKSKHQRNKD---------------PKKLQ-----

Query:  ------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKKKENLATSYCINVEEVD----------NSKKSEQRTSVFDRIK
                     K +R+KK             F Q ++ + L +   ++F   H ++   +   +  ++ EVD          NS +  QRTSVFDRIK
Subjt:  ------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKKKENLATSYCINVEEVD----------NSKKSEQRTSVFDRIK

Query:  PPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPSRMKRKFSVLIN
        P TTR SVF R+SMA  EEE QC     TR S F+RLS+STSKK R STS FDRLK+ NDQ +R+M +LK K F E N D+K+HS +PSRMKRK SV IN
Subjt:  PPTTRPSVFHRMSMAATEEETQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPSRMKRKFSVLIN

Query:  TEGSLKFLLSKFE
        TEG++  LL   +
Subjt:  TEGSLKFLLSKFE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTAAACACAACCCTTCATAAGTCGTCTTCGAAAGGAAAGCGACAAACGAATGTAACACATCACTTAACTTTAAAGGAAAGACGGAAGAAAAGCTATCCTTTCCC
TGATGACGACATCCCTGATATGTTGGAACAACTACTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATT
GCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTAAAGCTGGCTAAGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTA
GCTCAATCAAATCTTGCTACAATTAAAGAAAAGAGCAAACATCAAAGAAATAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCA
ACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTGCGACTTCCTACTGCATCAACGTAGAAGAAGTTGACAATTCTAAGA
AGAGTGAACAAAGGACTTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTATGGCCGCGACAGAGGAAGAAACTCAATGTTCG
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGTCTCAAAGTAACAAACGATCA
ACCTAAAAGAAAGATGAACAACTTGAAGTTGAAACTTTTCGATGAAGTAAACAGTGACAATAAGCTTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTC
TCATAAATACGGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGTTTCTTCGTTGTATC
CTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAATGATCTTATGTGGTGCGTTGTTGCATTGTTCTC
CTTCCCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCTCCTTCCCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCTCCTTCCTCCAAGTTCGA
AGGTTCACACGCGCTTCGCGACAGTTCTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCC
ACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCACCACAGTTCCTTCGTCCAAGTTCCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGA
AGGTTCTCACGCGCATCGCCACAGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAG
TTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCTCCTTCCCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCTCCTTCCTCCAAGTTCG
AAGGTTCACACGCGCTTCGCCACAGTTCTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCA
CATTTCTCGCCACAGTTCTCCTTCCTCCAAGTTTGAAGGTTCTCACGCGCATCGCCACATTTCTCGCCACAGTTCTCCTTCCTCCAAGTTTGGAGGTTCTCACGCGCATC
GCCACATTTCTCGCCACAGTTCTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACATTTCTCGCCACAGTTCTCCTTCCTCCAAGTTCGAAGGTTCTCACGCG
CATCGCCACAGGTTCTCACGTGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCCCACGCGGCATCGCCACGGTTCCTTCCTCCAAGTTCGAAGGTTCCCACGGGC
ATCGCCACGGTTTCTTCCTCCAAGTTCGAAGGTTCCCACGCGGATCGCCACGGTTCCCCTTCCTCCAAGTTCGAAGGTTCCCACGCGGATCGCCACGGTTCCCCTTCCTC
CAAGTTCGAAGGTTCCCACGCGGATCGCCACGGTTCCCCTTCCTCCAAGTTCAAAGGTTCCCACGCGGATCGCCACGGTTCCCCTTCCTCCAAGTTCGAAGGTTCTCACG
CGTATCGTCACGGTTCCTTCCTCCAAGTTCGAAGGTCTTCCTCCAAGTTCGAAGGTTCTCACGCGTATCGTCACGGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCA
TCGCCACGGTTCCTTCCTCCAAGTTCGAAGGCATCGCCACAATTCTCCTTCCCCAAGTTCGAAGGTTCTCTCACGCGCATCGCCACGACCCAAGACCAATAAAGCCCAAG
CCCAAGTTGTTAGGCCCAAAAGTCACCAGGGCCCATCCAGCGAGAACTCTATAAATAGAGGGGTTCTCCTCCATTTCGGGGGTTCAGAAATTCTACACTCTCACAAAGAC
AAGAGTTCAGAGTTTTCAAAGCTCTCAAGCAGAACCAGAGAATTCAGAGAGACTCCACCAAGTCGGAAGACCGAAGACTCTCTGCAATCCATAAGTCCAAGTGTTGAACA
CTTCTTGAAGACCAAACACTCCTCAAGACTTCAACACTTCTTGAAGACCCAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACACT
TCTTGAAGATTGAAGACTCCTTCAAGACTGGAAGACTTCAAGCTCCAAGAATCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGTAAACACAACCCTTCATAAGTCGTCTTCGAAAGGAAAGCGACAAACGAATGTAACACATCACTTAACTTTAAAGGAAAGACGGAAGAAAAGCTATCCTTTCCC
TGATGACGACATCCCTGATATGTTGGAACAACTACTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATT
GCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTAAAGCTGGCTAAGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTA
GCTCAATCAAATCTTGCTACAATTAAAGAAAAGAGCAAACATCAAAGAAATAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCA
ACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTGCGACTTCCTACTGCATCAACGTAGAAGAAGTTGACAATTCTAAGA
AGAGTGAACAAAGGACTTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTATGGCCGCGACAGAGGAAGAAACTCAATGTTCG
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGTCTCAAAGTAACAAACGATCA
ACCTAAAAGAAAGATGAACAACTTGAAGTTGAAACTTTTCGATGAAGTAAACAGTGACAATAAGCTTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTC
TCATAAATACGGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGTTTCTTCGTTGTATC
CTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAATGATCTTATGTGGTGCGTTGTTGCATTGTTCTC
CTTCCCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCTCCTTCCCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCTCCTTCCTCCAAGTTCGA
AGGTTCACACGCGCTTCGCGACAGTTCTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCC
ACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCACCACAGTTCCTTCGTCCAAGTTCCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGA
AGGTTCTCACGCGCATCGCCACAGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAG
TTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCTCCTTCCCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCTCCTTCCTCCAAGTTCG
AAGGTTCACACGCGCTTCGCCACAGTTCTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCA
CATTTCTCGCCACAGTTCTCCTTCCTCCAAGTTTGAAGGTTCTCACGCGCATCGCCACATTTCTCGCCACAGTTCTCCTTCCTCCAAGTTTGGAGGTTCTCACGCGCATC
GCCACATTTCTCGCCACAGTTCTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACATTTCTCGCCACAGTTCTCCTTCCTCCAAGTTCGAAGGTTCTCACGCG
CATCGCCACAGGTTCTCACGTGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCCCACGCGGCATCGCCACGGTTCCTTCCTCCAAGTTCGAAGGTTCCCACGGGC
ATCGCCACGGTTTCTTCCTCCAAGTTCGAAGGTTCCCACGCGGATCGCCACGGTTCCCCTTCCTCCAAGTTCGAAGGTTCCCACGCGGATCGCCACGGTTCCCCTTCCTC
CAAGTTCGAAGGTTCCCACGCGGATCGCCACGGTTCCCCTTCCTCCAAGTTCAAAGGTTCCCACGCGGATCGCCACGGTTCCCCTTCCTCCAAGTTCGAAGGTTCTCACG
CGTATCGTCACGGTTCCTTCCTCCAAGTTCGAAGGTCTTCCTCCAAGTTCGAAGGTTCTCACGCGTATCGTCACGGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCA
TCGCCACGGTTCCTTCCTCCAAGTTCGAAGGCATCGCCACAATTCTCCTTCCCCAAGTTCGAAGGTTCTCTCACGCGCATCGCCACGACCCAAGACCAATAAAGCCCAAG
CCCAAGTTGTTAGGCCCAAAAGTCACCAGGGCCCATCCAGCGAGAACTCTATAAATAGAGGGGTTCTCCTCCATTTCGGGGGTTCAGAAATTCTACACTCTCACAAAGAC
AAGAGTTCAGAGTTTTCAAAGCTCTCAAGCAGAACCAGAGAATTCAGAGAGACTCCACCAAGTCGGAAGACCGAAGACTCTCTGCAATCCATAAGTCCAAGTGTTGAACA
CTTCTTGAAGACCAAACACTCCTCAAGACTTCAACACTTCTTGAAGACCCAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACACT
TCTTGAAGATTGAAGACTCCTTCAAGACTGGAAGACTTCAAGCTCCAAGAATCCATTGA
Protein sequenceShow/hide protein sequence
MVVNTTLHKSSSKGKRQTNVTHHLTLKERRKKSYPFPDDDIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKERKIELDLDEV
AQSNLATIKEKSKHQRNKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEETQCS
MSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLKLKLFDEVNSDNKLHSSIPSRMKRKFSVLINTEGSLKFLLSKFEGPYTVRYCVVPSPSSKFLRCI
LLRCSFSKFEGSQLYNCYVVPPPSANDLMWCVVALFSFPSKFEGSHALRHSSPSPPSSKVHTRFATVLLPPSSKVHTRFATVLLPPSSKVHTRFATVLLPPSSKVLTRFA
TVPSSKFEGSHALHHSSFVQVPKVLTRIATVPSSKFEGSHAHRHSSFFQVRRFSRASPQFLPPSSKVLTRIATVPSSKFEGSHALRHSSPSPPSSKVHTRFATVLLPPSS
KVHTRFATVLLPPSSKVHTRFATVPSSKFEGSHAHRHISRHSSPSSKFEGSHAHRHISRHSSPSSKFGGSHAHRHISRHSSPSSKFEGSHAHRHISRHSSPSSKFEGSHA
HRHRFSRASPQFLPPSSKVPTRHRHGSFLQVRRFPRASPRFLPPSSKVPTRIATVPLPPSSKVPTRIATVPLPPSSKVPTRIATVPLPPSSKVPTRIATVPLPPSSKVLT
RIVTVPSSKFEGLPPSSKVLTRIVTVPSSKFEGSHAHRHGSFLQVRRHRHNSPSPSSKVLSRASPRPKTNKAQAQVVRPKSHQGPSSENSINRGVLLHFGGSEILHSHKD
KSSEFSKLSSRTREFRETPPSRKTEDSLQSISPSVEHFLKTKHSSRLQHFLKTQHSSRLQHSLKIKDSSGHQHFLKIEDSFKTGRLQAPRIH