; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026085 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026085
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr10:28905453..28908751
RNA-Seq ExpressionLag0026085
SyntenyLag0026085
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.4e-6449.71Show/hide
Query:  MVVNTTLPKSSSKEKRQTNGTHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        M+V  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+L
Subjt:  MVVNTTLPKSSSKEKRQTNGTHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKEGKIELDLDEVAQSNLATIKGNNK--------HQRKK-------DPKKLQHKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKKKENL
        I KLA+E KIELD+DEVAQ+N   +   +          QRK        +P  ++ ++K     SQ ++          +  L +SF +   ++  E  
Subjt:  ILKLAKEGKIELDLDEVAQSNLATIKGNNK--------HQRKK-------DPKKLQHKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKKKENL

Query:  A--TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPK
        A  T+  ++V       EE+DNS + +QRTSVFD IKP TTR SVFQR SMA  +EENQC   T  + SAF+RLS+S SKK +PST  FDRLK+T+DQ +
Subjt:  A--TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPK

Query:  RKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLITTEG
        R+M  L+ K F E N D K+HS VPSRMKRK SV I TEG
Subjt:  RKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLITTEG

KAA0040811.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.6e-6644.67Show/hide
Query:  MVVNTTLPKSSSKEKRQTNGTHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        M+V  T  KS SK K + +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H VE+CFVLK+L
Subjt:  MVVNTTLPKSSSKEKRQTNGTHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKEGKIELDLDEVAQSN---------------------------------------------------------------------LATIKGNNKH
        ILKL +E KIELD+DEVAQ+N                                                                      ++++ N+  
Subjt:  ILKLAKEGKIELDLDEVAQSN---------------------------------------------------------------------LATIKGNNKH

Query:  QRKKDPKKLQHKRK--RSKK-------------FSQPQQLVMLNKSFSKTF-HKKKKENLATSYC------------IDVEEVDNSKKSEQRTSVFDRIK
         +K     + HK+K  R+KK             F QP+Q + L + F ++F     KE L  + C               EEVDNS + +QRTSVFDRIK
Subjt:  QRKKDPKKLQHKRK--RSKK-------------FSQPQQLVMLNKSFSKTF-HKKKKENLATSYC------------IDVEEVDNSKKSEQRTSVFDRIK

Query:  PPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLIT
        P TTR SVFQR S+   EEENQC  ST TR SAF+ LS+STSKK +PSTS FDRLK+ +DQ +R+M +L+VK F E N D K+HS VPSRMKRK SV I 
Subjt:  PPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLIT

Query:  TEG
        TEG
Subjt:  TEG

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.9e-6654.14Show/hide
Query:  MVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        MVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+L
Subjt:  MVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQHKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCIDVEEVDNSKKS
        ILKLA+E KIELD+DEVAQ+N A I+  +   + KD   LQ +R         RS     P++++ +    + +   +   N  +S     +EV+NS + 
Subjt:  ILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQHKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCIDVEEVDNSKKS

Query:  EQRTSVFDRIKPPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPS
         QRTSVFDRIKP TTR SVFQR S+A  EEENQC     TR S  +RLS+ST KK +PSTS FDRLK+T+DQ +R+M + + K F E N D K+HS VPS
Subjt:  EQRTSVFDRIKPPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPS

Query:  RMKRKFSVLITTEG
        RMKRK  V I TEG
Subjt:  RMKRKFSVLITTEG

KAA0061611.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.9e-6445.1Show/hide
Query:  MVVNTTLPKSSSK-------EKRQTNGTHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        MVV+ T  KS SK        K   +     TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+L
Subjt:  MVVNTTLPKSSSK-------EKRQTNGTHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGNNKHQRKKD---------------PKKLQ-----
        ILKLA+E KIELD+DEVAQ+N                                       + TI   NK    KD               P  +Q     
Subjt:  ILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGNNKHQRKKD---------------PKKLQ-----

Query:  ----------HKRK--RSKKFSQPQQL-------------VMLNKSFSKTF-HKKKKENLATSYC-----IDVE-------EVDNSKKSEQRTSVFDRIK
                  HK+K  R+KK   P+ +             + L +   ++F     +E L  + C     ++V+       EV+NS +  QRTSVFDRIK
Subjt:  ----------HKRK--RSKKFSQPQQL-------------VMLNKSFSKTF-HKKKKENLATSYC-----IDVE-------EVDNSKKSEQRTSVFDRIK

Query:  PPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLIT
        P TTR SVFQR SM   EEENQC     TR S F+RLS+STSKK +PSTSVFDRLK+T DQ +R+M +L+ K F E N D K+HS VPSRMKRK SV I 
Subjt:  PPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLIT

Query:  TEGYVGSL
        TEG V  L
Subjt:  TEGYVGSL

TYK04576.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.4e-6445.34Show/hide
Query:  MVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        MVV+ T  KS SK K     R+ +G      TLKERQKK+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+L
Subjt:  MVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGNNKHQRKKD---------------PKKLQ-----
        ILKLA+E KIELD+DEVAQ+N                                       + TI   NK    KD               P  +Q     
Subjt:  ILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGNNKHQRKKD---------------PKKLQ-----

Query:  ------------HKRKRSKK-------------FSQPQQLVMLNKSFSKTF-HKKKKENLATSYC-----IDVE-------EVDNSKKSEQRTSVFDRIK
                     K +R+KK             F Q ++ + L +   ++F     +E L  + C     ++V+       EV+NS +  QRTSVFDRIK
Subjt:  ------------HKRKRSKK-------------FSQPQQLVMLNKSFSKTF-HKKKKENLATSYC-----IDVE-------EVDNSKKSEQRTSVFDRIK

Query:  PPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLIT
        P TTR SVFQR SMA  EEENQC     TR S F+RLS+STSKK +PSTS FDRLK+ +DQ +R+M +L+ K F E N D K+HS VPSRMKRK SV I 
Subjt:  PPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLIT

Query:  TEGYVGSL
        TEG V  L
Subjt:  TEGYVGSL

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein1.2e-6449.71Show/hide
Query:  MVVNTTLPKSSSKEKRQTNGTHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        M+V  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+L
Subjt:  MVVNTTLPKSSSKEKRQTNGTHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKEGKIELDLDEVAQSNLATIKGNNK--------HQRKK-------DPKKLQHKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKKKENL
        I KLA+E KIELD+DEVAQ+N   +   +          QRK        +P  ++ ++K     SQ ++          +  L +SF +   ++  E  
Subjt:  ILKLAKEGKIELDLDEVAQSNLATIKGNNK--------HQRKK-------DPKKLQHKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKKKENL

Query:  A--TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPK
        A  T+  ++V       EE+DNS + +QRTSVFD IKP TTR SVFQR SMA  +EENQC   T  + SAF+RLS+S SKK +PST  FDRLK+T+DQ +
Subjt:  A--TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPK

Query:  RKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLITTEG
        R+M  L+ K F E N D K+HS VPSRMKRK SV I TEG
Subjt:  RKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLITTEG

A0A5A7TGM1 Retrotransposon gag protein1.3e-6644.67Show/hide
Query:  MVVNTTLPKSSSKEKRQTNGTHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        M+V  T  KS SK K + +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H VE+CFVLK+L
Subjt:  MVVNTTLPKSSSKEKRQTNGTHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKEGKIELDLDEVAQSN---------------------------------------------------------------------LATIKGNNKH
        ILKL +E KIELD+DEVAQ+N                                                                      ++++ N+  
Subjt:  ILKLAKEGKIELDLDEVAQSN---------------------------------------------------------------------LATIKGNNKH

Query:  QRKKDPKKLQHKRK--RSKK-------------FSQPQQLVMLNKSFSKTF-HKKKKENLATSYC------------IDVEEVDNSKKSEQRTSVFDRIK
         +K     + HK+K  R+KK             F QP+Q + L + F ++F     KE L  + C               EEVDNS + +QRTSVFDRIK
Subjt:  QRKKDPKKLQHKRK--RSKK-------------FSQPQQLVMLNKSFSKTF-HKKKKENLATSYC------------IDVEEVDNSKKSEQRTSVFDRIK

Query:  PPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLIT
        P TTR SVFQR S+   EEENQC  ST TR SAF+ LS+STSKK +PSTS FDRLK+ +DQ +R+M +L+VK F E N D K+HS VPSRMKRK SV I 
Subjt:  PPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLIT

Query:  TEG
        TEG
Subjt:  TEG

A0A5A7URH1 Ty3-gypsy retrotransposon protein4.8e-6654.14Show/hide
Query:  MVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        MVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+L
Subjt:  MVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQHKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCIDVEEVDNSKKS
        ILKLA+E KIELD+DEVAQ+N A I+  +   + KD   LQ +R         RS     P++++ +    + +   +   N  +S     +EV+NS + 
Subjt:  ILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQHKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCIDVEEVDNSKKS

Query:  EQRTSVFDRIKPPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPS
         QRTSVFDRIKP TTR SVFQR S+A  EEENQC     TR S  +RLS+ST KK +PSTS FDRLK+T+DQ +R+M + + K F E N D K+HS VPS
Subjt:  EQRTSVFDRIKPPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPS

Query:  RMKRKFSVLITTEG
        RMKRK  V I TEG
Subjt:  RMKRKFSVLITTEG

A0A5A7V7A0 Retrotransposon gag protein9.0e-6545.1Show/hide
Query:  MVVNTTLPKSSSK-------EKRQTNGTHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        MVV+ T  KS SK        K   +     TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+L
Subjt:  MVVNTTLPKSSSK-------EKRQTNGTHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGNNKHQRKKD---------------PKKLQ-----
        ILKLA+E KIELD+DEVAQ+N                                       + TI   NK    KD               P  +Q     
Subjt:  ILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGNNKHQRKKD---------------PKKLQ-----

Query:  ----------HKRK--RSKKFSQPQQL-------------VMLNKSFSKTF-HKKKKENLATSYC-----IDVE-------EVDNSKKSEQRTSVFDRIK
                  HK+K  R+KK   P+ +             + L +   ++F     +E L  + C     ++V+       EV+NS +  QRTSVFDRIK
Subjt:  ----------HKRK--RSKKFSQPQQL-------------VMLNKSFSKTF-HKKKKENLATSYC-----IDVE-------EVDNSKKSEQRTSVFDRIK

Query:  PPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLIT
        P TTR SVFQR SM   EEENQC     TR S F+RLS+STSKK +PSTSVFDRLK+T DQ +R+M +L+ K F E N D K+HS VPSRMKRK SV I 
Subjt:  PPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLIT

Query:  TEGYVGSL
        TEG V  L
Subjt:  TEGYVGSL

A0A5D3C2C8 Retrotransposon gag protein1.2e-6445.34Show/hide
Query:  MVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL
        MVV+ T  KS SK K     R+ +G      TLKERQKK+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+L
Subjt:  MVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDL

Query:  ILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGNNKHQRKKD---------------PKKLQ-----
        ILKLA+E KIELD+DEVAQ+N                                       + TI   NK    KD               P  +Q     
Subjt:  ILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGNNKHQRKKD---------------PKKLQ-----

Query:  ------------HKRKRSKK-------------FSQPQQLVMLNKSFSKTF-HKKKKENLATSYC-----IDVE-------EVDNSKKSEQRTSVFDRIK
                     K +R+KK             F Q ++ + L +   ++F     +E L  + C     ++V+       EV+NS +  QRTSVFDRIK
Subjt:  ------------HKRKRSKK-------------FSQPQQLVMLNKSFSKTF-HKKKKENLATSYC-----IDVE-------EVDNSKKSEQRTSVFDRIK

Query:  PPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLIT
        P TTR SVFQR SMA  EEENQC     TR S F+RLS+STSKK +PSTS FDRLK+ +DQ +R+M +L+ K F E N D K+HS VPSRMKRK SV I 
Subjt:  PPTTRPSVFQRTSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLIT

Query:  TEGYVGSL
        TEG V  L
Subjt:  TEGYVGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACGAATGGAACACATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCC
TGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATT
GCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATCGAGCTCGACCTTGATGAAGTA
GCCCAATCAAATCTTGCTACAATCAAAGGAAATAACAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACACAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCA
ACAACTGGTGATGTTGAATAAATCCTTCTCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGA
AAAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAACGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCG
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCA
ACCTAAAAGAAAAATGGACAACTTGGAGGTGAAACTTTTCGATGAAGTAAACCGCGACAAGAAGCTTCATAGTAGCGTCCCGTCACGTATGAAGAGGAAATTCTCTGTTC
TCATAACTACAGAAGGGTACGTAGGCAGCTTAAAGAAAACTTTAAGTTCAGTCTCTGCAAACAAAAAAAAGGAAAGTGCAACAAATATTGAAGCACGACGATATAAATTG
AGGAAATTCATCACTAGTGGGGGCAACACAGCGAATGGAAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTT
CTCACGCGTTTCACTGCAGTTCTTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGAGTTTC
TTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCT
CACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCC
TCAAAGTTCGAAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCG
CTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTTCTCAAAGTTTGAAGGTTCTCACGTCGCTTCGCTGCGCTCATGCGCTTCG
CTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTTGCTGCGATCCTTCCTCCAAGTTCGATGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTC
GAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCA
GTTCCTTCCTCCAAGTTCAAAGGTTATCACGTCGCTTCGCTGCGCTCATGCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTC
CTTCCCCAAGTTCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCAC
GCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGGAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCT
CCAAATTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCTAAATTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGC
ACTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCCTGCACTCGTGCTGA
AAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCTTGCACTCGTGCTGAAAGGCGTGGCTGCGACACAAGTCCAAGGAACATGCCCCA
ACTCAAGGAACATGTCCGTGCACTCGTGCTGGAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGT
GGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGACCGTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAG
GAACATGTCCGTGCACTCGTGCTGGAAGGCGCGGCGGCGGCACGAGTCGAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGTG
GCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACA
TCCTTGCACTCGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACGAATGGAACACATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCC
TGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATT
GCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATCGAGCTCGACCTTGATGAAGTA
GCCCAATCAAATCTTGCTACAATCAAAGGAAATAACAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACACAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCA
ACAACTGGTGATGTTGAATAAATCCTTCTCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGA
AAAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAACGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCG
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCA
ACCTAAAAGAAAAATGGACAACTTGGAGGTGAAACTTTTCGATGAAGTAAACCGCGACAAGAAGCTTCATAGTAGCGTCCCGTCACGTATGAAGAGGAAATTCTCTGTTC
TCATAACTACAGAAGGGTACGTAGGCAGCTTAAAGAAAACTTTAAGTTCAGTCTCTGCAAACAAAAAAAAGGAAAGTGCAACAAATATTGAAGCACGACGATATAAATTG
AGGAAATTCATCACTAGTGGGGGCAACACAGCGAATGGAAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTT
CTCACGCGTTTCACTGCAGTTCTTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGAGTTTC
TTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCT
CACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCC
TCAAAGTTCGAAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCG
CTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTTCTCAAAGTTTGAAGGTTCTCACGTCGCTTCGCTGCGCTCATGCGCTTCG
CTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTTGCTGCGATCCTTCCTCCAAGTTCGATGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTC
GAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCA
GTTCCTTCCTCCAAGTTCAAAGGTTATCACGTCGCTTCGCTGCGCTCATGCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTC
CTTCCCCAAGTTCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCAC
GCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGGAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCT
CCAAATTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCTAAATTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGC
ACTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCCTGCACTCGTGCTGA
AAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCTTGCACTCGTGCTGAAAGGCGTGGCTGCGACACAAGTCCAAGGAACATGCCCCA
ACTCAAGGAACATGTCCGTGCACTCGTGCTGGAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGT
GGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGACCGTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAG
GAACATGTCCGTGCACTCGTGCTGGAAGGCGCGGCGGCGGCACGAGTCGAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGTG
GCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACA
TCCTTGCACTCGTGCTGA
Protein sequenceShow/hide protein sequence
MVVNTTLPKSSSKEKRQTNGTHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEV
AQSNLATIKGNNKHQRKKDPKKLQHKRKRSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRTSMAATEEENQCS
MSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKFSVLITTEGYVGSLKKTLSSVSANKKKESATNIEARRYKL
RKFITSGGNTANGSSFPPSSKVLTSLRCSSFLQVRRFSRVSLQFFPHSSKVLTRFTAVPSPQVRRFSRRFAEFLPPSLKVLTSLRCVPSSKFEGSHTLRSAIPSPKFEGS
HALRAVPSSKFEGSHAFRCISFPQIRRFSRASLQFLPQSSKVLTRCAAVPSSKFKGSHALRCSSFLQVRRFSRASLQFLPPSSKVLTRFAAVPFSKFEGSHVASLRSCAS
LQFLPPSLKVLTSLCCDPSSKFDGSHALRSAIPSPSSKVLTRFVQFLPPNSKVLTRFAAVPSPKFEGSHALRCSSFLQVQRLSRRFAALMRFAAILPPSSKVLTRFALQF
LPQVRRFSCASCSSFLQIRRFSRASLHFLPPNSKVLTRFAAVPSSKFEGSHALRSGIPSPKFEGSHALRAVPSSKFEGSHALRCSSFPLNSKVLTRFTAVPSPQVRRFSR
TSLQFLPPSSKVLTRFAALQRYFLKSKDVNCPCTRAERRGGDTSPRNMSQLKEHVLALVLKGVAATQVQGTCPNSRNMSVHSCWKARRRHKSKEHVPTQGTCPCTRAERR
GGDTSPRNMSQLKEHDRALVLKGVAATQVQGTCPNSRNMSVHSCWKARRRHESKEHVPTQGTCPCTRAERRGGGTSPRNMSQLKEHVLALVLKGVAATQVQGTCPNSRNT
SLHSC