; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025591 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025591
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr10:15772102..15776101
RNA-Seq ExpressionLag0025591
SyntenyLag0025591
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035697.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.8e-4351.54Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRKKDPKKL--------------QPKRKKVKSFL
        KVDD  YCKYH VI HPVE+CFVLK+LILKLA+E KI+LD+DE+AQ+N   ++  S          QR+K+  ++                  K+V +  
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRKKDPKKL--------------QPKRKKVKSFL

Query:  NLDNRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRI
         +   TSVF+RIKP TTR S+FQR+SMA  +EENQ    TST+ SAF+RL +S SKK R STS FDRLK+TNDQ +++M  L+ K F E N+D K+HS +
Subjt:  NLDNRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRI

Query:  LSRMKRKFSVLINTEGASLPSLPSLLL
        LSRMKRK SV INTEG SL   P L++
Subjt:  LSRMKRKFSVLINTEGASLPSLPSLLL

KAA0050736.1 retrotransposon gag protein [Cucumis melo var. makuwa]9.9e-4138.74Show/hide
Query:  MEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI----------------------------------------------
        +EKVDDP YCKYHR+I HPVE+CFVLK+LILKLA+E KI+LD+DEVAQ+N   +                                              
Subjt:  MEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI----------------------------------------------

Query:  ----------------------------------KGKSKHQRK---------------KDPKKLQPKRK-------------------------KVKSFL
                                          KG   H++K               KD K LQP+R                             S +
Subjt:  ----------------------------------KGKSKHQRK---------------KDPKKLQPKRK-------------------------KVKSFL

Query:  NLDN----------------RTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLEL
         +DN                RTSVFDRIKP TTR SVFQR+SMA  +EENQC MST TR SAF+RLS+S SKK R STS FDRLK+TNDQ +R+M +L+ 
Subjt:  NLDN----------------RTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLEL

Query:  KLFDEVNSDKKLHSRILSRMKRKFSVLINTEGA
        K F E N D K++SR+ SR+KRK S+ INTEG+
Subjt:  KLFDEVNSDKKLHSRILSRMKRKFSVLINTEGA

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.8e-4350.64Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLA------TIKGKS-----------------KHQRKKDPKKL----------
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A       IKGK                  +   + DP+++          
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLA------TIKGKS-----------------KHQRKKDPKKL----------

Query:  -------QPKRKKVKSFLNLDNRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNL
                   K+V +   ++ RTSVFDRIKP TTR SVFQR+S+A  +EENQC     TR S  +RLS+ST KK R STS FDRLK+TNDQ +R+M + 
Subjt:  -------QPKRKKVKSFLNLDNRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNL

Query:  ELKLFDEVNSDKKLHSRILSRMKRKFSVLINTEGA
        + K F E N D K+HS + SRMKRK  V INTEG+
Subjt:  ELKLFDEVNSDKKLHSRILSRMKRKFSVLINTEGA

KAA0065608.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.5e-4453.15Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRK---KDPKKLQPKRKKV--------KSFLNLDN-------
        KVDDP YCKYHRVI H +E+CFVLK+LILKLA + KIELD+DEVAQ+N A +   S         D   L+   +++         S + +DN       
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRK---KDPKKLQPKRKKV--------KSFLNLDN-------

Query:  ---------RTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKK
                 RT VFDRIKP TTR SVFQR+SMA  +EE QC  ST TR S F+RLS+STSKK R STS FDRLK+TNDQ +++M +L+ K F E N D K
Subjt:  ---------RTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKK

Query:  LHSRILSRMKRKFSVLINTEGA
        +HSR+ SR KRK SV INTEG+
Subjt:  LHSRILSRMKRKFSVLINTEGA

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.5e-4150.85Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK-------------------------KVK
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KI+LD+DE        IKG       KD   LQP+R                             
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK-------------------------KVK

Query:  SFLNLDN----------------RTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNN
        S + +DN                RTSVFDRIKP TTR  VFQR+SMA  +EENQC  ST TR SAF+RLS+STSKK R STS FDRLK+ NDQ +R+M +
Subjt:  SFLNLDN----------------RTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNN

Query:  LELKLFDEVNSDKKLHSRILSRMKRKFSVLINTEGA
        L+ K F E N D K+HS I S   RK SV IN EG+
Subjt:  LELKLFDEVNSDKKLHSRILSRMKRKFSVLINTEGA

TrEMBL top hitse value%identityAlignment
A0A5A7U974 Retrotransposon gag protein4.8e-4138.74Show/hide
Query:  MEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI----------------------------------------------
        +EKVDDP YCKYHR+I HPVE+CFVLK+LILKLA+E KI+LD+DEVAQ+N   +                                              
Subjt:  MEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI----------------------------------------------

Query:  ----------------------------------KGKSKHQRK---------------KDPKKLQPKRK-------------------------KVKSFL
                                          KG   H++K               KD K LQP+R                             S +
Subjt:  ----------------------------------KGKSKHQRK---------------KDPKKLQPKRK-------------------------KVKSFL

Query:  NLDN----------------RTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLEL
         +DN                RTSVFDRIKP TTR SVFQR+SMA  +EENQC MST TR SAF+RLS+S SKK R STS FDRLK+TNDQ +R+M +L+ 
Subjt:  NLDN----------------RTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLEL

Query:  KLFDEVNSDKKLHSRILSRMKRKFSVLINTEGA
        K F E N D K++SR+ SR+KRK S+ INTEG+
Subjt:  KLFDEVNSDKKLHSRILSRMKRKFSVLINTEGA

A0A5A7URH1 Ty3-gypsy retrotransposon protein2.3e-4350.64Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLA------TIKGKS-----------------KHQRKKDPKKL----------
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A       IKGK                  +   + DP+++          
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLA------TIKGKS-----------------KHQRKKDPKKL----------

Query:  -------QPKRKKVKSFLNLDNRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNL
                   K+V +   ++ RTSVFDRIKP TTR SVFQR+S+A  +EENQC     TR S  +RLS+ST KK R STS FDRLK+TNDQ +R+M + 
Subjt:  -------QPKRKKVKSFLNLDNRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNL

Query:  ELKLFDEVNSDKKLHSRILSRMKRKFSVLINTEGA
        + K F E N D K+HS + SRMKRK  V INTEG+
Subjt:  ELKLFDEVNSDKKLHSRILSRMKRKFSVLINTEGA

A0A5A7VFA5 Ty3-gypsy retrotransposon protein2.2e-4150.85Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK-------------------------KVK
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KI+LD+DE        IKG       KD   LQP+R                             
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK-------------------------KVK

Query:  SFLNLDN----------------RTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNN
        S + +DN                RTSVFDRIKP TTR  VFQR+SMA  +EENQC  ST TR SAF+RLS+STSKK R STS FDRLK+ NDQ +R+M +
Subjt:  SFLNLDN----------------RTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNN

Query:  LELKLFDEVNSDKKLHSRILSRMKRKFSVLINTEGA
        L+ K F E N D K+HS I S   RK SV IN EG+
Subjt:  LELKLFDEVNSDKKLHSRILSRMKRKFSVLINTEGA

A0A5D3CA53 Retrotransposon gag protein1.2e-4453.15Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRK---KDPKKLQPKRKKV--------KSFLNLDN-------
        KVDDP YCKYHRVI H +E+CFVLK+LILKLA + KIELD+DEVAQ+N A +   S         D   L+   +++         S + +DN       
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRK---KDPKKLQPKRKKV--------KSFLNLDN-------

Query:  ---------RTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKK
                 RT VFDRIKP TTR SVFQR+SMA  +EE QC  ST TR S F+RLS+STSKK R STS FDRLK+TNDQ +++M +L+ K F E N D K
Subjt:  ---------RTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKK

Query:  LHSRILSRMKRKFSVLINTEGA
        +HSR+ SR KRK SV INTEG+
Subjt:  LHSRILSRMKRKFSVLINTEGA

A0A5D3E4T1 Retrotransposon gag protein1.4e-4351.54Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRKKDPKKL--------------QPKRKKVKSFL
        KVDD  YCKYH VI HPVE+CFVLK+LILKLA+E KI+LD+DE+AQ+N   ++  S          QR+K+  ++                  K+V +  
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRKKDPKKL--------------QPKRKKVKSFL

Query:  NLDNRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRI
         +   TSVF+RIKP TTR S+FQR+SMA  +EENQ    TST+ SAF+RL +S SKK R STS FDRLK+TNDQ +++M  L+ K F E N+D K+HS +
Subjt:  NLDNRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRI

Query:  LSRMKRKFSVLINTEGASLPSLPSLLL
        LSRMKRK SV INTEG SL   P L++
Subjt:  LSRMKRKFSVLINTEGASLPSLPSLLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAG
GAAGGCAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGGAAAGAGCAAGCATCAAAGAAAGAAGGATCCTAAGAAACTTCAA
CCCAAGAGGAAGAAAGTAAAAAGTTTTCTCAACCTCGACAACCGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATG
AGTATGGCCGCGACAAAAGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCAAAGAAAAGTCGATCT
TCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAG
CTTCATAGTAGAATCCTGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGCGCTTCTCTCCCTTCTCTTCCCTCGCTCCTTCTCCAAGTTCGA
AGGCGCTTCTCTCCCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTTCGTTGCTACCTTCCTCCAAGTTCGAATCCTTCCTCCAAGTTCGAAGG
TTCACACGCGCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGT
TCCTTCCTACAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTTCAAGTTCGAAGGTTCTCACGTGCATCGCCACAATTCCTTCCTTCAAGTTCGAA
GGTTCTCACGCGCATCGCCACAATTCCTTCCTCCAAGTTCCAAGGTTCTCACGCGCATTGCCACAATTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCATCGCC
ACAGTTCCTTCCTCCAAGTTCGAATCCTTCCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCCTCCAAGTTCG
AAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGGTTCTCACGC
GCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCA
CAGTTCCTTCCTTCCTCCAAGTTCGAAGATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTT
CGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGATCGCCACAG
TTCCTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCC
ACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTTCGCTACAGTTCCT
TCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGA
AGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGATCGCCACAGTTCCTTCCTTC
CTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTC
CTCCAAGTTCGAAGGTTCTCACGCGTTTCGCTTCGCCACAATTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTC
GAAGATCGCCACAGTTCCTTCCTTCCTCCAAGTTCAAAGGCGCTTCTCTTCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTTCGTTGCTACCTTCCTCCAA
GTTCGAATCCTTCCTCCAAGTTCGAAGGTTCACACGCGCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTT
CGAAGGTTCACACGACGCGTGCCACAATTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCCTCCAAGT
TCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAG
TTCGAAGGTTCTCACGTGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGGTTCTCACGCGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGG
TTCTCACGCGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGATCGCCACAGT
TCCTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAA
GGTTCTCACGCGCATCACCACAGTTCCTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTCTCAAGTTCTCACGC
GCATCGCCACAGTTCCTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACA
GTTCCTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCCTCCCTCCAAGTTCGAAGGTTCTCACGCGCAT
CGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCCTCCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTCCCTCCAAGTTCGAAGGTTCTCACGCG
CATCGCCATCGCCACAGTTCCTCCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTT
CCCCTTCCTCCAAGTTCGAAGGTTTCTCCTAGGCTTCAAAGGCTCTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAG
GAAGGCAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGGAAAGAGCAAGCATCAAAGAAAGAAGGATCCTAAGAAACTTCAA
CCCAAGAGGAAGAAAGTAAAAAGTTTTCTCAACCTCGACAACCGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATG
AGTATGGCCGCGACAAAAGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCAAAGAAAAGTCGATCT
TCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAG
CTTCATAGTAGAATCCTGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGCGCTTCTCTCCCTTCTCTTCCCTCGCTCCTTCTCCAAGTTCGA
AGGCGCTTCTCTCCCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTTCGTTGCTACCTTCCTCCAAGTTCGAATCCTTCCTCCAAGTTCGAAGG
TTCACACGCGCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGT
TCCTTCCTACAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTTCAAGTTCGAAGGTTCTCACGTGCATCGCCACAATTCCTTCCTTCAAGTTCGAA
GGTTCTCACGCGCATCGCCACAATTCCTTCCTCCAAGTTCCAAGGTTCTCACGCGCATTGCCACAATTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCATCGCC
ACAGTTCCTTCCTCCAAGTTCGAATCCTTCCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCCTCCAAGTTCG
AAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGGTTCTCACGC
GCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCA
CAGTTCCTTCCTTCCTCCAAGTTCGAAGATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTT
CGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGATCGCCACAG
TTCCTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCC
ACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTTCGCTACAGTTCCT
TCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGA
AGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGATCGCCACAGTTCCTTCCTTC
CTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTC
CTCCAAGTTCGAAGGTTCTCACGCGTTTCGCTTCGCCACAATTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTC
GAAGATCGCCACAGTTCCTTCCTTCCTCCAAGTTCAAAGGCGCTTCTCTTCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTTCGTTGCTACCTTCCTCCAA
GTTCGAATCCTTCCTCCAAGTTCGAAGGTTCACACGCGCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTT
CGAAGGTTCACACGACGCGTGCCACAATTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCCTCCAAGT
TCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAG
TTCGAAGGTTCTCACGTGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGGTTCTCACGCGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGG
TTCTCACGCGCTTCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGATCGCCACAGT
TCCTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAA
GGTTCTCACGCGCATCACCACAGTTCCTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTCTCAAGTTCTCACGC
GCATCGCCACAGTTCCTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACA
GTTCCTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCCTCCCTCCAAGTTCGAAGGTTCTCACGCGCAT
CGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCCTCCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTCCCTCCAAGTTCGAAGGTTCTCACGCG
CATCGCCATCGCCACAGTTCCTCCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTT
CCCCTTCCTCCAAGTTCGAAGGTTTCTCCTAGGCTTCAAAGGCTCTCATGA
Protein sequenceShow/hide protein sequence
MEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKKVKSFLNLDNRTSVFDRIKPPTTRPSVFQRM
SMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRILSRMKRKFSVLINTEGASLPSLPSLLLQVR
RRFSPFSPLLLLQVRRRFSSLLPSSKFESFLQVRRFTRAPQFLPPSSKVHTRIATVPSSKFEGSHAHRHSSFLQVRRFTRASPQFLPSSSKVLTCIATIPSFKFE
GSHAHRHNSFLQVPRFSRALPQFLPPSSKVLTRIATVPSSKFESFPPSSKVLTRIATVPSSKFESFPPSSKVLTRFATVPSSKFEGSHVLRFATVPSSKFEGFSR
ASLRHSSFLQVRRFSRASLRHSSFLQVRRFSRASPQFLPSSKFEDRHSSFLPPSSKVLTRIATVPSFLQVRRFSRASPQFLPPSSKVLTRIATVPSFLQVRRSPQ
FLPSSKFEGSHAHRHSSFLQVRILPPSSKVLTRFATVPSSKFEGSHALRFATVPSSKFEGSHALRFATVPSSKFEGSHALRFATVPSSKFEGSHAHRHSSFLQVR
RFSRASPQFLPSSKFEDRHSSFLPPSSKIATVPSFLQVRRFSRASPQFLPPSSNPSSKFEGSHALRHSSFLQVRRFSRVSLRHNSFLQVRRFSRASPQFLPSSKF
EDRHSSFLPPSSKALLFVATSPSSKALLFVATFLQVRILPPSSKVHTRATVPSSKFEGSHAHRHSSFLQVRRFTRRVPQFLPPSSKVLTRIATVPSSKFESFPPS
SKVLTRIATVPSSKFESFPPSSKVLTRFATVPSSKFEGSHVLRFATVPSSKFEGFSRASLRHSSFLQVRRFSRASLRHSSFLQVRRFSRASPQFLPSSKFEDRHS
SFLPPSSKVLTRIATVPSFLQVRRSPQFLPSSKFEGSHAHHHSSFLPPSSKVLTRIATVPSFLQVLKFSRASPQFLPSSKFEGSHAHRHSSFLPPSSKVLTRIAT
VPSFLQVRRFSRASPQFLPPSSKVPPSKFEGSHAHRHSSFLQVRRFLPPSSKVLTRIATVPPSKFEGSHAHRHRHSSSLQVRRFSRASPQFLPPSSKVLTRIATV
PLPPSSKVSPRLQRLS