; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G06120 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G06120
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTransposon Tf2-1 polyprotein isoform X1
Genome locationChr2:4655913..4663693
RNA-Seq ExpressionCSPI02G06120
SyntenyCSPI02G06120
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR016197 - Chromo-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042929.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]3.3e-4346.67Show/hide
Query:  MKAKVIELIEEPKSNEREDDEKNHDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNW-------------------YRA
        MK K+ ELI+E   NE++ +EKN DQ+KFKK+EMPVFN ED D+WLFR DRYFQIH+LT+ +K+IVATISFEG ALNW                   YR 
Subjt:  MKAKVIELIEEPKSNEREDDEKNHDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNW-------------------YRA

Query:  QEER--------------EKFTD----WLN--------------------LKERLLVRFRSML-----GEHQIVQSEMMPYLTKNHEWQATLEEMYGYLK
        Q ++              E F +    W+                      K R ++R  + L     GEHQ  +S   PYL +NHEW+A  EE+YG +K
Subjt:  QEER--------------EKFTD----WLN--------------------LKERLLVRFRSML-----GEHQIVQSEMMPYLTKNHEWQATLEEMYGYLK

Query:  NKEGGWDVLIKWQGLPRNEATWEKY-EIQQLFPEFHLEDK
        NK  GWDVLIKW+GL RNEATWE Y EIQ  FP+ HLEDK
Subjt:  NKEGGWDVLIKWQGLPRNEATWEKY-EIQQLFPEFHLEDK

KAA0050168.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]1.8e-4142.8Show/hide
Query:  MKAKVIELIEEPKSNEREDDEKNHDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLV
        MK ++ ELI+E   NE++ +EKN+DQ+KFKKVEMPVFN ED D WLFR DRYFQIHKLT+ +K+ +ATISF+G  LNWYRAQEER+KF +WL+LKERLL+
Subjt:  MKAKVIELIEEPKSNEREDDEKNHDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLV

Query:  RFRSML------------------------------------------------------------------------------------------GEHQ
        RFRS+                                                                                           GEHQ
Subjt:  RFRSML------------------------------------------------------------------------------------------GEHQ

Query:  IVQSEMMPYLTKNHEWQATLEEMYGYLKNKEGGWDVLIKWQGL
          +S  MPYLT+NH+W+A  EE+YGY+KNK  GWDVLIKW+GL
Subjt:  IVQSEMMPYLTKNHEWQATLEEMYGYLKNKEGGWDVLIKWQGL

TYJ98817.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]6.6e-3675.24Show/hide
Query:  KMKAKVIELIEEPKSNEREDDEKNHDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLL
        K K KV ELI+E   NE+  +EKN+D++KFKKVEMPVFN ED D WLFRADRYFQIH+LT+S+K+ VATISFEG ALNWYRAQEER+KF DW NLKERLL
Subjt:  KMKAKVIELIEEPKSNEREDDEKNHDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLL

Query:  VRFRS
        VRFRS
Subjt:  VRFRS

TYJ99899.1 putative UDP-3-O-acyl-N-acetylglucosamine deacetylase 2 isoform X3 [Cucumis melo var. makuwa]1.2e-4246.06Show/hide
Query:  MAKKVEERFEVVEQEIGSIREELH---------------------------------------------------KMKAKVIELIEEPKSNEREDDEKNH
        +A+KVEERFE VEQEI +I+ EL                                                    KMK KV ELI+EPK +E+ED EK+ 
Subjt:  MAKKVEERFEVVEQEIGSIREELH---------------------------------------------------KMKAKVIELIEEPKSNEREDDEKNH

Query:  DQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLVRFRSMLGEHQIVQSEMMPYLTKNH
        D NKFKK+EMP+FN  + D WLFRAD YFQIHKLT+ +K IVATISFEG  LNWYRAQEEREKFT+W NLKE+LL   + M  E   +Q  M        
Subjt:  DQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLVRFRSMLGEHQIVQSEMMPYLTKNH

Query:  EWQATLEEMYGYLKNKEGGWDVLIKWQGLPRNEATWEKYE-IQQLFPEFHLEDK
                               IKW+GLPRNEATWE YE IQ+ FPEF+ EDK
Subjt:  EWQATLEEMYGYLKNKEGGWDVLIKWQGLPRNEATWEKYE-IQQLFPEFHLEDK

TYK10830.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]5.6e-5140.24Show/hide
Query:  MAKKVEERFEVVEQEIGSIREELH----------------------------------------------------KMKAKVIELIEEPKSNEREDDEKN
        MAKK EERFEV+EQEI +I+ EL                                                     KMK K+ ELI+E   NE++ +EKN
Subjt:  MAKKVEERFEVVEQEIGSIREELH----------------------------------------------------KMKAKVIELIEEPKSNEREDDEKN

Query:  HDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLVRFRSML-----------------
        +D++KFKK+EMPVFN ED D WLFRADRYFQIH+LT+ +K+IVATISFEG ALNWYR QEER+ F +WL+LKERLL+RFRS+                  
Subjt:  HDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLVRFRSML-----------------

Query:  -------------------------------------------------------------------------GEHQIVQSEMMPYLTKNHEWQATLEEM
                                                                                 GEHQ  +S   PYL +NHEW+A  EE+
Subjt:  -------------------------------------------------------------------------GEHQIVQSEMMPYLTKNHEWQATLEEM

Query:  YGYLKNKEGGWDVLIKWQGLPRNEATWEKY-EIQQLFP
        YGY+KNK  GWDVLIKW+GL RNEATWE Y EIQQ FP
Subjt:  YGYLKNKEGGWDVLIKWQGLPRNEATWEKY-EIQQLFP

TrEMBL top hitse value%identityAlignment
A0A5A7U4I6 Transposon Tf2-1 polyprotein isoform X18.7e-4242.8Show/hide
Query:  MKAKVIELIEEPKSNEREDDEKNHDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLV
        MK ++ ELI+E   NE++ +EKN+DQ+KFKKVEMPVFN ED D WLFR DRYFQIHKLT+ +K+ +ATISF+G  LNWYRAQEER+KF +WL+LKERLL+
Subjt:  MKAKVIELIEEPKSNEREDDEKNHDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLV

Query:  RFRSML------------------------------------------------------------------------------------------GEHQ
        RFRS+                                                                                           GEHQ
Subjt:  RFRSML------------------------------------------------------------------------------------------GEHQ

Query:  IVQSEMMPYLTKNHEWQATLEEMYGYLKNKEGGWDVLIKWQGL
          +S  MPYLT+NH+W+A  EE+YGY+KNK  GWDVLIKW+GL
Subjt:  IVQSEMMPYLTKNHEWQATLEEMYGYLKNKEGGWDVLIKWQGL

A0A5D3BI70 Transposon Tf2-1 polyprotein isoform X13.2e-3675.24Show/hide
Query:  KMKAKVIELIEEPKSNEREDDEKNHDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLL
        K K KV ELI+E   NE+  +EKN+D++KFKKVEMPVFN ED D WLFRADRYFQIH+LT+S+K+ VATISFEG ALNWYRAQEER+KF DW NLKERLL
Subjt:  KMKAKVIELIEEPKSNEREDDEKNHDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLL

Query:  VRFRS
        VRFRS
Subjt:  VRFRS

A0A5D3BLM9 Putative UDP-3-O-acyl-N-acetylglucosamine deacetylase 2 isoform X36.0e-4346.06Show/hide
Query:  MAKKVEERFEVVEQEIGSIREELH---------------------------------------------------KMKAKVIELIEEPKSNEREDDEKNH
        +A+KVEERFE VEQEI +I+ EL                                                    KMK KV ELI+EPK +E+ED EK+ 
Subjt:  MAKKVEERFEVVEQEIGSIREELH---------------------------------------------------KMKAKVIELIEEPKSNEREDDEKNH

Query:  DQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLVRFRSMLGEHQIVQSEMMPYLTKNH
        D NKFKK+EMP+FN  + D WLFRAD YFQIHKLT+ +K IVATISFEG  LNWYRAQEEREKFT+W NLKE+LL   + M  E   +Q  M        
Subjt:  DQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLVRFRSMLGEHQIVQSEMMPYLTKNH

Query:  EWQATLEEMYGYLKNKEGGWDVLIKWQGLPRNEATWEKYE-IQQLFPEFHLEDK
                               IKW+GLPRNEATWE YE IQ+ FPEF+ EDK
Subjt:  EWQATLEEMYGYLKNKEGGWDVLIKWQGLPRNEATWEKYE-IQQLFPEFHLEDK

A0A5D3C2A4 Transposon Tf2-1 polyprotein isoform X11.6e-4346.67Show/hide
Query:  MKAKVIELIEEPKSNEREDDEKNHDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNW-------------------YRA
        MK K+ ELI+E   NE++ +EKN DQ+KFKK+EMPVFN ED D+WLFR DRYFQIH+LT+ +K+IVATISFEG ALNW                   YR 
Subjt:  MKAKVIELIEEPKSNEREDDEKNHDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNW-------------------YRA

Query:  QEER--------------EKFTD----WLN--------------------LKERLLVRFRSML-----GEHQIVQSEMMPYLTKNHEWQATLEEMYGYLK
        Q ++              E F +    W+                      K R ++R  + L     GEHQ  +S   PYL +NHEW+A  EE+YG +K
Subjt:  QEER--------------EKFTD----WLN--------------------LKERLLVRFRSML-----GEHQIVQSEMMPYLTKNHEWQATLEEMYGYLK

Query:  NKEGGWDVLIKWQGLPRNEATWEKY-EIQQLFPEFHLEDK
        NK  GWDVLIKW+GL RNEATWE Y EIQ  FP+ HLEDK
Subjt:  NKEGGWDVLIKWQGLPRNEATWEKY-EIQQLFPEFHLEDK

A0A5D3CHI3 Transposon Tf2-1 polyprotein isoform X12.7e-5140.24Show/hide
Query:  MAKKVEERFEVVEQEIGSIREELH----------------------------------------------------KMKAKVIELIEEPKSNEREDDEKN
        MAKK EERFEV+EQEI +I+ EL                                                     KMK K+ ELI+E   NE++ +EKN
Subjt:  MAKKVEERFEVVEQEIGSIREELH----------------------------------------------------KMKAKVIELIEEPKSNEREDDEKN

Query:  HDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLVRFRSML-----------------
        +D++KFKK+EMPVFN ED D WLFRADRYFQIH+LT+ +K+IVATISFEG ALNWYR QEER+ F +WL+LKERLL+RFRS+                  
Subjt:  HDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLVRFRSML-----------------

Query:  -------------------------------------------------------------------------GEHQIVQSEMMPYLTKNHEWQATLEEM
                                                                                 GEHQ  +S   PYL +NHEW+A  EE+
Subjt:  -------------------------------------------------------------------------GEHQIVQSEMMPYLTKNHEWQATLEEM

Query:  YGYLKNKEGGWDVLIKWQGLPRNEATWEKY-EIQQLFP
        YGY+KNK  GWDVLIKW+GL RNEATWE Y EIQQ FP
Subjt:  YGYLKNKEGGWDVLIKWQGLPRNEATWEKY-EIQQLFP

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.1e-0744.83Show/hide
Query:  VGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFVVV
        V  +WVF +KY       R+K RLVA+GF Q Y I   ETF+PVA++++ R +L +V+
Subjt:  VGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFVVV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-1133.11Show/hide
Query:  EFQRGHKTVGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFVVV-LDCEDGAKGSATMRLNRDNDDIVVAALEQKRRD
        E  +G + + CKWVFKLK   D  + R+K RLV KGF Q  GI   E FSPV K+ +IR +L +   LD E       T  L+ D ++ +     +    
Subjt:  EFQRGHKTVGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFVVV-LDCEDGAKGSATMRLNRDNDDIVVAALEQKRRD

Query:  VGIAAVWQYCLCKRNIHLY------------------ASGYLRTRSDP
         G     ++ +CK N  LY                  +  YL+T SDP
Subjt:  VGIAAVWQYCLCKRNIHLY------------------ASGYLRTRSDP

P92520 Uncharacterized mitochondrial protein AtMg008207.6e-1158.93Show/hide
Query:  VGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFV
        +GCKWVFK K  +D T+DR K RLVAKGF Q  GIY  ET+SPV +   IR +L V
Subjt:  VGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-1051.72Show/hide
Query:  VGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFVVV
        VGC+W+F  KY +D +++R+K RLVAKG+ Q  G+  +ETFSPV K  +IR++L V V
Subjt:  VGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFVVV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.8e-1050Show/hide
Query:  VGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFVVV
        VGC+W+F  K+ +D +++R+K RLVAKG+ Q  G+  +ETFSPV K  +IR++L V V
Subjt:  VGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFVVV

Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein3.0e-1035.62Show/hide
Query:  KKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLVRF
        +++EMPVF+   +  W  + +R+F++ +  +SDKL +  +S EG AL W+  +    +F DW + ++RLL RF
Subjt:  KKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEEREKFTDWLNLKERLLVRF

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.4e-1455.17Show/hide
Query:  KTVGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFV
        K +GCKWV+K+KY +D TI+R+K RLVAKG+ Q  GI   ETFSPV KL +++++L +
Subjt:  KTVGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.4e-1258.93Show/hide
Query:  VGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFV
        +GCKWVFK K  +D T+DR K RLVAKGF Q  GIY  ET+SPV +   IR +L V
Subjt:  VGCKWVFKLKYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAGAAAGTAGAGGAAAGATTCGAAGTTGTTGAGCAGGAGATCGGAAGCATCAGGGAAGAGTTGCACAAGATGAAAGCGAAGGTAATAGAACTGATCGAAGAACC
GAAATCAAATGAAAGGGAAGACGATGAAAAGAATCATGATCAAAATAAATTCAAGAAGGTTGAAATGCCAGTATTCAACAACGAGGATCTGGATTTGTGGCTTTTTCGCG
CAGATAGATATTTTCAAATTCATAAGTTGACCAATTCCGATAAGTTGATTGTTGCGACAATTAGTTTCGAAGGATCAGCATTGAATTGGTATAGAGCACAAGAAGAACGT
GAGAAATTTACGGATTGGCTGAATCTCAAAGAAAGATTACTAGTACGTTTTCGGTCGATGTTGGGTGAGCATCAGATAGTGCAGTCGGAAATGATGCCTTACTTGACTAA
GAATCATGAATGGCAGGCTACACTGGAGGAAATGTATGGGTATTTGAAAAATAAAGAGGGAGGTTGGGATGTGCTCATCAAGTGGCAGGGATTACCAAGAAACGAAGCTA
CGTGGGAAAAATACGAGATTCAACAATTGTTTCCAGAATTTCACCTCGAGGACAAGGGAGAGTTCCAACGGGGTCACAAGACTGTTGGATGCAAATGGGTGTTTAAACTC
AAGTACAAAGCAGATAGTACAATTGACAGACATAAAGTTAGGCTAGTTGCAAAAGGGTTCGCTCAAACCTATGGGATTTACAATTCTGAGACTTTTTCTCCTGTTGCAAA
GTTGAACAATATTAGAGTTTTGTTGTTTGTTGTAGTGTTAGATTGCGAGGACGGTGCCAAAGGTTCAGCGACAATGCGATTGAACAGAGACAACGACGACATCGTCGTCG
CAGCGCTTGAGCAAAAGAGACGGGACGTGGGTATCGCAGCCGTGTGGCAGTACTGCTTGTGCAAAAGAAACATTCATTTATATGCGTCCGGTTATTTACGAACGAGATCC
GACCCACCCAACCCAATTTTGATTCGGATCAGCCGTGTGGCAGTACTGCTTGTGCAAAAGAAACATTCATTTATATGCGTCCGGTTATTTACGAACGAGATCCGACCCAC
CCAACCCAATTTTGATTCGGATCAGGGAATTGTGCTTCTTCTTCTTTTTGAGATGAGAGAGGCTAAAGAGAAAGGGGTTGTAACCGAGAAAAAAATGGAAGATTCTTCTT
CTGATGAGAGGTGGAGAGGGGCTCTGACGGAAAAAAAGAAAAAGAAGAAAAAGGGAGAACCAAATATTCCATATTTGATTCTGCATTCCACCGCCCTTACCACCCTCCGA
GTGGTAGTTTCTTGTGTTTGGAAGTACTTACTATATGATACATTCCAGAATGTGATGATAACAGAAGACGAAAGAGATATTGGAATGGAAGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAGAAAGTAGAGGAAAGATTCGAAGTTGTTGAGCAGGAGATCGGAAGCATCAGGGAAGAGTTGCACAAGATGAAAGCGAAGGTAATAGAACTGATCGAAGAACC
GAAATCAAATGAAAGGGAAGACGATGAAAAGAATCATGATCAAAATAAATTCAAGAAGGTTGAAATGCCAGTATTCAACAACGAGGATCTGGATTTGTGGCTTTTTCGCG
CAGATAGATATTTTCAAATTCATAAGTTGACCAATTCCGATAAGTTGATTGTTGCGACAATTAGTTTCGAAGGATCAGCATTGAATTGGTATAGAGCACAAGAAGAACGT
GAGAAATTTACGGATTGGCTGAATCTCAAAGAAAGATTACTAGTACGTTTTCGGTCGATGTTGGGTGAGCATCAGATAGTGCAGTCGGAAATGATGCCTTACTTGACTAA
GAATCATGAATGGCAGGCTACACTGGAGGAAATGTATGGGTATTTGAAAAATAAAGAGGGAGGTTGGGATGTGCTCATCAAGTGGCAGGGATTACCAAGAAACGAAGCTA
CGTGGGAAAAATACGAGATTCAACAATTGTTTCCAGAATTTCACCTCGAGGACAAGGGAGAGTTCCAACGGGGTCACAAGACTGTTGGATGCAAATGGGTGTTTAAACTC
AAGTACAAAGCAGATAGTACAATTGACAGACATAAAGTTAGGCTAGTTGCAAAAGGGTTCGCTCAAACCTATGGGATTTACAATTCTGAGACTTTTTCTCCTGTTGCAAA
GTTGAACAATATTAGAGTTTTGTTGTTTGTTGTAGTGTTAGATTGCGAGGACGGTGCCAAAGGTTCAGCGACAATGCGATTGAACAGAGACAACGACGACATCGTCGTCG
CAGCGCTTGAGCAAAAGAGACGGGACGTGGGTATCGCAGCCGTGTGGCAGTACTGCTTGTGCAAAAGAAACATTCATTTATATGCGTCCGGTTATTTACGAACGAGATCC
GACCCACCCAACCCAATTTTGATTCGGATCAGCCGTGTGGCAGTACTGCTTGTGCAAAAGAAACATTCATTTATATGCGTCCGGTTATTTACGAACGAGATCCGACCCAC
CCAACCCAATTTTGATTCGGATCAGGGAATTGTGCTTCTTCTTCTTTTTGAGATGAGAGAGGCTAAAGAGAAAGGGGTTGTAACCGAGAAAAAAATGGAAGATTCTTCTT
CTGATGAGAGGTGGAGAGGGGCTCTGACGGAAAAAAAGAAAAAGAAGAAAAAGGGAGAACCAAATATTCCATATTTGATTCTGCATTCCACCGCCCTTACCACCCTCCGA
GTGGTAGTTTCTTGTGTTTGGAAGTACTTACTATATGATACATTCCAGAATGTGATGATAACAGAAGACGAAAGAGATATTGGAATGGAAGATTAG
Protein sequenceShow/hide protein sequence
MAKKVEERFEVVEQEIGSIREELHKMKAKVIELIEEPKSNEREDDEKNHDQNKFKKVEMPVFNNEDLDLWLFRADRYFQIHKLTNSDKLIVATISFEGSALNWYRAQEER
EKFTDWLNLKERLLVRFRSMLGEHQIVQSEMMPYLTKNHEWQATLEEMYGYLKNKEGGWDVLIKWQGLPRNEATWEKYEIQQLFPEFHLEDKGEFQRGHKTVGCKWVFKL
KYKADSTIDRHKVRLVAKGFAQTYGIYNSETFSPVAKLNNIRVLLFVVVLDCEDGAKGSATMRLNRDNDDIVVAALEQKRRDVGIAAVWQYCLCKRNIHLYASGYLRTRS
DPPNPILIRISRVAVLLVQKKHSFICVRLFTNEIRPTQPNFDSDQGIVLLLLFEMREAKEKGVVTEKKMEDSSSDERWRGALTEKKKKKKKGEPNIPYLILHSTALTTLR
VVVSCVWKYLLYDTFQNVMITEDERDIGMED