; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy2G018050 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy2G018050
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionUPF0587 protein C1orf123 homolog
Genome locationGy14Chr2:27864249..27867004
RNA-Seq ExpressionCsGy2G018050
SyntenyCsGy2G018050
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR008584 - CXXC motif containing zinc binding protein, eukaryotic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585732.1 CXXC motif containing zinc binding protein, partial [Cucurbita argyrosperma subsp. sororia]2.17e-11190.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGE+SQKETCVTL ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GFIFGPGWK ESIEGTKFEDIDL+GGEYAEYDEKGECPVMIS+LEA FE +K
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

XP_004142917.1 CXXC motif containing zinc binding protein [Cucumis sativus]6.98e-124100Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

XP_008444495.1 PREDICTED: UPF0587 protein C1orf123 homolog [Cucumis melo]8.81e-11894.58Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL ETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESG F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GF+FGPGWKVESIEGTKFEDIDLNGGE+AEYDEKGECPVMIS+L+AKFELLK
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

XP_022952030.1 UPF0587 protein C1orf123 homolog [Cucurbita moschata]6.69e-11390.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGE+SQKETCVTL ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GFIFGPGWK ESIEGTKFEDIDL+GGEYAEYDEKGECPVMIS+LEA FE +K
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

XP_038884333.1 CXXC motif containing zinc binding protein isoform X1 [Benincasa hispida]5.29e-11089.76Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL ETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRG+PLTQE SE G  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+PM F+FGPGWK ESIEGTKFEDIDL+ GE+AEYDEKGECPVMIS LEA FEL+K
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

TrEMBL top hitse value%identityAlignment
A0A0A0LN37 Uncharacterized protein3.38e-124100Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

A0A1S3BAF1 UPF0587 protein C1orf123 homolog4.27e-11894.58Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL ETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESG F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GF+FGPGWKVESIEGTKFEDIDLNGGE+AEYDEKGECPVMIS+L+AKFELLK
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

A0A5A7V299 UPF0587 protein C1orf123-like protein4.27e-11894.58Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL ETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESG F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GF+FGPGWKVESIEGTKFEDIDLNGGE+AEYDEKGECPVMIS+L+AKFELLK
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

A0A6J1GKL1 UPF0587 protein C1orf123 homolog3.24e-11390.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGE+SQKETCVTL ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GFIFGPGWK ESIEGTKFEDIDL+GGEYAEYDEKGECPVMIS+LEA FE +K
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

A0A6J1KKK0 UPF0587 protein C1orf123 homolog3.24e-11390.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGE+SQKETCVTL ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GFIFGPGWK ESIEGTKFEDIDL+GGEYAEYDEKGECPVMIS+LEA FE +K
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

SwissProt top hitse value%identityAlignment
Q32P66 CXXC motif containing zinc binding protein1.1e-2537.34Show/hide
Query:  LKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGFSPLML
        L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C RE +I ++    K    E +E   F  ++ 
Subjt:  LKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGFSPLML

Query:  FDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF
        F+CRG +P+ F    G+  E +E GT F DI+L   ++ +YDEK +  V I  +  +F
Subjt:  FDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF

Q3B8G0 CXXC motif containing zinc binding protein6.4e-2939.88Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MV F L+ KA LENLT L+P       +F +  K+KCG CGEVS K   +TL +++PL+ G+G+ ++VQ+CK C RE +I ++     P   E SE+  F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF
          ++ F+CRG +P+ F    G+  E  E GT F +I+L   ++ +YDEK +  V I  +E +F
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF

Q498R7 CXXC motif containing zinc binding protein1.5e-2536.81Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C RE +I ++    K    E +E   F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF
          ++ F+CRG +P+ F    G+  E +E GT F DI+L   ++ +YDEK +  V I  +  +F
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF

Q8BHG2 CXXC motif containing zinc binding protein1.9e-2536.2Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C RE +I ++    K    E +E   F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF
          ++ F+CRG +P+ F    G+  + +E GT F DI+L   ++ +YDEK +  V I  +  +F
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF

Q9NWV4 CXXC motif containing zinc binding protein7.8e-2737.42Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S K   + L +++ L+ G+G+ ++VQKCK C RE +I ++    KP   E +E+  F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF
          ++ F+CRG +P+ F    G+  E +E GT F DI+L   ++ +YDEK +  V I  +  +F
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF

Arabidopsis top hitse value%identityAlignment
AT4G32930.1 unknown protein6.7e-6666.47Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVN++LKI A+LENLTNLQP  GCDD NFPYLFK+KC RCGEV+ KETCVTL ET     G+GT +LVQKCKFCGREG +TMIPG+G+PLT E SE+G  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGG-EYAEYDEKGECPVMISSLEAKFELLK
        +PLM+FDCRGY+P+ F FG  WK ++  GTKF++IDL+ G E+ EYDEKGECPVMIS+  A F + K
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGG-EYAEYDEKGECPVMISSLEAKFELLK

AT4G32930.2 unknown protein1.1e-6363.43Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQK--------CKFCGREGTITMIPGRGKPLTQ
        MVN++LKI A+LENLTNLQP  GCDD NFPYLFK+KC RCGEV+ KETCVTL ET     G+GT +LVQK        CKFCGREG +TMIPG+G+PLT 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQK--------CKFCGREGTITMIPGRGKPLTQ

Query:  EISESGGFSPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGG-EYAEYDEKGECPVMISSLEAKFELLK
        E SE+G  +PLM+FDCRGY+P+ F FG  WK ++  GTKF++IDL+ G E+ EYDEKGECPVMIS+  A F + K
Subjt:  EISESGGFSPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGG-EYAEYDEKGECPVMISSLEAKFELLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCTCAAGATGGTTGCGACGACCCCAACTTCCCTTACCTTTTCAAAGTGAAATG
CGGGAGATGCGGGGAGGTGAGCCAGAAAGAAACGTGTGTGACCTTGGGTGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTCGTTCAAAAGTGCAAGTTCT
GTGGGAGGGAAGGAACTATCACAATGATTCCGGGGCGAGGTAAACCATTGACTCAGGAAATAAGTGAATCAGGGGGTTTCTCTCCGTTGATGTTATTTGACTGCAGAGGT
TATGATCCTATGGGCTTTATATTTGGACCTGGATGGAAAGTAGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTTAAATGGAGGTGAGTATGCAGAGTATGATGA
GAAGGGAGAATGCCCTGTCATGATTTCCAGTCTAGAAGCCAAATTTGAGTTATTAAAGTAG
mRNA sequenceShow/hide mRNA sequence
GGTCATCCGTTTTAGAAAGAAGACCCGAGCCCATAGTCCGTCTTCCCAAAGACTAAAATCGAAAATTCCACGCACGGGAAGTGTGATTATGCAAAACCGCTAAAAAAAAA
ATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCTCAAGATGGTTGCGACGACCCCAACTTCCCTTACCTTTTCAAAGTGAAATG
CGGGAGATGCGGGGAGGTGAGCCAGAAAGAAACGTGTGTGACCTTGGGTGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTCGTTCAAAAGTGCAAGTTCT
GTGGGAGGGAAGGAACTATCACAATGATTCCGGGGCGAGGTAAACCATTGACTCAGGAAATAAGTGAATCAGGGGGTTTCTCTCCGTTGATGTTATTTGACTGCAGAGGT
TATGATCCTATGGGCTTTATATTTGGACCTGGATGGAAAGTAGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTTAAATGGAGGTGAGTATGCAGAGTATGATGA
GAAGGGAGAATGCCCTGTCATGATTTCCAGTCTAGAAGCCAAATTTGAGTTATTAAAGTAGGAGATAATTATATGCAAATTATTATATCTCCCACTGTAAAAGAAAATGA
TCATATTTGAATGCCCCTCATCTATCTGTTCTCTTTTTCTTTTTAAATAAGAATGAAGTTTGAGGAGTTGGCATGAACGGTCTTTATTTGTTTCCTTTTCTTCCCAGTAC
CTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTTCTTTCTTTTCAACCTATGCATTGCTCAAAACTAGTTGAATATATTTATATATATAGTTCTCGAAATTCCAAAAA
GATAAATAGTCCAAACTTCACCGCT
Protein sequenceShow/hide protein sequence
MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGFSPLMLFDCRG
YDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK