; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G18140 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G18140
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUPF0587 protein C1orf123 homolog
Genome locationChr2:16861520..16864231
RNA-Seq ExpressionCSPI02G18140
SyntenyCSPI02G18140
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR008584 - CXXC motif containing zinc binding protein, eukaryotic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585732.1 CXXC motif containing zinc binding protein, partial [Cucurbita argyrosperma subsp. sororia]1.4e-8690.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGE+SQKETCVTL ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GFIFGPGWK ESIEGTKFEDIDL+GGEYAEYDEKGECPVMIS+LEA FE +K
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

XP_004142917.1 CXXC motif containing zinc binding protein [Cucumis sativus]6.5e-95100Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

XP_008444495.1 PREDICTED: UPF0587 protein C1orf123 homolog [Cucumis melo]2.8e-9094.58Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL ETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESG F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GF+FGPGWKVESIEGTKFEDIDLNGGE+AEYDEKGECPVMIS+L+AKFELLK
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

XP_022952030.1 UPF0587 protein C1orf123 homolog [Cucurbita moschata]1.4e-8690.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGE+SQKETCVTL ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GFIFGPGWK ESIEGTKFEDIDL+GGEYAEYDEKGECPVMIS+LEA FE +K
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

XP_038884333.1 CXXC motif containing zinc binding protein isoform X1 [Benincasa hispida]2.3e-8489.76Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL ETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRG+PLTQE SE G  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+PM F+FGPGWK ESIEGTKFEDIDL+ GE+AEYDEKGECPVMIS LEA FEL+K
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

TrEMBL top hitse value%identityAlignment
A0A0A0LN37 Uncharacterized protein3.1e-95100Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

A0A1S3BAF1 UPF0587 protein C1orf123 homolog1.4e-9094.58Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL ETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESG F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GF+FGPGWKVESIEGTKFEDIDLNGGE+AEYDEKGECPVMIS+L+AKFELLK
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

A0A5A7V299 UPF0587 protein C1orf123-like protein1.4e-9094.58Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL ETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESG F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GF+FGPGWKVESIEGTKFEDIDLNGGE+AEYDEKGECPVMIS+L+AKFELLK
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

A0A6J1GKL1 UPF0587 protein C1orf123 homolog7.0e-8790.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGE+SQKETCVTL ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GFIFGPGWK ESIEGTKFEDIDL+GGEYAEYDEKGECPVMIS+LEA FE +K
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

A0A6J1KKK0 UPF0587 protein C1orf123 homolog7.0e-8790.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGE+SQKETCVTL ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK
        SPLMLFDCRGY+P+GFIFGPGWK ESIEGTKFEDIDL+GGEYAEYDEKGECPVMIS+LEA FE +K
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK

SwissProt top hitse value%identityAlignment
Q32P66 CXXC motif containing zinc binding protein1.1e-2537.34Show/hide
Query:  LKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGFSPLML
        L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C RE +I ++    K    E +E   F  ++ 
Subjt:  LKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGFSPLML

Query:  FDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF
        F+CRG +P+ F    G+  E +E GT F DI+L   ++ +YDEK +  V I  +  +F
Subjt:  FDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF

Q3B8G0 CXXC motif containing zinc binding protein6.4e-2939.88Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MV F L+ KA LENLT L+P       +F +  K+KCG CGEVS K   +TL +++PL+ G+G+ ++VQ+CK C RE +I ++     P   E SE+  F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF
          ++ F+CRG +P+ F    G+  E  E GT F +I+L   ++ +YDEK +  V I  +E +F
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF

Q498R7 CXXC motif containing zinc binding protein1.5e-2536.81Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C RE +I ++    K    E +E   F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF
          ++ F+CRG +P+ F    G+  E +E GT F DI+L   ++ +YDEK +  V I  +  +F
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF

Q8BHG2 CXXC motif containing zinc binding protein1.9e-2536.2Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C RE +I ++    K    E +E   F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF
          ++ F+CRG +P+ F    G+  + +E GT F DI+L   ++ +YDEK +  V I  +  +F
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF

Q9NWV4 CXXC motif containing zinc binding protein7.8e-2737.42Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S K   + L +++ L+ G+G+ ++VQKCK C RE +I ++    KP   E +E+  F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF
          ++ F+CRG +P+ F    G+  E +E GT F DI+L   ++ +YDEK +  V I  +  +F
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIE-GTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKF

Arabidopsis top hitse value%identityAlignment
AT4G32930.1 unknown protein6.7e-6666.47Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF
        MVN++LKI A+LENLTNLQP  GCDD NFPYLFK+KC RCGEV+ KETCVTL ET     G+GT +LVQKCKFCGREG +TMIPG+G+PLT E SE+G  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGF

Query:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGG-EYAEYDEKGECPVMISSLEAKFELLK
        +PLM+FDCRGY+P+ F FG  WK ++  GTKF++IDL+ G E+ EYDEKGECPVMIS+  A F + K
Subjt:  SPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGG-EYAEYDEKGECPVMISSLEAKFELLK

AT4G32930.2 unknown protein1.1e-6363.43Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQK--------CKFCGREGTITMIPGRGKPLTQ
        MVN++LKI A+LENLTNLQP  GCDD NFPYLFK+KC RCGEV+ KETCVTL ET     G+GT +LVQK        CKFCGREG +TMIPG+G+PLT 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQK--------CKFCGREGTITMIPGRGKPLTQ

Query:  EISESGGFSPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGG-EYAEYDEKGECPVMISSLEAKFELLK
        E SE+G  +PLM+FDCRGY+P+ F FG  WK ++  GTKF++IDL+ G E+ EYDEKGECPVMIS+  A F + K
Subjt:  EISESGGFSPLMLFDCRGYDPMGFIFGPGWKVESIEGTKFEDIDLNGG-EYAEYDEKGECPVMISSLEAKFELLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCTCAAGATGGTTGCGACGACCCCAACTTCCCTTACCTTTTCAAAGTGAAATG
CGGGAGATGCGGGGAGGTGAGCCAGAAAGAAACGTGTGTGACCTTGGGTGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTCGTTCAAAAGTGCAAGTTCT
GTGGGAGGGAAGGAACTATCACAATGATTCCGGGGCGAGGTAAACCATTGACTCAGGAAATAAGTGAATCAGGGGGTTTCTCTCCGTTGATGTTATTTGACTGCAGAGGT
TATGATCCTATGGGCTTTATATTTGGACCTGGATGGAAAGTAGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTTAAATGGAGGTGAGTATGCAGAGTATGATGA
GAAGGGAGAATGCCCTGTCATGATTTCCAGTCTAGAAGCCAAATTTGAGTTATTAAAGTAG
mRNA sequenceShow/hide mRNA sequence
AAATCATTTAGGAAAACTCCTTTCGGGTCATCCGTTTTAGAAAGAAGACCCGAGCCCATAGTCCGTCTTCCCAAAGACTAAAATCGAGAATTCCACGCACTGGAAGTGTG
ATTATGCAAAACCGCTAAAAAAAAAATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCTCAAGATGGTTGCGACGACCCCAACT
TCCCTTACCTTTTCAAAGTGAAATGCGGGAGATGCGGGGAGGTGAGCCAGAAAGAAACGTGTGTGACCTTGGGTGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACT
AATCTCGTTCAAAAGTGCAAGTTCTGTGGGAGGGAAGGAACTATCACAATGATTCCGGGGCGAGGTAAACCATTGACTCAGGAAATAAGTGAATCAGGGGGTTTCTCTCC
GTTGATGTTATTTGACTGCAGAGGTTATGATCCTATGGGCTTTATATTTGGACCTGGATGGAAAGTAGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTTAAATG
GAGGTGAGTATGCAGAGTATGATGAGAAGGGAGAATGCCCTGTCATGATTTCCAGTCTAGAAGCCAAATTTGAGTTATTAAAGTAGGAGATAATTATATGCAAATTATTA
TATCTCCCACTGTAAAAGAAAATGATCATATTTGAATGCCCCTCATCTATCTGTTCTCTTTTTCTTTTTAAATAAGAATGAAGTTTGAGGAGTTGGCATGAACGGTCTTT
ATTTGTTTCCTTTTCTTCCCAGTACCTTTTTCTCTCTCTCTCTTTCTTTTTTTTCAACCTATGTATTGCTCAAAACTAGTTGAATATATTTATATATATAGTTCTCGAAA
TTCCAAAAAGATAAA
Protein sequenceShow/hide protein sequence
MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGEVSQKETCVTLGETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGGFSPLMLFDCRG
YDPMGFIFGPGWKVESIEGTKFEDIDLNGGEYAEYDEKGECPVMISSLEAKFELLK