; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0085451 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0085451
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionUPF0587 protein C1orf123 homolog
Genome locationCMiso1.1chr03:29227561..29230163
RNA-Seq ExpressionCmc03g0085451
SyntenyCmc03g0085451
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR008584 - CXXC motif containing zinc binding protein, eukaryotic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585732.1 CXXC motif containing zinc binding protein, partial [Cucurbita argyrosperma subsp. sororia]7.1e-8689.76Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK
        SPLMLFDCRGYEP+GF+FGPGWK ESIEGTKFEDIDL+GGE+AEYDEKGECPVMISNL+A FE +K
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK

XP_004142917.1 CXXC motif containing zinc binding protein [Cucumis sativus]4.8e-9094.58Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL ETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESG F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK
        SPLMLFDCRGY+P+GF+FGPGWKVESIEGTKFEDIDLNGGE+AEYDEKGECPVMIS+L+AKFELLK
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK

XP_008444495.1 PREDICTED: UPF0587 protein C1orf123 homolog [Cucumis melo]3.2e-94100Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK
        SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK

XP_022952030.1 UPF0587 protein C1orf123 homolog [Cucurbita moschata]7.1e-8689.76Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK
        SPLMLFDCRGYEP+GF+FGPGWK ESIEGTKFEDIDL+GGE+AEYDEKGECPVMISNL+A FE +K
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK

XP_038884333.1 CXXC motif containing zinc binding protein isoform X1 [Benincasa hispida]4.6e-8590.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRG+PLTQE SE G  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK
        SPLMLFDCRGYEP+ FVFGPGWK ESIEGTKFEDIDL+ GEFAEYDEKGECPVMIS L+A FEL+K
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK

TrEMBL top hitse value%identityAlignment
A0A0A0LN37 Uncharacterized protein2.3e-9094.58Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL ETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESG F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK
        SPLMLFDCRGY+P+GF+FGPGWKVESIEGTKFEDIDLNGGE+AEYDEKGECPVMIS+L+AKFELLK
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK

A0A1S3BAF1 UPF0587 protein C1orf123 homolog1.5e-94100Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK
        SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK

A0A5A7V299 UPF0587 protein C1orf123-like protein1.5e-94100Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK
        SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK

A0A6J1GKL1 UPF0587 protein C1orf123 homolog3.5e-8689.76Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK
        SPLMLFDCRGYEP+GF+FGPGWK ESIEGTKFEDIDL+GGE+AEYDEKGECPVMISNL+A FE +K
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK

A0A6J1KKK0 UPF0587 protein C1orf123 homolog3.5e-8689.76Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK
        SPLMLFDCRGYEP+GF+FGPGWK ESIEGTKFEDIDL+GGE+AEYDEKGECPVMISNL+A FE +K
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK

SwissProt top hitse value%identityAlignment
Q32P66 CXXC motif containing zinc binding protein2.3e-2637.97Show/hide
Query:  LKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDFSPLML
        L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C RE +I ++    K    E +E   F  ++ 
Subjt:  LKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDFSPLML

Query:  FDCRGYEPIGFVFGPGWKVESIE-GTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKF
        F+CRG EP+ F    G+  E +E GT F DI+L   ++ +YDEK +  V I  +  +F
Subjt:  FDCRGYEPIGFVFGPGWKVESIE-GTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKF

Q3B8G0 CXXC motif containing zinc binding protein2.2e-2940.49Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        MV F L+ KA LENLT L+P       +F +  K+KCG CGEVS K   +TL +++PL+ G+G+ ++VQ+CK C RE +I ++     P   E SE+  F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIE-GTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKF
          ++ F+CRG EPI F    G+  E  E GT F +I+L   ++ +YDEK +  V I  ++ +F
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIE-GTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKF

Q498R7 CXXC motif containing zinc binding protein3.0e-2637.42Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C RE +I ++    K    E +E   F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIE-GTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKF
          ++ F+CRG EP+ F    G+  E +E GT F DI+L   ++ +YDEK +  V I  +  +F
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIE-GTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKF

Q8BHG2 CXXC motif containing zinc binding protein3.9e-2636.81Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C RE +I ++    K    E +E   F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIE-GTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKF
          ++ F+CRG EP+ F    G+  + +E GT F DI+L   ++ +YDEK +  V I  +  +F
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIE-GTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKF

Q9NWV4 CXXC motif containing zinc binding protein1.6e-2738.04Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S K   + L +++ L+ G+G+ ++VQKCK C RE +I ++    KP   E +E  +F
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIE-GTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKF
          ++ F+CRG EP+ F    G+  E +E GT F DI+L   ++ +YDEK +  V I  +  +F
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIE-GTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKF

Arabidopsis top hitse value%identityAlignment
AT4G32930.1 unknown protein7.9e-6768.26Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF
        MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGEV+ KETCVTL+ET     G+GT +LVQKCKFCGREG +TMIPG+G+PLT E SE+G+ 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDF

Query:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGG-EFAEYDEKGECPVMISNLDAKFELLK
        +PLM+FDCRGYEPI F FG  WK ++  GTKF++IDL+ G EF EYDEKGECPVMISN  A F + K
Subjt:  SPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGG-EFAEYDEKGECPVMISNLDAKFELLK

AT4G32930.2 unknown protein1.3e-6465.14Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQK--------CKFCGREGTITMIPGRGKPLTQ
        MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGEV+ KETCVTL+ET     G+GT +LVQK        CKFCGREG +TMIPG+G+PLT 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQK--------CKFCGREGTITMIPGRGKPLTQ

Query:  EISESGDFSPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGG-EFAEYDEKGECPVMISNLDAKFELLK
        E SE+G+ +PLM+FDCRGYEPI F FG  WK ++  GTKF++IDL+ G EF EYDEKGECPVMISN  A F + K
Subjt:  EISESGDFSPLMLFDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGG-EFAEYDEKGECPVMISNLDAKFELLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCTCAAGATGGTTGCGACGACCCCAACTTCACTTACCTTTTCAAAGTG
AAATGCGGGAGATGCGGGGAGGTGAGCCAGAAAGAAACGTGTGTGACCTTGAGTGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTCGTTCAAAAG
TGCAAGTTCTGTGGTAGGGAAGGAACTATCACAATGATTCCCGGGCGAGGTAAACCATTGACTCAGGAAATAAGTGAGTCAGGGGATTTCTCTCCGTTGATGTTA
TTTGACTGCAGAGGTTATGAGCCTATCGGTTTTGTATTTGGACCTGGATGGAAAGTAGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTTAAATGGAGGT
GAGTTTGCAGAGTATGATGAGAAGGGAGAATGCCCTGTAATGATTTCCAATCTAGATGCCAAATTTGAGTTGTTAAAGTAG
mRNA sequenceShow/hide mRNA sequence
TCTTTTCAAAGACTAAAATCGAAAATTCCACGCACCGGAAATGTGATTATGCAAAACCGCTCAAAAAAATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCG
AGAACCTCACGAATCTTCAGCCTCAAGATGGTTGCGACGACCCCAACTTCACTTACCTTTTCAAAGTGAAATGCGGGAGATGCGGGGAGGTGAGCCAGAAAGAAA
CGTGTGTGACCTTGAGTGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTCGTTCAAAAGTGCAAGTTCTGTGGTAGGGAAGGAACTATCACAATGA
TTCCCGGGCGAGGTAAACCATTGACTCAGGAAATAAGTGAGTCAGGGGATTTCTCTCCGTTGATGTTATTTGACTGCAGAGGTTATGAGCCTATCGGTTTTGTAT
TTGGACCTGGATGGAAAGTAGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTTAAATGGAGGTGAGTTTGCAGAGTATGATGAGAAGGGAGAATGCCCTG
TAATGATTTCCAATCTAGATGCCAAATTTGAGTTGTTAAAGTAGGAGATAATTATACGCAAATTATTATTATCTCCCCACTGTAAAAGAAAAATTATCATATTTG
AATGGCCCTCATCTATCTGTACTCTTCATCTTTCTGTTTCTTTTTTTAAAATAAGAATGAAGTTTGAGGAGTTGGCATGAATGGTCATT
Protein sequenceShow/hide protein sequence
MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQAGKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDFSPLML
FDCRGYEPIGFVFGPGWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK