; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007787 (gene) of Chayote v1 genome

Gene IDSed0007787
OrganismSechium edule (Chayote v1)
DescriptionUPF0587 protein C1orf123 homolog
Genome locationLG08:32408036..32410890
RNA-Seq ExpressionSed0007787
SyntenySed0007787
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR008584 - CXXC motif containing zinc binding protein, eukaryotic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585732.1 CXXC motif containing zinc binding protein, partial [Cucurbita argyrosperma subsp. sororia]6.2e-9095.78Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGELSQKETCVTLNET+PLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
        SPLML DCRGYEPV F+FGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMIS LEATFE VK
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

XP_008444495.1 PREDICTED: UPF0587 protein C1orf123 homolog [Cucumis melo]1.0e-8489.16Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGE+SQKETCVTL+ET+PLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
        SPLML DCRGYEP+ FVFGPGWK ESIEGTKFEDIDL+GGE+AEYDEKGECPVMIS L+A FEL+K
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

XP_022132352.1 UPF0587 protein C1orf123 [Momordica charantia]6.0e-8589.16Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKI AELENLTNLQPQDGCDDPNF YLFK+KCGRCGE+SQKETC+TLNETV L  GKGTTNLVQKCKFCGRDGT+TMIPGRG+PLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
        SPLML DCRGYEP+DF+FGPGWKAESIEGTKFEDIDLS GE+AEYDEKGECPVMIS L+ATF+LVK
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

XP_022952030.1 UPF0587 protein C1orf123 homolog [Cucurbita moschata]6.2e-9095.78Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGELSQKETCVTLNET+PLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
        SPLML DCRGYEPV F+FGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMIS LEATFE VK
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

XP_038884333.1 CXXC motif containing zinc binding protein isoform X1 [Benincasa hispida]6.9e-8994.58Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGE+SQKETCVTLNET+PLQAGKGTTNLVQKCKFCGR+GTITMIPGRGQPLTQETSE GKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
        SPLML DCRGYEP+DFVFGPGWKAESIEGTKFEDIDLS GE+AEYDEKGECPVMIS LEATFELVK
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

TrEMBL top hitse value%identityAlignment
A0A1S3BAF1 UPF0587 protein C1orf123 homolog5.0e-8589.16Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGE+SQKETCVTL+ET+PLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
        SPLML DCRGYEP+ FVFGPGWK ESIEGTKFEDIDL+GGE+AEYDEKGECPVMIS L+A FEL+K
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

A0A5A7V299 UPF0587 protein C1orf123-like protein5.0e-8589.16Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGE+SQKETCVTL+ET+PLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
        SPLML DCRGYEP+ FVFGPGWK ESIEGTKFEDIDL+GGE+AEYDEKGECPVMIS L+A FEL+K
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

A0A6J1BTL8 UPF0587 protein C1orf1232.9e-8589.16Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKI AELENLTNLQPQDGCDDPNF YLFK+KCGRCGE+SQKETC+TLNETV L  GKGTTNLVQKCKFCGRDGT+TMIPGRG+PLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
        SPLML DCRGYEP+DF+FGPGWKAESIEGTKFEDIDLS GE+AEYDEKGECPVMIS L+ATF+LVK
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

A0A6J1GKL1 UPF0587 protein C1orf123 homolog3.0e-9095.78Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGELSQKETCVTLNET+PLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
        SPLML DCRGYEPV F+FGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMIS LEATFE VK
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

A0A6J1KKK0 UPF0587 protein C1orf123 homolog3.0e-9095.78Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGELSQKETCVTLNET+PLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
        SPLML DCRGYEPV F+FGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMIS LEATFE VK
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

SwissProt top hitse value%identityAlignment
Q32P66 CXXC motif containing zinc binding protein3.0e-2639.51Show/hide
Query:  LKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLML
        L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L ++V L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+  ++ 
Subjt:  LKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLML

Query:  LDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
         +CRG EPVDF    G+ AE +E GT F DI+L   ++ +YDEK +  V I   E T + VK
Subjt:  LDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

Q3B8G0 CXXC motif containing zinc binding protein4.9e-2940.49Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MV F L+ KA LENLT L+P       +F +  K+KCG CGE+S K   +TL ++VPL+ G+G+ ++VQ+CK C R+ +I ++     P   E SE+ K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISTLEATF
          ++  +CRG EP+DF    G+ AE  E GT F +I+L   ++ +YDEK +  V I  +E  F
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISTLEATF

Q498R7 CXXC motif containing zinc binding protein3.0e-2638.92Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L ++V L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
          ++  +CRG EPVDF    G+ AE +E GT F DI+L   ++ +YDEK +  V I   E T + VK
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

Q8BHG2 CXXC motif containing zinc binding protein3.9e-2638.32Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L ++V L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
          ++  +CRG EPVDF    G+ A+ +E GT F DI+L   ++ +YDEK +  V I   E T + VK
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

Q9NWV4 CXXC motif containing zinc binding protein2.0e-2739.52Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S K   + L ++V L+ G+G+ ++VQKCK C R+ +I ++    +P   E +E+ K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK
          ++  +CRG EPVDF    G+ AE +E GT F DI+L   ++ +YDEK +  V I   E T + VK
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK

Arabidopsis top hitse value%identityAlignment
AT4G32930.1 unknown protein1.8e-6667.07Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGE++ KETCVTLNET     G+GT +LVQKCKFCGR+G +TMIPG+G+PLT E SE+G+ 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGG-EYAEYDEKGECPVMISTLEATFELVK
        +PLM+ DCRGYEP+DF FG  WKA++  GTKF++IDLS G E+ EYDEKGECPVMIS   A+F + K
Subjt:  SPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGG-EYAEYDEKGECPVMISTLEATFELVK

AT4G32930.2 unknown protein2.8e-6464Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQK--------CKFCGRDGTITMIPGRGQPLTQ
        MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGE++ KETCVTLNET     G+GT +LVQK        CKFCGR+G +TMIPG+G+PLT 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQK--------CKFCGRDGTITMIPGRGQPLTQ

Query:  ETSESGKSSPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGG-EYAEYDEKGECPVMISTLEATFELVK
        E SE+G+ +PLM+ DCRGYEP+DF FG  WKA++  GTKF++IDLS G E+ EYDEKGECPVMIS   A+F + K
Subjt:  ETSESGKSSPLMLLDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGG-EYAEYDEKGECPVMISTLEATFELVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAATTTCTTGCTTAAGATCAAAGCGGAGCTTGAGAATCTCACGAATCTTCAGCCTCAAGATGGTTGCGACGATCCCAACTTCACTTACCTCTTCAAAGTGAAATG
CGGGAGATGCGGGGAGTTGAGCCAGAAAGAAACTTGTGTGACCTTGAATGAAACTGTGCCTCTCCAAGCGGGTAAAGGGACTACTAATCTTGTTCAAAAGTGCAAGTTCT
GTGGGAGGGATGGAACTATTACAATGATTCCGGGGCGAGGTCAACCACTGACTCAGGAAACAAGTGAATCAGGCAAGTCATCTCCCTTAATGTTATTGGACTGCAGAGGA
TATGAGCCGGTGGACTTTGTATTTGGTCCTGGATGGAAAGCAGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTTGTCTGGAGGTGAGTATGCAGAGTATGATGA
GAAGGGAGAATGCCCTGTCATGATTTCCACTCTAGAGGCCACATTTGAATTGGTAAAGTAA
mRNA sequenceShow/hide mRNA sequence
CAAAGGTTTAATCCAAAATTTCGACGCATTGTTGAAGTGGAAAGTGCGATCGAGCAAAAACCGCAGAAGAAATGGTGAATTTCTTGCTTAAGATCAAAGCGGAGCTTGAG
AATCTCACGAATCTTCAGCCTCAAGATGGTTGCGACGATCCCAACTTCACTTACCTCTTCAAAGTGAAATGCGGGAGATGCGGGGAGTTGAGCCAGAAAGAAACTTGTGT
GACCTTGAATGAAACTGTGCCTCTCCAAGCGGGTAAAGGGACTACTAATCTTGTTCAAAAGTGCAAGTTCTGTGGGAGGGATGGAACTATTACAATGATTCCGGGGCGAG
GTCAACCACTGACTCAGGAAACAAGTGAATCAGGCAAGTCATCTCCCTTAATGTTATTGGACTGCAGAGGATATGAGCCGGTGGACTTTGTATTTGGTCCTGGATGGAAA
GCAGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTTGTCTGGAGGTGAGTATGCAGAGTATGATGAGAAGGGAGAATGCCCTGTCATGATTTCCACTCTAGAGGC
CACATTTGAATTGGTAAAGTAAGAGAGCAATCCCCACCCTAGATCTTTCACTGTAAAACAATTTACCATATTGAATGGCCCTGACATTATTGGTGTCACCCACTCTATAC
TATCTTTCTGCTTTATTGCTTCATGAAAAGTAAGTGTTGCAAGTCAGAAATTATAACTCGTGATCTCTTATCTCTTAGTTATAGGCACACTTATGTTGGTTGAACTATTT
TTGTTGATTGATGTGGAAAATAAGTATAAAGTTCGAGGTGTTGAATGAATGGTGGTCAGTTTTTTTTTTGTTTGAAATAGTGG
Protein sequenceShow/hide protein sequence
MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLMLLDCRG
YEPVDFVFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISTLEATFELVK