; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg23974 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg23974
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionUPF0587 protein C1orf123 homolog
Genome locationCarg_Chr12:3843432..3846595
RNA-Seq ExpressionCarg23974
SyntenyCarg23974
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR008584 - CXXC motif containing zinc binding protein, eukaryotic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585732.1 CXXC motif containing zinc binding protein, partial [Cucurbita argyrosperma subsp. sororia]3.2e-94100Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
        SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

XP_004142917.1 CXXC motif containing zinc binding protein [Cucumis sativus]3.2e-8690.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGE+SQKETCVTL ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
        SPLMLFDCRGY+P+GFIFGPGWK ESIEGTKFEDIDL+GGEYAEYDEKGECPVMIS+LEA FE +K
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

XP_008444495.1 PREDICTED: UPF0587 protein C1orf123 homolog [Cucumis melo]9.3e-8689.76Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
        SPLMLFDCRGYEP+GF+FGPGWK ESIEGTKFEDIDL+GGE+AEYDEKGECPVMISNL+A FE +K
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

XP_022952030.1 UPF0587 protein C1orf123 homolog [Cucurbita moschata]3.2e-94100Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
        SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

XP_038884333.1 CXXC motif containing zinc binding protein isoform X1 [Benincasa hispida]9.9e-8893.37Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRGQPLTQETSE GKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
        SPLMLFDCRGYEP+ F+FGPGWKAESIEGTKFEDIDLS GE+AEYDEKGECPVMIS LEATFE VK
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

TrEMBL top hitse value%identityAlignment
A0A0A0LN37 Uncharacterized protein1.5e-8690.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGE+SQKETCVTL ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
        SPLMLFDCRGY+P+GFIFGPGWK ESIEGTKFEDIDL+GGEYAEYDEKGECPVMIS+LEA FE +K
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

A0A1S3BAF1 UPF0587 protein C1orf123 homolog4.5e-8689.76Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
        SPLMLFDCRGYEP+GF+FGPGWK ESIEGTKFEDIDL+GGE+AEYDEKGECPVMISNL+A FE +K
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

A0A5A7V299 UPF0587 protein C1orf123-like protein4.5e-8689.76Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
        SPLMLFDCRGYEP+GF+FGPGWK ESIEGTKFEDIDL+GGE+AEYDEKGECPVMISNL+A FE +K
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

A0A6J1GKL1 UPF0587 protein C1orf123 homolog1.5e-94100Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
        SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

A0A6J1KKK0 UPF0587 protein C1orf123 homolog1.5e-94100Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
        SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

SwissProt top hitse value%identityAlignment
Q32P66 CXXC motif containing zinc binding protein8.6e-2638.89Show/hide
Query:  LKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLML
        L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+  ++ 
Subjt:  LKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLML

Query:  FDCRGYEPVGFIFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
        F+CRG EPV F    G+ AE +E GT F DI+L   ++ +YDEK +  V I   E T + VK
Subjt:  FDCRGYEPVGFIFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

Q3B8G0 CXXC motif containing zinc binding protein8.3e-2939.88Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MV F L+ KA LENLT L+P       +F +  K+KCG CGE+S K   +TL +++PL+ G+G+ ++VQ+CK C R+ +I ++     P   E SE+ K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISNLEATF
          ++ F+CRG EP+ F    G+ AE  E GT F +I+L   ++ +YDEK +  V I  +E  F
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISNLEATF

Q498R7 CXXC motif containing zinc binding protein1.1e-2538.32Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
          ++ F+CRG EPV F    G+ AE +E GT F DI+L   ++ +YDEK +  V I   E T + VK
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

Q8BHG2 CXXC motif containing zinc binding protein1.5e-2537.72Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
          ++ F+CRG EPV F    G+ A+ +E GT F DI+L   ++ +YDEK +  V I   E T + VK
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

Q9NWV4 CXXC motif containing zinc binding protein7.8e-2738.92Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +P   E +E+ K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK
          ++ F+CRG EPV F    G+ AE +E GT F DI+L   ++ +YDEK +  V I   E T + VK
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIE-GTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK

Arabidopsis top hitse value%identityAlignment
AT4G32930.1 unknown protein7.1e-6868.26Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVN++LKI A+LENLTNLQP  GCDD NFPYLFK+KC RCGE++ KETCVTLNET     G+GT +LVQKCKFCGR+G +TMIPG+G+PLT E SE+G+ 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGG-EYAEYDEKGECPVMISNLEATFESVK
        +PLM+FDCRGYEP+ F FG  WKA++  GTKF++IDLS G E+ EYDEKGECPVMISN  A+F   K
Subjt:  SPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGG-EYAEYDEKGECPVMISNLEATFESVK

AT4G32930.2 unknown protein1.1e-6565.14Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQK--------CKFCGRDGTITMIPGRGQPLTQ
        MVN++LKI A+LENLTNLQP  GCDD NFPYLFK+KC RCGE++ KETCVTLNET     G+GT +LVQK        CKFCGR+G +TMIPG+G+PLT 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQK--------CKFCGRDGTITMIPGRGQPLTQ

Query:  ETSESGKSSPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGG-EYAEYDEKGECPVMISNLEATFESVK
        E SE+G+ +PLM+FDCRGYEP+ F FG  WKA++  GTKF++IDLS G E+ EYDEKGECPVMISN  A+F   K
Subjt:  ETSESGKSSPLMLFDCRGYEPVGFIFGPGWKAESIEGTKFEDIDLSGG-EYAEYDEKGECPVMISNLEATFESVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAATTTTTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCCCAAGATGGTTGCGATGATCCCAATTTCCCTTACCTTTTCAAAGTCAAATG
CGGGAGATGCGGAGAGTTGAGCCAGAAAGAAACTTGTGTGACCTTGAATGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTCGTTCAAAAGTGCAAGTTCT
GTGGGAGGGATGGAACTATTACAATGATTCCGGGGCGAGGTCAACCATTGACTCAGGAAACAAGTGAATCAGGGAAGTCATCTCCTTTGATGTTATTTGACTGCAGAGGT
TACGAGCCTGTGGGCTTCATATTTGGACCTGGATGGAAAGCAGAATCTATAGAAGGGACTAAGTTCGAGGACATTGACTTGTCTGGAGGTGAGTATGCAGAGTACGATGA
GAAGGGAGAATGCCCTGTCATGATTTCCAATCTAGAGGCCACATTTGAGTCGGTAAAGTAG
mRNA sequenceShow/hide mRNA sequence
AAGTGAAAGTGTGATTAAGCAAAGCCGCTCAAGAATGGTGAATTTTTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCCCAAGATGGTTGCGATG
ATCCCAATTTCCCTTACCTTTTCAAAGTCAAATGCGGGAGATGCGGAGAGTTGAGCCAGAAAGAAACTTGTGTGACCTTGAATGAAACTATTCCTCTCCAAGCGGGTAAA
GGGACTACTAATCTCGTTCAAAAGTGCAAGTTCTGTGGGAGGGATGGAACTATTACAATGATTCCGGGGCGAGGTCAACCATTGACTCAGGAAACAAGTGAATCAGGGAA
GTCATCTCCTTTGATGTTATTTGACTGCAGAGGTTACGAGCCTGTGGGCTTCATATTTGGACCTGGATGGAAAGCAGAATCTATAGAAGGGACTAAGTTCGAGGACATTG
ACTTGTCTGGAGGTGAGTATGCAGAGTACGATGAGAAGGGAGAATGCCCTGTCATGATTTCCAATCTAGAGGCCACATTTGAGTCGGTAAAGTAGGAGGTAATCCTCACC
TTAGATCTCTGGATGTAAAAAATTGAGCATTTTGATGCTGCTATTGATATCACCCACTCTATACTTTGATAATGAATGATATCATTATGCAAGATTGTTCATCATGTTTT
AAACACAAATCTATGGTGTGATCTAAGTGATGGATATTTTCATAACACTATATGTTTTCAGTTATTATTTAGTTCTTTTTAATTAAATTATTGTCGGTCAAATACAATAT
ACAAGTGGGACCTGCACTATAAGTAAGCTGTGCGAAAAACTGCTGATTTTTTTCGGTTTTGCAACAACTTGGGCGACATGGGGTTCTATCGGGAAACCAGGAATTTTTTA
GTTCCATTACTAAAATTTATATTTGAT
Protein sequenceShow/hide protein sequence
MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLMLFDCRG
YEPVGFIFGPGWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK