; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G17460 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G17460
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUPF0587 protein C1orf123 homolog
Genome locationClcChr08:27783855..27787489
RNA-Seq ExpressionClc08G17460
SyntenyClc08G17460
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR008584 - CXXC motif containing zinc binding protein, eukaryotic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585732.1 CXXC motif containing zinc binding protein, partial [Cucurbita argyrosperma subsp. sororia]3.6e-8993.98Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNET+PLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP  F+FGPGWK ESIEGTKFEDIDLSGGE+AEYDEKGECPVMISNLEATF+ VK
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

XP_008444495.1 PREDICTED: UPF0587 protein C1orf123 homolog [Cucumis melo]8.3e-8690.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL+ET+PLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP  FVFGPGWKVESIEGTKFEDIDL+GGEFAEYDEKGECPVMISNL+A F+L+K
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

XP_022132352.1 UPF0587 protein C1orf123 [Momordica charantia]1.3e-8691.57Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKI AELENLTNLQPQDGCDDPNFAYLFK+KCGRCGEVSQKETC+TLNETV L  GKGTTNLVQKCKFCGRDGT+TMIPGRG+PLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP DF+FGPGWK ESIEGTKFEDIDLS GEFAEYDEKGECPVMIS L+ATFDLVK
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

XP_022952030.1 UPF0587 protein C1orf123 homolog [Cucurbita moschata]3.6e-8993.98Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNET+PLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP  F+FGPGWK ESIEGTKFEDIDLSGGE+AEYDEKGECPVMISNLEATF+ VK
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

XP_038884333.1 CXXC motif containing zinc binding protein isoform X1 [Benincasa hispida]1.1e-8894.58Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTLNET+PLQAGKGTTNLVQKCKFCGR+GTITMIPGRGQPLTQETSE GKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP DFVFGPGWK ESIEGTKFEDIDLS GEFAEYDEKGECPVMIS LEATF+LVK
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

TrEMBL top hitse value%identityAlignment
A0A1S3BAF1 UPF0587 protein C1orf123 homolog4.0e-8690.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL+ET+PLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP  FVFGPGWKVESIEGTKFEDIDL+GGEFAEYDEKGECPVMISNL+A F+L+K
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

A0A5A7V299 UPF0587 protein C1orf123-like protein4.0e-8690.96Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL+ET+PLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP  FVFGPGWKVESIEGTKFEDIDL+GGEFAEYDEKGECPVMISNL+A F+L+K
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

A0A6J1BTL8 UPF0587 protein C1orf1236.2e-8791.57Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKI AELENLTNLQPQDGCDDPNFAYLFK+KCGRCGEVSQKETC+TLNETV L  GKGTTNLVQKCKFCGRDGT+TMIPGRG+PLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP DF+FGPGWK ESIEGTKFEDIDLS GEFAEYDEKGECPVMIS L+ATFDLVK
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

A0A6J1GKL1 UPF0587 protein C1orf123 homolog1.7e-8993.98Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNET+PLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP  F+FGPGWK ESIEGTKFEDIDLSGGE+AEYDEKGECPVMISNLEATF+ VK
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

A0A6J1KKK0 UPF0587 protein C1orf123 homolog1.7e-8993.98Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNET+PLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP  F+FGPGWK ESIEGTKFEDIDLSGGE+AEYDEKGECPVMISNLEATF+ VK
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

SwissProt top hitse value%identityAlignment
Q32P66 CXXC motif containing zinc binding protein1.3e-2538.89Show/hide
Query:  LKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLML
        L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L ++V L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+  ++ 
Subjt:  LKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLML

Query:  FDCRGYEPTDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        F+CRG EP DF    G+  E +E GT F DI+L   ++ +YDEK +  V I   E T   VK
Subjt:  FDCRGYEPTDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

Q3B8G0 CXXC motif containing zinc binding protein5.7e-2941.1Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MV F L+ KA LENLT L+P       +F +  K+KCG CGEVS K   +TL ++VPL+ G+G+ ++VQ+CK C R+ +I ++     P   E SE+ K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATF
          ++ F+CRG EP DF    G+  E  E GT F +I+L   ++ +YDEK +  V I  +E  F
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATF

Q498R7 CXXC motif containing zinc binding protein1.0e-2538.32Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L ++V L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
          ++ F+CRG EP DF    G+  E +E GT F DI+L   ++ +YDEK +  V I   E T   VK
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

Q8BHG2 CXXC motif containing zinc binding protein1.3e-2537.72Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L ++V L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
          ++ F+CRG EP DF    G+  + +E GT F DI+L   ++ +YDEK +  V I   E T   VK
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

Q9NWV4 CXXC motif containing zinc binding protein7.0e-2738.92Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S K   + L ++V L+ G+G+ ++VQKCK C R+ +I ++    +P   E +E+ K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
          ++ F+CRG EP DF    G+  E +E GT F DI+L   ++ +YDEK +  V I   E T   VK
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

Arabidopsis top hitse value%identityAlignment
AT4G32930.1 unknown protein1.4e-6768.86Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGEV+ KETCVTLNET     G+GT +LVQKCKFCGR+G +TMIPG+G+PLT E SE+G+ 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGG-EFAEYDEKGECPVMISNLEATFDLVK
        +PLM+FDCRGYEP DF FG  WK ++  GTKF++IDLS G EF EYDEKGECPVMISN  A+F + K
Subjt:  SPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGG-EFAEYDEKGECPVMISNLEATFDLVK

AT4G32930.2 unknown protein2.3e-6565.71Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQK--------CKFCGRDGTITMIPGRGQPLTQ
        MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGEV+ KETCVTLNET     G+GT +LVQK        CKFCGR+G +TMIPG+G+PLT 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQK--------CKFCGRDGTITMIPGRGQPLTQ

Query:  ETSESGKSSPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGG-EFAEYDEKGECPVMISNLEATFDLVK
        E SE+G+ +PLM+FDCRGYEP DF FG  WK ++  GTKF++IDLS G EF EYDEKGECPVMISN  A+F + K
Subjt:  ETSESGKSSPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGG-EFAEYDEKGECPVMISNLEATFDLVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCGTCTTCCCAAAGTCTATAATCCAAAATCCCACGCACAGTTGAAGAGGAAGTGTGATAATGCAAAACCACTCAAACAAATGGTGAACTTCTTGCTTAAGATCAA
AGCGGAGCTCGAGAACCTCACCAATCTTCAGCCCCAAGATGGTTGCGACGACCCCAACTTCGCTTACCTTTTCAAAGTGAAATGCGGGAGATGCGGGGAGGTGAGCCAGA
AAGAAACGTGTGTGACCTTGAATGAAACTGTTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTAGTTCAAAAGTGCAAGTTCTGTGGGAGGGATGGAACTATCACAATG
ATTCCAGGGCGAGGTCAACCATTGACTCAGGAAACAAGTGAATCAGGAAAGTCATCTCCCTTGATGTTATTTGACTGCAGAGGTTACGAGCCTACGGACTTCGTATTTGG
ACCTGGATGGAAAGTGGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTTGAGTGGAGGTGAGTTTGCTGAGTATGATGAGAAGGGAGAATGCCCTGTCATGATTT
CCAATCTAGAGGCCACATTTGACTTGGTAAAGTAG
mRNA sequenceShow/hide mRNA sequence
AGATTATCTAGCAGACATTTTGGGTCATCCGTTAAAGAAAAAGAAAACCCGAGCCCATGGTCCGTCTTCCCAAAGTCTATAATCCAAAATCCCACGCACAGTTGAAGAGG
AAGTGTGATAATGCAAAACCACTCAAACAAATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACCAATCTTCAGCCCCAAGATGGTTGCGACGACCC
CAACTTCGCTTACCTTTTCAAAGTGAAATGCGGGAGATGCGGGGAGGTGAGCCAGAAAGAAACGTGTGTGACCTTGAATGAAACTGTTCCTCTCCAAGCGGGTAAAGGGA
CTACTAATCTAGTTCAAAAGTGCAAGTTCTGTGGGAGGGATGGAACTATCACAATGATTCCAGGGCGAGGTCAACCATTGACTCAGGAAACAAGTGAATCAGGAAAGTCA
TCTCCCTTGATGTTATTTGACTGCAGAGGTTACGAGCCTACGGACTTCGTATTTGGACCTGGATGGAAAGTGGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTT
GAGTGGAGGTGAGTTTGCTGAGTATGATGAGAAGGGAGAATGCCCTGTCATGATTTCCAATCTAGAGGCCACATTTGACTTGGTAAAGTAGGAGAGAATAATACCCACAT
TATATCTCACTGTTAAAAAAAAAAAAATTGACCATATTGAATGTGGCCCTGATCCATCAGTTCTCTTCATCTTTCTGTTTTTTGAAATAAGAATGAAGTTTGAGGAGTTG
AGTGAATGGTCATTATATGTTTCCTTTTGTTGGCAGTTCCTTTTTCTCTTTCTCTTTTTCTTTTTCTTTTTCTT
Protein sequenceShow/hide protein sequence
MVRLPKVYNPKSHAQLKRKCDNAKPLKQMVNFLLKIKAELENLTNLQPQDGCDDPNFAYLFKVKCGRCGEVSQKETCVTLNETVPLQAGKGTTNLVQKCKFCGRDGTITM
IPGRGQPLTQETSESGKSSPLMLFDCRGYEPTDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK