; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G015680 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G015680
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUPF0587 protein C1orf123 homolog
Genome locationchr08:23725463..23728594
RNA-Seq ExpressionLsi08G015680
SyntenyLsi08G015680
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR008584 - CXXC motif containing zinc binding protein, eukaryotic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585732.1 CXXC motif containing zinc binding protein, partial [Cucurbita argyrosperma subsp. sororia]1.2e-8893.98Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQ LTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP+ F+FGPGWK ESIEGTKFEDIDLSGGE+AEYDEKGECPVMISNLEATF+ VK
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

XP_008444495.1 PREDICTED: UPF0587 protein C1orf123 homolog [Cucumis melo]5.5e-8691.57Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+ LTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP+ FVFGPGWKVESIEGTKFEDIDL+GGEFAEYDEKGECPVMISNL+A F+L+K
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

XP_022132352.1 UPF0587 protein C1orf123 [Momordica charantia]1.2e-8589.76Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        MVNFLLKI AELENLTNLQPQDGCDDPNF YLFK+KCGRCGEVSQKETC+TLNET+ L  GKGTTNLVQKCKFCGRDGT+TMIPGRG+ LTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP+DF+FGPGWK ESIEGTKFEDIDLS GEFAEYDEKGECPVMIS L+ATFDLVK
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

XP_022952030.1 UPF0587 protein C1orf123 homolog [Cucurbita moschata]1.2e-8893.98Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQ LTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP+ F+FGPGWK ESIEGTKFEDIDLSGGE+AEYDEKGECPVMISNLEATF+ VK
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

XP_038884333.1 CXXC motif containing zinc binding protein isoform X1 [Benincasa hispida]2.4e-8995.78Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRGQ LTQETSE GKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEPMDFVFGPGWK ESIEGTKFEDIDLS GEFAEYDEKGECPVMIS LEATF+LVK
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

TrEMBL top hitse value%identityAlignment
A0A1S3BAF1 UPF0587 protein C1orf123 homolog2.6e-8691.57Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+ LTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP+ FVFGPGWKVESIEGTKFEDIDL+GGEFAEYDEKGECPVMISNL+A F+L+K
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

A0A5A7V299 UPF0587 protein C1orf123-like protein2.6e-8691.57Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+ LTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP+ FVFGPGWKVESIEGTKFEDIDL+GGEFAEYDEKGECPVMISNL+A F+L+K
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

A0A6J1BTL8 UPF0587 protein C1orf1235.9e-8689.76Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        MVNFLLKI AELENLTNLQPQDGCDDPNF YLFK+KCGRCGEVSQKETC+TLNET+ L  GKGTTNLVQKCKFCGRDGT+TMIPGRG+ LTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP+DF+FGPGWK ESIEGTKFEDIDLS GEFAEYDEKGECPVMIS L+ATFDLVK
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

A0A6J1GKL1 UPF0587 protein C1orf123 homolog5.7e-8993.98Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQ LTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP+ F+FGPGWK ESIEGTKFEDIDLSGGE+AEYDEKGECPVMISNLEATF+ VK
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

A0A6J1KKK0 UPF0587 protein C1orf123 homolog5.7e-8993.98Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQ LTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        SPLMLFDCRGYEP+ F+FGPGWK ESIEGTKFEDIDLSGGE+AEYDEKGECPVMISNLEATF+ VK
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

SwissProt top hitse value%identityAlignment
Q32P66 CXXC motif containing zinc binding protein3.9e-2638.27Show/hide
Query:  LKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLML
        L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    ++   E +E  K+  ++ 
Subjt:  LKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLML

Query:  FDCRGYEPMDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
        F+CRG EP+DF    G+  E +E GT F DI+L   ++ +YDEK +  V I   E T   VK
Subjt:  FDCRGYEPMDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

Q3B8G0 CXXC motif containing zinc binding protein2.4e-2839.88Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        MV F L+ KA LENLT L+P       +F +  K+KCG CGEVS K   +TL +++PL+ G+G+ ++VQ+CK C R+ +I ++         E SE+ K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATF
          ++ F+CRG EP+DF    G+  E  E GT F +I+L   ++ +YDEK +  V I  +E  F
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATF

Q498R7 CXXC motif containing zinc binding protein3.9e-2637.72Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    ++   E +E  K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
          ++ F+CRG EP+DF    G+  E +E GT F DI+L   ++ +YDEK +  V I   E T   VK
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

Q8BHG2 CXXC motif containing zinc binding protein2.3e-2637.72Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +A   E +E  K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
          ++ F+CRG EP+DF    G+  + +E GT F DI+L   ++ +YDEK +  V I   E T   VK
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

Q9NWV4 CXXC motif containing zinc binding protein3.9e-2637.72Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +    E +E+ K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
          ++ F+CRG EP+DF    G+  E +E GT F DI+L   ++ +YDEK +  V I   E T   VK
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK

Arabidopsis top hitse value%identityAlignment
AT4G32930.1 unknown protein3.5e-6768.26Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS
        MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGEV+ KETCVTLNET     G+GT +LVQKCKFCGR+G +TMIPG+G+ LT E SE+G+ 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKS

Query:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGG-EFAEYDEKGECPVMISNLEATFDLVK
        +PLM+FDCRGYEP+DF FG  WK ++  GTKF++IDLS G EF EYDEKGECPVMISN  A+F + K
Subjt:  SPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGG-EFAEYDEKGECPVMISNLEATFDLVK

AT4G32930.2 unknown protein5.7e-6565.14Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQK--------CKFCGRDGTITMIPGRGQALTQ
        MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGEV+ KETCVTLNET     G+GT +LVQK        CKFCGR+G +TMIPG+G+ LT 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQK--------CKFCGRDGTITMIPGRGQALTQ

Query:  ETSESGKSSPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGG-EFAEYDEKGECPVMISNLEATFDLVK
        E SE+G+ +PLM+FDCRGYEP+DF FG  WK ++  GTKF++IDLS G EF EYDEKGECPVMISN  A+F + K
Subjt:  ETSESGKSSPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGG-EFAEYDEKGECPVMISNLEATFDLVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCTCAAGATGGTTGCGACGACCCCAACTTCACTTACCTTTTCAAAGTGAAATG
CGGGAGATGCGGGGAGGTGAGCCAGAAAGAAACGTGTGTGACCTTGAATGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTAGTTCAAAAGTGCAAGTTCT
GTGGGAGGGATGGAACGATTACAATGATTCCAGGGCGAGGTCAAGCATTGACTCAGGAAACAAGTGAATCAGGGAAGTCATCTCCCTTGATGTTATTTGATTGCAGAGGT
TATGAGCCTATGGACTTTGTATTTGGACCTGGATGGAAAGTAGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTTGAGTGGAGGTGAGTTTGCAGAGTATGATGA
GAAGGGAGAATGCCCTGTCATGATTTCCAATCTAGAGGCCACATTTGACTTGGTAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATTTTGGGTCATCCGTTGAAGAAAAAGAAAACCCGAGCCCATGGTCCGTCTTCAGAAAGTCTAAAATCCAAAATTCGACGCACTGTCGAAGAGGAAGTGTGATTATTAAA
AATCGCTCAAAAAATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCTCAAGATGGTTGCGACGACCCCAACTTCACTTACCTTT
TCAAAGTGAAATGCGGGAGATGCGGGGAGGTGAGCCAGAAAGAAACGTGTGTGACCTTGAATGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTAGTTCAA
AAGTGCAAGTTCTGTGGGAGGGATGGAACGATTACAATGATTCCAGGGCGAGGTCAAGCATTGACTCAGGAAACAAGTGAATCAGGGAAGTCATCTCCCTTGATGTTATT
TGATTGCAGAGGTTATGAGCCTATGGACTTTGTATTTGGACCTGGATGGAAAGTAGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTTGAGTGGAGGTGAGTTTG
CAGAGTATGATGAGAAGGGAGAATGCCCTGTCATGATTTCCAATCTAGAGGCCACATTTGACTTGGTAAAGTAGGAGATAGTAATACCCACATCATATCTCCCACTGTAA
AAAAAAAAGACCATATTGAATGTGGCCCTGATCTATCAATTCTCTTCATCTTTCTGTTTTCTGTAATAAGAATGAAGTTTGAGGAGTTGAATGAATGGTCATTATTTGTT
TCCTTTTGTTGACAGTTCTTTTTTCTCTTTTTCTTTCCCCATATATATATATATAGCTGTGAGAGTGAGACCCCCCCTTATGGGAGAAGACAACATCCTAAAAGAGGTCC
AAATTTACGCATTTAATCTTATTGTTGAAAAATGGTTATTTTTTTAA
Protein sequenceShow/hide protein sequence
MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRG
YEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK