; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC04G074330 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC04G074330
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionValine-tRNA ligase
Genome locationCiama_Chr04:23436179..23438559
RNA-Seq ExpressionCaUC04G074330
SyntenyCaUC04G074330
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142096.1 uncharacterized protein LOC101220441 [Cucumis sativus]8.1e-10986.46Show/hide
Query:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIR TLNF+VS  FL+VSTFLSC S G ES VVTLDSIVIYKTHEWLA+KPTVYF CQGGN+T LPDVQKEHVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHD----GEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKK
        YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFN KEFNATFLCL+CTAYSNVTST S +      GEKGM SA++IVIS+VAS VLIIGMVVGYKYWQKK
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHD----GEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKK

Query:  RREQDQARFLKLFEDGDDIEDELGLSDVI
        RR+QDQARFLKLFEDGDDIEDELGLSDVI
Subjt:  RREQDQARFLKLFEDGDDIEDELGLSDVI

XP_008447337.1 PREDICTED: uncharacterized protein LOC103489806 [Cucumis melo]5.8e-10785.15Show/hide
Query:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIR TL F+V   FL+  TFLSC SGG ES VVTLDSIVIYKTHEWLA+KPTVYF C GGNKT LPDVQKEHVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHD----GEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKK
        YEEDSIKSDDVFEEWEFCPSDFT+PAGKYVRFN KEFNATFLCLQCTAYSNVTST S +      GEKGMH+AV+IVIS+VAS VLI+GMVVGYKYWQKK
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHD----GEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKK

Query:  RREQDQARFLKLFEDGDDIEDELGLSDVI
        RR+QDQARFLKLFEDGDDIEDELGLSDVI
Subjt:  RREQDQARFLKLFEDGDDIEDELGLSDVI

XP_022143802.1 uncharacterized protein LOC111013628 [Momordica charantia]3.9e-11186.67Show/hide
Query:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIR TLNF+VS++FLIVS FL+C +GG ES VVTLDS++IYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
Subjt:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQ
        YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFN KEFNATF+CLQCTAYSNV+S+ +P+H+ E+GMH A +IVIS + STVLIIG+VVGYKYWQKKRREQ
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQ

Query:  DQARFLKLFEDGDDIEDELGLSDVI
        DQARFLKLFEDGDDIEDELGL DVI
Subjt:  DQARFLKLFEDGDDIEDELGLSDVI

XP_023518189.1 uncharacterized protein LOC111781731 [Cucurbita pepo subsp. pepo]3.4e-10785.33Show/hide
Query:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIRR+ +FKV T+FL+VS  LSC SGG ES VVTLDSIVIYKTHEWLASKPTVYF+CQGGNKTKLPDVQK HVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQ
        YEEDSIKSDDVFEEWE CPSDFTAP G+YVR+N KEFNATFLCL+CTAYSNVTS+ S S D EKGMH+A +I+ISV+ STVLIIGMV+GYKYWQKKRREQ
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQ

Query:  DQARFLKLFEDGDDIEDELGLSDVI
        DQARFLKLFEDGDDIEDELGL+DVI
Subjt:  DQARFLKLFEDGDDIEDELGLSDVI

XP_038883476.1 uncharacterized protein LOC120074430 isoform X1 [Benincasa hispida]2.8e-11792.89Show/hide
Query:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIR TLNF+VST+FL+VSTFLSC SGGVES VVTLDSIVIYKTHEWLAS+PTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
Subjt:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQ
        YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNA+EFNATFLCLQCTAYSNVTS+  PS+DGEKGMHSAV+IVISVVASTVLI+GMVVGYKYWQ+KRREQ
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQ

Query:  DQARFLKLFEDGDDIEDELGLSDVI
        DQARFLKLFEDGDDIEDELGLS+VI
Subjt:  DQARFLKLFEDGDDIEDELGLSDVI

TrEMBL top hitse value%identityAlignment
A0A0A0KXD0 Uncharacterized protein3.9e-10986.46Show/hide
Query:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIR TLNF+VS  FL+VSTFLSC S G ES VVTLDSIVIYKTHEWLA+KPTVYF CQGGN+T LPDVQKEHVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHD----GEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKK
        YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFN KEFNATFLCL+CTAYSNVTST S +      GEKGM SA++IVIS+VAS VLIIGMVVGYKYWQKK
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHD----GEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKK

Query:  RREQDQARFLKLFEDGDDIEDELGLSDVI
        RR+QDQARFLKLFEDGDDIEDELGLSDVI
Subjt:  RREQDQARFLKLFEDGDDIEDELGLSDVI

A0A1S3BH72 uncharacterized protein LOC1034898062.8e-10785.15Show/hide
Query:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIR TL F+V   FL+  TFLSC SGG ES VVTLDSIVIYKTHEWLA+KPTVYF C GGNKT LPDVQKEHVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHD----GEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKK
        YEEDSIKSDDVFEEWEFCPSDFT+PAGKYVRFN KEFNATFLCLQCTAYSNVTST S +      GEKGMH+AV+IVIS+VAS VLI+GMVVGYKYWQKK
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHD----GEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKK

Query:  RREQDQARFLKLFEDGDDIEDELGLSDVI
        RR+QDQARFLKLFEDGDDIEDELGLSDVI
Subjt:  RREQDQARFLKLFEDGDDIEDELGLSDVI

A0A6J1CRX4 uncharacterized protein LOC1110136281.9e-11186.67Show/hide
Query:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIR TLNF+VS++FLIVS FL+C +GG ES VVTLDS++IYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
Subjt:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQ
        YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFN KEFNATF+CLQCTAYSNV+S+ +P+H+ E+GMH A +IVIS + STVLIIG+VVGYKYWQKKRREQ
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQ

Query:  DQARFLKLFEDGDDIEDELGLSDVI
        DQARFLKLFEDGDDIEDELGL DVI
Subjt:  DQARFLKLFEDGDDIEDELGLSDVI

A0A6J1HE82 uncharacterized protein LOC111463378 isoform X18.2e-10784.44Show/hide
Query:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIR + +F+V T+FL+VS  LSC S G ES VVTLDSIVIYKTHEWLASKPTVYF+CQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQ
        YEEDSIKSDDVFEEWE CPSDFTAP G+YVR+N KEFNATFLCL+CTAYSNVTS+ S SHD EKGMH+A +I+ISV+ STVLI+GMV+GYKYWQKKRREQ
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQ

Query:  DQARFLKLFEDGDDIEDELGLSDVI
        DQARFLKLFEDGDDIEDELGL+DVI
Subjt:  DQARFLKLFEDGDDIEDELGLSDVI

A0A6J1ICG3 uncharacterized protein LOC111471262 isoform X14.8e-10784.89Show/hide
Query:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF
        MPIRR+ +F+V T+FL+VS  LSC S G ES VVTLDSIVIYKTHEWLASKPTVYF+CQGGNKTKLPDVQK HVLYSFNGEESWQPLTEF+SKKCKRCGF
Subjt:  MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGF

Query:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQ
        YEEDSIKSDDVFEEWE CPSDFTAP G+YVR+N KEFNATFLCL+CTAYSNVTS+ S SHD EKGMH+A +I+ISV+ STVLIIGMV+GYKYWQKKRREQ
Subjt:  YEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQ

Query:  DQARFLKLFEDGDDIEDELGLSDVI
        DQARFLKLFEDGDDIEDELGL+DVI
Subjt:  DQARFLKLFEDGDDIEDELGLSDVI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G09645.1 unknown protein2.3e-0534.94Show/hide
Query:  FLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQDQARFLKLFEDGDDIEDELGLSD
        FL L   + +      + SH       + V++V+  V    + +     YK WQKK+R++  AR LKLFE+ D++E ELGL D
Subjt:  FLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQDQARFLKLFEDGDDIEDELGLSD

AT1G57765.1 unknown protein8.1e-0636.36Show/hide
Query:  FLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVG-----YKYWQKKRREQDQARFLKLFEDGDDIEDELGLSD
        FL +     S+ T   + SH       + V+ ++       L  G V G     YK WQKK+R++  AR LKLFE+ D++E ELGL D
Subjt:  FLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVG-----YKYWQKKRREQDQARFLKLFEDGDDIEDELGLSD

AT1G57765.2 unknown protein2.1e-0634.07Show/hide
Query:  NATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVG-----YKYWQKKRREQDQARFLKLFEDGDDIEDELGLSD
        ++ + C+     S+ T   + SH       + V+ ++       L  G V G     YK WQKK+R++  AR LKLFE+ D++E ELGL D
Subjt:  NATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVG-----YKYWQKKRREQDQARFLKLFEDGDDIEDELGLSD

AT3G53490.1 unknown protein2.6e-6556.34Show/hide
Query:  FLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGFYEEDSIKSDDVFEE
        FL+   F S   G + +  VTLDS+ I+ TH+W ++KPTV+FQC+G NKT LPDV++ +V YSFNGEESWQPLTE +  KCKRCG YE+D +K  D F+E
Subjt:  FLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGFYEEDSIKSDDVFEE

Query:  WEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEK--GMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQDQARFLKLFEDG
        WE CPSDFTA  G Y RF  KEFNATFLC  C+     ++  S +   E+  GMH  +V++I V+   V+ +G++VGYKYW+KK+R+Q+QARFLKLFEDG
Subjt:  WEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEK--GMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQDQARFLKLFEDG

Query:  DDIEDELGLSDVI
        DDIEDELGL + +
Subjt:  DDIEDELGLSDVI

AT5G02720.1 unknown protein4.0e-2941.73Show/hide
Query:  LTEFESKKCKRCGFYEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGM
        +T    +KCKRCG YE+ S+ SD  F+ WE CP+DF+A +  Y+ F  KE NATF+C  C  + +  +  SP  +G  G+   + I+  V+ +T++++G 
Subjt:  LTEFESKKCKRCGFYEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGM

Query:  VVGYKYWQKKRREQDQARFLKLFEDGDDIEDELGLSDVI
        V  +K+ Q+ ++++DQARF++LFE+ D+ EDELGL  VI
Subjt:  VVGYKYWQKKRREQDQARFLKLFEDGDDIEDELGLSDVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATTCGGCGTACTCTGAATTTCAAAGTTTCAACGCTTTTTCTTATCGTTTCGACATTCCTGAGCTGCCAGTCAGGGGGCGTAGAATCGGGGGTTGTTACG
CTTGATTCCATCGTAATATACAAGACGCACGAATGGTTAGCATCGAAACCAACAGTTTATTTTCAATGTCAAGGGGGAAATAAGACGAAATTGCCTGATGTACAG
AAAGAGCATGTTTTATACAGCTTCAATGGTGAAGAATCATGGCAGCCATTGACTGAATTTGAAAGCAAAAAGTGCAAGCGATGTGGGTTCTATGAAGAGGACAGC
ATTAAATCTGACGATGTATTTGAAGAATGGGAGTTTTGTCCATCTGATTTCACAGCTCCTGCTGGAAAATATGTAAGATTCAATGCAAAAGAGTTCAATGCCACT
TTTCTATGCTTGCAGTGCACAGCATATTCCAATGTTACCAGTACAATTTCACCTTCGCATGACGGAGAAAAGGGAATGCATTCTGCTGTAGTCATAGTTATCAGT
GTTGTGGCTTCAACTGTGTTAATTATTGGTATGGTGGTTGGTTACAAATATTGGCAAAAAAAGAGAAGAGAGCAAGATCAGGCCAGGTTTCTGAAGCTGTTTGAA
GATGGGGATGACATTGAGGATGAATTGGGCCTTAGCGATGTAATTTGA
mRNA sequenceShow/hide mRNA sequence
GTTGGTGTAAGCGTGTCACAGACACAAAATTCCCCAGTTCTTCCATTACCCCACAGGTCCATTCGCACGCAAAGGAGCCAAATCGCCATAACCTCGAAGCTTTCT
TTAATTGAATCGGAGTTAAATCTTCCGCCGTCTGCCACCGCCCTATCTCACCGGCTGTTTCATCGATCCCATCGTTTCAATTTTCAACATCTCTCCGATAATCGC
TATTAGGGTTTTCAGAAAATTTGACACGCGATATGCCGATTCGGCGTACTCTGAATTTCAAAGTTTCAACGCTTTTTCTTATCGTTTCGACATTCCTGAGCTGCC
AGTCAGGGGGCGTAGAATCGGGGGTTGTTACGCTTGATTCCATCGTAATATACAAGACGCACGAATGGTTAGCATCGAAACCAACAGTTTATTTTCAATGTCAAG
GGGGAAATAAGACGAAATTGCCTGATGTACAGAAAGAGCATGTTTTATACAGCTTCAATGGTGAAGAATCATGGCAGCCATTGACTGAATTTGAAAGCAAAAAGT
GCAAGCGATGTGGGTTCTATGAAGAGGACAGCATTAAATCTGACGATGTATTTGAAGAATGGGAGTTTTGTCCATCTGATTTCACAGCTCCTGCTGGAAAATATG
TAAGATTCAATGCAAAAGAGTTCAATGCCACTTTTCTATGCTTGCAGTGCACAGCATATTCCAATGTTACCAGTACAATTTCACCTTCGCATGACGGAGAAAAGG
GAATGCATTCTGCTGTAGTCATAGTTATCAGTGTTGTGGCTTCAACTGTGTTAATTATTGGTATGGTGGTTGGTTACAAATATTGGCAAAAAAAGAGAAGAGAGC
AAGATCAGGCCAGGTTTCTGAAGCTGTTTGAAGATGGGGATGACATTGAGGATGAATTGGGCCTTAGCGATGTAATTTGA
Protein sequenceShow/hide protein sequence
MPIRRTLNFKVSTLFLIVSTFLSCQSGGVESGVVTLDSIVIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTEFESKKCKRCGFYEEDS
IKSDDVFEEWEFCPSDFTAPAGKYVRFNAKEFNATFLCLQCTAYSNVTSTISPSHDGEKGMHSAVVIVISVVASTVLIIGMVVGYKYWQKKRREQDQARFLKLFE
DGDDIEDELGLSDVI