; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy1G028075 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy1G028075
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionCupin_5 domain-containing protein
Genome locationGy14Chr1:26303132..26307263
RNA-Seq ExpressionCsGy1G028075
SyntenyCsGy1G028075
Gene Ontology termsNA
InterPro domainsIPR009327 - Cupin domain of unknown function DUF985
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold
IPR039935 - Uncharacterized protein YML079W-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK10013.1 Cupin_5 domain-containing protein [Cucumis melo var. makuwa]2.18e-13697.35Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATAS+IV KLNLKPHPEGGFYSETFRDYSVHLSKS LPPEYKVDREVSTCIYFLMPSGCVS+LHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

XP_004144040.1 uncharacterized protein LOC101210771 isoform X1 [Cucumis sativus]1.66e-140100Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

XP_008450945.1 PREDICTED: uncharacterized protein LOC103492382 [Cucumis melo]3.62e-13596.83Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATAS+IV KLNLKPHPEGGFYSETFRDYSVHLSKS LP EYKVDREVSTCIYFLMPSGCVS+LHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

XP_022147566.1 uncharacterized protein LOC111016461 [Momordica charantia]2.17e-12789.95Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATASEIV KL+LKPHPEGGFYSETFRD SVHLSK+HLPP+YKV REVSTCIYFL+PSGCVS+LHRIPCAETWHFYLGEPLT+LELN+ DGRVKLTCLG 
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFG+FPTKDF ISADGT+TKAAPRDS+NHYSLVGC+CAPAFQFEDFELAKRSDLVSRFPDSEA VSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

XP_038878514.1 uncharacterized protein LOC120070727 [Benincasa hispida]1.95e-13294.15Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATASEIV KLNLKPHPEGGFY+ETFRD SVHLSKSHLPPEYKVDREVSTCIYFL+PSGCVS+LHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPD
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGT+TKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDL SRFP+SEA +SLLTP+
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPD

TrEMBL top hitse value%identityAlignment
A0A0A0M280 Cupin_5 domain-containing protein8.04e-141100Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

A0A1S3BPT5 uncharacterized protein LOC1034923821.75e-13596.83Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATAS+IV KLNLKPHPEGGFYSETFRDYSVHLSKS LP EYKVDREVSTCIYFLMPSGCVS+LHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

A0A5A7UIT0 Cupin_5 domain-containing protein1.75e-13596.83Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATAS+IV KLNLKPHPEGGFYSETFRDYSVHLSKS LP EYKVDREVSTCIYFLMPSGCVS+LHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

A0A5D3CDS0 Cupin_5 domain-containing protein1.05e-13697.35Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATAS+IV KLNLKPHPEGGFYSETFRDYSVHLSKS LPPEYKVDREVSTCIYFLMPSGCVS+LHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

A0A6J1D2R9 uncharacterized protein LOC1110164611.05e-12789.95Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        MATASEIV KL+LKPHPEGGFYSETFRD SVHLSK+HLPP+YKV REVSTCIYFL+PSGCVS+LHRIPCAETWHFYLGEPLT+LELN+ DGRVKLTCLG 
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA
        DLIGDNQLPQYTVPPNVWFG+FPTKDF ISADGT+TKAAPRDS+NHYSLVGC+CAPAFQFEDFELAKRSDLVSRFPDSEA VSLLTP A
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19130.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF985 (InterPro:IPR009327), RmlC-like jelly roll fold (InterPro:IPR014710); Has 1465 Blast hits to 1465 proteins in 584 species: Archae - 10; Bacteria - 1038; Metazoa - 19; Fungi - 43; Plants - 51; Viruses - 0; Other Eukaryotes - 304 (source: NCBI BLink).1.1e-7771.51Show/hide
Query:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS
        M  +SEIVGKLNL+ H EGGF++ETFRD SV LS S LPP +KVDR VST IYFL+PSG VS LHRIP AETWHFYLGEPLTV+EL + DG++K TCLG 
Subjt:  MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGS

Query:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLT
        DL   +Q PQYTVPPNVWFG+FPTKD + S DG L KA  RDSENH+SLVGC+CAPAFQFEDFELAKRSDL+SRFP  E+ +++L+
Subjt:  DLIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACAGCATCAGAAATTGTAGGGAAATTGAATCTGAAGCCACATCCCGAAGGCGGTTTTTACTCTGAAACCTTCAGGGACTACTCCGTGCATCTCTCCAAATCTCA
CCTCCCACCCGAGTACAAGGTTGATCGAGAGGTCAGTACGTGTATATACTTTTTGATGCCGTCTGGATGTGTTTCTTCTCTTCATCGTATACCATGTGCTGAGACTTGGC
ATTTTTACTTGGGAGAACCACTTACGGTATTAGAGTTGAATGAAAAGGACGGTCGAGTCAAATTGACTTGTCTTGGGTCTGATCTCATTGGAGATAATCAACTACCTCAG
TATACAGTGCCTCCTAATGTTTGGTTTGGTGCTTTTCCAACCAAAGATTTCAATATTTCTGCTGATGGAACTCTGACCAAAGCTGCTCCAAGGGACTCTGAGAATCACTA
CTCGCTTGTGGGCTGCAGCTGTGCACCTGCTTTCCAGTTTGAGGACTTTGAGTTGGCAAAACGCTCTGATCTTGTTTCACGTTTTCCCGATAGTGAAGCATTCGTCTCGT
TGCTGACACCAGATGCCTGA
mRNA sequenceShow/hide mRNA sequence
TAAGAACAAATTTAAAATCGATTTATATAACAAAATATCACATTTGATTAATCAACAAAATTTTTGGGTCCTTCACAAAAGAAACAGTCTATGTTGTCTATCAAAATATT
TTGTTATATTAATATGAAATTATCAGTAATGAGTGATTGCTTTAGGTTGAAGGAGGTTTGGTGAAGAACAAAAGAAGCAAAAAACAATGGCTACAGCATCAGAAATTGTA
GGGAAATTGAATCTGAAGCCACATCCCGAAGGCGGTTTTTACTCTGAAACCTTCAGGGACTACTCCGTGCATCTCTCCAAATCTCACCTCCCACCCGAGTACAAGGTTGA
TCGAGAGGTCAGTACGTGTATATACTTTTTGATGCCGTCTGGATGTGTTTCTTCTCTTCATCGTATACCATGTGCTGAGACTTGGCATTTTTACTTGGGAGAACCACTTA
CGGTATTAGAGTTGAATGAAAAGGACGGTCGAGTCAAATTGACTTGTCTTGGGTCTGATCTCATTGGAGATAATCAACTACCTCAGTATACAGTGCCTCCTAATGTTTGG
TTTGGTGCTTTTCCAACCAAAGATTTCAATATTTCTGCTGATGGAACTCTGACCAAAGCTGCTCCAAGGGACTCTGAGAATCACTACTCGCTTGTGGGCTGCAGCTGTGC
ACCTGCTTTCCAGTTTGAGGACTTTGAGTTGGCAAAACGCTCTGATCTTGTTTCACGTTTTCCCGATAGTGAAGCATTCGTCTCGTTGCTGACACCAGATGCCTGATTCA
ACTGAAGAAGACTGACCCTTTTTCTGGAAATCATAGCTGAGATTCTGATTATAAGGAGATCTGGATTGTATGATGTGATTTTGATCTCTGTTCATTTTCAAATCACGGTG
TAGTTAAAGCTTTTAGTCAGTGGATTGTGCTCTAGTGGAACCTCCTATGGCTTACTGAAGATAATGATAATGAATAAGGCTCTCTTTTACGACGAATCTTGTAAATCCAA
TATGTTTTCAGGTTCTGTGGATGCTGCCCCTCACATGAAAAGAGAAATGTACCCATCACAAAAGGAAAAAAATCAGTGAGAGAGACCTTACCTCTGCCTCTCTCTCTCTC
TCTCTCTCTCCATGAGTTTAAGTGTAAAGAATGTGGGCCATCTGTAAAGTTCTTGATTTCATTCTATGATTTAGACATATCCCTAATTTATTTTCTTCCATTCTAGAATA
TTGTCAGAACAATAATTTTGAGAATTTCTGGTGAAAGACATGTGATAAGCAAATTGTGATTTCAACATCTTTTACTTATAGGAAGGAAAGATTACGAAAGAATTATTCAT
CACGGCAGAGTCCTTTTGCTTTTCTGTTTGATTTTATATAATAGACACTGAATTGTGAGTAACATCCCCAGTTGTTAAATTGTATTTCCAAAGGATATTGTAGGAGTCTC
TCAAAAGATAAAATTGAGCTCAGTATCAATTATTCCAAACTGTTAGATAAGTACAATAGCTAAATATCCAAATAAACAATCAAGAAACCAAGCGAGAAAATAATGTCTTG
TGAACTGTGATATTTTCTATGGCAGCGTCACAAATGGCAGGTAGCTTCTCAGATTTCTAAAACGATATCTGCACAAGAAAATGGAAACAAACAATCAGGTTAGTCTATAA
AAGTAAAACAGCTAAAAGAGTTTTGTGTATTATGGGAAAAACATATTAGAATTTTGTGTATATGATGGCAAGAAACGTGCATGTTTGATGGAACTTGAAAGTTGGTTTAT
AGGTACTGATTATGTTGTTAATACTGACATTCAGAATCAGTCTGAACCCATTTACTTCCATCACTTGTCTTGTCGTTCTTTTCGTCTTCATTTTCATCTCTTTGCTTGTT
TTTCTTCCTCAGGTACTTTTTCTTTGATGATGTTCTGACATTTGAGCTTCGGGGAGCTGGTCTCAGAGGAGGAGTAACCACAGCAGGTGGTGCGGTACCTCCTCCACAGA
CAAAAATCTTCTTGAGAAGGAGGGATAAAGTTCTTTTGCCAATAATTGCGGTGTCATTAGCGTCCACACCTACATCTTTGCCTTTACTGAGTATGACATGCTGGAAAATA
TTGCTCTTCTTAATCAGCTCTTCATAACATAGTTCTTTCTTTGTTGTTTCAGATCCAAAGGTTAATTGTCTGTTAAGAAATATATTTGATGGACAATGCTTGCTGACTTC
AAATTCCAAGGTCGGGCCCGATTGCTTAAGGGAAGATGATGAATTCTCTTGTGACCTTTTTGGCCGTGCTTCATTCAGATTCTCGTCTCCAAACGTTCCAATCGCCAGCA
ATACATGAGACCAGTTGCTGAATTCTTCTGGTCGAGTTTTGTGCACTATCTCGTCTGTTTGAGAAAGAACTTTTAAGTTACTTGCACTCAAATCATCTAAAAGATCATCA
TACCATTTGAAACGAAACTGAATAATAGAGTCTTTGCTCACCGTTGATTGAATTTGATGTTGTTCTTTTGCTCCCCTTTCTGCCACTAGGCTTCTTCCCTTGCATCCAAT
TCAATATCTGCACACAAAAACGAGTTATAACCAATCACGAAGATTACAAGACAAAGTCTAGATAAAACACTCTTGCATTAGCTTACCCTCATTTGTCAAAAGACGGATTG
GCAACAGGGATGTACCCTATTCTTCGTCCTCTCTGTGGCCTCTAGTTAGAGTTTGAATTCTTGTTGATAAGGCCAGATAGAACCAGGTCCAGAAAGATATAGCTATAATT
TTGGGCTATGGATCCCACAACAGGCACAAGGATTGCATACACCAAGGCAAGATGAGTGCAAAACACTTTATATAGTGAAGAGAAATGTGAGAAATTCCACAACCAAACTA
CTAGCTGCCACTGTTTCAAGTTCCAAAAC
Protein sequenceShow/hide protein sequence
MATASEIVGKLNLKPHPEGGFYSETFRDYSVHLSKSHLPPEYKVDREVSTCIYFLMPSGCVSSLHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGSDLIGDNQLPQ
YTVPPNVWFGAFPTKDFNISADGTLTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSEAFVSLLTPDA