; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023063 (gene) of Chayote v1 genome

Gene IDSed0023063
OrganismSechium edule (Chayote v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationLG13:1172189..1180679
RNA-Seq ExpressionSed0023063
SyntenySed0023063
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439151.1 PREDICTED: uncharacterized protein LOC103484033 [Cucumis melo]1.1e-11182.79Show/hide
Query:  MAESQGKKRKMAGRKGDGGL--------WPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLA
        MAE+QGKKRKMAGR+G+G          WP IK KQNLQ+  LK +DLFTVP FFT VES+AFIK AES+GF+HQGSLGPT+GEAYRDNDRISV+DPDLA
Subjt:  MAESQGKKRKMAGRKGDGGL--------WPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLA

Query:  DTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYGSRNG
        D IWRSGLN LFADIKIR K+AVGLNPNIR YRY VGQRFGRHIDESVDLG GK TYYTLLIYLSGGSK KTKNDTN+SKDPSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYGSRNG

Query:  LVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS
        ++AEVAPTEGMALLHLHGDKCLLHE RNV KGVKYVFRSDVIFS
Subjt:  LVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS

XP_022140818.1 uncharacterized protein LOC111011396 [Momordica charantia]2.6e-11384.77Show/hide
Query:  MAESQGKKRKMAGRKGDGGL--------WPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLA
        MAE+QGKKRKMAGR+G+G          WPAIK KQNLQI RLK++DLFTVP F TSVES+ FI IAESMGFVHQGSLGPT+GEA+RDNDRISV+DPDLA
Subjt:  MAESQGKKRKMAGRKGDGGL--------WPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLA

Query:  DTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYGSRNG
        DTIWRSGL+KLFADIKIR KVAVGLNPNIRFYRYNVGQRFGRHIDESVDLG GK TYYTLLIYLSGGSK KTKNDTN SK PSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYGSRNG

Query:  LVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIF
        +VAEVAPTEGMALLHLHGDKCLLHE RNVTKG+KY+FRSDV F
Subjt:  LVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIF

XP_022945055.1 uncharacterized protein LOC111449413 [Cucurbita moschata]5.7e-11383.87Show/hide
Query:  MAESQGKKRKMAGRKGDG------------GLWPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDD
        MAE+QG+KRKMAGR+ +G              WPAIK KQ+LQI RLK++DLFTVP FFTSVES+AFIK AESMGFVHQGSLGP++GEAYRDNDRISV+D
Subjt:  MAESQGKKRKMAGRKGDG------------GLWPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDD

Query:  PDLADTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYG
        PDLADTIWRSGL+K FADIKIR KVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGK TYYTLLIYLSGGSK KTKNDTN+SKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYG

Query:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS
         RN +VAEVAPTEGMALLHLHGDKCLLHE RNVTKGVKYVFRSDVIFS
Subjt:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS

XP_023541473.1 uncharacterized protein LOC111801646 [Cucurbita pepo subsp. pepo]5.7e-11383.87Show/hide
Query:  MAESQGKKRKMAGRKGDG------------GLWPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDD
        MAE+QG+KRKMAGR+ +G              WPAIK KQ+LQI RLK++DLFTVP FFTSVES+AFIK AESMGFVHQGSLGP++GEAYRDNDRISV+D
Subjt:  MAESQGKKRKMAGRKGDG------------GLWPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDD

Query:  PDLADTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYG
        PDLADTIWRSGL+K FADIKIR KVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGK TYYTLLIYLSGGSK KTKNDTN+SKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYG

Query:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS
         RN +VAEVAPTEGMALLHLHGDKCLLHE RNVTKGVKYVFRSDVIFS
Subjt:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]9.7e-11383.61Show/hide
Query:  MAESQGKKRKMAGRKGDGGL--------WPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLA
        MAE+QGKKRKMAGR+ +G          WPAIK KQNLQI+RLKD+DLFTVP FF+ VES+ FIK AES+GFVHQGSLGPT+GEAYRDNDRISV+DPDLA
Subjt:  MAESQGKKRKMAGRKGDGGL--------WPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLA

Query:  DTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYGSRNG
        D IWRSGL+ LF+DIKIR KVAVGLNPNIR YRY VGQRFGRHIDESVDLGEGK TYYTLLIYLSGGSK KTKNDTN+SKDPSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYGSRNG

Query:  LVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS
        +VAEV+PTEGMALLHLHGDKCLLHE RNVT+GVKYVFRSDVIFS
Subjt:  LVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein2.9e-11081.97Show/hide
Query:  MAESQGKKRKMAGRKGDGGL--------WPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLA
        MAE+ GKKRKMAGR+G+G          WP IK KQNLQ+  LKD+DLFTVP FFT VES+AFIK AES+GF+HQGSLGPT+GEAYRDNDRISV+DPDLA
Subjt:  MAESQGKKRKMAGRKGDGGL--------WPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLA

Query:  DTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYGSRNG
        D IWRSGL+ LFADIKIR KVAVGLNPNIR YRY VGQRFGRHIDESVDLG GK TYYTLLIYLSGGSK KTKNDTN+SKDPSS+ LVGGETVFYGSRNG
Subjt:  DTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYGSRNG

Query:  LVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS
        ++AEVAPTEGMALLHLHGDKCLLHE RNV KGVKYVFRSDVIFS
Subjt:  LVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840335.2e-11282.79Show/hide
Query:  MAESQGKKRKMAGRKGDGGL--------WPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLA
        MAE+QGKKRKMAGR+G+G          WP IK KQNLQ+  LK +DLFTVP FFT VES+AFIK AES+GF+HQGSLGPT+GEAYRDNDRISV+DPDLA
Subjt:  MAESQGKKRKMAGRKGDGGL--------WPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLA

Query:  DTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYGSRNG
        D IWRSGLN LFADIKIR K+AVGLNPNIR YRY VGQRFGRHIDESVDLG GK TYYTLLIYLSGGSK KTKNDTN+SKDPSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYGSRNG

Query:  LVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS
        ++AEVAPTEGMALLHLHGDKCLLHE RNV KGVKYVFRSDVIFS
Subjt:  LVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS

A0A6J1CIV9 uncharacterized protein LOC1110113961.2e-11384.77Show/hide
Query:  MAESQGKKRKMAGRKGDGGL--------WPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLA
        MAE+QGKKRKMAGR+G+G          WPAIK KQNLQI RLK++DLFTVP F TSVES+ FI IAESMGFVHQGSLGPT+GEA+RDNDRISV+DPDLA
Subjt:  MAESQGKKRKMAGRKGDGGL--------WPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLA

Query:  DTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYGSRNG
        DTIWRSGL+KLFADIKIR KVAVGLNPNIRFYRYNVGQRFGRHIDESVDLG GK TYYTLLIYLSGGSK KTKNDTN SK PSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYGSRNG

Query:  LVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIF
        +VAEVAPTEGMALLHLHGDKCLLHE RNVTKG+KY+FRSDV F
Subjt:  LVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIF

A0A6J1FZS7 uncharacterized protein LOC1114494132.8e-11383.87Show/hide
Query:  MAESQGKKRKMAGRKGDG------------GLWPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDD
        MAE+QG+KRKMAGR+ +G              WPAIK KQ+LQI RLK++DLFTVP FFTSVES+AFIK AESMGFVHQGSLGP++GEAYRDNDRISV+D
Subjt:  MAESQGKKRKMAGRKGDG------------GLWPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDD

Query:  PDLADTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYG
        PDLADTIWRSGL+K FADIKIR KVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGK TYYTLLIYLSGGSK KTKNDTN+SKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYG

Query:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS
         RN +VAEVAPTEGMALLHLHGDKCLLHE RNVTKGVKYVFRSDVIFS
Subjt:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS

A0A6J1HX87 uncharacterized protein LOC1114676976.8e-11283.06Show/hide
Query:  MAESQGKKRKMAGRKGDG------------GLWPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDD
        MAE+QG+KRKMAGR+ +G              WPAIK KQ+LQI RLK++DLFTVP FF SVES+AFIK AESMGFVHQGSLGP++GEAYRDNDRISV+D
Subjt:  MAESQGKKRKMAGRKGDG------------GLWPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDD

Query:  PDLADTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYG
        PDLADTIWRSGL+K FADIKIR KVAVGLNPNIRFYRY VGQRFGRHIDESVDLGEGK TYYTLLIYLSGGSK KTKNDTN+SKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLNKLFADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYG

Query:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS
         RN +VAEVAPTEGMALLHLHGDKCLLHE RNVTKGVKYVFRSDVIFS
Subjt:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.5e-9069.23Show/hide
Query:  QGKKRKMAGRKGDGGLWPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLADTIWRSGLNKLF
        Q  +    G  G+   WP IK K NL ++ LK+ DLFTV    TS ES+AF+KIAES+GF HQGS GP  GEAYRDN RISV+DP LADT+W+SGL+ LF
Subjt:  QGKKRKMAGRKGDGGLWPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLADTIWRSGLNKLF

Query:  ADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGS-KRKTKNDTNSSKDPSS-EPLVGGETVFYGSRNGLVAEVAPTEG
         DIKIRRKVAVGLNPNIRFYRY+ GQ FGRHIDES DL +G  TYYTLLIYLSG S K K+K+ ++ + D SS EPLVGGETVFYGSRN +VAEVAP EG
Subjt:  ADIKIRRKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGS-KRKTKNDTNSSKDPSS-EPLVGGETVFYGSRNGLVAEVAPTEG

Query:  MALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS
        MAL H+HGDKC+LHEGRNV+KGVKYVFRSDV+F+
Subjt:  MALLHLHGDKCLLHEGRNVTKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAATCACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAAAGGAGATGGGGGTTTATGGCCAGCCATTAAATCCAAGCAGAATCTTCAGATCACTCGCCTGAAAGA
TGATGATCTTTTCACTGTGCCTGGTTTTTTCACAAGTGTTGAGTCACAAGCATTCATCAAGATTGCTGAGTCGATGGGGTTTGTTCATCAGGGGAGCCTTGGTCCTACCA
GAGGAGAAGCTTACAGAGATAATGATCGAATCTCGGTGGATGATCCGGATTTAGCAGACACCATTTGGCGCTCGGGACTTAATAAGCTATTTGCCGATATTAAAATTCGG
AGAAAAGTAGCTGTTGGGTTGAATCCCAATATCAGATTTTACAGATACAATGTCGGTCAGCGCTTTGGACGCCATATTGATGAAAGTGTTGATCTTGGAGAGGGCAAGTG
CACATATTATACTCTGTTAATATATTTAAGCGGAGGTTCGAAAAGAAAAACAAAAAATGATACCAACAGCTCCAAAGATCCTTCTTCTGAGCCTCTGGTTGGAGGAGAAA
CTGTATTCTATGGTTCTAGGAATGGCCTTGTGGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCATCTTCATGGGGACAAGTGTTTGCTGCATGAAGGTCGAAAC
GTTACGAAGGGTGTCAAATATGTTTTCCGTTCGGACGTCATATTTTCTTGA
mRNA sequenceShow/hide mRNA sequence
AGAATTTCATCGAAGAAACAGAGCAGCTTTGTTGATTTCCAATGGCGGAATCACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAAAGGAGATGGGGGTTTATGGCCAGCC
ATTAAATCCAAGCAGAATCTTCAGATCACTCGCCTGAAAGATGATGATCTTTTCACTGTGCCTGGTTTTTTCACAAGTGTTGAGTCACAAGCATTCATCAAGATTGCTGA
GTCGATGGGGTTTGTTCATCAGGGGAGCCTTGGTCCTACCAGAGGAGAAGCTTACAGAGATAATGATCGAATCTCGGTGGATGATCCGGATTTAGCAGACACCATTTGGC
GCTCGGGACTTAATAAGCTATTTGCCGATATTAAAATTCGGAGAAAAGTAGCTGTTGGGTTGAATCCCAATATCAGATTTTACAGATACAATGTCGGTCAGCGCTTTGGA
CGCCATATTGATGAAAGTGTTGATCTTGGAGAGGGCAAGTGCACATATTATACTCTGTTAATATATTTAAGCGGAGGTTCGAAAAGAAAAACAAAAAATGATACCAACAG
CTCCAAAGATCCTTCTTCTGAGCCTCTGGTTGGAGGAGAAACTGTATTCTATGGTTCTAGGAATGGCCTTGTGGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGC
ATCTTCATGGGGACAAGTGTTTGCTGCATGAAGGTCGAAACGTTACGAAGGGTGTCAAATATGTTTTCCGTTCGGACGTCATATTTTCTTGACGATAAGTTTAGCCAGGT
CATCACCATGGAGTTTTCGGCATCCCATGTAGCAAAAGATGAATCATCATTGGAAGGTGCTACCTTGTCTCCAGTAAGGTTGTTGGTCGCCTGGGCGAAAATTACATGGA
ATTACCTACACAAGATTAAAATTGTGCTATTGGCTTGGTTATCTATAGGTCCTGTAATGTTGTAAAAACTGCAAACAGGTCACCGAGGTAGCCACCTCATTTTATTCAAT
ATCTAAACACCCGATTTTATCTCGTAGATGAGCTCGTCCAATGTCGATTTTCTTACGTCAAGGTTGGTCCCTAGAATTTCTTAAGAATTCAGTGTATCACTAGTTGAAAT
GGAACTGGCCATATTCTTAGAAGCTGTATTACAATTACTTGGTGGCTTGTTGAAATTTATGACATTCTACTTCATATGCCACTCTGATATGTGTTATTTGTCGTATTTAC
CTTCCTTTTTATTATGTCAAAGGAATTTACTCTTCTAAATTTGATGTTTCCTTTCTGATGTGGCTTGTAATTCAAAGTTTCCATCTTTATGTGGC
Protein sequenceShow/hide protein sequence
MAESQGKKRKMAGRKGDGGLWPAIKSKQNLQITRLKDDDLFTVPGFFTSVESQAFIKIAESMGFVHQGSLGPTRGEAYRDNDRISVDDPDLADTIWRSGLNKLFADIKIR
RKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKCTYYTLLIYLSGGSKRKTKNDTNSSKDPSSEPLVGGETVFYGSRNGLVAEVAPTEGMALLHLHGDKCLLHEGRN
VTKGVKYVFRSDVIFS