; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019967 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019967
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationchr5:47084150..47091093
RNA-Seq ExpressionLag0019967
SyntenyLag0019967
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022140818.1 uncharacterized protein LOC111011396 [Momordica charantia]1.9e-12290.79Show/hide
Query:  KGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLADTIW
        +GKKRKMAGRRGEGDSKKT P++ WPAIKPKQNL I RLKENDLFTVP+F T+VESK FI  AESMGF+HQGSLGPTKGEA+RDNDRISVNDPDLADTIW
Subjt:  KGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLADTIW

Query:  RSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVVAE
        RSGLDKLF DIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYGSRNGVVAE
Subjt:  RSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVVAE

Query:  VAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF
        VAPTEGMALLHLHGDKCLLHEARNVTKG+KY+FRSDV F
Subjt:  VAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF

XP_022945055.1 uncharacterized protein LOC111449413 [Cucurbita moschata]5.6e-12291.39Show/hide
Query:  KGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLA
        +G+KRKMAGRR EGDSKKTA    PSS+WPAIKPKQ+L I RLKENDLFTVP+FFT+VESKAFIKTAESMGF+HQGSLGP+KGEAYRDNDRISVNDPDLA
Subjt:  KGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DTIWRSGLDK F DIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYG RN 
Subjt:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_022968459.1 uncharacterized protein LOC111467697 [Cucurbita maxima]1.4e-12090.57Show/hide
Query:  KGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLA
        +G+KRKMAGRR EGDSKKTA    PSS+WPAIKPKQ+L I RLKENDLFTVP+FF +VESKAFIKTAESMGF+HQGSLGP+KGEAYRDNDRISVNDPDLA
Subjt:  KGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DTIWRSGLDK F DIKIRGKVAVGLNPNIRFYRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYG RN 
Subjt:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_023541473.1 uncharacterized protein LOC111801646 [Cucurbita pepo subsp. pepo]5.6e-12291.39Show/hide
Query:  KGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLA
        +G+KRKMAGRR EGDSKKTA    PSS+WPAIKPKQ+L I RLKENDLFTVP+FFT+VESKAFIKTAESMGF+HQGSLGP+KGEAYRDNDRISVNDPDLA
Subjt:  KGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DTIWRSGLDK F DIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYG RN 
Subjt:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]4.7e-12190.42Show/hide
Query:  KGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLADTIW
        +GKKRKMAGRR EGD KKT PSSKWPAIKPKQNL I+RLK+NDLFTVP+FF+ VESK FIK AES+GF+HQGSLGPTKGEAYRDNDRISVNDPDLAD IW
Subjt:  KGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLADTIW

Query:  RSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVVAE
        RSGLD LF+DIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYGSRNGVVAE
Subjt:  RSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVVAE

Query:  VAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+PTEGMALLHLHGDKCLLHEARNVT+GVKYVFRSDVIFS
Subjt:  VAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein3.3e-12089.96Show/hide
Query:  GKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLADTIWR
        GKKRKMAGRRGEGD KKT PSS WP IKPKQNL +  LK+NDLFTVP+FFT VESKAFIK AES+GF+HQGSLGPTKGEAYRDNDRISVNDPDLAD IWR
Subjt:  GKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLADTIWR

Query:  SGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVVAEV
        SGLD LF DIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSS+ LVGGETVFYGSRNGV+AEV
Subjt:  SGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVVAEV

Query:  APTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        APTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  APTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840337.3e-12089.17Show/hide
Query:  KGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLADTIW
        +GKKRKMAGRRGEGD  KT PSS WP IKPKQNL +  LK NDLFTVP+FFT VESKAFIK AES+GF+HQGSLGPTKGEAYRDNDRISVNDPDLAD IW
Subjt:  KGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLADTIW

Query:  RSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVVAE
        RSGL+ LF DIKIRGK+AVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYGSRNGV+AE
Subjt:  RSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVVAE

Query:  VAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        VAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1CIV9 uncharacterized protein LOC1110113969.3e-12390.79Show/hide
Query:  KGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLADTIW
        +GKKRKMAGRRGEGDSKKT P++ WPAIKPKQNL I RLKENDLFTVP+F T+VESK FI  AESMGF+HQGSLGPTKGEA+RDNDRISVNDPDLADTIW
Subjt:  KGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLADTIW

Query:  RSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVVAE
        RSGLDKLF DIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYGSRNGVVAE
Subjt:  RSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVVAE

Query:  VAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF
        VAPTEGMALLHLHGDKCLLHEARNVTKG+KY+FRSDV F
Subjt:  VAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF

A0A6J1FZS7 uncharacterized protein LOC1114494132.7e-12291.39Show/hide
Query:  KGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLA
        +G+KRKMAGRR EGDSKKTA    PSS+WPAIKPKQ+L I RLKENDLFTVP+FFT+VESKAFIKTAESMGF+HQGSLGP+KGEAYRDNDRISVNDPDLA
Subjt:  KGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DTIWRSGLDK F DIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYG RN 
Subjt:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1HX87 uncharacterized protein LOC1114676976.6e-12190.57Show/hide
Query:  KGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLA
        +G+KRKMAGRR EGDSKKTA    PSS+WPAIKPKQ+L I RLKENDLFTVP+FF +VESKAFIKTAESMGF+HQGSLGP+KGEAYRDNDRISVNDPDLA
Subjt:  KGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DTIWRSGLDK F DIKIRGKVAVGLNPNIRFYRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYG RN 
Subjt:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.3e-8973.3Show/hide
Query:  SSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLADTIWRSGLDKLFTDIKIRGKVAVGL
        S KWP IK K NL ++ LK +DLFTV    T+ ESKAF+K AES+GF HQGS GP  GEAYRDN RISVNDP LADT+W+SGL  LFTDIKIR KVAVGL
Subjt:  SSKWPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLADTIWRSGLDKLFTDIKIRGKVAVGL

Query:  NPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KSKTKNDTNNSKDPSS-EPLVGGETVFYGSRNGVVAEVAPTEGMALLHLHGDKCLL
        NPNIRFYRY+ GQ FGRHIDES DL  G RTYYTLLIYLSG S KSK+K+ ++ + D SS EPLVGGETVFYGSRN +VAEVAP EGMAL H+HGDKC+L
Subjt:  NPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KSKTKNDTNNSKDPSS-EPLVGGETVFYGSRNGVVAEVAPTEGMALLHLHGDKCLL

Query:  HEARNVTKGVKYVFRSDVIFS
        HE RNV+KGVKYVFRSDV+F+
Subjt:  HEARNVTKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGGGGAAGACGTGGCCTACAAGACAGTAAACCTGCACACCGGTGAGGTGCTCGCCACACCGGCTCTGATGCTTAAGTCAGCAAACAGAACGGTCAAACCAGGCGA
AACCGGGGCATCCAGAGGCGGTGGGGACCAGACGGGACCGAACAGGCTCGGCCCGCGCGAGCGGGCCGAGCAGGGGGTCGGGCCTAAAACCCGACCCCTTCGGTCTTGGC
CCGTCTCACTTGCCGGTTTTGCCTCCTTGGTCCATCTTTCAGCCCGATTCTTCCCGGTTGTCCTCGACCAAAAGACTGACCCAGAGGAAGACCAGGCCAAAAGGTCGGGC
CACCCATATGGTCGGCCTCGGCAAAAGGCCGAGGCCGACCATTTGGCCCGCTTGCACGGGCCGAGCCCGATTACCCCTTTTCGGTCCTTGATGCCCCGGATCGCCCCGAA
TCGCCCCGGTTCCGTTGCTTCTCCTCGATTTGCTGACTTAGGCATCGGAGGCGGTGTGGCCTACACCACACGGGTGTGCAACGGTTTTTGTTGGTCTTGCAGGTCACGTC
TTCCCCAGCTTCTACAAATTCACTATTGGTGTCACGTGAAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTCGAAGAAAACAGCCCCTTCTTCAAAG
TGGCCAGCCATTAAACCCAAGCAGAATCTTCTGATCACCCGCCTCAAAGAAAATGATCTTTTCACCGTGCCAACTTTTTTCACAAATGTTGAGTCAAAAGCATTCATCAA
GACTGCAGAGTCGATGGGTTTTATTCATCAGGGGAGCCTTGGTCCTACAAAAGGAGAAGCCTATAGAGATAATGATCGAATTTCGGTGAATGATCCTGATTTAGCAGATA
CCATTTGGCGTTCGGGACTTGATAAACTGTTTACTGATATTAAAATACGGGGAAAAGTTGCTGTTGGGTTGAATCCGAATATTAGATTTTACAGATACAACGTTGGTCAG
CGCTTTGGACGCCATATTGATGAAAGTGTTGATCTTGGAGGAGGGAAGCGCACATATTATACTTTATTAATATATTTAAGCGGAGGTTCCAAAAGCAAAACAAAAAATGA
TACCAATAATTCCAAAGATCCTTCTTCCGAGCCTCTAGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAATGGCGTTGTGGCTGAGGTGGCTCCTACTGAAGGGATGG
CTCTCCTGCATCTTCATGGGGACAAGTGTTTGTTGCATGAAGCTCGCAATGTTACGAAGGGTGTCAAATACGTTTTCCGTTCTGACGTCATATTTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGGGGAAGACGTGGCCTACAAGACAGTAAACCTGCACACCGGTGAGGTGCTCGCCACACCGGCTCTGATGCTTAAGTCAGCAAACAGAACGGTCAAACCAGGCGA
AACCGGGGCATCCAGAGGCGGTGGGGACCAGACGGGACCGAACAGGCTCGGCCCGCGCGAGCGGGCCGAGCAGGGGGTCGGGCCTAAAACCCGACCCCTTCGGTCTTGGC
CCGTCTCACTTGCCGGTTTTGCCTCCTTGGTCCATCTTTCAGCCCGATTCTTCCCGGTTGTCCTCGACCAAAAGACTGACCCAGAGGAAGACCAGGCCAAAAGGTCGGGC
CACCCATATGGTCGGCCTCGGCAAAAGGCCGAGGCCGACCATTTGGCCCGCTTGCACGGGCCGAGCCCGATTACCCCTTTTCGGTCCTTGATGCCCCGGATCGCCCCGAA
TCGCCCCGGTTCCGTTGCTTCTCCTCGATTTGCTGACTTAGGCATCGGAGGCGGTGTGGCCTACACCACACGGGTGTGCAACGGTTTTTGTTGGTCTTGCAGGTCACGTC
TTCCCCAGCTTCTACAAATTCACTATTGGTGTCACGTGAAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTCGAAGAAAACAGCCCCTTCTTCAAAG
TGGCCAGCCATTAAACCCAAGCAGAATCTTCTGATCACCCGCCTCAAAGAAAATGATCTTTTCACCGTGCCAACTTTTTTCACAAATGTTGAGTCAAAAGCATTCATCAA
GACTGCAGAGTCGATGGGTTTTATTCATCAGGGGAGCCTTGGTCCTACAAAAGGAGAAGCCTATAGAGATAATGATCGAATTTCGGTGAATGATCCTGATTTAGCAGATA
CCATTTGGCGTTCGGGACTTGATAAACTGTTTACTGATATTAAAATACGGGGAAAAGTTGCTGTTGGGTTGAATCCGAATATTAGATTTTACAGATACAACGTTGGTCAG
CGCTTTGGACGCCATATTGATGAAAGTGTTGATCTTGGAGGAGGGAAGCGCACATATTATACTTTATTAATATATTTAAGCGGAGGTTCCAAAAGCAAAACAAAAAATGA
TACCAATAATTCCAAAGATCCTTCTTCCGAGCCTCTAGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAATGGCGTTGTGGCTGAGGTGGCTCCTACTGAAGGGATGG
CTCTCCTGCATCTTCATGGGGACAAGTGTTTGTTGCATGAAGCTCGCAATGTTACGAAGGGTGTCAAATACGTTTTCCGTTCTGACGTCATATTTTCTTGA
Protein sequenceShow/hide protein sequence
MRGEDVAYKTVNLHTGEVLATPALMLKSANRTVKPGETGASRGGGDQTGPNRLGPRERAEQGVGPKTRPLRSWPVSLAGFASLVHLSARFFPVVLDQKTDPEEDQAKRSG
HPYGRPRQKAEADHLARLHGPSPITPFRSLMPRIAPNRPGSVASPRFADLGIGGGVAYTTRVCNGFCWSCRSRLPQLLQIHYWCHVKGKKRKMAGRRGEGDSKKTAPSSK
WPAIKPKQNLLITRLKENDLFTVPTFFTNVESKAFIKTAESMGFIHQGSLGPTKGEAYRDNDRISVNDPDLADTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQ
RFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS