; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012361 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012361
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationChr01:20381864..20383910
RNA-Seq ExpressionHG10012361
SyntenyHG10012361
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033599.1 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Cucumis melo var. makuwa]1.9e-12190.6Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVES+AFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GK TYYTLL+YLSGGSKNK KNDTN+ KDPSSEPLVGGET+FYGSRN ++AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG

Query:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHG+KCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

XP_004150303.1 uncharacterized protein LOC101210552 [Cucumis sativus]2.5e-12190.6Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDFKKTDPSS WP IKPKQNLQ+  LK+NDLFTVPSFFTCVES+AFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG
        LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLG GK TYYTLL+YLSGGSKNK KNDTN+ KDPSS+ LVGGET+FYGSRN ++AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG

Query:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHG+KCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

XP_008439151.1 PREDICTED: uncharacterized protein LOC103484033 [Cucumis melo]7.4e-12190.17Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVES+AFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGL+N
Subjt:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GK TYYTLL+YLSGGSKNK KNDTN+ KDPSSEPLVGGET+FYGSRN ++AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG

Query:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHG+KCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

XP_022945055.1 uncharacterized protein LOC111449413 [Cucurbita moschata]5.1e-11486.55Show/hide
Query:  MAGRRGEGDFKKT----DPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRS
        MAGRR EGD KKT     PSS+WP IKPKQ+LQI RLKENDLFTVPSFFT VES+AFIK AES+GFVHQGSLGP+KGEAYRDNDRISVNDPDLAD IWRS
Subjt:  MAGRRGEGDFKKT----DPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRS

Query:  GLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVA
        GLD  FADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLGEGK TYYTLL+YLSGGSK K KNDTN+ KDPSSEPLVGGET+FYG RN +VAEVA
Subjt:  GLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVA

Query:  PTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        PTEGMALLHLHG+KCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  PTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]7.1e-12491.91Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRR EGDFKKTDPSSKWP IKPKQNLQI+RLK+NDLFTVPSFF+CVES+ FIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG
        LF+DIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGK TYYTLL+YLSGGSKNK KNDTN+ KDPSSEPLVGGET+FYGSRN +VAEV+PTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG

Query:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFSS
        MALLHLHG+KCLLHEARNVT+GVKYVFRSDVIFSS
Subjt:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFSS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein1.2e-12190.6Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDFKKTDPSS WP IKPKQNLQ+  LK+NDLFTVPSFFTCVES+AFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG
        LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLG GK TYYTLL+YLSGGSKNK KNDTN+ KDPSS+ LVGGET+FYGSRN ++AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG

Query:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHG+KCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840333.6e-12190.17Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVES+AFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGL+N
Subjt:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GK TYYTLL+YLSGGSKNK KNDTN+ KDPSSEPLVGGET+FYGSRN ++AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG

Query:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHG+KCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

A0A5A7SWR3 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 19.4e-12290.6Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVES+AFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GK TYYTLL+YLSGGSKNK KNDTN+ KDPSSEPLVGGET+FYGSRN ++AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG

Query:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHG+KCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1FZS7 uncharacterized protein LOC1114494132.5e-11486.55Show/hide
Query:  MAGRRGEGDFKKT----DPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRS
        MAGRR EGD KKT     PSS+WP IKPKQ+LQI RLKENDLFTVPSFFT VES+AFIK AES+GFVHQGSLGP+KGEAYRDNDRISVNDPDLAD IWRS
Subjt:  MAGRRGEGDFKKT----DPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRS

Query:  GLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVA
        GLD  FADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLGEGK TYYTLL+YLSGGSK K KNDTN+ KDPSSEPLVGGET+FYG RN +VAEVA
Subjt:  GLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVA

Query:  PTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        PTEGMALLHLHG+KCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  PTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1HX87 uncharacterized protein LOC1114676973.2e-11486.55Show/hide
Query:  MAGRRGEGDFKKT----DPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRS
        MAGRR EGD KKT     PSS+WP IKPKQ+LQI RLKENDLFTVPSFF  VES+AFIK AES+GFVHQGSLGP+KGEAYRDNDRISVNDPDLAD IWRS
Subjt:  MAGRRGEGDFKKT----DPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRS

Query:  GLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVA
        GLD  FADIKIRGKVAVGLNPNIR YRYKVGQRFGRHIDESVDLGEGK TYYTLL+YLSGGSK K KNDTN+ KDPSSEPLVGGET+FYG RN +VAEVA
Subjt:  GLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVA

Query:  PTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        PTEGMALLHLHG+KCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  PTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.7e-8871.04Show/hide
Query:  SSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL
        S KWP IK K NL ++ LK +DLFTV +  T  ES+AF+K AESLGF HQGS GP  GEAYRDN RISVNDP LAD +W+SGL NLF DIKIR KVAVGL
Subjt:  SSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL

Query:  NPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGS-KNKMKNDTNDFKDPSS-EPLVGGETIFYGSRNSIVAEVAPTEGMALLHLHGNKCLL
        NPNIR YRY  GQ FGRHIDES DL +G  TYYTLL+YLSG S K+K K+ ++   D SS EPLVGGET+FYGSRNSIVAEVAP EGMAL H+HG+KC+L
Subjt:  NPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGS-KNKMKNDTNDFKDPSS-EPLVGGETIFYGSRNSIVAEVAPTEGMALLHLHGNKCLL

Query:  HEARNVTKGVKYVFRSDVIFS
        HE RNV+KGVKYVFRSDV+F+
Subjt:  HEARNVTKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGCAGAAGAGGGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCGAAGTGGCCAACCATTAAACCCAAGCAGAATCTTCAAATCACTCGCCTGAAAGAAAATGA
TCTTTTCACTGTGCCAAGTTTTTTCACTTGTGTTGAGTCAAGAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTTGTTCATCAGGGGAGCCTTGGTCCTACTAAAGGAG
AAGCTTACAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCTGGGCTTGATAACCTATTTGCTGATATTAAAATTCGGGGAAAA
GTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTTGGTCAGCGCTTTGGACGTCATATTGATGAAAGTGTTGATCTTGGAGAAGGCAAGTGCACATA
TTATACTTTGTTAGTATACTTAAGCGGAGGTTCCAAGAACAAAATGAAAAATGATACCAACGACTTCAAAGATCCTTCTTCTGAGCCTCTGGTTGGAGGGGAAACTATTT
TCTATGGTTCAAGGAATAGTATTGTTGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGAACAAGTGTTTGTTGCATGAAGCTCGCAACGTTACA
AAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGGCAGAAGAGGGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCGAAGTGGCCAACCATTAAACCCAAGCAGAATCTTCAAATCACTCGCCTGAAAGAAAATGA
TCTTTTCACTGTGCCAAGTTTTTTCACTTGTGTTGAGTCAAGAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTTGTTCATCAGGGGAGCCTTGGTCCTACTAAAGGAG
AAGCTTACAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCTGGGCTTGATAACCTATTTGCTGATATTAAAATTCGGGGAAAA
GTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTTGGTCAGCGCTTTGGACGTCATATTGATGAAAGTGTTGATCTTGGAGAAGGCAAGTGCACATA
TTATACTTTGTTAGTATACTTAAGCGGAGGTTCCAAGAACAAAATGAAAAATGATACCAACGACTTCAAAGATCCTTCTTCTGAGCCTCTGGTTGGAGGGGAAACTATTT
TCTATGGTTCAAGGAATAGTATTGTTGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGAACAAGTGTTTGTTGCATGAAGCTCGCAACGTTACA
AAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCTTCATGA
Protein sequenceShow/hide protein sequence
MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGK
VAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEGMALLHLHGNKCLLHEARNVT
KGVKYVFRSDVIFSS