; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G01370 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G01370
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationClcChr11:1421920..1426876
RNA-Seq ExpressionClc11G01370
SyntenyClc11G01370
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK14443.1 prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa]7.9e-20887.91Show/hide
Query:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASI
        M D AESRQRRRLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWT+GASI
Subjt:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREF+AVCYLNSYGVEFGGGLFHFQDGEP+TISPF GDCVMY ADSDN+HSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNFK GFDICWARLHALGYDIYFP DH FSEYPDLFSQDVQLV G+++F+Q+FENILHLLQVVQFLCWKGK+LD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDS

Query:  TNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKL-ATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS
        TN   DS YAEYLSP+RNV V YFKSEFSK++ LAESVFSSATS GK NQHWLGW KL   AAAAWEDYA+ILRRELLGS SHWRN QSIYSVSL S
Subjt:  TNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKL-ATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS

XP_004140463.1 prolyl 3-hydroxylase 1 [Cucumis sativus]3.0e-20787.63Show/hide
Query:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASI
        M D AESRQRRRLIL NFL+ EECRELEFIHKSC TVGYRPNVFSTTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWT+GASI
Subjt:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREF+AVCYLNSYGVEFGGGLFHFQDGEP+TISPF GDCVMYTAD+DN+HSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDS
        HDRFPDSCLPQPPSCNMYWFSPEDDPNFK GFDICWARL ALGYD+YFP DH FSEYPDLF QDVQLV G+++F+Q+FENILHLLQVVQFLCWKGK+LDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDS

Query:  TNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS
        TN   DSSYAEYLSP+RNV V YFKSEFSK++ LAESVFSSA SDGK NQ WLGW KL  AAAAWE YA+ILRRELLGS SHWRN QSIYSVSL S
Subjt:  TNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS

XP_008456831.1 PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo]3.9e-20787.66Show/hide
Query:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASI
        M D AESRQRRRLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWT+GASI
Subjt:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREF+AVCYLNSYGVEFGGGLFHFQDGEP+TISPF GDCVMYTADSDN+HSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFK GFDICWARLHALGYDIYFP DH FSEYPDLFSQDVQLV G+++F+Q+FENILHLLQVVQFLCWKGK+LD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDS

Query:  TNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKL-ATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS
        TN   DS YAEYLSP+RNV V YFKSEFSK++ LAESVFSSATS GK NQHWLGW KL   AAAAWEDYA+ILRRELLGS SHWRN QSIYSVSL S
Subjt:  TNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKL-ATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS

XP_022155191.1 uncharacterized protein LOC111022327 [Momordica charantia]2.7e-20083.92Show/hide
Query:  MGDEAESRQ--RRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGA
        MGDE E+RQ  RRRLIL NFLT EECRELEFIHKSCCTVGYRP+VFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGC+YELFVEFTGLISWT+GA
Subjt:  MGDEAESRQ--RRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREFTAVCYLNSYGV+F GGLFHFQDGEPK+ISPFCGDCVMYTADS N+HSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQL
        HLHDRFP+SC+P PPSCNMYWFSPE+DPNFK G D+CWARLHALGYDIYFPRD+  S+YP LFS  VQLV+  ++F+QEF NILH LQVVQF+CWKGK+L
Subjt:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQL

Query:  DSTNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS
        DSTNFKG+SSYA YLSP+ N  V YFKSEFSK+ VLA+SVFSSA+SD K  Q WLGWAKLATA AAWEDYA+ LR ELL SL HWR +QS+Y VSLGS
Subjt:  DSTNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]4.6e-21691.21Show/hide
Query:  MGDEAES--RQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGA
        MGDE ES  R+RRRLIL NFLT EECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGC YELFVEFTGLISWT+GA
Subjt:  MGDEAES--RQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQR+F+AVCYLNSYGVEFGGGLFHFQDGEP+TISPFCGDCVMYTADSDN+HSVDEITNGERLTLTLW TRDSSHDED+KLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQL
        HLHDR PDS LPQPPSCNMYWFS EDDPNFKSGFDICWARLHALGYDIYF  DHSFSEYPDLFS+DVQLVQGN+LF+QEFENILHLLQVVQFLCWKGK+L
Subjt:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQL

Query:  DSTNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS
        DSTN K DSSYAEYLSP+RNV V YFKSEFSKD+VLAESVFSSATSDGK NQHWLGW KLA AAAAWEDYA+ILRRELLGSLS+WRNSQSIYSVSL S
Subjt:  DSTNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS

TrEMBL top hitse value%identityAlignment
A0A0A0KMN7 Procollagen-proline 3-dioxygenase1.5e-20484.43Show/hide
Query:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLIS-------
        M D AESRQRRRLIL NFL+ EECRELEFIHKSC TVGYRPNVFSTTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLIS       
Subjt:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLIS-------

Query:  --------WTKGASIGWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSH
                WT+GASIGWHSDDNRPYLKQREF+AVCYLNSYGVEFGGGLFHFQDGEP+TISPF GDCVMYTAD+DN+HSVDEITNGERLTLTLWFTRDSSH
Subjt:  --------WTKGASIGWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSH

Query:  DEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLL
        DEDAKLLSLLSQS LHDRFPDSCLPQPPSCNMYWFSPEDDPNFK GFDICWARL ALGYD+YFP DH FSEYPDLF QDVQLV G+++F+Q+FENILHLL
Subjt:  DEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLL

Query:  QVVQFLCWKGKQLDSTNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRN
        QVVQFLCWKGK+LDSTN   DSSYAEYLSP+RNV V YFKSEFSK++ LAESVFSSA SDGK NQ WLGW KL  AAAAWE YA+ILRRELLGS SHWRN
Subjt:  QVVQFLCWKGKQLDSTNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRN

Query:  SQSIYSVSLGS
         QSIYSVSL S
Subjt:  SQSIYSVSLGS

A0A1S3C486 Procollagen-proline 3-dioxygenase1.9e-20787.66Show/hide
Query:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASI
        M D AESRQRRRLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWT+GASI
Subjt:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREF+AVCYLNSYGVEFGGGLFHFQDGEP+TISPF GDCVMYTADSDN+HSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFK GFDICWARLHALGYDIYFP DH FSEYPDLFSQDVQLV G+++F+Q+FENILHLLQVVQFLCWKGK+LD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDS

Query:  TNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKL-ATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS
        TN   DS YAEYLSP+RNV V YFKSEFSK++ LAESVFSSATS GK NQHWLGW KL   AAAAWEDYA+ILRRELLGS SHWRN QSIYSVSL S
Subjt:  TNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKL-ATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS

A0A5A7SSL8 Procollagen-proline 3-dioxygenase1.9e-20787.66Show/hide
Query:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASI
        M D AESRQRRRLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWT+GASI
Subjt:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREF+AVCYLNSYGVEFGGGLFHFQDGEP+TISPF GDCVMYTADSDN+HSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFK GFDICWARLHALGYDIYFP DH FSEYPDLFSQDVQLV G+++F+Q+FENILHLLQVVQFLCWKGK+LD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDS

Query:  TNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKL-ATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS
        TN   DS YAEYLSP+RNV V YFKSEFSK++ LAESVFSSATS GK NQHWLGW KL   AAAAWEDYA+ILRRELLGS SHWRN QSIYSVSL S
Subjt:  TNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKL-ATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase3.8e-20887.91Show/hide
Query:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASI
        M D AESRQRRRLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWT+GASI
Subjt:  MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREF+AVCYLNSYGVEFGGGLFHFQDGEP+TISPF GDCVMY ADSDN+HSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNFK GFDICWARLHALGYDIYFP DH FSEYPDLFSQDVQLV G+++F+Q+FENILHLLQVVQFLCWKGK+LD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDS

Query:  TNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKL-ATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS
        TN   DS YAEYLSP+RNV V YFKSEFSK++ LAESVFSSATS GK NQHWLGW KL   AAAAWEDYA+ILRRELLGS SHWRN QSIYSVSL S
Subjt:  TNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKL-ATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS

A0A6J1DQY8 Procollagen-proline 3-dioxygenase1.3e-20083.92Show/hide
Query:  MGDEAESRQ--RRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGA
        MGDE E+RQ  RRRLIL NFLT EECRELEFIHKSCCTVGYRP+VFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGC+YELFVEFTGLISWT+GA
Subjt:  MGDEAESRQ--RRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREFTAVCYLNSYGV+F GGLFHFQDGEPK+ISPFCGDCVMYTADS N+HSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQL
        HLHDRFP+SC+P PPSCNMYWFSPE+DPNFK G D+CWARLHALGYDIYFPRD+  S+YP LFS  VQLV+  ++F+QEF NILH LQVVQF+CWKGK+L
Subjt:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQL

Query:  DSTNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS
        DSTNFKG+SSYA YLSP+ N  V YFKSEFSK+ VLA+SVFSSA+SD K  Q WLGWAKLATA AAWEDYA+ LR ELL SL HWR +QS+Y VSLGS
Subjt:  DSTNFKGDSSYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS

SwissProt top hitse value%identityAlignment
Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 31.0e-0832.94Show/hide
Query:  WHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQD-GEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSH
        WH   ++      ++T++ YL+ Y  +FGGG F F D G  +T+ P  G    +T+ S+N+H V++++ G R  +T+ FT +  H
Subjt:  WHSDDNRPYLKQREFTAVCYLNSYGVEFGGGLFHFQD-GEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSH

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.3e-11956.04Show/hide
Query:  RQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASIGWHSDDN
        ++  RLIL NFL+P EC+ELE IHKS  T+GYRPNVFSTTL HL+ATNS HLI+PFV IRERLKEK EE FGC+YELF+EFTGLISW KGASIGWHSDDN
Subjt:  RQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASIGWHSDDN

Query:  RPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDS
        R YLKQR+F AVCYLNSY  +F GGLF FQ GEP T++P  GD +MYTAD  NIHSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + 
Subjt:  RPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDS

Query:  CLPQPPSCNMYWFSP-EDDPNFKSGFDICWARLHALGYDIYF--PRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDSTNFK
        CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++     DHS      L    +QL +G +L  ++F NILH LQVVQF  WK  +L ++N +
Subjt:  CLPQPPSCNMYWFSP-EDDPNFKSGFDICWARLHALGYDIYF--PRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDSTNFK

Query:  GDS-SYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSV
         D+    + +S  +   +   KS F  D  L  + F  + S G+  +  L    +A A  +WE+Y+  L +ELL SL  W+  Q+I+ V
Subjt:  GDS-SYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSV

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.3e-9448.45Show/hide
Query:  RQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASIGWHSDDN
        ++  RLIL NFL+P EC+ELE IHKS  T+GYRPNVFSTTL HL+ATNS HLI+PFV IRERLKEK EE FGC+YELF+EFTGLISW KGASIGWHSDDN
Subjt:  RQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASIGWHSDDN

Query:  RPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDS
        R YLKQR+F +                    GEP T++P  GD +MYTAD  NIHSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ          
Subjt:  RPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDS

Query:  CLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYF--PRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDSTNFKG
                                FD+C ARLH LG+D++     DHS      L    +QL +G +L  ++F NILH LQVVQF  WK  +L ++N + 
Subjt:  CLPQPPSCNMYWFSPEDDPNFKSGFDICWARLHALGYDIYF--PRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDSTNFKG

Query:  DS-SYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSV
        D+    + +S  +   +   KS F  D  L  + F  + S G+  +  L    +A A  +WE+Y+  L +ELL SL  W+  Q+I+ V
Subjt:  DS-SYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSV

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.4e-10652.19Show/hide
Query:  RQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASIGWHSDDN
        ++  RLIL NFL+P EC+ELE IHKS  T+GYRPNVFSTTL HL+ATNS HLI+PFV IRERLKEK EE FGC+YELF+EFTGLISW KGASIGWHSDDN
Subjt:  RQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASIGWHSDDN

Query:  RPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDS
        R YLKQR+F +                    GEP T++P  GD +MYTAD  NIHSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + 
Subjt:  RPYLKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDS

Query:  CLPQPPSCNMYWFSP-EDDPNFKSGFDICWARLHALGYDIYF--PRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDSTNFK
        CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++     DHS      L    +QL +G +L  ++F NILH LQVVQF  WK  +L ++N +
Subjt:  CLPQPPSCNMYWFSP-EDDPNFKSGFDICWARLHALGYDIYF--PRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDSTNFK

Query:  GDS-SYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSV
         D+    + +S  +   +   KS F  D  L  + F  + S G+  +  L    +A A  +WE+Y+  L +ELL SL  W+  Q+I+ V
Subjt:  GDS-SYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSV

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.1e-4945.49Show/hide
Query:  MYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKSGFDICWARLHALGYDIYF--P
        MYTAD  NIHSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++    
Subjt:  MYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKSGFDICWARLHALGYDIYF--P

Query:  RDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDSTNFKGDS-SYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKG
         DHS      L    +QL +G +L  ++F NILH LQVVQF  WK  +L ++N + D+    + +S  +   +   KS F  D  L  + F  + S G+ 
Subjt:  RDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDSTNFKGDS-SYAEYLSPERNVVVGYFKSEFSKDNVLAESVFSSATSDGKG

Query:  NQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSV
         +  L    +A A  +WE+Y+  L +ELL SL  W+  Q+I+ V
Subjt:  NQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACGAAGCGGAGAGCAGGCAGCGGCGGCGTCTGATTCTGGGAAATTTCTTAACCCCTGAAGAATGCAGGGAACTGGAGTTCATTCATAAGAGCTGCTGTACGGT
GGGTTATAGACCAAACGTCTTCTCCACCACTCTTTTGCATCTTGTTGCCACTAATTCTGCTCATCTGATCATGCCTTTTGTTCCGATTAGAGAGAGGTTGAAGGAGAAAG
CGGAGGAATTCTTTGGCTGTGATTATGAACTCTTTGTCGAGTTTACTGGCTTGATCAGCTGGACCAAGGGAGCAAGCATTGGATGGCATAGTGACGATAACCGGCCCTAC
CTAAAACAACGTGAATTTACAGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTCGGAGGTGGGCTGTTTCACTTTCAGGATGGGGAACCAAAAACTATCTCGCCTTT
TTGTGGAGATTGTGTGATGTACACGGCCGACAGTGACAATATTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACGTTATGGTTCACCCGGGATAGTT
CCCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCATTTACATGATCGTTTTCCTGACTCATGCCTACCTCAGCCTCCGTCCTGTAATATGTATTGGTTT
TCACCAGAAGACGATCCAAATTTCAAATCTGGTTTTGATATATGCTGGGCGAGACTGCATGCACTTGGATACGACATCTATTTTCCTAGGGACCATAGCTTTTCAGAGTA
TCCAGATTTATTCTCACAGGACGTGCAATTAGTACAGGGAAACAGGTTATTCTATCAGGAATTTGAGAACATCTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGA
AAGGCAAACAACTGGATTCTACCAACTTCAAGGGGGATTCAAGCTATGCAGAATATTTATCCCCAGAGAGGAATGTGGTAGTCGGTTACTTTAAATCTGAGTTTTCGAAG
GACAATGTATTGGCCGAGTCGGTCTTTTCATCTGCTACTTCTGATGGGAAGGGGAACCAACACTGGTTGGGATGGGCCAAGCTTGCTACTGCTGCCGCAGCTTGGGAAGA
TTATGCTACCATTTTAAGGAGAGAACTCCTTGGGAGCTTGAGCCATTGGAGAAACAGTCAATCCATATATAGTGTTTCACTTGGTAGCTGA
mRNA sequenceShow/hide mRNA sequence
TGGATTTTGGGTTAGTCTTGGCAATTTTCCCTACCAAAGAGCTTGATCAATGGCGCGGGTATTGGAGCTCCTACGCGCTACATCGATATAGGATATTTCGTGTGCTTGTA
GCTGGGATATCGCTTTAGGAATTATTCTATCCGAATTCTCTTTTTTTGATTCCCATTTGTTACTTCTTAGTTCTAGGGATCCTTAGAATCGTTGCTCAAACTCCATAGTG
GAACAGTATTTTCTATATTTCACACAAATTTGTTTTATTTCTGACTTTTTGTATATATAGACACTATAATTTGGATAGTCAAGGGCATTTTGTTCGATTACTGCCATTGA
AACGGGAGAGACGGAGAATTGGACGAAAATGGGAGACGAAGCGGAGAGCAGGCAGCGGCGGCGTCTGATTCTGGGAAATTTCTTAACCCCTGAAGAATGCAGGGAACTGG
AGTTCATTCATAAGAGCTGCTGTACGGTGGGTTATAGACCAAACGTCTTCTCCACCACTCTTTTGCATCTTGTTGCCACTAATTCTGCTCATCTGATCATGCCTTTTGTT
CCGATTAGAGAGAGGTTGAAGGAGAAAGCGGAGGAATTCTTTGGCTGTGATTATGAACTCTTTGTCGAGTTTACTGGCTTGATCAGCTGGACCAAGGGAGCAAGCATTGG
ATGGCATAGTGACGATAACCGGCCCTACCTAAAACAACGTGAATTTACAGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTCGGAGGTGGGCTGTTTCACTTTCAGG
ATGGGGAACCAAAAACTATCTCGCCTTTTTGTGGAGATTGTGTGATGTACACGGCCGACAGTGACAATATTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACA
CTGACGTTATGGTTCACCCGGGATAGTTCCCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCATTTACATGATCGTTTTCCTGACTCATGCCTACCTCA
GCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGACGATCCAAATTTCAAATCTGGTTTTGATATATGCTGGGCGAGACTGCATGCACTTGGATACGACATCTATT
TTCCTAGGGACCATAGCTTTTCAGAGTATCCAGATTTATTCTCACAGGACGTGCAATTAGTACAGGGAAACAGGTTATTCTATCAGGAATTTGAGAACATCTTGCATTTG
CTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAACAACTGGATTCTACCAACTTCAAGGGGGATTCAAGCTATGCAGAATATTTATCCCCAGAGAGGAATGTGGTAGT
CGGTTACTTTAAATCTGAGTTTTCGAAGGACAATGTATTGGCCGAGTCGGTCTTTTCATCTGCTACTTCTGATGGGAAGGGGAACCAACACTGGTTGGGATGGGCCAAGC
TTGCTACTGCTGCCGCAGCTTGGGAAGATTATGCTACCATTTTAAGGAGAGAACTCCTTGGGAGCTTGAGCCATTGGAGAAACAGTCAATCCATATATAGTGTTTCACTT
GGTAGCTGAATCTCCCACTTGTGAGAAAGTGGCATCCCAAATGCTAAAAGTATCTGAGCATAAAGGTTAGTTTTGAGCCTTTTATAGCCTAACTAGATTATTATAGTTGA
GCTTCAAGGTTAGTTATGGGCCTTTGTAAAGCTAAACCAGATTTAACTAGAACATTCAACATTATTTGTTTTGAAACTTATGCTTGTTTTATCTTAAATTTTTGCTTTAT
AGTTTTCATCTTTATTAAAAAAGAGTCGGAAATTAATTTTTGAATACTAATTTTTTTAATCTTGAAAATTTGACTTGGTGTTTTATTGTTAATTGGATCCCTTGAACAAT
AATTTAAAACAAAATAAAAATTTCAAAATGAATCATTTGACTCATTCCAAGGCTAGTCACTCTTAAATAGTTTAACAACAAAAACTACAATGCGAAAACTCATGACCAAA
ATAGTTTGCATAACGACTGTTTGTCACTTGCCTCACACGCCCTTGCCTTTACCGAAGGAAAAAAAAGGAGAATGTAAGTATTATTAAAATACTCAGTAACCCTTAAAGAG
ACTCACCATACACGTTTGTGTCACATAGATGTTCAAATATATATGGCACTCACACAAGAAGCAAAATGTAAACTTGGGTGTTGAGACACTCGTGCCTTGTCACATACTAT
CAGTGTACTCAACACTTTTGTAACATACACCCCATGAATTGTCACACAAACGTATCACAAAGGTAAAGTGTATATATCTCATAACTCACACTGTTTGTGTCATGCACATT
CACGTGACATAGACTAGTCAAT
Protein sequenceShow/hide protein sequence
MGDEAESRQRRRLILGNFLTPEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCDYELFVEFTGLISWTKGASIGWHSDDNRPY
LKQREFTAVCYLNSYGVEFGGGLFHFQDGEPKTISPFCGDCVMYTADSDNIHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWF
SPEDDPNFKSGFDICWARLHALGYDIYFPRDHSFSEYPDLFSQDVQLVQGNRLFYQEFENILHLLQVVQFLCWKGKQLDSTNFKGDSSYAEYLSPERNVVVGYFKSEFSK
DNVLAESVFSSATSDGKGNQHWLGWAKLATAAAAWEDYATILRRELLGSLSHWRNSQSIYSVSLGS