; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0005810 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0005810
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionEpimerase family protein SDR39U1-like protein
Genome locationchr12:19089365..19095752
RNA-Seq ExpressionIVF0005810
SyntenyIVF0005810
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001509 - NAD-dependent epimerase/dehydratase
IPR010099 - Epimerase family protein SDR39U1
IPR013549 - Domain of unknown function DUF1731
IPR036291 - NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148540.1 epimerase family protein SDR39U1 homolog, chloroplastic isoform X2 [Cucumis sativus]1.94e-25098.56Show/hide
Query:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
        MDLPPAISFSWSRTVSHSLRIPQHLAICGNR RVFC IDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
Subjt:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL
        PGWK+CIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLA+VCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
        GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS
        PDFALKAVLGEGASVVLEGQ+VVPTRAKELGFSYKYPSVKDALKSILS
Subjt:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS

XP_008444867.1 PREDICTED: epimerase family protein SDR39U1 homolog, chloroplastic-like [Cucumis melo]2.11e-254100Show/hide
Query:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
        MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
Subjt:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
        GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS
        PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS
Subjt:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS

XP_023530645.1 epimerase family protein SDR39U1 homolog, chloroplastic-like [Cucurbita pepo subsp. pepo]1.47e-22990.52Show/hide
Query:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
        M+L  A S SWSRTVSHS+RIPQHLAICG R +VFC ID T+MKNQLTVSITGATGFIGRRLV+RL+AD HNIRVLTRSKSKA+LIFPAREFPGI+IAEE
Subjt:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVV LINDAPDAARPTVLVS+TAVGYYGTSETA FDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
        GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMM+AGGP+GSGKQWFSWIHLDDIV+LIYEALINPSY+GVINGTAPNPV L ELC+ LGA MGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS
        PDFALKAVLG+GASVVLEGQRVVP RAKELGFS+KYPSVKDALK+ILS
Subjt:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS

XP_023554067.1 epimerase family protein SDR39U1 homolog, chloroplastic-like isoform X2 [Cucurbita pepo subsp. pepo]5.14e-23090.52Show/hide
Query:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
        M++P AISFSWSRTVSHS+RIPQH+AICG R RVFC IDAT+MKNQLTVSITGATGFIGRRLVQRLH D HNIRVLTRSKSKAELIFPAREFPGI+IAEE
Subjt:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAG+PISTRWS EIKKEIKQSRIRVTSKVVSLINDAPD ARPTVLVSATAVGYYGTSETA FDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
          NKNVRVALIRIGVVLGKEGGALAKMIPLF +FAGGPLGSG+QWFSWIHLDDIVNLIYEAL NPSY GVINGTAPNPV L ELC+ LGA MGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS
        PDFALKAVLGEGASVVLEGQRVVPTRAKELGF++KYP+VK+ALK+ILS
Subjt:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS

XP_038887742.1 epimerase family protein SDR39U1 homolog, chloroplastic isoform X2 [Benincasa hispida]3.69e-23994.54Show/hide
Query:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
        MDLPP ISFSWSRTVSH LRIPQHLAICGNR RVFC IDA +MK+QLTVSITGATGFIGRRLVQRLHADKH IRVLTRSKSKAELIFPA +FPGI+IAEE
Subjt:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
        GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPV L ELC+ LGA MGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS
        P+FALKAVLGEGASVVLEGQRV+PTRAKELGFSYKYPSVKDALK+ILS
Subjt:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS

TrEMBL top hitse value%identityAlignment
A0A0A0K2D2 Uncharacterized protein4.2e-19598.56Show/hide
Query:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
        MDLPPAISFSWSRTVSHSLRIPQHLAICGNR RVFC IDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
Subjt:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL
        PGWK+CIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLA+VCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
        GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS
        PDFALKAVLGEGASVVLEGQ+VVPTRAKELGFSYKYPSVKDALKSILS
Subjt:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS

A0A1S3BBD1 epimerase family protein SDR39U1 homolog, chloroplastic-like4.1e-198100Show/hide
Query:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
        MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
Subjt:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
        GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS
        PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS
Subjt:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS

A0A6J1EGF9 epimerase family protein SDR39U1 homolog, chloroplastic-like1.6e-17890.23Show/hide
Query:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
        M+L  A S SWSRTVSHS+RIPQHLAICG R +VFC ID T+MKNQLTVSITGATGFIGRRLV+RL+AD HNIRVLTRSKSKA+LIFPAREFPGI+IAEE
Subjt:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVV LIN+APDAARPTVLVS+TAVGYYGTSETA FDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
        GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMM+AGGP+GSGKQWFSWIHLDDIV+LIYEALINPSY+GVINGTAPNPV L ELC+ LGA MGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS
        PDFALKAVLG+GASVVLEGQRVVP RAKELGFS+KYPSVKDALK+ILS
Subjt:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS

A0A6J1GNA8 epimerase family protein SDR39U1 homolog, chloroplastic-like isoform X23.3e-17990.23Show/hide
Query:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
        M  P AISFSW+RTVSHS+RIPQH+AICG R R FC IDAT+MKNQLTVSITGATGFIGRRLVQRLH D HNIRVLTRSKSKAELIFPAREFPGI+IAEE
Subjt:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAG+PISTRWS EIKKEIKQSRIRVTSKVVSLINDAPD ARPTVLVSATAVGYYGTSETA FDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
        G NKNVRVALIRIGVVLGKEGGALAKMIPLF +FAGGPLGSG+QWFSWIHLDDIVNLIYEAL NPSY GVINGTAPNPV L ELC+ LGA MGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS
        PDFALKAVLGEGASVVLEGQRVVPTRAKELGF++KYP+VK+ALK+ILS
Subjt:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS

A0A6J1I7C1 epimerase family protein SDR39U1 homolog, chloroplastic-like2.0e-17690.23Show/hide
Query:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE
        M L  A S SWSRTVSHS+RIPQHLAICG R +VFC ID T+MKNQLTVSITGATGFIGRRLV+RL+AD HNIRVLTRSKSKAELIFPAREFPGI+IAEE
Subjt:  MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVV LIN+APDAARPTVLVS+TAVGYYGTSETA FDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV
        GVNKNVRVALIRIGVVLGK GGALAKMIPLFMM+AGGPLGSGKQWFSWIHLDDIV+LIYEAL+NPSY+GVINGTAPNPV L ELC+ LGA MGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS
        PDFALKAVLG+GASVVLEGQRVVP RAKELGFS+KYPSVKDALK+ILS
Subjt:  PDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSILS

SwissProt top hitse value%identityAlignment
P73467 Epimerase family protein slr12232.0e-7749.51Show/hide
Query:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGI-MIAEEP----GWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVT
        + + +TGATGF+G  LV  LH   H + +L RS SKA+ +F    FP +  IA E      W+  + G D V+NLAG PIS RW+   K EI  SR   T
Subjt:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGI-MIAEEP----GWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVT

Query:  SKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATALGVNK-NVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLG
         K+V  I  A    +P V++S +A+GYYGTSETATF E S  G+D+LAEVC+ WE  A  V +  VR+ + RIG+VLG +GGALAKM+P F +FAGGPLG
Subjt:  SKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATALGVNK-NVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLG

Query:  SGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPVPDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVK
        SG+QWFSWI   D++ LI +AL + + +G  N TAPNPV + E C  LG  + RPSWLPVPD AL+ +LGEGA +VLEGQ V+P    +  F ++ P ++
Subjt:  SGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPVPDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVK

Query:  DALKSIL
         +L+ IL
Subjt:  DALKSIL

P77775 Epimerase family protein YfcH3.0e-5239.02Show/hide
Query:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCIQGSDGVVNLAGMPIS-TRWSSEIKKEIKQSRIRVTSKVV
        + + ITG TG IGR L+ RL    H I V+TR+  KA  +   R      +A++      + G D V+NLAG PI+  RW+ E K+ + QSR  +T K+V
Subjt:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCIQGSDGVVNLAGMPIS-TRWSSEIKKEIKQSRIRVTSKVV

Query:  SLIN--DAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATALGVNKN-VRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSG
         LIN  D P    P+VL+S +A GYYG        E  P  N++  ++C  WE  A     +  RV L+R GVVL  +GG L KM+P F +  GGP+GSG
Subjt:  SLIN--DAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATALGVNKN-VRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSG

Query:  KQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPVPDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDA
        +Q+ +WIH+DD+VN I   L N   +G  N  +P PV   +    LG  + RP+ L VP  A++ ++GE + +VL GQR +P R +E GF++++  +++A
Subjt:  KQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPVPDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDA

Query:  LKSIL
        L  ++
Subjt:  LKSIL

Q17QH8 Epimerase family protein SDR39U12.9e-4736.04Show/hide
Query:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCIQGSDGVVNLAGMPIST---RWSSEIKKEIKQSRIRVTSK
        + V + G TGFIG  L Q L A  H + +++R      + +      G            +   D  VNLAG  I     RW++  +KE+  SR+  T  
Subjt:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCIQGSDGVVNLAGMPIST---RWSSEIKKEIKQSRIRVTSK

Query:  VVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGN-DYLAEVCREWEATALGVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSG
        +   I  AP   +  VLV  T V YY  S TA +DE SP G+ D+ + +  +WEA A     + R  ++R GVVLG+ GGA+  M+  F +  GGP+GSG
Subjt:  VVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGN-DYLAEVCREWEATALGVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSG

Query:  KQWFSWIHLDDIVNLIYEALINPSYQGVINGTAP-NPVTLGELCKGLGAEMGRPSWLPVPDFALKAVLG-EGASVVLEGQRVVPTRAKELGFSYKYPSVK
         Q+F WIH+ D+  ++  AL     QG++NG AP +  T  E  + LG  +GRP+++P+P   ++AV G E A ++LEGQ+VVP R    G+ Y +P + 
Subjt:  KQWFSWIHLDDIVNLIYEALINPSYQGVINGTAP-NPVTLGELCKGLGAEMGRPSWLPVPDFALKAVLG-EGASVVLEGQRVVPTRAKELGFSYKYPSVK

Query:  DALKSILS
         ALK +++
Subjt:  DALKSILS

Q5M8N4 Epimerase family protein SDR39U16.8e-4935.5Show/hide
Query:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCIQGSDGVVNLAGMPIST---RWSSEIKKEIKQSRIRVTSK
        + V + G TGFIG  + Q L    H +++++R      + +      G+ +             D V+NLAG  I     RW+   +KE+  SR+  T  
Subjt:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCIQGSDGVVNLAGMPIST---RWSSEIKKEIKQSRIRVTSK

Query:  VVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGN-DYLAEVCREWEATALGVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSG
        +   I +   A  P   +  T V YY  S T  +DE SP GN D+ + +  +WEA A    ++ R  ++R GVVLG+ GGA++ M+  F +  GGP+GSG
Subjt:  VVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGN-DYLAEVCREWEATALGVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSG

Query:  KQWFSWIHLDDIVNLIYEALINPSYQGVINGTAP-NPVTLGELCKGLGAEMGRPSWLPVPDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKD
        +Q+F WIH+ D+  ++  AL     QGV+NG AP +  T  E  + LGA +GRP+++PVP   ++AV GE A ++LEGQ+VVP R    G+ Y +P ++ 
Subjt:  KQWFSWIHLDDIVNLIYEALINPSYQGVINGTAP-NPVTLGELCKGLGAEMGRPSWLPVPDFALKAVLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKD

Query:  ALKSILS
        ALK +++
Subjt:  ALKSILS

Q9SJU9 Epimerase family protein SDR39U1 homolog, chloroplastic3.6e-14374.12Show/hide
Query:  SFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCI
        S S S  +S +L +P+  ++ G   R F V+ +++ ++Q+TVS+TGATGFIGRRLVQRL AD H IRVLTRSKSKAE IFPA++FPGI+IAEE  WK+C+
Subjt:  SFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCI

Query:  QGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATALGVNKNVR
        QGS  VVNLAG+PISTRWS EIKKEIK SRIRVTSKVV LIN++P  ARPTVLVSATAVGYYGTSET  FDE SPSG DYLAEVCREWE TAL  NK+VR
Subjt:  QGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATALGVNKNVR

Query:  VALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPVPDFALKA
        VALIRIGVVLGK+GGALA MIP F MFAGGPLGSG+QWFSWIH+DD+VNLIYEAL NPSY+GVINGTAPNPV LGE+C+ LG+ + RPSWLPVPDFALKA
Subjt:  VALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPVPDFALKA

Query:  VLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSIL
        +LGEGA+VVLEGQ+V+P RAKELGF +KY  VKDAL++I+
Subjt:  VLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSIL

Arabidopsis top hitse value%identityAlignment
AT2G21280.1 NAD(P)-binding Rossmann-fold superfamily protein2.6e-14474.12Show/hide
Query:  SFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCI
        S S S  +S +L +P+  ++ G   R F V+ +++ ++Q+TVS+TGATGFIGRRLVQRL AD H IRVLTRSKSKAE IFPA++FPGI+IAEE  WK+C+
Subjt:  SFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCI

Query:  QGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATALGVNKNVR
        QGS  VVNLAG+PISTRWS EIKKEIK SRIRVTSKVV LIN++P  ARPTVLVSATAVGYYGTSET  FDE SPSG DYLAEVCREWE TAL  NK+VR
Subjt:  QGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATALGVNKNVR

Query:  VALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPVPDFALKA
        VALIRIGVVLGK+GGALA MIP F MFAGGPLGSG+QWFSWIH+DD+VNLIYEAL NPSY+GVINGTAPNPV LGE+C+ LG+ + RPSWLPVPDFALKA
Subjt:  VALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPVPDFALKA

Query:  VLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSIL
        +LGEGA+VVLEGQ+V+P RAKELGF +KY  VKDAL++I+
Subjt:  VLGEGASVVLEGQRVVPTRAKELGFSYKYPSVKDALKSIL

AT4G33360.1 NAD(P)-binding Rossmann-fold superfamily protein5.2e-0422.07Show/hide
Query:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVS
        + + +TG+TG++G RL   L    H++R L R  S    + P  E     + +     D   G D V + A   +   W  +  + I  + +     V+ 
Subjt:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVS

Query:  LINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWE-----ATALGVN---KNVRVALIRIGVVLGKEGGALAKMIP--LFMMFAG
         + +     +   ++  ++    G+++ +  +E       +    C E+E     A  + +N   + V + L+  GV+ G      A M+   L   F G
Subjt:  LINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWE-----ATALGVN---KNVRVALIRIGVVLGKEGGALAKMIP--LFMMFAG

Query:  ---GPLGSGKQWFSWIHLDDIV
           G +GSG   +S+ H+DD+V
Subjt:  ---GPLGSGKQWFSWIHLDDIV

AT4G33360.2 NAD(P)-binding Rossmann-fold superfamily protein5.2e-0422.07Show/hide
Query:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVS
        + + +TG+TG++G RL   L    H++R L R  S    + P  E     + +     D   G D V + A   +   W  +  + I  + +     V+ 
Subjt:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVS

Query:  LINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWE-----ATALGVN---KNVRVALIRIGVVLGKEGGALAKMIP--LFMMFAG
         + +     +   ++  ++    G+++ +  +E       +    C E+E     A  + +N   + V + L+  GV+ G      A M+   L   F G
Subjt:  LINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWE-----ATALGVN---KNVRVALIRIGVVLGKEGGALAKMIP--LFMMFAG

Query:  ---GPLGSGKQWFSWIHLDDIV
           G +GSG   +S+ H+DD+V
Subjt:  ---GPLGSGKQWFSWIHLDDIV

AT4G33360.3 NAD(P)-binding Rossmann-fold superfamily protein5.2e-0422.07Show/hide
Query:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVS
        + + +TG+TG++G RL   L    H++R L R  S    + P  E     + +     D   G D V + A   +   W  +  + I  + +     V+ 
Subjt:  LTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVS

Query:  LINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWE-----ATALGVN---KNVRVALIRIGVVLGKEGGALAKMIP--LFMMFAG
         + +     +   ++  ++    G+++ +  +E       +    C E+E     A  + +N   + V + L+  GV+ G      A M+   L   F G
Subjt:  LINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWE-----ATALGVN---KNVRVALIRIGVVLGKEGGALAKMIP--LFMMFAG

Query:  ---GPLGSGKQWFSWIHLDDIV
           G +GSG   +S+ H+DD+V
Subjt:  ---GPLGSGKQWFSWIHLDDIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCTCCCTCCCGCCATTTCCTTCTCATGGAGCCGTACTGTCTCCCATTCTCTTCGCATTCCTCAACACCTGGCAATATGTGGCAACAGGTGTCGAGTGTTCTGTGT
CATTGATGCTACAAAGATGAAAAATCAGCTCACAGTATCAATAACTGGAGCTACAGGCTTTATTGGCCGAAGGCTTGTGCAAAGGCTACATGCAGATAAACACAACATTC
GAGTTTTAACACGTTCTAAATCTAAGGCCGAGTTGATATTTCCGGCTAGGGAGTTTCCAGGAATCATGATTGCAGAGGAGCCGGGGTGGAAAGACTGCATCCAAGGTTCA
GATGGAGTTGTTAACTTGGCTGGCATGCCTATAAGTACCAGGTGGTCTTCTGAGATCAAGAAAGAGATCAAGCAAAGCAGGATCAGAGTCACCTCAAAGGTTGTAAGCTT
AATCAATGACGCACCGGATGCAGCTCGCCCTACGGTTTTGGTTAGCGCAACAGCTGTTGGTTACTATGGTACTAGTGAGACAGCAACATTCGATGAACGAAGTCCATCCG
GAAATGACTACTTAGCAGAGGTTTGTAGGGAATGGGAAGCAACAGCCCTGGGAGTAAACAAGAACGTTAGAGTGGCTCTTATCCGTATAGGTGTTGTTCTTGGTAAAGAA
GGTGGTGCTTTAGCCAAAATGATACCTCTCTTCATGATGTTCGCTGGAGGCCCTCTGGGATCTGGAAAACAATGGTTTTCGTGGATTCATTTGGATGACATTGTGAACTT
AATATATGAAGCTCTGATCAATCCATCTTATCAAGGGGTTATAAATGGAACAGCACCGAACCCAGTTACGTTGGGTGAATTATGTAAAGGATTGGGAGCTGAGATGGGAA
GACCTTCATGGCTCCCAGTACCTGACTTTGCTCTCAAAGCCGTACTTGGAGAAGGAGCTTCTGTGGTTTTGGAAGGGCAAAGGGTGGTTCCTACCAGAGCCAAGGAATTG
GGTTTTTCGTACAAGTACCCTTCTGTCAAGGATGCACTCAAGTCCATTCTTTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACCTCCCTCCCGCCATTTCCTTCTCATGGAGCCGTACTGTCTCCCATTCTCTTCGCATTCCTCAACACCTGGCAATATGTGGCAACAGGTGTCGAGTGTTCTGTGT
CATTGATGCTACAAAGATGAAAAATCAGCTCACAGTATCAATAACTGGAGCTACAGGCTTTATTGGCCGAAGGCTTGTGCAAAGGCTACATGCAGATAAACACAACATTC
GAGTTTTAACACGTTCTAAATCTAAGGCCGAGTTGATATTTCCGGCTAGGGAGTTTCCAGGAATCATGATTGCAGAGGAGCCGGGGTGGAAAGACTGCATCCAAGGTTCA
GATGGAGTTGTTAACTTGGCTGGCATGCCTATAAGTACCAGGTGGTCTTCTGAGATCAAGAAAGAGATCAAGCAAAGCAGGATCAGAGTCACCTCAAAGGTTGTAAGCTT
AATCAATGACGCACCGGATGCAGCTCGCCCTACGGTTTTGGTTAGCGCAACAGCTGTTGGTTACTATGGTACTAGTGAGACAGCAACATTCGATGAACGAAGTCCATCCG
GAAATGACTACTTAGCAGAGGTTTGTAGGGAATGGGAAGCAACAGCCCTGGGAGTAAACAAGAACGTTAGAGTGGCTCTTATCCGTATAGGTGTTGTTCTTGGTAAAGAA
GGTGGTGCTTTAGCCAAAATGATACCTCTCTTCATGATGTTCGCTGGAGGCCCTCTGGGATCTGGAAAACAATGGTTTTCGTGGATTCATTTGGATGACATTGTGAACTT
AATATATGAAGCTCTGATCAATCCATCTTATCAAGGGGTTATAAATGGAACAGCACCGAACCCAGTTACGTTGGGTGAATTATGTAAAGGATTGGGAGCTGAGATGGGAA
GACCTTCATGGCTCCCAGTACCTGACTTTGCTCTCAAAGCCGTACTTGGAGAAGGAGCTTCTGTGGTTTTGGAAGGGCAAAGGGTGGTTCCTACCAGAGCCAAGGAATTG
GGTTTTTCGTACAAGTACCCTTCTGTCAAGGATGCACTCAAGTCCATTCTTTCCTAA
Protein sequenceShow/hide protein sequence
MDLPPAISFSWSRTVSHSLRIPQHLAICGNRCRVFCVIDATKMKNQLTVSITGATGFIGRRLVQRLHADKHNIRVLTRSKSKAELIFPAREFPGIMIAEEPGWKDCIQGS
DGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPTVLVSATAVGYYGTSETATFDERSPSGNDYLAEVCREWEATALGVNKNVRVALIRIGVVLGKE
GGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALINPSYQGVINGTAPNPVTLGELCKGLGAEMGRPSWLPVPDFALKAVLGEGASVVLEGQRVVPTRAKEL
GFSYKYPSVKDALKSILS