; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001535 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001535
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionepimerase family protein SDR39U1 homolog, chloroplastic-like
Genome locationchr4:32544616..32552454
RNA-Seq ExpressionLag0001535
SyntenyLag0001535
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001509 - NAD-dependent epimerase/dehydratase
IPR010099 - Epimerase family protein SDR39U1
IPR013549 - Domain of unknown function DUF1731
IPR036291 - NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588414.1 Epimerase family protein SDR39U1-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]6.5e-18292.24Show/hide
Query:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE
        MELS ATS S S TVSH +RIPQHLAICGKR +VFCAID TEMKNQLTVSITGATGFIG+RLV++L+ADNHNIRVLTRSKSKA+LIFPAREFPGIVIAEE
Subjt:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVV LIN+APDAARP VLVS+TAVGYYG+SETAIFDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV
        GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMM+AGGP+GSGKQWFSWIHLDDIV+LIYEAL+NPSYKGVINGTAPNPVKLSELCERLG AMGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS
        PDFALKAVLG+GASVVLEGQRV+PARAKELGFSFKYPSVK+ALKAILS
Subjt:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS

XP_022135861.1 epimerase family protein SDR39U1 homolog, chloroplastic isoform X2 [Momordica charantia]6.1e-18091.67Show/hide
Query:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE
        MEL   TSFS S  VSH L    HLAICGKRFRVFCAIDA EMKNQLTVSITGATGFIG+RLVQ+LHADNHNIRVLTRSKSKAELIFPAREFP IVIAEE
Subjt:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLIND PDAARP VLVSATAVGYYG+SETA+FDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV
        GVNK+VRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEAL NPSYKGVINGTAPNPVKL+ELC+RLGDAMGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS
        P+FALKAVLGEGA+VVLEGQRV+PARAKELGFSFKYPSV++AL AILS
Subjt:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS

XP_022924930.1 epimerase family protein SDR39U1 homolog, chloroplastic-like [Cucurbita moschata]1.9e-18191.95Show/hide
Query:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE
        MELS A+S S S TVSH +RIPQHLAICGKR +VFCAID TEMKNQLTVSITGATGFIG+RLV++L+ADNHNIRVLTRSKSKA+LIFPAREFPGIVIAEE
Subjt:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVV LIN+APDAARP VLVS+TAVGYYG+SETAIFDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV
        GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMM+AGGP+GSGKQWFSWIHLDDIV+LIYEAL+NPSYKGVINGTAPNPVKLSELCERLG AMGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS
        PDFALKAVLG+GASVVLEGQRV+PARAKELGFSFKYPSVK+ALKAILS
Subjt:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS

XP_023530645.1 epimerase family protein SDR39U1 homolog, chloroplastic-like [Cucurbita pepo subsp. pepo]1.7e-18292.53Show/hide
Query:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE
        MELS ATS S S TVSH +RIPQHLAICGKR +VFCAID TEMKNQLTVSITGATGFIG+RLV++L+ADNHNIRVLTRSKSKA+LIFPAREFPGIVIAEE
Subjt:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVV LINDAPDAARP VLVS+TAVGYYG+SETAIFDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV
        GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMM+AGGP+GSGKQWFSWIHLDDIV+LIYEAL+NPSYKGVINGTAPNPVKLSELCERLG AMGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS
        PDFALKAVLG+GASVVLEGQRV+PARAKELGFSFKYPSVK+ALKAILS
Subjt:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS

XP_023554067.1 epimerase family protein SDR39U1 homolog, chloroplastic-like isoform X2 [Cucurbita pepo subsp. pepo]4.7e-18091.38Show/hide
Query:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE
        ME+  A SFS S TVSH +RIPQH+AICGKRFRVFCAIDATEMKNQLTVSITGATGFIG+RLVQ+LH DNHNIRVLTRSKSKAELIFPAREFPGIVIAEE
Subjt:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAG+PISTRWS EIKKEIKQSRIRVTSKVVSLINDAPD ARP VLVSATAVGYYG+SETAIFDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV
          NKNVRVALIRIGVVLGKEGGALAKMIPLF +FAGGPLGSG+QWFSWIHLDDIVNLIYEAL NPSY GVINGTAPNPVKL+ELCERLG AMGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS
        PDFALKAVLGEGASVVLEGQRV+P RAKELGF+FKYP+VKEALKAILS
Subjt:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS

TrEMBL top hitse value%identityAlignment
A0A0A0K2D2 Uncharacterized protein2.5e-17990.8Show/hide
Query:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE
        M+L  A SFS S TVSH LRIPQHLAICG RFRVFCAIDAT+MKNQLTVSITGATGFIG+RLVQ+LHAD HNIRVLTRSKSKAELIFPAREFPGI+IAEE
Subjt:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL
        PGWK+CIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARP VLVSATAVGYYG+SETA FDERSPSGNDYLA+VCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV
        GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEAL+NPSY+GVINGTAPNPV L ELC+ LG  MGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS
        PDFALKAVLGEGASVVLEGQ+V+P RAKELGFS+KYPSVK+ALK+ILS
Subjt:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS

A0A6J1C1X5 epimerase family protein SDR39U1 homolog, chloroplastic isoform X23.0e-18091.67Show/hide
Query:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE
        MEL   TSFS S  VSH L    HLAICGKRFRVFCAIDA EMKNQLTVSITGATGFIG+RLVQ+LHADNHNIRVLTRSKSKAELIFPAREFP IVIAEE
Subjt:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLIND PDAARP VLVSATAVGYYG+SETA+FDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV
        GVNK+VRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEAL NPSYKGVINGTAPNPVKL+ELC+RLGDAMGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS
        P+FALKAVLGEGA+VVLEGQRV+PARAKELGFSFKYPSV++AL AILS
Subjt:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS

A0A6J1EGF9 epimerase family protein SDR39U1 homolog, chloroplastic-like9.2e-18291.95Show/hide
Query:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE
        MELS A+S S S TVSH +RIPQHLAICGKR +VFCAID TEMKNQLTVSITGATGFIG+RLV++L+ADNHNIRVLTRSKSKA+LIFPAREFPGIVIAEE
Subjt:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVV LIN+APDAARP VLVS+TAVGYYG+SETAIFDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV
        GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMM+AGGP+GSGKQWFSWIHLDDIV+LIYEAL+NPSYKGVINGTAPNPVKLSELCERLG AMGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS
        PDFALKAVLG+GASVVLEGQRV+PARAKELGFSFKYPSVK+ALKAILS
Subjt:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS

A0A6J1GNA8 epimerase family protein SDR39U1 homolog, chloroplastic-like isoform X26.6e-18090.8Show/hide
Query:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE
        M+   A SFS + TVSH +RIPQH+AICGKRFR FCAIDATEMKNQLTVSITGATGFIG+RLVQ+LH DNHNIRVLTRSKSKAELIFPAREFPGIVIAEE
Subjt:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAG+PISTRWS EIKKEIKQSRIRVTSKVVSLINDAPD ARP VLVSATAVGYYG+SETAIFDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV
        G NKNVRVALIRIGVVLGKEGGALAKMIPLF +FAGGPLGSG+QWFSWIHLDDIVNLIYEAL NPSY GVINGTAPNPVKL+ELCERLG AMGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS
        PDFALKAVLGEGASVVLEGQRV+P RAKELGF+FKYP+VKEALKAILS
Subjt:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS

A0A6J1I7C1 epimerase family protein SDR39U1 homolog, chloroplastic-like5.0e-18092.24Show/hide
Query:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE
        M+LS ATS S S TVSH +RIPQHLAICGKR +VFCAID TEMKNQLTVSITGATGFIG+RLV++L+ADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE
Subjt:  MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEE

Query:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL
        PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVV LIN+APDAARP VLVS+TAVGYYG+SETAIFDERSPSGNDYLAEVCREWEATAL
Subjt:  PGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATAL

Query:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV
        GVNKNVRVALIRIGVVLGK GGALAKMIPLFMM+AGGPLGSGKQWFSWIHLDDIV+LIYEAL+NPSYKGVINGTAPNPVKLSELCERLG AMGRPSWLPV
Subjt:  GVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPV

Query:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS
        PDFALKAVLG+GASVVLEGQRV+PARAKELGFSFKYPSVK+ALKAILS
Subjt:  PDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAILS

SwissProt top hitse value%identityAlignment
P73467 Epimerase family protein slr12232.8e-7949.84Show/hide
Query:  LTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGI-VIAEEP----GWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVT
        + + +TGATGF+G  LV  LH   H + +L RS SKA+ +F    FP +  IA E      W+  + G D V+NLAG PIS RW+   K EI  SR   T
Subjt:  LTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGI-VIAEEP----GWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVT

Query:  SKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATALGVNK-NVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLG
         K+V  I  A    +P V++S +A+GYYG+SETA F E S  G+D+LAEVC+ WE  A  V +  VR+ + RIG+VLG +GGALAKM+P F +FAGGPLG
Subjt:  SKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATALGVNK-NVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLG

Query:  SGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPVPDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVK
        SG+QWFSWI   D++ LI +AL + + +G  N TAPNPVK+ E C  LG  + RPSWLPVPD AL+ +LGEGA +VLEGQ VLP    +  F F+ P ++
Subjt:  SGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPVPDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVK

Query:  EALKAIL
         +L+ IL
Subjt:  EALKAIL

P77775 Epimerase family protein YfcH6.4e-5539.34Show/hide
Query:  LTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEPGWKDCIQGSDGVVNLAGMPIS-TRWSSEIKKEIKQSRIRVTSKVV
        + + ITG TG IG+ L+ +L    H I V+TR+  KA  +      P + + +    +  + G D V+NLAG PI+  RW+ E K+ + QSR  +T K+V
Subjt:  LTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEPGWKDCIQGSDGVVNLAGMPIS-TRWSSEIKKEIKQSRIRVTSKVV

Query:  SLIN--DAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATALGVNKN-VRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSG
         LIN  D P    P+VL+S +A GYYG     +  E  P  N++  ++C  WE  A     +  RV L+R GVVL  +GG L KM+P F +  GGP+GSG
Subjt:  SLIN--DAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATALGVNKN-VRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSG

Query:  KQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPVPDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEA
        +Q+ +WIH+DD+VN I   L N   +G  N  +P PV+  +    LG A+ RP+ L VP  A++ ++GE + +VL GQR LP R +E GF+F++  ++EA
Subjt:  KQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPVPDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEA

Query:  LKAIL
        L  ++
Subjt:  LKAIL

Q5M8N4 Epimerase family protein SDR39U13.7e-4734.2Show/hide
Query:  LTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEPGWKDCIQGSDGVVNLAGMPIST---RWSSEIKKEIKQSRIRVTSK
        + V + G TGFIG  + Q L    H +++++R      + +      G+ +             D V+NLAG  I     RW+   +KE+  SR+  T  
Subjt:  LTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEPGWKDCIQGSDGVVNLAGMPIST---RWSSEIKKEIKQSRIRVTSK

Query:  VVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGN-DYLAEVCREWEATALGVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSG
        +   I +   A  P   +  T V YY  S T  +DE SP GN D+ + +  +WEA A    ++ R  ++R GVVLG+ GGA++ M+  F +  GGP+GSG
Subjt:  VVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGN-DYLAEVCREWEATALGVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSG

Query:  KQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAP-NPVKLSELCERLGDAMGRPSWLPVPDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKE
        +Q+F WIH+ D+  ++  AL     +GV+NG AP +    +E  + LG A+GRP+++PVP   ++AV GE A ++LEGQ+V+P R    G+ + +P ++ 
Subjt:  KQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAP-NPVKLSELCERLGDAMGRPSWLPVPDFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKE

Query:  ALKAILS
        ALK +++
Subjt:  ALKAILS

Q9NRG7 Epimerase family protein SDR39U12.4e-4635.18Show/hide
Query:  LTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEPGWKDCIQGSDGVVNLAGMPIST---RWSSEIKKEIKQSRIRVTSK
        + V + G TGFIG  L Q L+A  H + +++R      + +      G            +   D  VNLAG  I     RW+   +KE+  SR+  T  
Subjt:  LTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEPGWKDCIQGSDGVVNLAGMPIST---RWSSEIKKEIKQSRIRVTSK

Query:  VVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGN-DYLAEVCREWEATALGVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSG
        +   I  AP   +  VLV  T V YY  S TA +DE SP G+ D+ + +  +WEA A     + R  ++R GVVLG+ GGA+  M+  F +  GGP+GSG
Subjt:  VVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGN-DYLAEVCREWEATALGVNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSG

Query:  KQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPVPDFALKAVLG-EGASVVLEGQRVLPARAKELGFSFKYPSVKE
         Q+F WIH+ D+  ++  AL      GV+NG AP+    +E  + LG A+GR +++P+P   ++AV G + A ++LEGQ+V+P R    G+ + +P +  
Subjt:  KQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPVPDFALKAVLG-EGASVVLEGQRVLPARAKELGFSFKYPSVKE

Query:  ALKAILS
        ALK I++
Subjt:  ALKAILS

Q9SJU9 Epimerase family protein SDR39U1 homolog, chloroplastic3.0e-14573.7Show/hide
Query:  LSRATSFSCSPTVSHPLRIPQHLAICG-KRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEP
        L   TS S S  +S  L +P+  ++ G +RF V C   +++ ++Q+TVS+TGATGFIG+RLVQ+L ADNH IRVLTRSKSKAE IFPA++FPGIVIAEE 
Subjt:  LSRATSFSCSPTVSHPLRIPQHLAICG-KRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEP

Query:  GWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATALG
         WK+C+QGS  VVNLAG+PISTRWS EIKKEIK SRIRVTSKVV LIN++P  ARP VLVSATAVGYYG+SET +FDE SPSG DYLAEVCREWE TAL 
Subjt:  GWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATALG

Query:  VNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPVP
         NK+VRVALIRIGVVLGK+GGALA MIP F MFAGGPLGSG+QWFSWIH+DD+VNLIYEAL NPSYKGVINGTAPNPV+L E+C++LG  + RPSWLPVP
Subjt:  VNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPVP

Query:  DFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAIL
        DFALKA+LGEGA+VVLEGQ+VLP RAKELGF FKY  VK+AL+AI+
Subjt:  DFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAIL

Arabidopsis top hitse value%identityAlignment
AT2G21280.1 NAD(P)-binding Rossmann-fold superfamily protein2.1e-14673.7Show/hide
Query:  LSRATSFSCSPTVSHPLRIPQHLAICG-KRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEP
        L   TS S S  +S  L +P+  ++ G +RF V C   +++ ++Q+TVS+TGATGFIG+RLVQ+L ADNH IRVLTRSKSKAE IFPA++FPGIVIAEE 
Subjt:  LSRATSFSCSPTVSHPLRIPQHLAICG-KRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEP

Query:  GWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATALG
         WK+C+QGS  VVNLAG+PISTRWS EIKKEIK SRIRVTSKVV LIN++P  ARP VLVSATAVGYYG+SET +FDE SPSG DYLAEVCREWE TAL 
Subjt:  GWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATALG

Query:  VNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPVP
         NK+VRVALIRIGVVLGK+GGALA MIP F MFAGGPLGSG+QWFSWIH+DD+VNLIYEAL NPSYKGVINGTAPNPV+L E+C++LG  + RPSWLPVP
Subjt:  VNKNVRVALIRIGVVLGKEGGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPVP

Query:  DFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAIL
        DFALKA+LGEGA+VVLEGQ+VLP RAKELGF FKY  VK+AL+AI+
Subjt:  DFALKAVLGEGASVVLEGQRVLPARAKELGFSFKYPSVKEALKAIL

AT4G33360.1 NAD(P)-binding Rossmann-fold superfamily protein9.6e-0623.25Show/hide
Query:  TEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEPGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRV
        TE +N + + +TG+TG++G RL   L    H++R L R  S    + P  E     + +     D   G D V + A   +   W  +  + I  + +  
Subjt:  TEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEPGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRV

Query:  TSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWE-----ATALGVN---KNVRVALIRIGVVLGKEGGALAKMIP--L
           V+  + +     +   ++  ++    GS++ ++ +E       +    C E+E     A  + +N   + V + L+  GV+ G      A M+   L
Subjt:  TSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWE-----ATALGVN---KNVRVALIRIGVVLGKEGGALAKMIP--L

Query:  FMMFAG---GPLGSGKQWFSWIHLDDIV
           F G   G +GSG   +S+ H+DD+V
Subjt:  FMMFAG---GPLGSGKQWFSWIHLDDIV

AT4G33360.2 NAD(P)-binding Rossmann-fold superfamily protein9.6e-0623.25Show/hide
Query:  TEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEPGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRV
        TE +N + + +TG+TG++G RL   L    H++R L R  S    + P  E     + +     D   G D V + A   +   W  +  + I  + +  
Subjt:  TEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEPGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRV

Query:  TSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWE-----ATALGVN---KNVRVALIRIGVVLGKEGGALAKMIP--L
           V+  + +     +   ++  ++    GS++ ++ +E       +    C E+E     A  + +N   + V + L+  GV+ G      A M+   L
Subjt:  TSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWE-----ATALGVN---KNVRVALIRIGVVLGKEGGALAKMIP--L

Query:  FMMFAG---GPLGSGKQWFSWIHLDDIV
           F G   G +GSG   +S+ H+DD+V
Subjt:  FMMFAG---GPLGSGKQWFSWIHLDDIV

AT4G33360.3 NAD(P)-binding Rossmann-fold superfamily protein3.6e-0522.52Show/hide
Query:  LTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEPGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVS
        + + +TG+TG++G RL   L    H++R L R  S    + P  E     + +     D   G D V + A   +   W  +  + I  + +     V+ 
Subjt:  LTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEPGWKDCIQGSDGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVS

Query:  LINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWE-----ATALGVN---KNVRVALIRIGVVLGKEGGALAKMIP--LFMMFAG
         + +     +   ++  ++    GS++ ++ +E       +    C E+E     A  + +N   + V + L+  GV+ G      A M+   L   F G
Subjt:  LINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWE-----ATALGVN---KNVRVALIRIGVVLGKEGGALAKMIP--LFMMFAG

Query:  ---GPLGSGKQWFSWIHLDDIV
           G +GSG   +S+ H+DD+V
Subjt:  ---GPLGSGKQWFSWIHLDDIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTCTCTCGCGCCACTTCTTTCTCATGCAGCCCTACTGTCTCCCATCCTCTTCGCATCCCTCAACACCTGGCAATATGTGGCAAGAGGTTCCGGGTTTTCTGCGC
CATTGATGCAACAGAGATGAAAAATCAGCTCACTGTATCAATAACTGGAGCTACAGGCTTCATTGGTAAAAGACTTGTGCAAAAGCTACATGCAGATAACCATAATATTC
GAGTTTTGACACGTTCTAAATCCAAGGCCGAGTTGATTTTTCCGGCTAGGGAGTTTCCAGGAATCGTAATTGCAGAGGAGCCAGGGTGGAAAGATTGCATCCAAGGTTCA
GATGGTGTTGTTAATTTGGCTGGGATGCCCATAAGTACCAGGTGGTCTTCTGAGATCAAGAAAGAGATCAAGCAAAGCAGGATCAGAGTCACCTCAAAGGTTGTAAGCTT
AATTAATGATGCACCAGATGCAGCTCGACCGGCAGTTTTGGTTAGCGCAACAGCTGTTGGTTACTACGGCTCTAGTGAAACAGCAATATTTGATGAGCGAAGTCCATCCG
GAAATGACTACTTGGCTGAGGTTTGTAGGGAATGGGAAGCAACAGCTCTGGGAGTTAACAAGAATGTCAGAGTGGCTCTTATTCGTATTGGTGTTGTTCTTGGTAAAGAA
GGTGGTGCTTTAGCCAAAATGATCCCTCTCTTCATGATGTTTGCTGGAGGCCCTTTGGGATCTGGAAAACAATGGTTTTCCTGGATTCATTTGGATGACATTGTGAACCT
AATATATGAAGCTCTGGTCAATCCATCTTATAAGGGTGTTATAAATGGAACGGCGCCGAACCCAGTTAAATTGTCGGAATTATGTGAACGGTTGGGAGACGCCATGGGCA
GACCTTCATGGCTTCCCGTACCTGACTTTGCCCTTAAAGCCGTGCTTGGAGAAGGAGCTTCTGTGGTTTTGGAAGGGCAACGGGTTCTTCCAGCAAGAGCCAAAGAATTG
GGTTTTTCATTCAAGTACCCCTCAGTGAAAGAGGCACTCAAGGCCATTCTTTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCTCTCTCGCGCCACTTCTTTCTCATGCAGCCCTACTGTCTCCCATCCTCTTCGCATCCCTCAACACCTGGCAATATGTGGCAAGAGGTTCCGGGTTTTCTGCGC
CATTGATGCAACAGAGATGAAAAATCAGCTCACTGTATCAATAACTGGAGCTACAGGCTTCATTGGTAAAAGACTTGTGCAAAAGCTACATGCAGATAACCATAATATTC
GAGTTTTGACACGTTCTAAATCCAAGGCCGAGTTGATTTTTCCGGCTAGGGAGTTTCCAGGAATCGTAATTGCAGAGGAGCCAGGGTGGAAAGATTGCATCCAAGGTTCA
GATGGTGTTGTTAATTTGGCTGGGATGCCCATAAGTACCAGGTGGTCTTCTGAGATCAAGAAAGAGATCAAGCAAAGCAGGATCAGAGTCACCTCAAAGGTTGTAAGCTT
AATTAATGATGCACCAGATGCAGCTCGACCGGCAGTTTTGGTTAGCGCAACAGCTGTTGGTTACTACGGCTCTAGTGAAACAGCAATATTTGATGAGCGAAGTCCATCCG
GAAATGACTACTTGGCTGAGGTTTGTAGGGAATGGGAAGCAACAGCTCTGGGAGTTAACAAGAATGTCAGAGTGGCTCTTATTCGTATTGGTGTTGTTCTTGGTAAAGAA
GGTGGTGCTTTAGCCAAAATGATCCCTCTCTTCATGATGTTTGCTGGAGGCCCTTTGGGATCTGGAAAACAATGGTTTTCCTGGATTCATTTGGATGACATTGTGAACCT
AATATATGAAGCTCTGGTCAATCCATCTTATAAGGGTGTTATAAATGGAACGGCGCCGAACCCAGTTAAATTGTCGGAATTATGTGAACGGTTGGGAGACGCCATGGGCA
GACCTTCATGGCTTCCCGTACCTGACTTTGCCCTTAAAGCCGTGCTTGGAGAAGGAGCTTCTGTGGTTTTGGAAGGGCAACGGGTTCTTCCAGCAAGAGCCAAAGAATTG
GGTTTTTCATTCAAGTACCCCTCAGTGAAAGAGGCACTCAAGGCCATTCTTTCCTAA
Protein sequenceShow/hide protein sequence
MELSRATSFSCSPTVSHPLRIPQHLAICGKRFRVFCAIDATEMKNQLTVSITGATGFIGKRLVQKLHADNHNIRVLTRSKSKAELIFPAREFPGIVIAEEPGWKDCIQGS
DGVVNLAGMPISTRWSSEIKKEIKQSRIRVTSKVVSLINDAPDAARPAVLVSATAVGYYGSSETAIFDERSPSGNDYLAEVCREWEATALGVNKNVRVALIRIGVVLGKE
GGALAKMIPLFMMFAGGPLGSGKQWFSWIHLDDIVNLIYEALVNPSYKGVINGTAPNPVKLSELCERLGDAMGRPSWLPVPDFALKAVLGEGASVVLEGQRVLPARAKEL
GFSFKYPSVKEALKAILS