; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028065 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028065
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 2
Genome locationtig00153056:3077168..3079262
RNA-Seq ExpressionSgr028065
SyntenySgr028065
Gene Ontology termsNA
InterPro domainsIPR027443 - Isopenicillin N synthase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649785.1 hypothetical protein Csa_012409 [Cucumis sativus]1.9e-26293.87Show/hide
Query:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM
        MAGNGLPSLGRVKLTDI PSEGVPSESFKLSVSTLS SLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAAC SAEL+QSNDSREWCRTSGYY DAQM
Subjt:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM

Query:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ
        WQETYDYRPGLTPVEP+N M+L PAGLPDIFA YGKASRIILDAISFYLNLRSSPFTE+LDNVPLRSRE+SSSVLSVCC+GRPSFHGEHHHKLTAQEDSQ
Subjt:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ

Query:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC
        LAMY +DH++QIDKSLITL K+DKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQG+MYGRCSLSFKLMPKSMTSLSC
Subjt:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQLPVPVDDFMQRS STDQLFNRPNFQNFSFSTSQDGSIKM+RRKNNS+ KPLPPSKRLRLEAQRVLKE+VQDIADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN
        K+CESHIHTLDSPCASTR+EIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWT SHD+ELSLTEPGQVGQQSTN
Subjt:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN

XP_004144636.1 uncharacterized protein LOC101216737 isoform X1 [Cucumis sativus]1.9e-26293.87Show/hide
Query:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM
        MAGNGLPSLGRVKLTDI PSEGVPSESFKLSVSTLS SLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAAC SAEL+QSNDSREWCRTSGYY DAQM
Subjt:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM

Query:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ
        WQETYDYRPGLTPVEP+N M+L PAGLPDIFA YGKASRIILDAISFYLNLRSSPFTE+LDNVPLRSRE+SSSVLSVCC+GRPSFHGEHHHKLTAQEDSQ
Subjt:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ

Query:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC
        LAMY +DH++QIDKSLITL K+DKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQG+MYGRCSLSFKLMPKSMTSLSC
Subjt:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQLPVPVDDFMQRS STDQLFNRPNFQNFSFSTSQDGSIKM+RRKNNS+ KPLPPSKRLRLEAQRVLKE+VQDIADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN
        K+CESHIHTLDSPCASTR+EIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWT SHD+ELSLTEPGQVGQQSTN
Subjt:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN

XP_022132227.1 uncharacterized protein LOC111005137 [Momordica charantia]3.0e-26895.77Show/hide
Query:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM
        MAGNGLPSLGRVKLTDI PSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAEL+Q+NDSREWCRTSGYY D+QM
Subjt:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM

Query:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ
        WQE YDYRPGLTPVEP+N+++L PAGLPDIFA YGKASRIILDAISFYLNLRSSPFTE+LDNVPLRSRE+SSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ
Subjt:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ

Query:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC
        L+MYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQG+MYGRCS SFKLMPKSMTSLSC
Subjt:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKM+RRKNNST KPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN
        KECESHIHTLDSPCASTR+EIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN
Subjt:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN

XP_022936950.1 uncharacterized protein LOC111443386 [Cucurbita moschata]3.7e-26393.45Show/hide
Query:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM
        MAGNGLPSLGRVKLTDI PSEGVPSE+FKLSVSTLS SLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAEL+Q+NDSREWCRTSGYY D+QM
Subjt:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM

Query:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ
        WQETYDYRPGLTPVEP+N M+L PAGL DIFA YGKASRIILDAISFYLNLRSSPFTE+LDNVPLRSRE+SSSVLS CC+GRPSFHGEHHHKLT QEDSQ
Subjt:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ

Query:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC
        LAMY +DHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQG+MYGRCSLSFKLMPKSMTSLSC
Subjt:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGS+KM+RRKNNS+ KPLPPSKRLRLEAQRVLKERVQ+IADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN
        K+CE+HIHTLDSPCASTR+EIGWPPGVPFVHPHDLPNKAK+GFLEAYEPGWTASHD+ELSLTEPGQVGQQSTN
Subjt:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN

XP_023536343.1 uncharacterized protein LOC111797542 [Cucurbita pepo subsp. pepo]1.4e-26293.02Show/hide
Query:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM
        MAGNGLPSLGRVKLTDI PSEGVPSE+FKLSVSTLS SLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAEL+Q+ND+REWCRTSGYY D+QM
Subjt:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM

Query:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ
        WQETYDYRPGLTPVEP+N M+L PAGL DIFA YGKASRIILDAISFYLNLRSSPFTE+LDNVPLRSRE+SSSVLS CC+GRPSFHGEHHHKLT QEDSQ
Subjt:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ

Query:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC
        LAMY +DHEHQIDKSL+TLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQG+MYGRCSLSFKLMPKSMTSLSC
Subjt:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGS+KM+RRKNNS+ KPLPPSKRLRLEAQRVLKERVQ+IADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN
        K+CE+HIHTLDSPCASTR+EIGWPPGVPFVHPHDLPNKAK+GFLEAYEPGWTASHD+ELSLTEPGQVGQQSTN
Subjt:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN

TrEMBL top hitse value%identityAlignment
A0A1S3CNW5 uncharacterized protein LOC103503048 isoform X14.2e-26092.6Show/hide
Query:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM
        MAGNGLPSLGRVKLTDI PSEGVPSESFKLSVSTLS SLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAAC SAEL+Q+NDSREWCRTSGYY D QM
Subjt:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM

Query:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ
        WQETYDYRPGLTPVEP++ M+L PAGLPDIFA YGKASRIILDAISFYLNLRSSPFTE+LDNVPLRSRE+SSSVLSVCC+GRPSFHGEHHHKLTAQEDSQ
Subjt:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ

Query:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC
        L+MY +DH++QIDKSLITL KSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQG+MYGRCSLSFKLMPKSMT+LSC
Subjt:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQLPVPVDDFMQRS STDQLFNRPNFQNFSFSTSQDGSIKM+RRKN+S+ KPLPPSKRLRLEAQRVLKE+VQDIADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN
        K+CE+HIHTLDSPCASTR+EIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWT SHD+ELSLTEPGQVGQQSTN
Subjt:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN

A0A5A7UD01 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 24.2e-26092.6Show/hide
Query:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM
        MAGNGLPSLGRVKLTDI PSEGVPSESFKLSVSTLS SLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAAC SAEL+Q+NDSREWCRTSGYY D QM
Subjt:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM

Query:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ
        WQETYDYRPGLTPVEP++ M+L PAGLPDIFA YGKASRIILDAISFYLNLRSSPFTE+LDNVPLRSRE+SSSVLSVCC+GRPSFHGEHHHKLTAQEDSQ
Subjt:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ

Query:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC
        L+MY +DH++QIDKSLITL KSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQG+MYGRCSLSFKLMPKSMT+LSC
Subjt:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQLPVPVDDFMQRS STDQLFNRPNFQNFSFSTSQDGSIKM+RRKN+S+ KPLPPSKRLRLEAQRVLKE+VQDIADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN
        K+CE+HIHTLDSPCASTR+EIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWT SHD+ELSLTEPGQVGQQSTN
Subjt:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN

A0A6J1BT97 uncharacterized protein LOC1110051371.4e-26895.77Show/hide
Query:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM
        MAGNGLPSLGRVKLTDI PSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAEL+Q+NDSREWCRTSGYY D+QM
Subjt:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM

Query:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ
        WQE YDYRPGLTPVEP+N+++L PAGLPDIFA YGKASRIILDAISFYLNLRSSPFTE+LDNVPLRSRE+SSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ
Subjt:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ

Query:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC
        L+MYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQG+MYGRCS SFKLMPKSMTSLSC
Subjt:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKM+RRKNNST KPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN
        KECESHIHTLDSPCASTR+EIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN
Subjt:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN

A0A6J1FF60 uncharacterized protein LOC1114433861.8e-26393.45Show/hide
Query:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM
        MAGNGLPSLGRVKLTDI PSEGVPSE+FKLSVSTLS SLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAEL+Q+NDSREWCRTSGYY D+QM
Subjt:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM

Query:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ
        WQETYDYRPGLTPVEP+N M+L PAGL DIFA YGKASRIILDAISFYLNLRSSPFTE+LDNVPLRSRE+SSSVLS CC+GRPSFHGEHHHKLT QEDSQ
Subjt:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ

Query:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC
        LAMY +DHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQG+MYGRCSLSFKLMPKSMTSLSC
Subjt:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGS+KM+RRKNNS+ KPLPPSKRLRLEAQRVLKERVQ+IADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN
        K+CE+HIHTLDSPCASTR+EIGWPPGVPFVHPHDLPNKAK+GFLEAYEPGWTASHD+ELSLTEPGQVGQQSTN
Subjt:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN

A0A6J1IKE1 uncharacterized protein LOC1114770565.8e-26293.02Show/hide
Query:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM
        MAGNGLPSLGRVKLTDI PSEGVPSE+FKLSVSTLS SLAQYSAAIIQFPACDGALLRSGLDS+RLYFHQRAACPSAEL+Q+NDSREWCRTSGYY DAQM
Subjt:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQM

Query:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ
        WQETYDYRPGLTPVE +N M+L PAGL DIFA YGKASRIILDAISFYLNLRSSPFTE+LDNVPLRSRE+SSSVLS CC+GRPSFHGEHHHKLT QEDSQ
Subjt:  WQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQ

Query:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC
        LAMY +DHEHQIDKSLITLVK+DKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQG+MYGRCSLSFKLMPKSMTSLSC
Subjt:  LAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGS+KM+RRKNNS+ KPLPPSKRLRLEAQRVLKERVQ+IADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN
        K+CE+HIHTLDSPCASTR+EIGWPPGVPFVHPHDLPNKAK+GFLEAYEPGWTASHD+ELSLTEPGQVGQQSTN
Subjt:  KECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G12940.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.1e-20773.53Show/hide
Query:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSA-ELIQSNDSREWCRTSGYYADAQ
        MAGNG+P+LGRVK+ D+VPSEG+PS+S+KL+V+TLSQSLAQYSAAIIQFPA DGALLRSGLDSARLYFHQR + P+   +I +NDS+EWC+TSGYYAD Q
Subjt:  MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSA-ELIQSNDSREWCRTSGYYADAQ

Query:  MWQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDS
         WQE+Y+YRPGLTP EP+N+M+  PAGLPDIFA  GKA+R++LDAI FYLNLRS PFTE+LDNVPLR+ EVSSSVLSVCC+ RPSFHG  HH LT  ED 
Subjt:  MWQETYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDS

Query:  QLAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLS
        QL +Y +DH+HQ+DKSLI+ VKSDKAGL I+D +G+WILVD DLGPQ+A+VYPGLALYQATAGYV+PA+ RTD+N++QG++ GR SL+FKLMPKSMT+LS
Subjt:  QLAMYPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLS

Query:  CSEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKM--KRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRF
        CSEMRAAGHGV+ QFQLPV VDDFMQRS S D+LFNR   Q+F    SQDGS+K   KRRK++S  KPLPPSKRLRLEAQRVLKERVQ+IADKKGIKLRF
Subjt:  CSEMRAAGHGVDVQFQLPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKM--KRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRF

Query:  CNLKECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN
        CNLKECE++ + ++SPCA+ R EIGWP GVPFVHPHDLPNKAKIGFLE YEPGW+ +HD+E SL+E  Q  Q  TN
Subjt:  CNLKECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDIELSLTEPGQVGQQSTN

AT3G19895.1 RING/U-box superfamily protein4.3e-3935.71Show/hide
Query:  NGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQMWQE
        +G P L RV+L++I+P EG PS  +  +V  LS SL +Y+A++I+  + D AL+R GL++ARLYF  R+   S    + N                    
Subjt:  NGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQMWQE

Query:  TYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQLAM
           YR G +  +    +D +P  + +IF   GK +R  L AI+ +L LRS  F  +LD+ PL   EVSSSVL          +G+H     A     L+ 
Subjt:  TYDYRPGLTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQLAM

Query:  YPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQG-NMYGRCSLSFKLMPKSMTSLSCSE
             + +++K L+TL  SD  G+ + D NGRW   D   G  D ++  G AL  ATAG    A  RT  +++   +  GR SL+F+LMPKS   L CS 
Subjt:  YPADHEHQIDKSLITLVKSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQG-NMYGRCSLSFKLMPKSMTSLSCSE

Query:  MRAAGHGVDVQFQLPVPVDDFMQR-SPSTDQLFNRP
        + AAGH V  Q  +PV V  FM       D L N P
Subjt:  MRAAGHGVDVQFQLPVPVDDFMQR-SPSTDQLFNRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGGCAATGGCCTGCCATCATTGGGTCGTGTTAAGCTTACTGATATAGTACCATCTGAAGGCGTTCCTTCTGAGTCTTTCAAACTGTCGGTCTCAACACTGTCACA
GTCACTAGCTCAATATTCTGCTGCCATTATCCAATTCCCTGCATGTGATGGGGCTCTTCTAAGGTCTGGTTTAGATTCTGCTCGCCTCTACTTCCACCAGAGAGCTGCAT
GTCCATCTGCTGAATTGATCCAAAGCAATGATTCGCGGGAGTGGTGCAGAACTTCTGGTTACTATGCAGATGCTCAGATGTGGCAAGAAACGTATGATTATAGGCCTGGA
CTGACTCCAGTTGAGCCCAACAATGCGATGGATTTAACACCTGCAGGTTTGCCAGACATATTTGCCTTCTATGGAAAGGCATCTCGAATTATTTTGGATGCAATCAGCTT
CTATCTAAACTTGCGCAGCTCTCCTTTCACAGAAGTACTCGATAATGTTCCCCTAAGAAGTAGGGAGGTATCATCTTCTGTGTTGTCTGTGTGCTGTCATGGGAGGCCCT
CATTTCATGGAGAGCATCACCACAAATTAACTGCTCAAGAGGATAGCCAGTTGGCTATGTATCCTGCTGACCATGAGCATCAAATTGATAAAAGCCTTATTACTCTGGTC
AAGTCGGATAAGGCAGGTTTACTAATAAAAGATTTCAATGGTAGATGGATTCTTGTGGATGGAGATCTTGGTCCTCAAGATGCCATAGTTTACCCTGGACTTGCACTCTA
TCAAGCAACTGCGGGGTATGTGAATCCCGCTTTGCTCAGAACAGATGTAAATAATATTCAAGGTAATATGTACGGACGGTGTTCTCTGTCATTCAAACTCATGCCTAAAT
CCATGACTAGCCTCAGTTGTTCAGAAATGAGAGCAGCTGGTCATGGGGTTGATGTTCAGTTCCAGCTTCCAGTACCAGTGGATGACTTCATGCAGAGATCGCCCTCAACT
GATCAACTCTTTAACCGCCCGAATTTCCAGAATTTCAGTTTCTCTACATCCCAAGATGGATCCATAAAAATGAAGAGGAGAAAGAATAACTCAACTGCCAAACCTCTACC
CCCTTCTAAGAGATTACGGCTTGAGGCACAGAGAGTTCTGAAGGAGAGAGTACAGGACATTGCAGATAAGAAGGGTATCAAATTGAGGTTCTGTAATCTGAAAGAATGTG
AGAGTCACATTCACACGTTAGATAGCCCATGTGCCAGCACAAGATTGGAGATTGGATGGCCTCCTGGAGTGCCATTCGTTCATCCTCATGATCTGCCTAATAAGGCAAAA
ATTGGTTTTCTTGAAGCTTACGAGCCTGGTTGGACAGCTAGTCATGACATTGAATTAAGTCTTACTGAACCTGGACAAGTGGGTCAACAGTCAACCAACTGTAAGTGGAA
CTCCGCCATATTCATATATGCATATCATCAACTGGTTCTTGTTCTTTACTTGAAATCAGGAATGCTTTGCAATCAGTATGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGGCAATGGCCTGCCATCATTGGGTCGTGTTAAGCTTACTGATATAGTACCATCTGAAGGCGTTCCTTCTGAGTCTTTCAAACTGTCGGTCTCAACACTGTCACA
GTCACTAGCTCAATATTCTGCTGCCATTATCCAATTCCCTGCATGTGATGGGGCTCTTCTAAGGTCTGGTTTAGATTCTGCTCGCCTCTACTTCCACCAGAGAGCTGCAT
GTCCATCTGCTGAATTGATCCAAAGCAATGATTCGCGGGAGTGGTGCAGAACTTCTGGTTACTATGCAGATGCTCAGATGTGGCAAGAAACGTATGATTATAGGCCTGGA
CTGACTCCAGTTGAGCCCAACAATGCGATGGATTTAACACCTGCAGGTTTGCCAGACATATTTGCCTTCTATGGAAAGGCATCTCGAATTATTTTGGATGCAATCAGCTT
CTATCTAAACTTGCGCAGCTCTCCTTTCACAGAAGTACTCGATAATGTTCCCCTAAGAAGTAGGGAGGTATCATCTTCTGTGTTGTCTGTGTGCTGTCATGGGAGGCCCT
CATTTCATGGAGAGCATCACCACAAATTAACTGCTCAAGAGGATAGCCAGTTGGCTATGTATCCTGCTGACCATGAGCATCAAATTGATAAAAGCCTTATTACTCTGGTC
AAGTCGGATAAGGCAGGTTTACTAATAAAAGATTTCAATGGTAGATGGATTCTTGTGGATGGAGATCTTGGTCCTCAAGATGCCATAGTTTACCCTGGACTTGCACTCTA
TCAAGCAACTGCGGGGTATGTGAATCCCGCTTTGCTCAGAACAGATGTAAATAATATTCAAGGTAATATGTACGGACGGTGTTCTCTGTCATTCAAACTCATGCCTAAAT
CCATGACTAGCCTCAGTTGTTCAGAAATGAGAGCAGCTGGTCATGGGGTTGATGTTCAGTTCCAGCTTCCAGTACCAGTGGATGACTTCATGCAGAGATCGCCCTCAACT
GATCAACTCTTTAACCGCCCGAATTTCCAGAATTTCAGTTTCTCTACATCCCAAGATGGATCCATAAAAATGAAGAGGAGAAAGAATAACTCAACTGCCAAACCTCTACC
CCCTTCTAAGAGATTACGGCTTGAGGCACAGAGAGTTCTGAAGGAGAGAGTACAGGACATTGCAGATAAGAAGGGTATCAAATTGAGGTTCTGTAATCTGAAAGAATGTG
AGAGTCACATTCACACGTTAGATAGCCCATGTGCCAGCACAAGATTGGAGATTGGATGGCCTCCTGGAGTGCCATTCGTTCATCCTCATGATCTGCCTAATAAGGCAAAA
ATTGGTTTTCTTGAAGCTTACGAGCCTGGTTGGACAGCTAGTCATGACATTGAATTAAGTCTTACTGAACCTGGACAAGTGGGTCAACAGTCAACCAACTGTAAGTGGAA
CTCCGCCATATTCATATATGCATATCATCAACTGGTTCTTGTTCTTTACTTGAAATCAGGAATGCTTTGCAATCAGTATGAATAA
Protein sequenceShow/hide protein sequence
MAGNGLPSLGRVKLTDIVPSEGVPSESFKLSVSTLSQSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRAACPSAELIQSNDSREWCRTSGYYADAQMWQETYDYRPG
LTPVEPNNAMDLTPAGLPDIFAFYGKASRIILDAISFYLNLRSSPFTEVLDNVPLRSREVSSSVLSVCCHGRPSFHGEHHHKLTAQEDSQLAMYPADHEHQIDKSLITLV
KSDKAGLLIKDFNGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGNMYGRCSLSFKLMPKSMTSLSCSEMRAAGHGVDVQFQLPVPVDDFMQRSPST
DQLFNRPNFQNFSFSTSQDGSIKMKRRKNNSTAKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNLKECESHIHTLDSPCASTRLEIGWPPGVPFVHPHDLPNKAK
IGFLEAYEPGWTASHDIELSLTEPGQVGQQSTNCKWNSAIFIYAYHQLVLVLYLKSGMLCNQYE