; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037653 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037653
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Description2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 2
Genome locationchr2:8056333..8059744
RNA-Seq ExpressionLag0037653
SyntenyLag0037653
Gene Ontology termsNA
InterPro domainsIPR027443 - Isopenicillin N synthase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649785.1 hypothetical protein Csa_012409 [Cucumis sativus]2.8e-26796.62Show/hide
Query:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM
        MAGNGLPSLGRVKLTDIAPSEGVPSESFK SVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQR AC SAELMQ+NDSREWCRTSGYYVDAQM
Subjt:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM

Query:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ
        WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISF+LNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQ+DSQ
Subjt:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ

Query:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
        LAMYTSDH++QIDKSLITL K+DKAGLLIKD +GRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
Subjt:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQ+PVPVDDFMQRS STDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKE+VQDIADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
        KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWT SHDVELSLTEPGQVGQQSTN
Subjt:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN

XP_004144636.1 uncharacterized protein LOC101216737 isoform X1 [Cucumis sativus]2.8e-26796.62Show/hide
Query:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM
        MAGNGLPSLGRVKLTDIAPSEGVPSESFK SVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQR AC SAELMQ+NDSREWCRTSGYYVDAQM
Subjt:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM

Query:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ
        WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISF+LNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQ+DSQ
Subjt:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ

Query:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
        LAMYTSDH++QIDKSLITL K+DKAGLLIKD +GRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
Subjt:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQ+PVPVDDFMQRS STDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKE+VQDIADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
        KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWT SHDVELSLTEPGQVGQQSTN
Subjt:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN

XP_022936950.1 uncharacterized protein LOC111443386 [Cucurbita moschata]6.6e-26996.62Show/hide
Query:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM
        MAGNGLPSLGRVKLTDIAPSEGVPSE+FK SVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQR ACPSAELMQNNDSREWCRTSGYYVD+QM
Subjt:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM

Query:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ
        WQETYDYRPGLTPVEPSNGMELPPAGL DIFALYGKASRIILDAISF+LNLRSSPFTEILDNVPLRSREISSSVLS CCYGRPSFHGEHHHKLT Q+DSQ
Subjt:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ

Query:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
        LAMYTSDHEHQIDKSLITLVKSDKAGLLIKD +GRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
Subjt:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQ+PVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGS+KMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQ+IADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
        KDCE+HIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAK+GFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
Subjt:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN

XP_022976775.1 uncharacterized protein LOC111477056 [Cucurbita maxima]6.1e-26795.98Show/hide
Query:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM
        MAGNGLPSLGRVKLTDI PSEGVPSE+FK SVSTLSHSLAQYSAAIIQFPACDGALLRSGLDS+RLYFHQR ACPSAELMQNNDSREWCRTSGYYVDAQM
Subjt:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM

Query:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ
        WQETYDYRPGLTPVE SNGMELPPAGL DIFALYGKASRIILDAISF+LNLRSSPFTEILDNVPLRSREISSSVLS CCYGRPSFHGEHHHKLT Q+DSQ
Subjt:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ

Query:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
        LAMYTSDHEHQIDKSLITLVK+DKAGLLIKD +GRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
Subjt:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQ+PVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGS+KMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQ+IADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
        KDCE+HIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAK+GFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
Subjt:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN

XP_023536343.1 uncharacterized protein LOC111797542 [Cucurbita pepo subsp. pepo]2.5e-26896.19Show/hide
Query:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM
        MAGNGLPSLGRVKLTDIAPSEGVPSE+FK SVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQR ACPSAELMQNND+REWCRTSGYYVD+QM
Subjt:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM

Query:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ
        WQETYDYRPGLTPVEPSNGMELPPAGL DIFALYGKASRIILDAISF+LNLRSSPFTEILDNVPLRSREISSSVLS CCYGRPSFHGEHHHKLT Q+DSQ
Subjt:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ

Query:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
        LAMYTSDHEHQIDKSL+TLVKSDKAGLLIKD +GRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
Subjt:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQ+PVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGS+KMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQ+IADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
        KDCE+HIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAK+GFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
Subjt:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN

TrEMBL top hitse value%identityAlignment
A0A1S3CNW5 uncharacterized protein LOC103503048 isoform X19.6e-26695.57Show/hide
Query:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM
        MAGNGLPSLGRVKLTDIAPSEGVPSESFK SVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQR AC SAELMQNNDSREWCRTSGYYVD QM
Subjt:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM

Query:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ
        WQETYDYRPGLTPVEPS+GMELPPAGLPDIFALYGKASRIILDAISF+LNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQ+DSQ
Subjt:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ

Query:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
        L+MY SDH++QIDKSLITL KSDKAGLLIKD +GRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMT+LSC
Subjt:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQ+PVPVDDFMQRS STDQLFNRPNFQNFSFSTSQDGSIKMRRRKN+SSTKPLPPSKRLRLEAQRVLKE+VQDIADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTNL
        KDCE+HIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWT SHDVELSLTEPGQVGQQSTNL
Subjt:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTNL

A0A5A7UD01 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 22.8e-26595.56Show/hide
Query:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM
        MAGNGLPSLGRVKLTDIAPSEGVPSESFK SVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQR AC SAELMQNNDSREWCRTSGYYVD QM
Subjt:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM

Query:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ
        WQETYDYRPGLTPVEPS+GMELPPAGLPDIFALYGKASRIILDAISF+LNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQ+DSQ
Subjt:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ

Query:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
        L+MY SDH++QIDKSLITL KSDKAGLLIKD +GRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMT+LSC
Subjt:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQ+PVPVDDFMQRS STDQLFNRPNFQNFSFSTSQDGSIKMRRRKN+SSTKPLPPSKRLRLEAQRVLKE+VQDIADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
        KDCE+HIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWT SHDVELSLTEPGQVGQQSTN
Subjt:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN

A0A6J1BT97 uncharacterized protein LOC1110051374.3e-26695.56Show/hide
Query:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM
        MAGNGLPSLGRVKLTDIAPSEGVPSESFK SVSTLS SLAQYSAAIIQFPACDGALLRSGLDSARLYFHQR ACPSAELMQNNDSREWCRTSGYYVD+QM
Subjt:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM

Query:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ
        WQE YDYRPGLTPVEPSN +ELPPAGLPDIFALYGKASRIILDAISF+LNLRSSPFTEILDNVPLRSREISSSVLSVCC+GRPSFHGEHHHKLTAQ+DSQ
Subjt:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ

Query:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
        L+MY +DHEHQIDKSLITLVKSDKAGLLIKD +GRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCS SFKLMPKSMTSLSC
Subjt:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQ+PVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNS+TKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
        K+CESHIHTLDSPCASTR+EIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHD+ELSLTEPGQVGQQSTN
Subjt:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN

A0A6J1FF60 uncharacterized protein LOC1114433863.2e-26996.62Show/hide
Query:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM
        MAGNGLPSLGRVKLTDIAPSEGVPSE+FK SVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQR ACPSAELMQNNDSREWCRTSGYYVD+QM
Subjt:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM

Query:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ
        WQETYDYRPGLTPVEPSNGMELPPAGL DIFALYGKASRIILDAISF+LNLRSSPFTEILDNVPLRSREISSSVLS CCYGRPSFHGEHHHKLT Q+DSQ
Subjt:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ

Query:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
        LAMYTSDHEHQIDKSLITLVKSDKAGLLIKD +GRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
Subjt:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQ+PVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGS+KMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQ+IADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
        KDCE+HIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAK+GFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
Subjt:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN

A0A6J1IKE1 uncharacterized protein LOC1114770563.0e-26795.98Show/hide
Query:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM
        MAGNGLPSLGRVKLTDI PSEGVPSE+FK SVSTLSHSLAQYSAAIIQFPACDGALLRSGLDS+RLYFHQR ACPSAELMQNNDSREWCRTSGYYVDAQM
Subjt:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQM

Query:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ
        WQETYDYRPGLTPVE SNGMELPPAGL DIFALYGKASRIILDAISF+LNLRSSPFTEILDNVPLRSREISSSVLS CCYGRPSFHGEHHHKLT Q+DSQ
Subjt:  WQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQ

Query:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
        LAMYTSDHEHQIDKSLITLVK+DKAGLLIKD +GRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC
Subjt:  LAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSC

Query:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL
        SEMRAAGHGVDVQFQ+PVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGS+KMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQ+IADKKGIKLRFCNL
Subjt:  SEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNL

Query:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
        KDCE+HIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAK+GFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
Subjt:  KDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G12940.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.9e-20673.11Show/hide
Query:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSA-ELMQNNDSREWCRTSGYYVDAQ
        MAGNG+P+LGRVK+ D+ PSEG+PS+S+K +V+TLS SLAQYSAAIIQFPA DGALLRSGLDSARLYFHQR + P+   ++  NDS+EWC+TSGYY D Q
Subjt:  MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSA-ELMQNNDSREWCRTSGYYVDAQ

Query:  MWQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDS
         WQE+Y+YRPGLTP EPSN ME PPAGLPDIFAL GKA+R++LDAI F+LNLRS PFTEILDNVPLR+ E+SSSVLSVCCY RPSFHG  HH LT  +D 
Subjt:  MWQETYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDS

Query:  QLAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLS
        QL +Y SDH+HQ+DKSLI+ VKSDKAGL I+D HG+WILVD DLGPQ+A+VYPGLALYQATAGYV+PA+ RTD+N++QGS+ GR SL+FKLMPKSMT+LS
Subjt:  QLAMYTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLS

Query:  CSEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKM--RRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRF
        CSEMRAAGHGV+ QFQ+PV VDDFMQRS S D+LFNR   Q+F    SQDGS+K   +RRK++S  KPLPPSKRLRLEAQRVLKERVQ+IADKKGIKLRF
Subjt:  CSEMRAAGHGVDVQFQVPVPVDDFMQRSPSTDQLFNRPNFQNFSFSTSQDGSIKM--RRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRF

Query:  CNLKDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN
        CNLK+CE++ + ++SPCA+ R EIGWP GVPFVHPHDLPNKAKIGFLE YEPGW+ +HD+E SL+E  Q  Q  TN
Subjt:  CNLKDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAKIGFLEAYEPGWTASHDVELSLTEPGQVGQQSTN

AT3G19895.1 RING/U-box superfamily protein1.4e-3835.42Show/hide
Query:  NGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQMWQE
        +G P L RV+L++I P EG PS  + ++V  LS SL +Y+A++I+  + D AL+R GL++ARLYF                     RT    V  +  + 
Subjt:  NGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQMWQE

Query:  TYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQLAM
           YR G +  +    ++  P  + +IF   GK +R  L AI+  L LRS  F  +LD+ PL   E+SSSVL +  Y   S     H    A     L+ 
Subjt:  TYDYRPGLTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQLAM

Query:  YTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGS-MYGRCSLSFKLMPKSMTSLSCSE
             + +++K L+TL  SD  G+ + D +GRW   D   G  D ++  G AL  ATAG    A  RT  +++  +   GR SL+F+LMPKS   L CS 
Subjt:  YTSDHEHQIDKSLITLVKSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGS-MYGRCSLSFKLMPKSMTSLSCSE

Query:  MRAAGHGVDVQFQVPVPVDDFMQR-SPSTDQLFNRP
        + AAGH V  Q  VPV V  FM       D L N P
Subjt:  MRAAGHGVDVQFQVPVPVDDFMQR-SPSTDQLFNRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGGCAATGGCCTGCCATCATTGGGTCGTGTGAAGCTTACCGATATAGCACCATCTGAAGGCGTTCCTTCTGAGTCTTTCAAACAGTCGGTCTCAACTCTGTCACA
TTCTCTAGCTCAATACTCTGCTGCCATCATTCAATTCCCTGCATGTGATGGAGCTCTTTTAAGATCTGGTTTAGATTCTGCTCGCCTTTACTTCCACCAGAGAACTGCAT
GTCCGTCTGCTGAGTTGATGCAAAACAACGATTCACGGGAATGGTGCAGAACATCTGGTTACTATGTGGATGCTCAAATGTGGCAAGAAACGTATGACTATAGGCCTGGA
CTGACTCCAGTAGAGCCCAGCAATGGAATGGAGTTACCACCTGCAGGTTTGCCAGATATATTTGCACTATATGGAAAGGCATCTCGAATTATTTTGGATGCAATCAGCTT
CTTTTTAAACTTGCGCAGCTCTCCTTTCACAGAAATACTCGATAATGTTCCCCTAAGAAGTAGGGAGATATCATCTTCTGTGTTGTCTGTGTGTTGTTATGGGAGGCCCT
CATTTCATGGAGAACATCACCATAAATTAACTGCTCAAGATGATAGCCAGTTGGCTATGTATACGTCTGACCACGAGCATCAAATTGATAAAAGTCTTATTACTCTGGTC
AAGTCGGATAAGGCAGGTTTACTAATAAAAGATTCCCATGGTAGATGGATTCTTGTGGATGGAGATCTTGGTCCTCAAGATGCTATAGTTTATCCTGGACTTGCACTCTA
TCAAGCAACTGCGGGGTATGTGAATCCTGCTTTGCTTAGAACAGATGTGAATAATATTCAAGGTAGTATGTATGGACGGTGTTCCTTGTCATTCAAACTCATGCCTAAAT
CCATGACCAGCCTGAGTTGTTCAGAAATGAGGGCAGCTGGCCATGGAGTAGATGTTCAGTTCCAGGTTCCAGTACCAGTGGATGACTTCATGCAAAGATCACCCTCAACT
GACCAACTCTTTAACCGCCCTAATTTCCAGAATTTCAGTTTCTCTACATCCCAAGACGGATCCATAAAAATGAGAAGGAGAAAGAATAATTCAAGTACCAAACCTCTACC
CCCTTCTAAGAGATTACGGCTTGAGGCACAGAGAGTTTTGAAGGAGAGAGTACAGGACATTGCAGATAAGAAGGGTATCAAATTGAGGTTCTGTAACCTGAAGGATTGTG
AGAGTCACATTCACACCTTAGATAGCCCTTGTGCTAGCACAAGAATGGAGATTGGATGGCCTCCTGGGGTGCCCTTCGTTCACCCTCATGATCTACCTAATAAGGCAAAA
ATTGGTTTTCTTGAAGCTTACGAGCCTGGTTGGACAGCTAGTCATGATGTTGAATTAAGTCTTACTGAACCTGGACAAGTGGGTCAACAGTCAACCAACTTGGGCATCAT
CGACAGAGATTACTTTGGTCCACAGGTGAATTGGAGCCTAGAGAATCAGTTCATGTTCTGTACAGCAAGGAATACTCAAGGCCTCGTAGCTTGTGACTTCAAAACCAGTT
CCAATCATGGCTGTTCATCAGCTCATCATACCTGTGCTAAGATTGTTGTTTCAATGACCAACGCCGAGAAATCTTGCCTATTTGCATCTCCAGCAAAGACTGTACATATG
TCATTTTCTGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGGCAATGGCCTGCCATCATTGGGTCGTGTGAAGCTTACCGATATAGCACCATCTGAAGGCGTTCCTTCTGAGTCTTTCAAACAGTCGGTCTCAACTCTGTCACA
TTCTCTAGCTCAATACTCTGCTGCCATCATTCAATTCCCTGCATGTGATGGAGCTCTTTTAAGATCTGGTTTAGATTCTGCTCGCCTTTACTTCCACCAGAGAACTGCAT
GTCCGTCTGCTGAGTTGATGCAAAACAACGATTCACGGGAATGGTGCAGAACATCTGGTTACTATGTGGATGCTCAAATGTGGCAAGAAACGTATGACTATAGGCCTGGA
CTGACTCCAGTAGAGCCCAGCAATGGAATGGAGTTACCACCTGCAGGTTTGCCAGATATATTTGCACTATATGGAAAGGCATCTCGAATTATTTTGGATGCAATCAGCTT
CTTTTTAAACTTGCGCAGCTCTCCTTTCACAGAAATACTCGATAATGTTCCCCTAAGAAGTAGGGAGATATCATCTTCTGTGTTGTCTGTGTGTTGTTATGGGAGGCCCT
CATTTCATGGAGAACATCACCATAAATTAACTGCTCAAGATGATAGCCAGTTGGCTATGTATACGTCTGACCACGAGCATCAAATTGATAAAAGTCTTATTACTCTGGTC
AAGTCGGATAAGGCAGGTTTACTAATAAAAGATTCCCATGGTAGATGGATTCTTGTGGATGGAGATCTTGGTCCTCAAGATGCTATAGTTTATCCTGGACTTGCACTCTA
TCAAGCAACTGCGGGGTATGTGAATCCTGCTTTGCTTAGAACAGATGTGAATAATATTCAAGGTAGTATGTATGGACGGTGTTCCTTGTCATTCAAACTCATGCCTAAAT
CCATGACCAGCCTGAGTTGTTCAGAAATGAGGGCAGCTGGCCATGGAGTAGATGTTCAGTTCCAGGTTCCAGTACCAGTGGATGACTTCATGCAAAGATCACCCTCAACT
GACCAACTCTTTAACCGCCCTAATTTCCAGAATTTCAGTTTCTCTACATCCCAAGACGGATCCATAAAAATGAGAAGGAGAAAGAATAATTCAAGTACCAAACCTCTACC
CCCTTCTAAGAGATTACGGCTTGAGGCACAGAGAGTTTTGAAGGAGAGAGTACAGGACATTGCAGATAAGAAGGGTATCAAATTGAGGTTCTGTAACCTGAAGGATTGTG
AGAGTCACATTCACACCTTAGATAGCCCTTGTGCTAGCACAAGAATGGAGATTGGATGGCCTCCTGGGGTGCCCTTCGTTCACCCTCATGATCTACCTAATAAGGCAAAA
ATTGGTTTTCTTGAAGCTTACGAGCCTGGTTGGACAGCTAGTCATGATGTTGAATTAAGTCTTACTGAACCTGGACAAGTGGGTCAACAGTCAACCAACTTGGGCATCAT
CGACAGAGATTACTTTGGTCCACAGGTGAATTGGAGCCTAGAGAATCAGTTCATGTTCTGTACAGCAAGGAATACTCAAGGCCTCGTAGCTTGTGACTTCAAAACCAGTT
CCAATCATGGCTGTTCATCAGCTCATCATACCTGTGCTAAGATTGTTGTTTCAATGACCAACGCCGAGAAATCTTGCCTATTTGCATCTCCAGCAAAGACTGTACATATG
TCATTTTCTGGGTAA
Protein sequenceShow/hide protein sequence
MAGNGLPSLGRVKLTDIAPSEGVPSESFKQSVSTLSHSLAQYSAAIIQFPACDGALLRSGLDSARLYFHQRTACPSAELMQNNDSREWCRTSGYYVDAQMWQETYDYRPG
LTPVEPSNGMELPPAGLPDIFALYGKASRIILDAISFFLNLRSSPFTEILDNVPLRSREISSSVLSVCCYGRPSFHGEHHHKLTAQDDSQLAMYTSDHEHQIDKSLITLV
KSDKAGLLIKDSHGRWILVDGDLGPQDAIVYPGLALYQATAGYVNPALLRTDVNNIQGSMYGRCSLSFKLMPKSMTSLSCSEMRAAGHGVDVQFQVPVPVDDFMQRSPST
DQLFNRPNFQNFSFSTSQDGSIKMRRRKNNSSTKPLPPSKRLRLEAQRVLKERVQDIADKKGIKLRFCNLKDCESHIHTLDSPCASTRMEIGWPPGVPFVHPHDLPNKAK
IGFLEAYEPGWTASHDVELSLTEPGQVGQQSTNLGIIDRDYFGPQVNWSLENQFMFCTARNTQGLVACDFKTSSNHGCSSAHHTCAKIVVSMTNAEKSCLFASPAKTVHM
SFSG