; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G019480 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G019480
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPolyketide cyclase
Genome locationchr01:25152314..25162278
RNA-Seq ExpressionLsi01G019480
SyntenyLsi01G019480
Gene Ontology termsNA
InterPro domainsIPR005031 - Coenzyme Q-binding protein COQ10, START domain
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442209.1 PREDICTED: uncharacterized protein LOC103486131 [Cucumis melo]0.0e+0088.92Show/hide
Query:  MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVS
        MNTMIVCRAL F LGPPLPLTS VY TQTEYCQTSSSSLPLRTKCVSLSAA+GFEWNS+QYFA G NLKR SGVYGGR DGEEGEAERER+VRCEVEVVS
Subjt:  MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVS

Query:  WRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGK
        WRERRIRADIFVHSGIESVWN LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQE LNSDGSREL FSMVDGDFKKFEGK
Subjt:  WRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGK

Query:  WSIKAGTR-SSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNL
        WSIKAGTR SSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEK EGGQRVG+ KDSK+VVLSNTLNGATC K+EIVQENSRG NSNSNL
Subjt:  WSIKAGTR-SSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNL

Query:  GPLPSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGC
        GP+P LSNELNTNWGVFGKVCRLDKRCMVDE          ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRILQEGC
Subjt:  GPLPSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGC

Query:  KGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFE
        KGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLG+HHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SFE
Subjt:  KGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFE

Query:  AFDEGDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIAS
           +G+ EE SVP + NQSNGYTTTAEGVS + GR S RPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRM GRVDIEKAITRMGGFRRIAS
Subjt:  AFDEGDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIAS

Query:  LMNLSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVD
        LMNLSLAYKHRKPKGYWDKFDNLQEE         ++P         E  GRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DY+  NDVD
Subjt:  LMNLSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVD

Query:  AESKTPSKPYISQDTEKWLTGLKYLDINWVE
         ESK PSKPYISQDTEKWLTGLKYLDINWVE
Subjt:  AESKTPSKPYISQDTEKWLTGLKYLDINWVE

XP_011654397.2 uncharacterized protein LOC101212159 [Cucumis sativus]0.0e+0089.27Show/hide
Query:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVSWRE
        MIVCRAL F LGPPLPLTS V  TQTEY QTSSSSLPLRTKCVSLSAA+GFEWN TQYFA G NLKR SGVYGGREDGEEGEAERER+VRCEVEVVSWRE
Subjt:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVSWRE

Query:  RRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSI
        RRIRAD+FVHSGIESVWN LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSREL FSMVDGDFKKFEGKWSI
Subjt:  RRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSI

Query:  KAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLGPLP
         AGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEK EGGQRVG+ KDSK VVLSNTLNGATC K+EIVQENSRG NSNSNLG +P
Subjt:  KAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLGPLP

Query:  SLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLL
         LSNELNTNWGVFGKVCRLDKRCMVDE          ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRILQEGCKGLL
Subjt:  SLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLL

Query:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFDE
        YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLG+HHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKR LK SFEA D+
Subjt:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFDE

Query:  GDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMNL
        GDSEE SV  RNNQSNGYTTTAEGVSD+ GR S RPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRM GRVDIEKAITRMGGFRRIASLMNL
Subjt:  GDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMNL

Query:  SLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK
        SLAYKHRKPKGYWDKFDNLQEE         ++P         E  GRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DY+ VND D ESK
Subjt:  SLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK

Query:  TPSKPYISQDTEKWLTGLKYLDINWVE
         PSKPYISQDTEKWLTGLKYLDINWVE
Subjt:  TPSKPYISQDTEKWLTGLKYLDINWVE

XP_022925024.1 uncharacterized protein LOC111432394 isoform X1 [Cucurbita moschata]0.0e+0084.07Show/hide
Query:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQT-SSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVSWR
        MIVCR LRFNLGP LP  S VY  Q EYC T SSSSL LRTKCVS+SAAEGF+WNS++YF    +LKRGSGVYGGR+   EGE ERER+V CEVEVVSWR
Subjt:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQT-SSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVSWR

Query:  ERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS
        ER+IRA+IFV+SGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS
Subjt:  ERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS

Query:  IKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLGPL
        +KAGTRSSPT+LSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE   EGGQRVG+++DSKS++LSNT+NGA CEK+E++QE     NS+SNLG L
Subjt:  IKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLGPL

Query:  PSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGL
        P LSNELN+NWGVFGKVC+LDKRCMVDE          ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRI+QEGCKGL
Subjt:  PSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGL

Query:  LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFD
        LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLG+HHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SFE+F+
Subjt:  LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFD

Query:  EGDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMN
        +GDSEE S  ++NNQ N +TTT E VSDV GR S R RPK+PGLQRD+EVLKAEVLKFISEHGQEGFMPMRKQLRM GRVDIEKAITRMGGFRRIASLMN
Subjt:  EGDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMN

Query:  LSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAES
        LSLAYKHRKPKGYWDKFDNLQEE         ++P         E  GRYDIARALEKWGGLHEVSRLLSLKVRH NRQPSFAKDRKNDYL VNDVD+ES
Subjt:  LSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAES

Query:  KTPSKPYISQDTEKWLTGLKYLDINWVE
        KTPSKPYISQDTEKWL GLKYLDINWVE
Subjt:  KTPSKPYISQDTEKWLTGLKYLDINWVE

XP_023517467.1 uncharacterized protein LOC111781223 [Cucurbita pepo subsp. pepo]0.0e+0084.18Show/hide
Query:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVSWRE
        MIV   LRFNLGP LP TS VY  Q EYC TSSS L LRTKCVS+SAAEGF+WNS++YF    +LKRGSGVYGGR+   EGE ERER+V CEVEVVSWRE
Subjt:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVSWRE

Query:  RRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSI
        R+IRA+IFV+SGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS+
Subjt:  RRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSI

Query:  KAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLGPLP
        KAGTRSSPT+LSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE   EGGQRVG+++DSKS++LSNT+NGA CEK+E++QE     NS+SNLG LP
Subjt:  KAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLGPLP

Query:  SLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLL
         LSNELN+NWGVFGKVC+LDKRCMVDE          ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRI+QEGCKGLL
Subjt:  SLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLL

Query:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFDE
        YMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLG+HHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SFE+F++
Subjt:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFDE

Query:  GDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMNL
        GDSEE S  ++NNQ NG+TTT E VSD+ GR S RPRPK+PGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRM GRVDIEKAITRMGGFRRIASLMNL
Subjt:  GDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMNL

Query:  SLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK
        SLAYKHRKPKGYWDK DNLQEE         ++P         E  GRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DYL VNDVDAESK
Subjt:  SLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK

Query:  TPSKPYISQDTEKWLTGLKYLDINWVE
        TPSKPYISQDTEKWL GLKYLDINWVE
Subjt:  TPSKPYISQDTEKWLTGLKYLDINWVE

XP_038882723.1 uncharacterized protein LOC120073881 [Benincasa hispida]0.0e+0090.55Show/hide
Query:  MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVS
        MNTMIVCRAL F LGPP PLTS VY TQTEY QTS SSLP RTKCVSLSAAEGFEWNSTQYF  GCNLKRG+ VYGGREDGEEGE ERER+VRCEVEVVS
Subjt:  MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVS

Query:  WRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGK
        WRERRIRADIFV SGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSREL FSMVDGDFKKFEGK
Subjt:  WRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGK

Query:  WSIKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLG
        WSIKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEK EGGQRVG+TKDSKSVVLSNT+ GATCEK+E+VQENSRG NSNSNLG
Subjt:  WSIKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLG

Query:  PLPSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCK
        PLP LSNELNTNWGVFGKVCRLDKRCMVDE          ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRILQEGCK
Subjt:  PLPSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCK

Query:  GLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEA
        GLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLG+HHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SF A
Subjt:  GLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEA

Query:  FDEGDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASL
        FDEGDSEET V HRNNQSNGY TTA GVS+V GRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRM GRVDIEKAITRMGGFRRIASL
Subjt:  FDEGDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASL

Query:  MNLSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDA
        MNLSLAYKHRKPKGYWDKFDNLQEE         ++P         E  GRYDIARALEKWGGLHEVS LLSLKVRHPNRQPSFA DRKNDYLAVNDVDA
Subjt:  MNLSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDA

Query:  ESKTPSKPYISQDTEKWLTGLKYLDINWVE
        ESKTPSKPYISQDTEKWLTGLKYLDINWVE
Subjt:  ESKTPSKPYISQDTEKWLTGLKYLDINWVE

TrEMBL top hitse value%identityAlignment
A0A0A0KYT4 Uncharacterized protein0.0e+0089Show/hide
Query:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVSWRE
        MIVCRAL F LGPPLPLTS V  TQTEY QTSSSSLPLRTKCVSLSAA+GFEWN TQYFA G NLKR SGVYGGREDGEEGEAERER+VRCEVEVVSWRE
Subjt:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVSWRE

Query:  RRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSI
        RRIRAD+FVHSGIESVWN LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSREL FSMVDGDFKKFEGKWSI
Subjt:  RRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSI

Query:  KAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLGPLP
         AGTRSSPTMLSYEVNVIPRFNFPAILLE+IIRSDLPVNLRALA RAEEK EGGQRVG+ KDSK VVLSNTLNGATC K+EIVQENSRG NSNSNLG +P
Subjt:  KAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLGPLP

Query:  SLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLL
         LSNELNTNWGVFGKVCRLDKRCMVDE          ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRILQEGCKGLL
Subjt:  SLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLL

Query:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFDE
        YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLG+HHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKR LK SFEA D+
Subjt:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFDE

Query:  GDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMNL
        GDSEE SV  RNNQSNGYTTTAEGVSD+ GR S RPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRM GRVDIEKAITRMGGFRRIASLMNL
Subjt:  GDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMNL

Query:  SLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK
        SLAYKHRKPKGYWDKFDNLQEE         ++P         E  GRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DY+ VND D ESK
Subjt:  SLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK

Query:  TPSKPYISQDTEKWLTGLKYLDINWVE
         PSKPYISQDTEKWLTGLKYLDINWVE
Subjt:  TPSKPYISQDTEKWLTGLKYLDINWVE

A0A1S3B5Y3 uncharacterized protein LOC1034861310.0e+0088.92Show/hide
Query:  MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVS
        MNTMIVCRAL F LGPPLPLTS VY TQTEYCQTSSSSLPLRTKCVSLSAA+GFEWNS+QYFA G NLKR SGVYGGR DGEEGEAERER+VRCEVEVVS
Subjt:  MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVS

Query:  WRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGK
        WRERRIRADIFVHSGIESVWN LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQE LNSDGSREL FSMVDGDFKKFEGK
Subjt:  WRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGK

Query:  WSIKAGTR-SSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNL
        WSIKAGTR SSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEK EGGQRVG+ KDSK+VVLSNTLNGATC K+EIVQENSRG NSNSNL
Subjt:  WSIKAGTR-SSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNL

Query:  GPLPSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGC
        GP+P LSNELNTNWGVFGKVCRLDKRCMVDE          ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRILQEGC
Subjt:  GPLPSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGC

Query:  KGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFE
        KGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLG+HHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SFE
Subjt:  KGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFE

Query:  AFDEGDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIAS
           +G+ EE SVP + NQSNGYTTTAEGVS + GR S RPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRM GRVDIEKAITRMGGFRRIAS
Subjt:  AFDEGDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIAS

Query:  LMNLSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVD
        LMNLSLAYKHRKPKGYWDKFDNLQEE         ++P         E  GRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DY+  NDVD
Subjt:  LMNLSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVD

Query:  AESKTPSKPYISQDTEKWLTGLKYLDINWVE
         ESK PSKPYISQDTEKWLTGLKYLDINWVE
Subjt:  AESKTPSKPYISQDTEKWLTGLKYLDINWVE

A0A6J1DL18 uncharacterized protein LOC111022083 isoform X10.0e+0082.93Show/hide
Query:  MIVCRALRFNLGP----------PLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVR
        MIVCRALRFNLG           P PLTS VY  Q EYCQT SSSLPLR+KCVSLSAAEGF+W+S++YFA  CNLK  S   GG EDG EG  + ER V 
Subjt:  MIVCRALRFNLGP----------PLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVR

Query:  CEVEVVSWRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGD
        CEV+V+SWRERRIRADI V++ IESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGD
Subjt:  CEVEVVSWRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGD

Query:  FKKFEGKWSIKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGS
        FKKFEGKWSIKAGTRSSPT LSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEE  EGG+RVG T+DSKS+VL+NT+NGA+CE +E+ QE SR S
Subjt:  FKKFEGKWSIKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGS

Query:  NSNSNLGPLPSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVR
        NSNSNLGPLP LSNELN+NWGVFGKVCRLDKRCMVDE          ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVR
Subjt:  NSNSNLGPLPSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVR

Query:  ILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRG
        ILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLG+HHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRG
Subjt:  ILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRG

Query:  LKKSFEAFDEG-DSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMG
           SFEAFDEG  SEE S  + N+Q NGYT   EGVSD  G++SCRP+PKV GLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRM GRVDIEKAITRMG
Subjt:  LKKSFEAFDEG-DSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMG

Query:  GFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDY
        GFRRIAS+MNLSLAYKHRKPKGYWDKFDNLQEE         ++P         E  GRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKND 
Subjt:  GFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDY

Query:  LAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE
        LA N  DAE+KT S+PYISQDTEKWL+GLKYLDINWVE
Subjt:  LAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE

A0A6J1EAX7 uncharacterized protein LOC111432394 isoform X10.0e+0084.07Show/hide
Query:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQT-SSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVSWR
        MIVCR LRFNLGP LP  S VY  Q EYC T SSSSL LRTKCVS+SAAEGF+WNS++YF    +LKRGSGVYGGR+   EGE ERER+V CEVEVVSWR
Subjt:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQT-SSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVSWR

Query:  ERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS
        ER+IRA+IFV+SGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS
Subjt:  ERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS

Query:  IKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLGPL
        +KAGTRSSPT+LSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE   EGGQRVG+++DSKS++LSNT+NGA CEK+E++QE     NS+SNLG L
Subjt:  IKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLGPL

Query:  PSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGL
        P LSNELN+NWGVFGKVC+LDKRCMVDE          ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRI+QEGCKGL
Subjt:  PSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGL

Query:  LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFD
        LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLG+HHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SFE+F+
Subjt:  LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFD

Query:  EGDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMN
        +GDSEE S  ++NNQ N +TTT E VSDV GR S R RPK+PGLQRD+EVLKAEVLKFISEHGQEGFMPMRKQLRM GRVDIEKAITRMGGFRRIASLMN
Subjt:  EGDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMN

Query:  LSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAES
        LSLAYKHRKPKGYWDKFDNLQEE         ++P         E  GRYDIARALEKWGGLHEVSRLLSLKVRH NRQPSFAKDRKNDYL VNDVD+ES
Subjt:  LSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAES

Query:  KTPSKPYISQDTEKWLTGLKYLDINWVE
        KTPSKPYISQDTEKWL GLKYLDINWVE
Subjt:  KTPSKPYISQDTEKWLTGLKYLDINWVE

A0A6J1HQY2 uncharacterized protein LOC111465941 isoform X20.0e+0084.07Show/hide
Query:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQT-SSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVSWR
        MIVCR LRFNLGP LP  S VY  Q EYC T SSSSL LRTKCVS+SAAEGF+WNS++YF    +LKRGSGVYGGR+   EGE ERER+V CEVEVVSWR
Subjt:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQT-SSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVSWR

Query:  ERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS
        ER+IRA IFV+SGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS
Subjt:  ERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS

Query:  IKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLGPL
        +KAGTRSSPT+LSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE   EGGQRVG+++DSKS++LSNT+NGA CEK+E++ E     NS+SNLG L
Subjt:  IKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLGPL

Query:  PSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGL
        P LSNELN+NWGVFGKVC+LDKRCMVDE          ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRI+QEGCKGL
Subjt:  PSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGL

Query:  LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFD
        LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLG+HHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SFE+F+
Subjt:  LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFD

Query:  EGDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMN
        +GDSEE S  ++NNQ  G+TTT E VSD+ GR S RPR K+PGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRM GRVDIEKAITRMGGFRRIASLMN
Subjt:  EGDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMN

Query:  LSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAES
        LSLAYKHRKPKGYWDKFDNLQEE         ++P         E  GRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK DYL VNDVDAES
Subjt:  LSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAES

Query:  KTPSKPYISQDTEKWLTGLKYLDINWVE
        KTPSKPYISQDTEKWL GLKYLDINWVE
Subjt:  KTPSKPYISQDTEKWLTGLKYLDINWVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01650.1 Polyketide cyclase / dehydrase and lipid transport protein1.1e-1935.71Show/hide
Query:  VRCEVEVVSWRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQR-ALYWHIEARVVLDLQ----ELLNSDGSRELH
        V  E++ +    RRIR+ I + + ++SVW+ LTDYE+L+DFIP LV S  +      R+ L Q G Q  AL     A+ VLD      E+L     RE+ 
Subjt:  VRCEVEVVSWRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQR-ALYWHIEARVVLDLQ----ELLNSDGSRELH

Query:  FSMVDGDFKKFEGKWSIK------------AGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEG
        F MV+GDF+ FEGKWSI+               +   T L+Y V+V P+   P  L+E  +  ++  NL ++   A++  EG
Subjt:  FSMVDGDFKKFEGKWSIK------------AGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEG

AT4G01650.2 Polyketide cyclase / dehydrase and lipid transport protein3.1e-1935.2Show/hide
Query:  EDGEEGE-AERERNVRCEVEVVSWRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQR-ALYWHIEARVVLDLQ--
        EDG+  E    +  V  E++ +    RRIR+ I + + ++SVW+ LTDYE+L+DFIP LV S  +      R+ L Q G Q  AL     A+ VLD    
Subjt:  EDGEEGE-AERERNVRCEVEVVSWRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQR-ALYWHIEARVVLDLQ--

Query:  --ELLNSDGSRELHFSMVDGDFKKFEGKWSIK------------AGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEG
          E+L     RE+ F MV+GDF+ FEGKWSI+               +   T L+Y V+V P+   P  L+E  +  ++  NL ++   A++  EG
Subjt:  --ELLNSDGSRELHFSMVDGDFKKFEGKWSIK------------AGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEG

AT5G08720.1 CONTAINS InterPro DOMAIN/s: Streptomyces cyclase/dehydrase (InterPro:IPR005031)1.5e-23666.22Show/hide
Query:  GSGVYGGREDGEEGEAER-ERNVRCEVEVVSWRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARV
        G G  G R D   G  ER ER VRCEV+V+SWRERRIR +I+V S  +SVWN LTDYERLADFIPNLV SGRIPCPHPGRIWLEQRGLQRALYWHIEARV
Subjt:  GSGVYGGREDGEEGEAER-ERNVRCEVEVVSWRERRIRADIFVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARV

Query:  VLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSIKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSV
        VLDL E L+S   RELHFSMVDGDFKKFEGKWS+K+G RS  T+LSYEVNVIPRFNFPAI LERIIRSDLPVNLRA+A +AE+  +   +    +D   +
Subjt:  VLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSIKAGTRSSPTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSV

Query:  VLSNTLNGATCEKNEIVQENSRGSNSNSNLGPLPSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAY
        + S        E + +  E S  S+  S      + SNELN NWGV+GK C+LDK C VDE          ENGGVHRC VASITVKAPV EVW VLT+Y
Subjt:  VLSNTLNGATCEKNEIVQENSRGSNSNSNLGPLPSLSNELNTNWGVFGKVCRLDKRCMVDE----------ENGGVHRCVVASITVKAPVREVWNVLTAY

Query:  ESLPEVVPNLAISKILSRDSNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEAL
        ESLPE+VPNLAISKILSRD+NKVRILQEGCKGLLYMVLHAR VLDL E  EQEI FEQVEGDFDSL GKW FEQLG+HHTLLKY+VES+M KD+FLSEA+
Subjt:  ESLPEVVPNLAISKILSRDSNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTLLKYSVESRMHKDTFLSEAL

Query:  MEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFDEGDSEETSVPHRNNQSNGYTTTAEGV-SDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEG
        MEEV+YEDLPSNLCAIRD IEKRG K S     E    ET        S+    + E V ++  G D  + R ++PGLQRDIEVLK+E+LKFISEHGQEG
Subjt:  MEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFDEGDSEETSVPHRNNQSNGYTTTAEGV-SDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEG

Query:  FMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVS
        FMPMRKQLR+ GRVDIEKAITRMGGFRRIA +MNLSLAYKHRKPKGYWD  +NLQEE         ++P         E  GRYDIARALEKWGGLHEVS
Subjt:  FMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEA-------FLEPP--------EVWGRYDIARALEKWGGLHEVS

Query:  RLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESKTP----SKPYISQDTEKWLTGLKYLDINWVE
        RLL+L VRHPNRQ +  KD  N  L     +A+  +     +KPY+SQDTEKWL  LK LDINWV+
Subjt:  RLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESKTP----SKPYISQDTEKWLTGLKYLDINWVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACACCATGATTGTTTGCAGAGCTTTGCGGTTCAATTTGGGGCCGCCATTGCCGTTAACATCCAGCGTCTATGTCACACAAACGGAGTATTGCCAAACTTCCTCTTC
CTCTCTTCCATTGCGCACTAAATGCGTCTCCCTTTCTGCTGCTGAAGGATTTGAGTGGAACTCGACCCAGTATTTTGCTAACGGCTGTAATTTGAAGAGGGGAAGTGGGG
TTTACGGTGGTCGAGAAGATGGTGAAGAGGGTGAGGCAGAGAGGGAGAGAAATGTGCGTTGTGAAGTGGAAGTTGTGTCGTGGAGGGAGCGCCGGATTCGGGCTGATATT
TTTGTTCATTCTGGGATTGAATCGGTTTGGAATGCTCTTACGGATTATGAGCGGCTTGCCGATTTCATACCCAATCTTGTTTCCAGTGGAAGAATTCCTTGTCCACATCC
TGGGCGGATATGGTTGGAACAAAGAGGTCTGCAACGGGCATTGTATTGGCATATTGAAGCTCGAGTTGTCTTGGATCTTCAAGAGCTTCTAAATTCGGATGGTAGTCGTG
AACTCCATTTTTCCATGGTTGATGGGGACTTCAAAAAGTTTGAAGGCAAATGGTCCATAAAAGCTGGTACAAGGTCATCTCCAACAATGCTGTCATATGAAGTTAATGTG
ATACCAAGATTCAATTTTCCTGCCATTCTTCTAGAAAGAATAATTAGATCAGACCTTCCGGTGAATCTTCGGGCCTTGGCCTGTAGAGCCGAAGAGAAACCTGAAGGGGG
TCAAAGAGTAGGACACACTAAAGACTCCAAGTCCGTGGTTCTCTCTAATACACTTAATGGTGCTACATGTGAAAAGAATGAAATAGTACAGGAAAATTCTAGAGGGAGTA
ATTCTAATTCCAATTTAGGACCCCTGCCCTCTTTATCTAATGAATTGAATACCAATTGGGGAGTTTTTGGAAAAGTTTGCCGACTTGACAAGCGTTGCATGGTTGATGAA
GAAAATGGAGGTGTCCATCGTTGTGTGGTAGCTAGCATAACAGTGAAAGCTCCTGTTCGTGAAGTCTGGAATGTTCTGACTGCTTATGAAAGTCTTCCCGAAGTAGTTCC
GAATCTAGCAATCAGCAAGATACTGTCAAGAGATAGTAACAAAGTTCGCATTCTTCAGGAAGGATGCAAGGGTCTACTGTATATGGTTCTGCATGCCCGTGTAGTTTTGG
ATTTGTGTGAACAGCTTGAACAGGAGATTAGCTTTGAACAGGTTGAAGGAGACTTTGACTCTCTTAGCGGAAAATGGCATTTTGAGCAGTTAGGAAATCATCATACCCTT
TTGAAATACTCGGTGGAGTCGAGAATGCACAAGGACACTTTTCTTTCTGAGGCTCTAATGGAAGAGGTTGTATATGAAGATCTTCCTTCAAACTTATGTGCAATTCGAGA
CTCCATTGAGAAAAGGGGTTTGAAAAAATCTTTTGAAGCATTTGATGAAGGTGATTCAGAGGAGACAAGTGTGCCACATCGAAACAATCAATCCAATGGCTATACGACAA
CAGCTGAAGGAGTTTCAGATGTCATTGGCAGAGATTCATGCAGACCAAGGCCCAAAGTTCCAGGCTTACAAAGAGATATTGAAGTTCTCAAAGCAGAGGTGCTCAAGTTT
ATTTCAGAACATGGGCAGGAAGGATTTATGCCAATGAGAAAGCAACTTCGCATGCAGGGAAGGGTAGATATTGAAAAGGCAATCACACGTATGGGTGGATTCAGAAGGAT
TGCATCACTTATGAATCTTTCTCTGGCTTATAAGCACCGCAAGCCGAAGGGTTACTGGGATAAATTTGACAATTTGCAGGAAGAGGCCTTTCTGGAACCTCCAGAAGTTT
GGGGGAGGTACGACATTGCACGGGCACTCGAGAAATGGGGTGGACTACACGAAGTTTCTCGTCTTTTGTCACTGAAAGTGAGACATCCTAATAGACAACCAAGCTTTGCC
AAAGATAGAAAGAATGATTATTTAGCTGTAAATGATGTTGATGCTGAAAGTAAAACTCCATCTAAACCCTATATTTCTCAGGACACAGAAAAATGGCTTACAGGACTAAA
ATATTTGGATATTAATTGGGTTGAGTAG
mRNA sequenceShow/hide mRNA sequence
AACATATTGGTCACAATCTGTCTCTCACTGTCATCTCCATTCTCACTCTCACTCCACTCTTTACTCTGCCATTAACCAATTTTCAGTTTCTTTTTTTTGATGTCTTCATG
AACACCATGATTGTTTGCAGAGCTTTGCGGTTCAATTTGGGGCCGCCATTGCCGTTAACATCCAGCGTCTATGTCACACAAACGGAGTATTGCCAAACTTCCTCTTCCTC
TCTTCCATTGCGCACTAAATGCGTCTCCCTTTCTGCTGCTGAAGGATTTGAGTGGAACTCGACCCAGTATTTTGCTAACGGCTGTAATTTGAAGAGGGGAAGTGGGGTTT
ACGGTGGTCGAGAAGATGGTGAAGAGGGTGAGGCAGAGAGGGAGAGAAATGTGCGTTGTGAAGTGGAAGTTGTGTCGTGGAGGGAGCGCCGGATTCGGGCTGATATTTTT
GTTCATTCTGGGATTGAATCGGTTTGGAATGCTCTTACGGATTATGAGCGGCTTGCCGATTTCATACCCAATCTTGTTTCCAGTGGAAGAATTCCTTGTCCACATCCTGG
GCGGATATGGTTGGAACAAAGAGGTCTGCAACGGGCATTGTATTGGCATATTGAAGCTCGAGTTGTCTTGGATCTTCAAGAGCTTCTAAATTCGGATGGTAGTCGTGAAC
TCCATTTTTCCATGGTTGATGGGGACTTCAAAAAGTTTGAAGGCAAATGGTCCATAAAAGCTGGTACAAGGTCATCTCCAACAATGCTGTCATATGAAGTTAATGTGATA
CCAAGATTCAATTTTCCTGCCATTCTTCTAGAAAGAATAATTAGATCAGACCTTCCGGTGAATCTTCGGGCCTTGGCCTGTAGAGCCGAAGAGAAACCTGAAGGGGGTCA
AAGAGTAGGACACACTAAAGACTCCAAGTCCGTGGTTCTCTCTAATACACTTAATGGTGCTACATGTGAAAAGAATGAAATAGTACAGGAAAATTCTAGAGGGAGTAATT
CTAATTCCAATTTAGGACCCCTGCCCTCTTTATCTAATGAATTGAATACCAATTGGGGAGTTTTTGGAAAAGTTTGCCGACTTGACAAGCGTTGCATGGTTGATGAAGAA
AATGGAGGTGTCCATCGTTGTGTGGTAGCTAGCATAACAGTGAAAGCTCCTGTTCGTGAAGTCTGGAATGTTCTGACTGCTTATGAAAGTCTTCCCGAAGTAGTTCCGAA
TCTAGCAATCAGCAAGATACTGTCAAGAGATAGTAACAAAGTTCGCATTCTTCAGGAAGGATGCAAGGGTCTACTGTATATGGTTCTGCATGCCCGTGTAGTTTTGGATT
TGTGTGAACAGCTTGAACAGGAGATTAGCTTTGAACAGGTTGAAGGAGACTTTGACTCTCTTAGCGGAAAATGGCATTTTGAGCAGTTAGGAAATCATCATACCCTTTTG
AAATACTCGGTGGAGTCGAGAATGCACAAGGACACTTTTCTTTCTGAGGCTCTAATGGAAGAGGTTGTATATGAAGATCTTCCTTCAAACTTATGTGCAATTCGAGACTC
CATTGAGAAAAGGGGTTTGAAAAAATCTTTTGAAGCATTTGATGAAGGTGATTCAGAGGAGACAAGTGTGCCACATCGAAACAATCAATCCAATGGCTATACGACAACAG
CTGAAGGAGTTTCAGATGTCATTGGCAGAGATTCATGCAGACCAAGGCCCAAAGTTCCAGGCTTACAAAGAGATATTGAAGTTCTCAAAGCAGAGGTGCTCAAGTTTATT
TCAGAACATGGGCAGGAAGGATTTATGCCAATGAGAAAGCAACTTCGCATGCAGGGAAGGGTAGATATTGAAAAGGCAATCACACGTATGGGTGGATTCAGAAGGATTGC
ATCACTTATGAATCTTTCTCTGGCTTATAAGCACCGCAAGCCGAAGGGTTACTGGGATAAATTTGACAATTTGCAGGAAGAGGCCTTTCTGGAACCTCCAGAAGTTTGGG
GGAGGTACGACATTGCACGGGCACTCGAGAAATGGGGTGGACTACACGAAGTTTCTCGTCTTTTGTCACTGAAAGTGAGACATCCTAATAGACAACCAAGCTTTGCCAAA
GATAGAAAGAATGATTATTTAGCTGTAAATGATGTTGATGCTGAAAGTAAAACTCCATCTAAACCCTATATTTCTCAGGACACAGAAAAATGGCTTACAGGACTAAAATA
TTTGGATATTAATTGGGTTGAGTAGTGTACATATTCAAAGCTACAAATGTATATATATATATTCAAGAGTATCTGTAGTGATTGACCATTTTTTATAATGTATTATAGAA
AATGTCATTAAAAA
Protein sequenceShow/hide protein sequence
MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFANGCNLKRGSGVYGGREDGEEGEAERERNVRCEVEVVSWRERRIRADI
FVHSGIESVWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSIKAGTRSSPTMLSYEVNV
IPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGQRVGHTKDSKSVVLSNTLNGATCEKNEIVQENSRGSNSNSNLGPLPSLSNELNTNWGVFGKVCRLDKRCMVDE
ENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGNHHTL
LKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKKSFEAFDEGDSEETSVPHRNNQSNGYTTTAEGVSDVIGRDSCRPRPKVPGLQRDIEVLKAEVLKF
ISEHGQEGFMPMRKQLRMQGRVDIEKAITRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEAFLEPPEVWGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFA
KDRKNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE