; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G001200 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G001200
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPolyketide cyclase / dehydrase and lipid transport protein
Genome locationCG_Chr04:3758618..3768745
RNA-Seq ExpressionClCG04G001200
SyntenyClCG04G001200
Gene Ontology termsNA
InterPro domainsIPR005031 - Coenzyme Q-binding protein COQ10, START domain
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442209.1 PREDICTED: uncharacterized protein LOC103486131 [Cucumis melo]0.0e+0093.3Show/hide
Query:  MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVS
        MNTMIVCRAL F LGPPLPLTS VY TQTEYCQTSSSSLPLRTKCVSLSAA+GFEWNS+QYFAKG NLKR SGVYGGR DGEEGEAERERDVRCEVEVVS
Subjt:  MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVS

Query:  WRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGK
        WRERRIRADIFVHSGIES+WN LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQE LNSDGSREL FSMVDGDFKKFEGK
Subjt:  WRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGK

Query:  WSIKGGTRSSS-TMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNL
        WSIK GTRSSS TMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEK EGG+RVG+ +DSK++VLSNTLNGATC K+E+VQENSRGGNSNSNL
Subjt:  WSIKGGTRSSS-TMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNL

Query:  GPLPPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGC
        GP+PPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRILQEGC
Subjt:  GPLPPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGC

Query:  KGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFE
        KGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SFE
Subjt:  KGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFE

Query:  AFDEGDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIAS
           +G+ EE SVP + NQSNGYTTTAEGVS +NGR S RPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIAS
Subjt:  AFDEGDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIAS

Query:  LMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVD
        LMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DY+  NDVD
Subjt:  LMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVD

Query:  AESKTPSKPYISQDTEKWLTGLKYLDINWAE
         ESK PSKPYISQDTEKWLTGLKYLDINW E
Subjt:  AESKTPSKPYISQDTEKWLTGLKYLDINWAE

XP_011654397.2 uncharacterized protein LOC101212159 [Cucumis sativus]0.0e+0093.4Show/hide
Query:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVSWRE
        MIVCRAL F LGPPLPLTS V  TQTEY QTSSSSLPLRTKCVSLSAA+GFEWN TQYFAKG NLKR SGVYGGREDGEEGEAERERDVRCEVEVVSWRE
Subjt:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVSWRE

Query:  RRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSI
        RRIRAD+FVHSGIES+WN LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSREL FSMVDGDFKKFEGKWSI
Subjt:  RRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSI

Query:  KGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLGPLP
          GTRSS TMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEK EGG+RVG+ +DSK +VLSNTLNGATC K+E+VQENSRGGNSNSNLG +P
Subjt:  KGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLGPLP

Query:  PLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLL
        PLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRILQEGCKGLL
Subjt:  PLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLL

Query:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFDE
        YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKR LK SFEA D+
Subjt:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFDE

Query:  GDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL
        GDSEE SV  RNNQSNGYTTTAEGVS++NGR S RPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL
Subjt:  GDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL

Query:  SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK
        SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DY+ VND D ESK
Subjt:  SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK

Query:  TPSKPYISQDTEKWLTGLKYLDINWAE
         PSKPYISQDTEKWLTGLKYLDINW E
Subjt:  TPSKPYISQDTEKWLTGLKYLDINWAE

XP_022925024.1 uncharacterized protein LOC111432394 isoform X1 [Cucurbita moschata]0.0e+0088.74Show/hide
Query:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQT-SSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVSWR
        MIVCR LRFNLGP LP  S VY  Q EYC T SSSSL LRTKCVS+SAAEGF+WNS++YF K  +LKRGSGVYGGR+   EGE ERERDV CEVEVVSWR
Subjt:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQT-SSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVSWR

Query:  ERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS
        ER+IRA+IFV+SGIES+WNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS
Subjt:  ERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS

Query:  IKGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLGPL
        +K GTRSS T+LSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE   EGG+RVG++EDSKSM+LSNT+NGA CEK+E++QE     NS+SNLG L
Subjt:  IKGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLGPL

Query:  PPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGL
        PPLSNELN+NWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRI+QEGCKGL
Subjt:  PPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGL

Query:  LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFD
        LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SFE+F+
Subjt:  LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFD

Query:  EGDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMN
        +GDSEE S  ++NNQ N +TTT E VS+VNGR S R RPK+PGLQRD+EVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMN
Subjt:  EGDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMN

Query:  LSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAES
        LSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRH NRQPSFAKDRKNDYL VNDVD+ES
Subjt:  LSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAES

Query:  KTPSKPYISQDTEKWLTGLKYLDINWAE
        KTPSKPYISQDTEKWL GLKYLDINW E
Subjt:  KTPSKPYISQDTEKWLTGLKYLDINWAE

XP_023517467.1 uncharacterized protein LOC111781223 [Cucurbita pepo subsp. pepo]0.0e+0088.86Show/hide
Query:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVSWRE
        MIV   LRFNLGP LP TS VY  Q EYC TSSS L LRTKCVS+SAAEGF+WNS++YF K  +LKRGSGVYGGR+   EGE ERERDV CEVEVVSWRE
Subjt:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVSWRE

Query:  RRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSI
        R+IRA+IFV+SGIES+WNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS+
Subjt:  RRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSI

Query:  KGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLGPLP
        K GTRSS T+LSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE   EGG+RVG++EDSKSM+LSNT+NGA CEK+E++QE     NS+SNLG LP
Subjt:  KGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLGPLP

Query:  PLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLL
        PLSNELN+NWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRI+QEGCKGLL
Subjt:  PLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLL

Query:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFDE
        YMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SFE+F++
Subjt:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFDE

Query:  GDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL
        GDSEE S  ++NNQ NG+TTT E VS++NGR S RPRPK+PGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL
Subjt:  GDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL

Query:  SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK
        SLAYKHRKPKGYWDK DNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DYL VNDVDAESK
Subjt:  SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK

Query:  TPSKPYISQDTEKWLTGLKYLDINWAE
        TPSKPYISQDTEKWL GLKYLDINW E
Subjt:  TPSKPYISQDTEKWLTGLKYLDINWAE

XP_038882723.1 uncharacterized protein LOC120073881 [Benincasa hispida]0.0e+0095.07Show/hide
Query:  MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVS
        MNTMIVCRAL F LGPP PLTS VY TQTEY QTS SSLP RTKCVSLSAAEGFEWNSTQYF KGCNLKRG+ VYGGREDGEEGE ERERDVRCEVEVVS
Subjt:  MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVS

Query:  WRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGK
        WRERRIRADIFV SGIES+WNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSREL FSMVDGDFKKFEGK
Subjt:  WRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGK

Query:  WSIKGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLG
        WSIK GTRSS TMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEK EGG+RVG+T+DSKS+VLSNT+ GATCEK+EMVQENSRGGNSNSNLG
Subjt:  WSIKGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLG

Query:  PLPPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCK
        PLPPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRILQEGCK
Subjt:  PLPPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCK

Query:  GLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEA
        GLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SF A
Subjt:  GLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEA

Query:  FDEGDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL
        FDEGDSEET V HRNNQSNGY TTA GVSNV+GRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL
Subjt:  FDEGDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASL

Query:  MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDA
        MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVS LLSLKVRHPNRQPSFA DRKNDYLAVNDVDA
Subjt:  MNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDA

Query:  ESKTPSKPYISQDTEKWLTGLKYLDINWAE
        ESKTPSKPYISQDTEKWLTGLKYLDINW E
Subjt:  ESKTPSKPYISQDTEKWLTGLKYLDINWAE

TrEMBL top hitse value%identityAlignment
A0A0A0KYT4 Uncharacterized protein0.0e+0093.12Show/hide
Query:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVSWRE
        MIVCRAL F LGPPLPLTS V  TQTEY QTSSSSLPLRTKCVSLSAA+GFEWN TQYFAKG NLKR SGVYGGREDGEEGEAERERDVRCEVEVVSWRE
Subjt:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVSWRE

Query:  RRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSI
        RRIRAD+FVHSGIES+WN LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSREL FSMVDGDFKKFEGKWSI
Subjt:  RRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSI

Query:  KGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLGPLP
          GTRSS TMLSYEVNVIPRFNFPAILLE+IIRSDLPVNLRALA RAEEK EGG+RVG+ +DSK +VLSNTLNGATC K+E+VQENSRGGNSNSNLG +P
Subjt:  KGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLGPLP

Query:  PLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLL
        PLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRILQEGCKGLL
Subjt:  PLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLL

Query:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFDE
        YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKR LK SFEA D+
Subjt:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFDE

Query:  GDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL
        GDSEE SV  RNNQSNGYTTTAEGVS++NGR S RPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL
Subjt:  GDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL

Query:  SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK
        SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DY+ VND D ESK
Subjt:  SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK

Query:  TPSKPYISQDTEKWLTGLKYLDINWAE
         PSKPYISQDTEKWLTGLKYLDINW E
Subjt:  TPSKPYISQDTEKWLTGLKYLDINWAE

A0A1S3B5Y3 uncharacterized protein LOC1034861310.0e+0093.3Show/hide
Query:  MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVS
        MNTMIVCRAL F LGPPLPLTS VY TQTEYCQTSSSSLPLRTKCVSLSAA+GFEWNS+QYFAKG NLKR SGVYGGR DGEEGEAERERDVRCEVEVVS
Subjt:  MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVS

Query:  WRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGK
        WRERRIRADIFVHSGIES+WN LTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQE LNSDGSREL FSMVDGDFKKFEGK
Subjt:  WRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGK

Query:  WSIKGGTRSSS-TMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNL
        WSIK GTRSSS TMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEK EGG+RVG+ +DSK++VLSNTLNGATC K+E+VQENSRGGNSNSNL
Subjt:  WSIKGGTRSSS-TMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNL

Query:  GPLPPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGC
        GP+PPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRILQEGC
Subjt:  GPLPPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGC

Query:  KGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFE
        KGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SFE
Subjt:  KGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFE

Query:  AFDEGDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIAS
           +G+ EE SVP + NQSNGYTTTAEGVS +NGR S RPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIAS
Subjt:  AFDEGDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIAS

Query:  LMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVD
        LMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DY+  NDVD
Subjt:  LMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVD

Query:  AESKTPSKPYISQDTEKWLTGLKYLDINWAE
         ESK PSKPYISQDTEKWLTGLKYLDINW E
Subjt:  AESKTPSKPYISQDTEKWLTGLKYLDINWAE

A0A6J1DL18 uncharacterized protein LOC111022083 isoform X10.0e+0087.4Show/hide
Query:  MIVCRALRFNLGP----------PLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVR
        MIVCRALRFNLG           P PLTS VY  Q EYCQT SSSLPLR+KCVSLSAAEGF+W+S++YFAK CNLK  S   GG EDG EG  + ER V 
Subjt:  MIVCRALRFNLGP----------PLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVR

Query:  CEVEVVSWRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGD
        CEV+V+SWRERRIRADI V++ IES+WNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGD
Subjt:  CEVEVVSWRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGD

Query:  FKKFEGKWSIKGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGG
        FKKFEGKWSIK GTRSS T LSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEE  EGGRRVG TEDSKSMVL+NT+NGA+CE +E+ QE SR  
Subjt:  FKKFEGKWSIKGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGG

Query:  NSNSNLGPLPPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVR
        NSNSNLGPLPPLSNELN+NWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVR
Subjt:  NSNSNLGPLPPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVR

Query:  ILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRG
        ILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRG
Subjt:  ILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRG

Query:  LKISFEAFDEG-DSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMG
           SFEAFDEG  SEE S  + N+Q NGYT   EGVS+ NG++SCRP+PKV GLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMG
Subjt:  LKISFEAFDEG-DSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMG

Query:  GFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDY
        GFRRIAS+MNLSLAYKHRKPKGYWDKFDNLQEEINRFQ SWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKND 
Subjt:  GFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDY

Query:  LAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWAE
        LA N  DAE+KT S+PYISQDTEKWL+GLKYLDINW E
Subjt:  LAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWAE

A0A6J1EAX7 uncharacterized protein LOC111432394 isoform X10.0e+0088.74Show/hide
Query:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQT-SSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVSWR
        MIVCR LRFNLGP LP  S VY  Q EYC T SSSSL LRTKCVS+SAAEGF+WNS++YF K  +LKRGSGVYGGR+   EGE ERERDV CEVEVVSWR
Subjt:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQT-SSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVSWR

Query:  ERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS
        ER+IRA+IFV+SGIES+WNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS
Subjt:  ERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS

Query:  IKGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLGPL
        +K GTRSS T+LSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE   EGG+RVG++EDSKSM+LSNT+NGA CEK+E++QE     NS+SNLG L
Subjt:  IKGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLGPL

Query:  PPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGL
        PPLSNELN+NWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRI+QEGCKGL
Subjt:  PPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGL

Query:  LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFD
        LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SFE+F+
Subjt:  LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFD

Query:  EGDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMN
        +GDSEE S  ++NNQ N +TTT E VS+VNGR S R RPK+PGLQRD+EVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMN
Subjt:  EGDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMN

Query:  LSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAES
        LSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRH NRQPSFAKDRKNDYL VNDVD+ES
Subjt:  LSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAES

Query:  KTPSKPYISQDTEKWLTGLKYLDINWAE
        KTPSKPYISQDTEKWL GLKYLDINW E
Subjt:  KTPSKPYISQDTEKWLTGLKYLDINWAE

A0A6J1HQY2 uncharacterized protein LOC111465941 isoform X20.0e+0088.74Show/hide
Query:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQT-SSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVSWR
        MIVCR LRFNLGP LP  S VY  Q EYC T SSSSL LRTKCVS+SAAEGF+WNS++YF K  +LKRGSGVYGGR+   EGE ERERDV CEVEVVSWR
Subjt:  MIVCRALRFNLGPPLPLTSSVYVTQTEYCQT-SSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVSWR

Query:  ERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS
        ER+IRA IFV+SGIES+WNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS
Subjt:  ERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWS

Query:  IKGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLGPL
        +K GTRSS T+LSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE   EGG+RVG++EDSKSM+LSNT+NGA CEK+E++ E     NS+SNLG L
Subjt:  IKGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLGPL

Query:  PPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGL
        PPLSNELN+NWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSR+SNKVRI+QEGCKGL
Subjt:  PPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGL

Query:  LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFD
        LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLK SFE+F+
Subjt:  LYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFD

Query:  EGDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMN
        +GDSEE S  ++NNQ  G+TTT E VS++NGR S RPR K+PGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMN
Subjt:  EGDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMN

Query:  LSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAES
        LSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK DYL VNDVDAES
Subjt:  LSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAES

Query:  KTPSKPYISQDTEKWLTGLKYLDINWAE
        KTPSKPYISQDTEKWL GLKYLDINW E
Subjt:  KTPSKPYISQDTEKWLTGLKYLDINWAE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01650.1 Polyketide cyclase / dehydrase and lipid transport protein7.1e-1932.89Show/hide
Query:  CNLKRGSGVYGGREDGEEGEAERERD-----------------VRCEVEVVSWRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPG
        C     S  +   ED  E E + E D                 V  E++ +    RRIR+ I + + ++S+W+ LTDYE+L+DFIP LV S  +      
Subjt:  CNLKRGSGVYGGREDGEEGEAERERD-----------------VRCEVEVVSWRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPG

Query:  RIWLEQRGLQR-ALYWHIEARVVLDLQ----ELLNSDGSRELHFSMVDGDFKKFEGKWSI----KG--------GTRSSSTMLSYEVNVIPRFNFPAILL
        R+ L Q G Q  AL     A+ VLD      E+L     RE+ F MV+GDF+ FEGKWSI    KG          +   T L+Y V+V P+   P  L+
Subjt:  RIWLEQRGLQR-ALYWHIEARVVLDLQ----ELLNSDGSRELHFSMVDGDFKKFEGKWSI----KG--------GTRSSSTMLSYEVNVIPRFNFPAILL

Query:  ERIIRSDLPVNLRALACRAEEKPEG
        E  +  ++  NL ++   A++  EG
Subjt:  ERIIRSDLPVNLRALACRAEEKPEG

AT4G01650.2 Polyketide cyclase / dehydrase and lipid transport protein5.4e-1936.22Show/hide
Query:  EDGEEGEAERERD-VRCEVEVVSWRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQR-ALYWHIEARVVLDLQ--
        EDG+  E     D V  E++ +    RRIR+ I + + ++S+W+ LTDYE+L+DFIP LV S  +      R+ L Q G Q  AL     A+ VLD    
Subjt:  EDGEEGEAERERD-VRCEVEVVSWRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQR-ALYWHIEARVVLDLQ--

Query:  --ELLNSDGSRELHFSMVDGDFKKFEGKWSI----KG--------GTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEG
          E+L     RE+ F MV+GDF+ FEGKWSI    KG          +   T L+Y V+V P+   P  L+E  +  ++  NL ++   A++  EG
Subjt:  --ELLNSDGSRELHFSMVDGDFKKFEGKWSI----KG--------GTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEG

AT5G08720.1 CONTAINS InterPro DOMAIN/s: Streptomyces cyclase/dehydrase (InterPro:IPR005031)5.2e-26471.02Show/hide
Query:  GSGVYGGREDGEEGEAER-ERDVRCEVEVVSWRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARV
        G G  G R D   G  ER ER VRCEV+V+SWRERRIR +I+V S  +S+WN LTDYERLADFIPNLV SGRIPCPHPGRIWLEQRGLQRALYWHIEARV
Subjt:  GSGVYGGREDGEEGEAER-ERDVRCEVEVVSWRERRIRADIFVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARV

Query:  VLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSIKGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSM
        VLDL E L+S   RELHFSMVDGDFKKFEGKWS+K G RS  T+LSYEVNVIPRFNFPAI LERIIRSDLPVNLRA+A +AE+  +   +    ED   +
Subjt:  VLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSIKGGTRSSSTMLSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSM

Query:  VLSNTLNGATCEKNEMVQENSRGGNSNSNLGPLPPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAY
        + S        E + +  E S      S++G L   SNELN NWGV+GK C+LDK C VDEVHLRRFDGLLENGGVHRC VASITVKAPV EVW VLT+Y
Subjt:  VLSNTLNGATCEKNEMVQENSRGGNSNSNLGPLPPLSNELNTNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAY

Query:  ESLPEVVPNLAISKILSRDSNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEAL
        ESLPE+VPNLAISKILSRD+NKVRILQEGCKGLLYMVLHAR VLDL E  EQEI FEQVEGDFDSL GKW FEQLGSHHTLLKY+VES+M KD+FLSEA+
Subjt:  ESLPEVVPNLAISKILSRDSNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEAL

Query:  MEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFDEGDSEETSVPHRNNQSNGYTTTAEGV-SNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEG
        MEEV+YEDLPSNLCAIRD IEKRG K S     E    ET        S+    + E V +N +G D  + R ++PGLQRDIEVLK+E+LKFISEHGQEG
Subjt:  MEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFDEGDSEETSVPHRNNQSNGYTTTAEGV-SNVNGRDSCRPRPKVPGLQRDIEVLKAEVLKFISEHGQEG

Query:  FMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVS
        FMPMRKQLR+HGRVDIEKAITRMGGFRRIA +MNLSLAYKHRKPKGYWD  +NLQEEI RFQ+SWGMDPS+MPSRKSFERAGRYDIARALEKWGGLHEVS
Subjt:  FMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVS

Query:  RLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESKTP----SKPYISQDTEKWLTGLKYLDINWAE
        RLL+L VRHPNRQ +  KD  N  L     +A+  +     +KPY+SQDTEKWL  LK LDINW +
Subjt:  RLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESKTP----SKPYISQDTEKWLTGLKYLDINWAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACACCATGATTGTTTGCAGAGCTTTGAGGTTCAATTTGGGGCCGCCATTGCCACTAACATCCAGCGTCTATGTCACACAAACGGAGTATTGCCAAACTTCCTCTTC
CTCTCTTCCATTGCGCACCAAATGCGTCTCCCTTTCTGCTGCCGAAGGATTTGAGTGGAACTCGACCCAGTATTTTGCCAAGGGCTGTAATTTGAAGAGGGGAAGTGGGG
TTTACGGTGGTCGAGAAGATGGTGAAGAGGGTGAGGCAGAGAGGGAGAGAGATGTGCGTTGTGAAGTGGAAGTTGTGTCGTGGAGGGAGCGCCGGATTCGGGCTGATATT
TTTGTTCATTCTGGGATTGAATCGATTTGGAATGCTCTTACGGATTATGAGCGGCTTGCCGATTTCATACCCAATCTTGTTTCCAGTGGGAGAATTCCTTGTCCACATCC
TGGTCGGATATGGTTGGAACAAAGAGGTCTGCAACGGGCGCTGTATTGGCATATTGAAGCTCGAGTTGTCTTGGATCTTCAAGAGCTTCTAAATTCGGATGGTAGTCGTG
AACTCCATTTTTCCATGGTTGATGGGGACTTTAAAAAGTTTGAAGGCAAATGGTCCATAAAAGGCGGAACAAGGTCATCTTCAACAATGTTGTCATATGAAGTTAATGTG
ATACCAAGATTCAATTTTCCTGCCATTCTTCTTGAAAGAATAATTAGATCAGACCTTCCTGTGAATCTTCGGGCCTTGGCCTGTAGAGCTGAAGAGAAACCTGAAGGGGG
TCGAAGAGTAGGACACACTGAAGACTCCAAGTCCATGGTTCTCTCTAATACACTTAATGGGGCTACATGTGAAAAGAATGAGATGGTACAGGAAAATTCTAGAGGGGGTA
ATTCTAATTCCAATTTAGGACCCTTGCCCCCGTTATCTAATGAATTGAATACCAATTGGGGAGTTTTTGGAAAAGTTTGCCGACTTGACAAGCGTTGCATGGTTGATGAA
GTTCATCTTCGCAGATTTGATGGTTTGTTGGAAAATGGAGGCGTCCATCGTTGTGTGGTAGCTAGCATAACAGTGAAAGCTCCTGTTCGTGAAGTCTGGAATGTCCTGAC
TGCTTATGAAAGTCTTCCCGAAGTAGTTCCAAATCTAGCAATCAGCAAGATACTGTCAAGAGATAGCAACAAAGTTCGCATTCTTCAGGAAGGATGCAAGGGTCTCCTGT
ATATGGTTCTGCATGCCCGTGTTGTTTTGGACTTGTGTGAGCAGCTTGAACAAGAGATTAGCTTTGAACAGGTTGAAGGAGACTTCGACTCTCTTAGCGGAAAATGGCAT
TTTGAGCAGTTAGGAAGTCATCATACCCTGTTGAAATATTCGGTGGAGTCGAGAATGCACAAAGACACTTTTCTTTCTGAGGCTCTAATGGAAGAGGTTGTATATGAAGA
TCTTCCTTCAAACTTATGTGCAATTCGGGACTCCATTGAGAAAAGGGGTTTGAAAATTTCTTTTGAAGCATTTGATGAAGGTGATTCAGAGGAGACAAGTGTGCCACATC
GAAACAATCAATCCAATGGCTATACGACAACAGCTGAAGGAGTTTCAAATGTCAATGGGAGAGATTCATGCAGACCAAGGCCCAAAGTTCCAGGCTTACAAAGAGATATT
GAAGTTCTCAAAGCAGAGGTGCTCAAGTTTATTTCAGAACATGGGCAGGAAGGATTTATGCCAATGAGAAAGCAACTTCGAATGCACGGAAGGGTAGATATAGAAAAGGC
AATCACCCGTATGGGTGGATTCAGAAGGATTGCATCACTTATGAATCTTTCACTGGCTTATAAGCACCGCAAGCCGAAGGGTTATTGGGATAAATTTGACAATTTGCAGG
AAGAGATAAATCGGTTCCAGAAGAGCTGGGGAATGGATCCATCATACATGCCCAGTAGGAAGTCCTTTGAACGAGCAGGGAGGTACGACATTGCACGGGCACTCGAGAAA
TGGGGCGGCCTACACGAAGTTTCTCGTCTTTTGTCACTAAAAGTGAGACATCCTAATAGACAGCCAAGCTTTGCCAAAGATAGAAAGAATGATTATTTAGCTGTAAATGA
TGTTGATGCTGAAAGTAAAACTCCATCTAAACCCTATATTTCTCAGGACACAGAAAAATGGCTTACAGGACTAAAATATTTGGATATTAATTGGGCTGAGTAA
mRNA sequenceShow/hide mRNA sequence
CTCTGCCTCTCACTGTCATCACCATTCTCACTCTCACTCCACTCTCTACTCTGCCAAACCAATTCTCAGTTTCTCTTTTTGATGTCTTCATGAACACCATGATTGTTTGC
AGAGCTTTGAGGTTCAATTTGGGGCCGCCATTGCCACTAACATCCAGCGTCTATGTCACACAAACGGAGTATTGCCAAACTTCCTCTTCCTCTCTTCCATTGCGCACCAA
ATGCGTCTCCCTTTCTGCTGCCGAAGGATTTGAGTGGAACTCGACCCAGTATTTTGCCAAGGGCTGTAATTTGAAGAGGGGAAGTGGGGTTTACGGTGGTCGAGAAGATG
GTGAAGAGGGTGAGGCAGAGAGGGAGAGAGATGTGCGTTGTGAAGTGGAAGTTGTGTCGTGGAGGGAGCGCCGGATTCGGGCTGATATTTTTGTTCATTCTGGGATTGAA
TCGATTTGGAATGCTCTTACGGATTATGAGCGGCTTGCCGATTTCATACCCAATCTTGTTTCCAGTGGGAGAATTCCTTGTCCACATCCTGGTCGGATATGGTTGGAACA
AAGAGGTCTGCAACGGGCGCTGTATTGGCATATTGAAGCTCGAGTTGTCTTGGATCTTCAAGAGCTTCTAAATTCGGATGGTAGTCGTGAACTCCATTTTTCCATGGTTG
ATGGGGACTTTAAAAAGTTTGAAGGCAAATGGTCCATAAAAGGCGGAACAAGGTCATCTTCAACAATGTTGTCATATGAAGTTAATGTGATACCAAGATTCAATTTTCCT
GCCATTCTTCTTGAAAGAATAATTAGATCAGACCTTCCTGTGAATCTTCGGGCCTTGGCCTGTAGAGCTGAAGAGAAACCTGAAGGGGGTCGAAGAGTAGGACACACTGA
AGACTCCAAGTCCATGGTTCTCTCTAATACACTTAATGGGGCTACATGTGAAAAGAATGAGATGGTACAGGAAAATTCTAGAGGGGGTAATTCTAATTCCAATTTAGGAC
CCTTGCCCCCGTTATCTAATGAATTGAATACCAATTGGGGAGTTTTTGGAAAAGTTTGCCGACTTGACAAGCGTTGCATGGTTGATGAAGTTCATCTTCGCAGATTTGAT
GGTTTGTTGGAAAATGGAGGCGTCCATCGTTGTGTGGTAGCTAGCATAACAGTGAAAGCTCCTGTTCGTGAAGTCTGGAATGTCCTGACTGCTTATGAAAGTCTTCCCGA
AGTAGTTCCAAATCTAGCAATCAGCAAGATACTGTCAAGAGATAGCAACAAAGTTCGCATTCTTCAGGAAGGATGCAAGGGTCTCCTGTATATGGTTCTGCATGCCCGTG
TTGTTTTGGACTTGTGTGAGCAGCTTGAACAAGAGATTAGCTTTGAACAGGTTGAAGGAGACTTCGACTCTCTTAGCGGAAAATGGCATTTTGAGCAGTTAGGAAGTCAT
CATACCCTGTTGAAATATTCGGTGGAGTCGAGAATGCACAAAGACACTTTTCTTTCTGAGGCTCTAATGGAAGAGGTTGTATATGAAGATCTTCCTTCAAACTTATGTGC
AATTCGGGACTCCATTGAGAAAAGGGGTTTGAAAATTTCTTTTGAAGCATTTGATGAAGGTGATTCAGAGGAGACAAGTGTGCCACATCGAAACAATCAATCCAATGGCT
ATACGACAACAGCTGAAGGAGTTTCAAATGTCAATGGGAGAGATTCATGCAGACCAAGGCCCAAAGTTCCAGGCTTACAAAGAGATATTGAAGTTCTCAAAGCAGAGGTG
CTCAAGTTTATTTCAGAACATGGGCAGGAAGGATTTATGCCAATGAGAAAGCAACTTCGAATGCACGGAAGGGTAGATATAGAAAAGGCAATCACCCGTATGGGTGGATT
CAGAAGGATTGCATCACTTATGAATCTTTCACTGGCTTATAAGCACCGCAAGCCGAAGGGTTATTGGGATAAATTTGACAATTTGCAGGAAGAGATAAATCGGTTCCAGA
AGAGCTGGGGAATGGATCCATCATACATGCCCAGTAGGAAGTCCTTTGAACGAGCAGGGAGGTACGACATTGCACGGGCACTCGAGAAATGGGGCGGCCTACACGAAGTT
TCTCGTCTTTTGTCACTAAAAGTGAGACATCCTAATAGACAGCCAAGCTTTGCCAAAGATAGAAAGAATGATTATTTAGCTGTAAATGATGTTGATGCTGAAAGTAAAAC
TCCATCTAAACCCTATATTTCTCAGGACACAGAAAAATGGCTTACAGGACTAAAATATTTGGATATTAATTGGGCTGAGTAATGAACATATACAAAGCTACAAATGTATA
TATATATGGGAATTCTGATTGTATAGCAACTTTT
Protein sequenceShow/hide protein sequence
MNTMIVCRALRFNLGPPLPLTSSVYVTQTEYCQTSSSSLPLRTKCVSLSAAEGFEWNSTQYFAKGCNLKRGSGVYGGREDGEEGEAERERDVRCEVEVVSWRERRIRADI
FVHSGIESIWNALTDYERLADFIPNLVSSGRIPCPHPGRIWLEQRGLQRALYWHIEARVVLDLQELLNSDGSRELHFSMVDGDFKKFEGKWSIKGGTRSSSTMLSYEVNV
IPRFNFPAILLERIIRSDLPVNLRALACRAEEKPEGGRRVGHTEDSKSMVLSNTLNGATCEKNEMVQENSRGGNSNSNLGPLPPLSNELNTNWGVFGKVCRLDKRCMVDE
VHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRDSNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWH
FEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKISFEAFDEGDSEETSVPHRNNQSNGYTTTAEGVSNVNGRDSCRPRPKVPGLQRDI
EVLKAEVLKFISEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEK
WGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWAE