; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002300 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002300
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPolyketide_cyc domain-containing protein
Genome locationscaffold1:30872390..30880071
RNA-Seq ExpressionSpg002300
SyntenySpg002300
Gene Ontology termsNA
InterPro domainsIPR005031 - Coenzyme Q-binding protein COQ10, START domain
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595620.1 hypothetical protein SDJN03_12173, partial [Cucurbita argyrosperma subsp. sororia]1.2e-27689.67Show/hide
Query:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE
        G+   +E K +L    A  RSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE +SEG Q+VGN++DSKSM+LSNTING  CEKDELLQE
Subjt:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE

Query:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
        N     +S++ G LPPLSNELNSNWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
Subjt:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE

Query:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
        SNKVRI+QEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
Subjt:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS

Query:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI
        IEKRGLKNS E+F++GDSEEKS+S++NNQFN  TTT E VSDVNGR+S R RPK+PGLQRD+EVLKAEVLKFI EHGQEGFMPMRKQLRMHGRVDIEKAI
Subjt:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI

Query:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR
        TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR
Subjt:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR

Query:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE
        KNDYL VNDVD+ESKTPSKPYISQDTEKWL GLKYLDINWVE
Subjt:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE

XP_011654397.2 uncharacterized protein LOC101212159 [Cucumis sativus]2.5e-27992.79Show/hide
Query:  AKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQENYSRGGNSNSN-GPLP
        A  RSSPT+LSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEE SEG Q+VGN KDSK +VLSNT+NG  C KDE++QEN SRGGNSNSN G +P
Subjt:  AKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQENYSRGGNSNSN-GPLP

Query:  PLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLL
        PLSNELN+NWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLL
Subjt:  PLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLL

Query:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKNSVEAFDE
        YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKR LKNS EA D+
Subjt:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKNSVEAFDE

Query:  GDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL
        GDSEEKS S RNNQ NG TTTAEGVSD+NGR S RPRPKVPGLQRDIEVLKAEVLKFI EHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL
Subjt:  GDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL

Query:  SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK
        SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DY+ VND D ESK
Subjt:  SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK

Query:  TPSKPYISQDTEKWLTGLKYLDINWVE
         PSKPYISQDTEKWLTGLKYLDINWVE
Subjt:  TPSKPYISQDTEKWLTGLKYLDINWVE

XP_022925024.1 uncharacterized protein LOC111432394 isoform X1 [Cucurbita moschata]1.3e-27589.48Show/hide
Query:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE
        G+   +E K +L    A  RSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE +SEG Q+VGN++DSKSM+LSNTING  CEKDELLQE
Subjt:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE

Query:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
        N     +S++ G LPPLSNELNSNWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
Subjt:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE

Query:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
        SNKVRI+QEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
Subjt:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS

Query:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI
        IEKRGLKNS E+F++GDSEEKS+S++NNQFN  TTT E VSDVNGR+S R RPK+PGLQRD+EVLKAEVLKFI EHGQEGFMPMRKQLRMHGRVDIEKAI
Subjt:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI

Query:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR
        TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRH NRQPSFAKDR
Subjt:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR

Query:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE
        KNDYL VNDVD+ESKTPSKPYISQDTEKWL GLKYLDINWVE
Subjt:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE

XP_023517467.1 uncharacterized protein LOC111781223 [Cucurbita pepo subsp. pepo]1.2e-27689.67Show/hide
Query:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE
        G+   +E K +L    A  RSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE +SEG Q+VGN++DSKSM+LSNTING  CEKDELLQE
Subjt:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE

Query:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
        N     +S++ G LPPLSNELNSNWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
Subjt:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE

Query:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
        SNKVRI+QEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
Subjt:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS

Query:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI
        IEKRGLKNS E+F++GDSEEKS+S++NNQ NG TTT E VSD+NGR+S RPRPK+PGLQRDIEVLKAEVLKFI EHGQEGFMPMRKQLRMHGRVDIEKAI
Subjt:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI

Query:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR
        TRMGGFRRIASLMNLSLAYKHRKPKGYWDK DNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR
Subjt:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR

Query:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE
        K+DYL VNDVDAESKTPSKPYISQDTEKWL GLKYLDINWVE
Subjt:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE

XP_038882723.1 uncharacterized protein LOC120073881 [Benincasa hispida]2.2e-28392.41Show/hide
Query:  GNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQENYS
        G W  K       A  RSSPT+LSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEE SEG Q+VGNTKDSKS+VLSNT+ G  CEKDE++QEN S
Subjt:  GNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQENYS

Query:  RGGNSNSN-GPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESN
        RGGNSNSN GPLPPLSNELN+NWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESN
Subjt:  RGGNSNSN-GPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESN

Query:  KVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIE
        KVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIE
Subjt:  KVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIE

Query:  KRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAITR
        KRGLKNS  AFDEGDSEE   SHRNNQ NG  TTA GVS+V+GR+SCRPRPKVPGLQRDIEVLKAEVLKFI EHGQEGFMPMRKQLRMHGRVDIEKAITR
Subjt:  KRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAITR

Query:  MGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKN
        MGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVS LLSLKVRHPNRQPSFA DRKN
Subjt:  MGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKN

Query:  DYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE
        DYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE
Subjt:  DYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE

TrEMBL top hitse value%identityAlignment
A0A0A0KYT4 Uncharacterized protein5.1e-27892.41Show/hide
Query:  AKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQENYSRGGNSNSN-GPLP
        A  RSSPT+LSYEVNVIPRFNFPAILLE+IIRSDLPVNLRALA RAEE SEG Q+VGN KDSK +VLSNT+NG  C KDE++QEN SRGGNSNSN G +P
Subjt:  AKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQENYSRGGNSNSN-GPLP

Query:  PLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLL
        PLSNELN+NWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLL
Subjt:  PLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGLL

Query:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKNSVEAFDE
        YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKR LKNS EA D+
Subjt:  YMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEKRGLKNSVEAFDE

Query:  GDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL
        GDSEEKS S RNNQ NG TTTAEGVSD+NGR S RPRPKVPGLQRDIEVLKAEVLKFI EHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL
Subjt:  GDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAITRMGGFRRIASLMNL

Query:  SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK
        SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK+DY+ VND D ESK
Subjt:  SLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAESK

Query:  TPSKPYISQDTEKWLTGLKYLDINWVE
         PSKPYISQDTEKWLTGLKYLDINWVE
Subjt:  TPSKPYISQDTEKWLTGLKYLDINWVE

A0A6J1EAX7 uncharacterized protein LOC111432394 isoform X16.2e-27689.48Show/hide
Query:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE
        G+   +E K +L    A  RSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE +SEG Q+VGN++DSKSM+LSNTING  CEKDELLQE
Subjt:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE

Query:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
        N     +S++ G LPPLSNELNSNWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
Subjt:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE

Query:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
        SNKVRI+QEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
Subjt:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS

Query:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI
        IEKRGLKNS E+F++GDSEEKS+S++NNQFN  TTT E VSDVNGR+S R RPK+PGLQRD+EVLKAEVLKFI EHGQEGFMPMRKQLRMHGRVDIEKAI
Subjt:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI

Query:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR
        TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRH NRQPSFAKDR
Subjt:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR

Query:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE
        KNDYL VNDVD+ESKTPSKPYISQDTEKWL GLKYLDINWVE
Subjt:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE

A0A6J1EB31 uncharacterized protein LOC111432394 isoform X36.2e-27689.48Show/hide
Query:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE
        G+   +E K +L    A  RSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE +SEG Q+VGN++DSKSM+LSNTING  CEKDELLQE
Subjt:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE

Query:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
        N     +S++ G LPPLSNELNSNWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
Subjt:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE

Query:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
        SNKVRI+QEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
Subjt:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS

Query:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI
        IEKRGLKNS E+F++GDSEEKS+S++NNQFN  TTT E VSDVNGR+S R RPK+PGLQRD+EVLKAEVLKFI EHGQEGFMPMRKQLRMHGRVDIEKAI
Subjt:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI

Query:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR
        TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRH NRQPSFAKDR
Subjt:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR

Query:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE
        KNDYL VNDVD+ESKTPSKPYISQDTEKWL GLKYLDINWVE
Subjt:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE

A0A6J1HNN8 uncharacterized protein LOC111465941 isoform X31.8e-27589.48Show/hide
Query:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE
        G+   +E K +L    A  RSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE +SEG Q+VGN++DSKSM+LSNTING  CEKDELL E
Subjt:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE

Query:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
        N     +S++ G LPPLSNELNSNWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
Subjt:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE

Query:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
        SNKVRI+QEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
Subjt:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS

Query:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI
        IEKRGLKNS E+F++GDSEEKS+S++NNQF G TTT E VSD+NGR+S RPR K+PGLQRDIEVLKAEVLKFI EHGQEGFMPMRKQLRMHGRVDIEKAI
Subjt:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI

Query:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR
        TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR
Subjt:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR

Query:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE
        K DYL VNDVDAESKTPSKPYISQDTEKWL GLKYLDINWVE
Subjt:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE

A0A6J1HSZ7 uncharacterized protein LOC111465941 isoform X11.8e-27589.48Show/hide
Query:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE
        G+   +E K +L    A  RSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAE +SEG Q+VGN++DSKSM+LSNTING  CEKDELL E
Subjt:  GNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQE

Query:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
        N     +S++ G LPPLSNELNSNWGVFGKVC+LDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE
Subjt:  NYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRE

Query:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
        SNKVRI+QEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSL+GKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS
Subjt:  SNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDS

Query:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI
        IEKRGLKNS E+F++GDSEEKS+S++NNQF G TTT E VSD+NGR+S RPR K+PGLQRDIEVLKAEVLKFI EHGQEGFMPMRKQLRMHGRVDIEKAI
Subjt:  IEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAI

Query:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR
        TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR
Subjt:  TRMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDR

Query:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE
        K DYL VNDVDAESKTPSKPYISQDTEKWL GLKYLDINWVE
Subjt:  KNDYLAVNDVDAESKTPSKPYISQDTEKWLTGLKYLDINWVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01650.1 Polyketide cyclase / dehydrase and lipid transport protein2.2e-1534.73Show/hide
Query:  RCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGL-LYMVLHARVVLDLCE-QLE-------QEISFEQVEGDFDSLSG
        R + + I ++A +  VW+VLT YE L + +P L +S+++ +E N+VR+ Q G + L L +  +A+ VLD  E +LE       +EI F+ VEGDF    G
Subjt:  RCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGL-LYMVLHARVVLDLCE-QLE-------QEISFEQVEGDFDSLSG

Query:  KWHFEQL--GSH-----------HTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEK
        KW  EQL  G H            T L Y+V+  +    +L   L+E  + +++ +NL +IRD+ +K
Subjt:  KWHFEQL--GSH-----------HTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEK

AT4G01650.2 Polyketide cyclase / dehydrase and lipid transport protein2.2e-1534.73Show/hide
Query:  RCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGL-LYMVLHARVVLDLCE-QLE-------QEISFEQVEGDFDSLSG
        R + + I ++A +  VW+VLT YE L + +P L +S+++ +E N+VR+ Q G + L L +  +A+ VLD  E +LE       +EI F+ VEGDF    G
Subjt:  RCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNKVRILQEGCKGL-LYMVLHARVVLDLCE-QLE-------QEISFEQVEGDFDSLSG

Query:  KWHFEQL--GSH-----------HTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEK
        KW  EQL  G H            T L Y+V+  +    +L   L+E  + +++ +NL +IRD+ +K
Subjt:  KWHFEQL--GSH-----------HTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEK

AT5G08720.1 CONTAINS InterPro DOMAIN/s: Streptomyces cyclase/dehydrase (InterPro:IPR005031)2.6e-19768.26Show/hide
Query:  GNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQENYS
        G W  K  +       RS  T+LSYEVNVIPRFNFPAI LERIIRSDLPVNLRA+A +AE+  +   K    +D   ++ S        E D L  E   
Subjt:  GNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGNTKDSKSMVLSNTINGVVCEKDELLQENYS

Query:  RGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNK
            ++S G L   SNELN+NWGV+GK C+LDK C VDEVHLRRFDGLLENGGVHRC VASITVKAPV EVW VLT+YESLPE+VPNLAISKILSR++NK
Subjt:  RGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESLPEVVPNLAISKILSRESNK

Query:  VRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEK
        VRILQEGCKGLLYMVLHAR VLDL E  EQEI FEQVEGDFDSL GKW FEQLGSHHTLLKY+VES+M KD+FLSEA+MEEV+YEDLPSNLCAIRD IEK
Subjt:  VRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNLCAIRDSIEK

Query:  RGLKNSVEAFDE--GDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAIT
        RG K+S     E    SEE  +S R      S  T     D  G +  + R ++PGLQRDIEVLK+E+LKFI EHGQEGFMPMRKQLR+HGRVDIEKAIT
Subjt:  RGLKNSVEAFDE--GDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAIT

Query:  RMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK
        RMGGFRRIA +MNLSLAYKHRKPKGYWD  +NLQEEI RFQ+SWGMDPS+MPSRKSFERAGRYDIARALEKWGGLHEVSRLL+L VRHPNRQ +  KD  
Subjt:  RMGGFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRK

Query:  NDYLAVNDVDAESKTP----SKPYISQDTEKWLTGLKYLDINWVE
        N  L     +A+  +     +KPY+SQDTEKWL  LK LDINWV+
Subjt:  NDYLAVNDVDAESKTP----SKPYISQDTEKWLTGLKYLDINWVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGGCCGGCGAACGGAAGAAAGCTGCTGTTTTCGTGAAGAAGAGGTGGAGAAAAACGCTGAGTTCGTGGGTGGGGTGGTGAAGCGTGAAACGAAAGGAGGAAAAGG
CAATAATGGGAATTGGGAAAATAAAAAAACCCTAAGTGGAGCTGGAGCAAAACCCAGGTCATCCCCAACAATTTTGTCATATGAAGTTAATGTGATACCAAGATTCAATT
TTCCTGCCATTCTTCTAGAAAGAATAATTAGATCAGACCTTCCTGTGAATCTACGGGCCTTGGCTTGTAGAGCCGAAGAGAATTCTGAAGGAGATCAAAAAGTAGGAAAC
ACTAAAGATTCCAAGTCCATGGTTCTCTCTAATACAATTAATGGTGTTGTATGTGAGAAGGATGAATTATTACAGGAAAATTATTCGAGAGGGGGTAATTCTAATTCCAA
TGGACCCTTGCCCCCATTATCTAATGAATTGAATAGCAACTGGGGAGTTTTTGGAAAAGTTTGCAGACTTGACAAACGTTGCATGGTTGATGAAGTTCATCTTCGCAGAT
TTGATGGTTTGTTGGAAAATGGAGGCGTTCACCGTTGTGTGGTTGCTAGCATAACAGTGAAAGCTCCTGTTCGTGAAGTCTGGAATGTCCTGACTGCTTACGAAAGTCTT
CCTGAAGTAGTTCCAAATCTAGCAATCAGCAAAATACTATCAAGAGAAAGCAACAAAGTTCGCATTCTTCAGGAAGGATGCAAGGGTTTACTATATATGGTTCTCCATGC
GCGTGTAGTTTTGGACTTGTGTGAACAGCTTGAACAAGAGATTAGCTTTGAACAGGTTGAAGGAGACTTTGACTCTCTTAGCGGAAAATGGCATTTTGAGCAGTTAGGAA
GTCATCATACCCTCTTGAAATACTCTGTGGAGTCGAGAATGCACAAAGATACCTTTCTCTCTGAGGCTCTAATGGAAGAGGTTGTATATGAAGATCTTCCTTCAAACTTA
TGTGCAATTCGAGACTCTATCGAGAAAAGGGGTTTGAAAAATTCTGTCGAAGCGTTTGATGAAGGTGATTCAGAGGAGAAAAGTGCCTCACATCGAAACAATCAATTCAA
TGGCTCTACGACAACAGCTGAGGGAGTCTCAGATGTCAATGGGAGAAATTCATGCAGACCAAGGCCCAAAGTTCCAGGCTTACAAAGAGATATTGAAGTTCTCAAAGCAG
AGGTGCTCAAGTTTATTTTAGAACATGGGCAGGAAGGATTTATGCCAATGAGGAAGCAACTTCGCATGCATGGAAGGGTAGATATCGAAAAGGCAATCACACGCATGGGT
GGATTCAGAAGGATTGCATCACTTATGAATCTTTCTCTGGCTTATAAGCACCGCAAGCCAAAGGGTTACTGGGACAAATTTGACAATTTGCAGGAAGAGATAAATCGATT
CCAGAAGAGCTGGGGAATGGATCCATCATACATGCCCAGTAGGAAGTCCTTTGAACGTGCAGGGAGGTACGACATCGCACGGGCACTCGAGAAATGGGGCGGTTTACACG
AAGTTTCTCGTCTTTTGTCACTAAAAGTGAGACATCCTAATAGACAACCAAGCTTTGCCAAAGATAGAAAGAATGATTATTTAGCTGTAAATGATGTTGATGCTGAAAGT
AAAACTCCATCTAAGCCCTATATTTCTCAGGACACAGAAAAATGGCTCACAGGATTAAAATATTTGGATATTAATTGGGTAGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCGGCCGGCGAACGGAAGAAAGCTGCTGTTTTCGTGAAGAAGAGGTGGAGAAAAACGCTGAGTTCGTGGGTGGGGTGGTGAAGCGTGAAACGAAAGGAGGAAAAGG
CAATAATGGGAATTGGGAAAATAAAAAAACCCTAAGTGGAGCTGGAGCAAAACCCAGGTCATCCCCAACAATTTTGTCATATGAAGTTAATGTGATACCAAGATTCAATT
TTCCTGCCATTCTTCTAGAAAGAATAATTAGATCAGACCTTCCTGTGAATCTACGGGCCTTGGCTTGTAGAGCCGAAGAGAATTCTGAAGGAGATCAAAAAGTAGGAAAC
ACTAAAGATTCCAAGTCCATGGTTCTCTCTAATACAATTAATGGTGTTGTATGTGAGAAGGATGAATTATTACAGGAAAATTATTCGAGAGGGGGTAATTCTAATTCCAA
TGGACCCTTGCCCCCATTATCTAATGAATTGAATAGCAACTGGGGAGTTTTTGGAAAAGTTTGCAGACTTGACAAACGTTGCATGGTTGATGAAGTTCATCTTCGCAGAT
TTGATGGTTTGTTGGAAAATGGAGGCGTTCACCGTTGTGTGGTTGCTAGCATAACAGTGAAAGCTCCTGTTCGTGAAGTCTGGAATGTCCTGACTGCTTACGAAAGTCTT
CCTGAAGTAGTTCCAAATCTAGCAATCAGCAAAATACTATCAAGAGAAAGCAACAAAGTTCGCATTCTTCAGGAAGGATGCAAGGGTTTACTATATATGGTTCTCCATGC
GCGTGTAGTTTTGGACTTGTGTGAACAGCTTGAACAAGAGATTAGCTTTGAACAGGTTGAAGGAGACTTTGACTCTCTTAGCGGAAAATGGCATTTTGAGCAGTTAGGAA
GTCATCATACCCTCTTGAAATACTCTGTGGAGTCGAGAATGCACAAAGATACCTTTCTCTCTGAGGCTCTAATGGAAGAGGTTGTATATGAAGATCTTCCTTCAAACTTA
TGTGCAATTCGAGACTCTATCGAGAAAAGGGGTTTGAAAAATTCTGTCGAAGCGTTTGATGAAGGTGATTCAGAGGAGAAAAGTGCCTCACATCGAAACAATCAATTCAA
TGGCTCTACGACAACAGCTGAGGGAGTCTCAGATGTCAATGGGAGAAATTCATGCAGACCAAGGCCCAAAGTTCCAGGCTTACAAAGAGATATTGAAGTTCTCAAAGCAG
AGGTGCTCAAGTTTATTTTAGAACATGGGCAGGAAGGATTTATGCCAATGAGGAAGCAACTTCGCATGCATGGAAGGGTAGATATCGAAAAGGCAATCACACGCATGGGT
GGATTCAGAAGGATTGCATCACTTATGAATCTTTCTCTGGCTTATAAGCACCGCAAGCCAAAGGGTTACTGGGACAAATTTGACAATTTGCAGGAAGAGATAAATCGATT
CCAGAAGAGCTGGGGAATGGATCCATCATACATGCCCAGTAGGAAGTCCTTTGAACGTGCAGGGAGGTACGACATCGCACGGGCACTCGAGAAATGGGGCGGTTTACACG
AAGTTTCTCGTCTTTTGTCACTAAAAGTGAGACATCCTAATAGACAACCAAGCTTTGCCAAAGATAGAAAGAATGATTATTTAGCTGTAAATGATGTTGATGCTGAAAGT
AAAACTCCATCTAAGCCCTATATTTCTCAGGACACAGAAAAATGGCTCACAGGATTAAAATATTTGGATATTAATTGGGTAGAGTAG
Protein sequenceShow/hide protein sequence
MGGRRTEESCCFREEEVEKNAEFVGGVVKRETKGGKGNNGNWENKKTLSGAGAKPRSSPTILSYEVNVIPRFNFPAILLERIIRSDLPVNLRALACRAEENSEGDQKVGN
TKDSKSMVLSNTINGVVCEKDELLQENYSRGGNSNSNGPLPPLSNELNSNWGVFGKVCRLDKRCMVDEVHLRRFDGLLENGGVHRCVVASITVKAPVREVWNVLTAYESL
PEVVPNLAISKILSRESNKVRILQEGCKGLLYMVLHARVVLDLCEQLEQEISFEQVEGDFDSLSGKWHFEQLGSHHTLLKYSVESRMHKDTFLSEALMEEVVYEDLPSNL
CAIRDSIEKRGLKNSVEAFDEGDSEEKSASHRNNQFNGSTTTAEGVSDVNGRNSCRPRPKVPGLQRDIEVLKAEVLKFILEHGQEGFMPMRKQLRMHGRVDIEKAITRMG
GFRRIASLMNLSLAYKHRKPKGYWDKFDNLQEEINRFQKSWGMDPSYMPSRKSFERAGRYDIARALEKWGGLHEVSRLLSLKVRHPNRQPSFAKDRKNDYLAVNDVDAES
KTPSKPYISQDTEKWLTGLKYLDINWVE