; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022254 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022254
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDUF4378 domain-containing protein
Genome locationChr05:22373790..22380620
RNA-Seq ExpressionHG10022254
SyntenyHG10022254
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005840 - ribosome (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR001854 - Ribosomal protein L29/L35
IPR025486 - Domain of unknown function DUF4378
IPR036049 - Ribosomal protein L29/L35 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05761.1 uncharacterized protein E5676_scaffold98G002500 [Cucumis melo var. makuwa]0.0e+0075.58Show/hide
Query:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP
        MEPR+ TASVLE LMGFDE QSQH  PRH KVFSDDYLQR ASIGI KKK  SRCHPFRM +EE TELFNSLKVENNFSRC +LWERE+ DSTLSAA +P
Subjt:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP

Query:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE
        LTRH IM EKHFSTGKVIQTSK FQDLP VLDSMDISPRP+RGKN IF+ A+N PSVSKA+Y      NDAGTKFKDR+QGQAH SEDL  L  SRPFLE
Subjt:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE

Query:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN
        W +KL FSSS  TSLKGSHL+ DKCK CH+S+NGK I KEK+R+   +L+PIKQ S+VSSILDGSRRT  H F+NL LKTSR+ETIYD++ R E      
Subjt:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN

Query:  SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVK
            LSN TAE KHSC FSVESY+ARES E+VIEE+RKTE+LM   RGRKMNEMPT+PHYA LPSDLNCKPV YDFQKH CS+ EHLHSGSPLCLS KVK
Subjt:  SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVK

Query:  RLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKT
        RLD+L K  HR RFDST+ VTTRSRTRSRYEALRNTWFLKHEGPGTWLQC PLNRSSNKKDA++PTLKLSSKKLKIFPCPDSAS HVDNDGCMVG DLKT
Subjt:  RLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKT

Query:  TVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS
        TVEKKD CDQ S NCL PRSKVVFCTQN  VK+ NQA                                                             TS
Subjt:  TVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS

Query:  IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIM
        IQQEGLAFEHYPSKE+DSIV+LEE +QPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLM DSPGTNSEGHDLFVSSDDDGGEGSIC SD+I+DIM
Subjt:  IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIM

Query:  STFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRP
        STFKFKDSR FSYLVDVL+EASL CKNLETGSV  + QEH VISP VFEILEKKFGEQISWRRSERKLLFDRINSGL ELFQSFVGVPEWAKPVSRRFRP
Subjt:  STFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRP

Query:  LLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV
        L+NHEMIEEELWILLDSQEREVNK+L+DKQFGKEI WIDLGDEIDSIC+E+ERLL NELVAEF  I++
Subjt:  LLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV

XP_008463525.1 PREDICTED: uncharacterized protein LOC103501659 [Cucumis melo]0.0e+0075.46Show/hide
Query:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP
        MEPR+ TASVLE LMGFDE QSQH  PRH KVFSDDYLQR ASIGI KKK  SRCHPFRM +EE TELFNSLKVENNFSRC +LWERE+ DSTLSAA +P
Subjt:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP

Query:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE
        LTRH IM EKHFSTGKVIQTSK FQDLP VLDSMDISPRP+RGKN IF+ A+N PSVSKA+Y      NDAGTKFKDR+QGQAH SEDL  L  SRPFLE
Subjt:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE

Query:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN
        W +KL FSSS  TSLKGSHL+ DKCK CH+S+NGK I KEK+R+   +L+PIKQ S+VSSILDGSRRT  H F+NL LKTSR+E IYD++ R E      
Subjt:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN

Query:  SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVK
            LSN TAE KHSC FSVESY+ARES E+VIEE+RKTE+LM   RGRKMNEMPT+PHYA LPSDLNCKPV YDFQKH CS+ EHLHSGSPLCLS KVK
Subjt:  SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVK

Query:  RLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKT
        RLD+L K  HR RFDST+ VTTRSRTRSRYEALRNTWFLKHEGPGTWLQC PLNRSSNKKDA++PTLKLSSKKLKIFPCPDSAS HVDNDGCMVG DLKT
Subjt:  RLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKT

Query:  TVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS
        TVEKKD CDQ S NCL PRSKVVFCTQN  VK+ NQA                                                             TS
Subjt:  TVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS

Query:  IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIM
        IQQEGLAFEHYPSKE+DSIV+LEE +QPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLM DSPGTNSEGHDLFVSSDDDGGEGSIC SD+I+DIM
Subjt:  IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIM

Query:  STFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRP
        STFKFKDSR FSYLVDVL+EASL CKNLETGSV  + QEH VISP VFEILEKKFGEQISWRRSERKLLFDRINSGL ELFQSFVGVPEWAKPVSRRFRP
Subjt:  STFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRP

Query:  LLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV
        L+NHEMIEEELWILLDSQEREVNK+L+DKQFGKEI WIDLGDEIDSIC+E+ERLL NELVAEF  I++
Subjt:  LLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV

XP_011655343.1 uncharacterized protein LOC101203594 [Cucumis sativus]0.0e+0075.81Show/hide
Query:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP
        MEPRQ TASVLE LMGFDE QSQH A RH KVFSDDYLQRVASIGI KKK  SRCHPFRM +EE TELFNSLKVENNFSRC +LWERE+ DSTLSAAY P
Subjt:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP

Query:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE
        LTRH    EKHFSTGKVIQTSK FQDLP VLDSMDISPRPTRGKN +F+QAK+  SVS AHY      NDAGTKFKDRKQGQAH SEDL  L  SRPFLE
Subjt:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE

Query:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN
        W +KL FSSS   SLKGSHL+ DKCK CH+S+NGK IAKEK+RT   +L+PIKQ S+VSSILDGSRRT +  F NLHLKTSR+ETIYD+V      C+  
Subjt:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN

Query:  SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVK
        +S  LSN TAE KHSC FSVESY+ARES E+VIEE+RKT NLM  T+GRKMNEMPT+P YA LPSDLNCKPV YDFQKH CS+KEHLHSGSPLCLS KVK
Subjt:  SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVK

Query:  RLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKT
        RLD+L K  HR RFDSTS VTTRSRTRSRYEAL NTWFLKHEGPGTWLQC PLNRSSNKKDA++PTLKLSSKKLKIFPCPDSAS H DNDGCMVG D KT
Subjt:  RLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKT

Query:  TVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS
        TV+KKD CDQ SLNCL PRSKVVFCTQN  VK+ NQA                                                             TS
Subjt:  TVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS

Query:  IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIM
        IQQEGLAF+HYPSKE+DSIV+LEEA+QPSPVSVLEPLFKEETLFSSES GINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDD GEGSIC SD+I+DIM
Subjt:  IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIM

Query:  STFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRP
        STFKFKDSR FSYLVDVL+EASLHCKNLE GSV  H QE  VISP VFEILEKKFGEQISWRRSERKLLFDRINSGL ELFQSFVGVPEWAKPVSRRFRP
Subjt:  STFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRP

Query:  LLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV
        LLNHEMIEEELWILLDSQEREVNK+LVDKQFGKEI WIDLGDEI+SICRE+E LL NELVAEF  I++
Subjt:  LLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV

XP_038889736.1 uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida]0.0e+0078.63Show/hide
Query:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP
        ME RQ T SVLE LMGFDERQ QH APRH +V SDDYLQRVASIGI KKK  SRCHPFRM VEE TELFNS KVENNFSRCNELWE EK DS+LSA  MP
Subjt:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP

Query:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE
        LTRHTIM EKHFSTGKVIQTSKDFQ+LP VLDSMDISPRPTRGKN IFNQAKN PSVSK HY      NDAGTK KDRK GQ HSSEDLDFL  SRP LE
Subjt:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE

Query:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSS-------RNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRK
        WRDKL FSSSS TSL+GSHL+NDKCKDC SS       +NGK IAKE +RTMEYALQPIKQ S+VSSILD SRRTT+H FVNLHLK SR  TIYDDV R 
Subjt:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSS-------RNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRK

Query:  ETKCKRNSSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPL
        ETK +RNSSP LSN TA+YKHSCFFSVESY+ARESRE+V EE+RKTENL+  T+GR+MNEMPTLPH+A LPSDLNCKPV +DFQKH CSNKEH HSGSPL
Subjt:  ETKCKRNSSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPL

Query:  CLSQKVKRLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCM
        CLS KVKRLDQL KNSHR RFDSTS VTTRSRTRSRYEALRNTWFLKHEGPG WLQC P NRSSNKKDASEP+LKLSSKKLKIFPCPDSASDHVDND CM
Subjt:  CLSQKVKRLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCM

Query:  VGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNS
        VGDDLKT VEKKD CDQ SLNCLSPRSK VFCTQN  VK+ NQA                                                        
Subjt:  VGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNS

Query:  CRAMYTSIQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGS
             TSIQQEGL FEHYPSKEQDSIV+LEEA+QPSPVSVLEPLFK+ETLFSSES GIN RDL+MQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC S
Subjt:  CRAMYTSIQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGS

Query:  DEINDIMSTFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKP
        +EI+DIMSTFKFKDSRDFSYLVDVL+EASLHCK+LETGSV  H QEHQVISP VFE LEKKFGEQ SWRRSERKLLFDRINSGLVELFQSF GVPEWAKP
Subjt:  DEINDIMSTFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKP

Query:  VSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV
        VSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICRE+ERLL NELVAEF  I++
Subjt:  VSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV

XP_038889740.1 uncharacterized protein LOC120079578 isoform X2 [Benincasa hispida]0.0e+0078.51Show/hide
Query:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP
        ME RQ T SVLE LMGFDERQ QH APRH +V SDDYLQRVASIGI KKK  SRCHPFRM VEE TELFNS KVENNFSRCNELWE EK DS+LSA  MP
Subjt:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP

Query:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE
        LTRHTIM EKHFSTGKVIQTSKDFQ+LP VLDSMDISPRPTRGKN IFNQAKN PSVSK HY      NDAGTK KDRK GQ HSSEDLDFL  SRP LE
Subjt:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE

Query:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSS-------RNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRK
        WRDKL FSSSS TSL+GSHL+NDKCKDC SS       +NGK IAKE +RTMEYALQPIKQ S+VSSILD SRRTT+H FVNLHLK SR  TIYDDV R 
Subjt:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSS-------RNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRK

Query:  ETKCKRNSSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPL
        ETK +RNSSP LSN TA+YKHSCFFSVESY+ARESRE+V EE+RKTENL+  T+GR+MNEMPTLPH+A LPSDLNCKPV +DFQKH CSNKEH HSGSPL
Subjt:  ETKCKRNSSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPL

Query:  CLSQKVKRLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCM
        CLS KVKRLDQL KNSHR RFDSTS VTTRSRTRSRYEALRNTWFLKHEGPG WLQC P NRSSNKKDASEP+LKLSSKKLKIFPCPDSASDHVDND CM
Subjt:  CLSQKVKRLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCM

Query:  VGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNS
        VGDDLKT VEKKD CDQ SLNCLSPRSK VFCTQN  VK+ NQA                                                        
Subjt:  VGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNS

Query:  CRAMYTSIQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGS
             TSIQQEGL FEHYPSKEQDSIV+LEEA+QPSPVSVLEPLFK+ETLFSSES GIN  DL+MQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC S
Subjt:  CRAMYTSIQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGS

Query:  DEINDIMSTFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKP
        +EI+DIMSTFKFKDSRDFSYLVDVL+EASLHCK+LETGSV  H QEHQVISP VFE LEKKFGEQ SWRRSERKLLFDRINSGLVELFQSF GVPEWAKP
Subjt:  DEINDIMSTFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKP

Query:  VSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV
        VSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICRE+ERLL NELVAEF  I++
Subjt:  VSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV

TrEMBL top hitse value%identityAlignment
A0A0A0KNN6 DUF4378 domain-containing protein0.0e+0075.81Show/hide
Query:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP
        MEPRQ TASVLE LMGFDE QSQH A RH KVFSDDYLQRVASIGI KKK  SRCHPFRM +EE TELFNSLKVENNFSRC +LWERE+ DSTLSAAY P
Subjt:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP

Query:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE
        LTRH    EKHFSTGKVIQTSK FQDLP VLDSMDISPRPTRGKN +F+QAK+  SVS AHY      NDAGTKFKDRKQGQAH SEDL  L  SRPFLE
Subjt:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE

Query:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN
        W +KL FSSS   SLKGSHL+ DKCK CH+S+NGK IAKEK+RT   +L+PIKQ S+VSSILDGSRRT +  F NLHLKTSR+ETIYD+V      C+  
Subjt:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN

Query:  SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVK
        +S  LSN TAE KHSC FSVESY+ARES E+VIEE+RKT NLM  T+GRKMNEMPT+P YA LPSDLNCKPV YDFQKH CS+KEHLHSGSPLCLS KVK
Subjt:  SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVK

Query:  RLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKT
        RLD+L K  HR RFDSTS VTTRSRTRSRYEAL NTWFLKHEGPGTWLQC PLNRSSNKKDA++PTLKLSSKKLKIFPCPDSAS H DNDGCMVG D KT
Subjt:  RLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKT

Query:  TVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS
        TV+KKD CDQ SLNCL PRSKVVFCTQN  VK+ NQA                                                             TS
Subjt:  TVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS

Query:  IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIM
        IQQEGLAF+HYPSKE+DSIV+LEEA+QPSPVSVLEPLFKEETLFSSES GINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDD GEGSIC SD+I+DIM
Subjt:  IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIM

Query:  STFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRP
        STFKFKDSR FSYLVDVL+EASLHCKNLE GSV  H QE  VISP VFEILEKKFGEQISWRRSERKLLFDRINSGL ELFQSFVGVPEWAKPVSRRFRP
Subjt:  STFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRP

Query:  LLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV
        LLNHEMIEEELWILLDSQEREVNK+LVDKQFGKEI WIDLGDEI+SICRE+E LL NELVAEF  I++
Subjt:  LLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV

A0A1S4E497 uncharacterized protein LOC1035016590.0e+0075.46Show/hide
Query:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP
        MEPR+ TASVLE LMGFDE QSQH  PRH KVFSDDYLQR ASIGI KKK  SRCHPFRM +EE TELFNSLKVENNFSRC +LWERE+ DSTLSAA +P
Subjt:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP

Query:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE
        LTRH IM EKHFSTGKVIQTSK FQDLP VLDSMDISPRP+RGKN IF+ A+N PSVSKA+Y      NDAGTKFKDR+QGQAH SEDL  L  SRPFLE
Subjt:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE

Query:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN
        W +KL FSSS  TSLKGSHL+ DKCK CH+S+NGK I KEK+R+   +L+PIKQ S+VSSILDGSRRT  H F+NL LKTSR+E IYD++ R E      
Subjt:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN

Query:  SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVK
            LSN TAE KHSC FSVESY+ARES E+VIEE+RKTE+LM   RGRKMNEMPT+PHYA LPSDLNCKPV YDFQKH CS+ EHLHSGSPLCLS KVK
Subjt:  SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVK

Query:  RLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKT
        RLD+L K  HR RFDST+ VTTRSRTRSRYEALRNTWFLKHEGPGTWLQC PLNRSSNKKDA++PTLKLSSKKLKIFPCPDSAS HVDNDGCMVG DLKT
Subjt:  RLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKT

Query:  TVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS
        TVEKKD CDQ S NCL PRSKVVFCTQN  VK+ NQA                                                             TS
Subjt:  TVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS

Query:  IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIM
        IQQEGLAFEHYPSKE+DSIV+LEE +QPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLM DSPGTNSEGHDLFVSSDDDGGEGSIC SD+I+DIM
Subjt:  IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIM

Query:  STFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRP
        STFKFKDSR FSYLVDVL+EASL CKNLETGSV  + QEH VISP VFEILEKKFGEQISWRRSERKLLFDRINSGL ELFQSFVGVPEWAKPVSRRFRP
Subjt:  STFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRP

Query:  LLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV
        L+NHEMIEEELWILLDSQEREVNK+L+DKQFGKEI WIDLGDEIDSIC+E+ERLL NELVAEF  I++
Subjt:  LLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV

A0A5D3C1E7 DUF4378 domain-containing protein0.0e+0075.58Show/hide
Query:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP
        MEPR+ TASVLE LMGFDE QSQH  PRH KVFSDDYLQR ASIGI KKK  SRCHPFRM +EE TELFNSLKVENNFSRC +LWERE+ DSTLSAA +P
Subjt:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMP

Query:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE
        LTRH IM EKHFSTGKVIQTSK FQDLP VLDSMDISPRP+RGKN IF+ A+N PSVSKA+Y      NDAGTKFKDR+QGQAH SEDL  L  SRPFLE
Subjt:  LTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLE

Query:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN
        W +KL FSSS  TSLKGSHL+ DKCK CH+S+NGK I KEK+R+   +L+PIKQ S+VSSILDGSRRT  H F+NL LKTSR+ETIYD++ R E      
Subjt:  WRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN

Query:  SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVK
            LSN TAE KHSC FSVESY+ARES E+VIEE+RKTE+LM   RGRKMNEMPT+PHYA LPSDLNCKPV YDFQKH CS+ EHLHSGSPLCLS KVK
Subjt:  SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVK

Query:  RLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKT
        RLD+L K  HR RFDST+ VTTRSRTRSRYEALRNTWFLKHEGPGTWLQC PLNRSSNKKDA++PTLKLSSKKLKIFPCPDSAS HVDNDGCMVG DLKT
Subjt:  RLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKT

Query:  TVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS
        TVEKKD CDQ S NCL PRSKVVFCTQN  VK+ NQA                                                             TS
Subjt:  TVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS

Query:  IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIM
        IQQEGLAFEHYPSKE+DSIV+LEE +QPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLM DSPGTNSEGHDLFVSSDDDGGEGSIC SD+I+DIM
Subjt:  IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIM

Query:  STFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRP
        STFKFKDSR FSYLVDVL+EASL CKNLETGSV  + QEH VISP VFEILEKKFGEQISWRRSERKLLFDRINSGL ELFQSFVGVPEWAKPVSRRFRP
Subjt:  STFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRP

Query:  LLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV
        L+NHEMIEEELWILLDSQEREVNK+L+DKQFGKEI WIDLGDEIDSIC+E+ERLL NELVAEF  I++
Subjt:  LLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKV

A0A6J1BX36 uncharacterized protein LOC1110062940.0e+0068.56Show/hide
Query:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFS---RCNELWEREKVDSTLSAA
        M  +Q TASVLE LMGF+E+QS H   RH +V S+ YLQR ASIG+ KKK  S+CHPFR  VEE  ELFN+L V ++F     CNEL  REK  S LS+A
Subjt:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFS---RCNELWEREKVDSTLSAA

Query:  YMPLTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRP
         MPLTRH  M  +HF T K+IQTS D Q+LP V DSMDISPRPTR K YIFN  +N  S+SK+H+      NDAGTKF +RKQGQA + +D D L  S P
Subjt:  YMPLTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQGQAHSSEDLDFLMPSRP

Query:  FLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKR-TMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETK
         LEW+DKL FSSSS TSLKGSHL+++KCK  H S+NGK +AKEK+R TM   ++PIKQPS+VS ILD S R T+H+FVNL +K SR+E+IYDDV RKET+
Subjt:  FLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKR-TMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETK

Query:  CKRNSSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLS
         +   SP LSN  AEYKHSC FSVESY+AR  RE+ IEE+++T+ L+L  +G    EMP L H+A LP+DLNCKPV YDFQKH CSNKEHLHSGSPLCLS
Subjt:  CKRNSSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLS

Query:  QKVKRLDQLSKNSHRSRFDSTSVVTT-RSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVG
         K +RLDQ+SKNSHR RF S + VTT RSRTRSRYE+LRNTWFLK EG  TWLQC P ++SS+ KDAS+PTLKL SKKL+IFPCP+SAS H+ +DGC+V 
Subjt:  QKVKRLDQLSKNSHRSRFDSTSVVTT-RSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVG

Query:  DDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDF-QDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSC
          L+T VEKK LC+Q+S+N LS R+ VVFC +NN     N+AIEC+LKSD+  DN SGMASN LAVKTDD   PTV+KQEP+S+S  + ET GDSSTNS 
Subjt:  DDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDF-QDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSC

Query:  RAMYTSIQQ--------EGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDD-G
        R    SIQQ        EG  FEHYP KE DSIV+LEEAYQPSPVSVLEPLFKEET+ SSESSGINSRDL+MQLELLMSDSPG+NSEGH++FVSSDDD G
Subjt:  RAMYTSIQQ--------EGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDD-G

Query:  GEGSICGSDEINDIMSTFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFV
        GEGS C S+EI+DIMSTFKFKDSRDFSYL+DVL+EA L+C NL+ G V   GQE  VISP+VFE LEKKFGEQ SWRRSERKLLFDRINSGL+ELFQS V
Subjt:  GEGSICGSDEINDIMSTFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFV

Query:  GVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARI
        GVPEWAKPVSRRFRPLL+ EM+EEELWILLDSQERE+NKDLVDKQFGKEIGWIDLG+EI+SICRE+ERLL  EL+AEF  I
Subjt:  GVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARI

A0A6J1JSS4 uncharacterized protein LOC1114871972.7e-22154.61Show/hide
Query:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASI-GILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYM
        ME  Q +ASVLE LMGFDE QS+HRA    +  S+ YLQRVASI G  KKKS SRC PFRM +EE  E+         FS  N LWERE           
Subjt:  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASI-GILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYM

Query:  PLTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHYNDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLEWRDKL
          +    MNEKHFST ++I TSKDF DLP  +DSMDISPR TR K+  FN  +N P++SK                                        
Subjt:  PLTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHYNDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLEWRDKL

Query:  RFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRNSSPRL
                 L  +H                                                                        RK+           
Subjt:  RFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRNSSPRL

Query:  SNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQL
             EYK SCF SVESY+  ESRE+VIEE+RK  NLML  +GR MNEM  LPHYA  PSDLNCKPV YDF K  C NK+HLHSGSPLCLS K +R D+L
Subjt:  SNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQL

Query:  SKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKTTVEKK
        SK  HRSR DS   V  RSR RSRYEALRNTWFLK EG GTWLQ  PLN  SNKK+ASEP+ KLSSKKL+IFPCPDS SDHVDNDGC+VG+DLKT VEK 
Subjt:  SKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKTTVEKK

Query:  DLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTSIQQEG
         LCDQ S+N LS  S              N AIE                                  +P SLS I+ ET G SST SCRA  TSIQQ+G
Subjt:  DLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTSIQQEG

Query:  LAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIMSTFKF
        L+F+ Y SKE DSIV LEE YQPSPVSVLE  FKEET  S ESSGINSR    +LELLM DSPGTNS+ H+LFVSS++DGGEGSIC SDEI DIMSTFKF
Subjt:  LAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIMSTFKF

Query:  KDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRPLLNHE
        KDSRDFSYLVDV++EA LH +NLE G V  H QE  VISP+VFE LEKKFGEQ+SWRRSERKLLFDRINSGL ELFQSFVGVPEWAKPVSRRFRPLL+ E
Subjt:  KDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRPLLNHE

Query:  MIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAE
        M+E++LW LLDSQE+E NKDLVDKQFGKEIGWIDL DEI SICRE+E LL  ELVAE
Subjt:  MIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAE

SwissProt top hitse value%identityAlignment
O80626 60S ribosomal protein L35-26.4e-4790.16Show/hide
Query:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS
        ARIKVHELR+KSKADL  QLK+ KAELALLRVAKVTGGA NK SKIKVVR SIAQVLTVISQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QAS
Subjt:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS

Query:  LKTERQKKKEMYFPLRKYAIKV
        LKTER+KKKEMYFP+RKYAIKV
Subjt:  LKTERQKKKEMYFPLRKYAIKV

Q9LZ41 60S ribosomal protein L35-43.2e-4689.34Show/hide
Query:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS
        ARIKVHELR KSK DL  QLK+ KAELALLRVAKVTGGA NK SKIKVVR SIAQVLTVISQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QAS
Subjt:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS

Query:  LKTERQKKKEMYFPLRKYAIKV
        LKTER+KKKEMYFP+RKYAIKV
Subjt:  LKTERQKKKEMYFPLRKYAIKV

Q9M3D2 60S ribosomal protein L35-32.1e-4587.7Show/hide
Query:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS
        ARIKVHELR KSK+DL  QLK+LKAELA LRVAKVTGGA NK SKIKVVR SIAQVLTV SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QAS
Subjt:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS

Query:  LKTERQKKKEMYFPLRKYAIKV
        LKTER+KKK+MYFP+RKYAIKV
Subjt:  LKTERQKKKEMYFPLRKYAIKV

Q9M5L0 60S ribosomal protein L353.8e-4790.16Show/hide
Query:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS
        ARIKVHELRQK+KA+LL QLKDLKAELALLRVAKVTGGA NK SKIKVVRLSIAQVLTVISQKQK ALREAYK KK LPLDLRPKKTRAIRRRLTK+Q S
Subjt:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS

Query:  LKTERQKKKEMYFPLRKYAIKV
        LKTER+KKKEMYFP+RKYAIKV
Subjt:  LKTERQKKKEMYFPLRKYAIKV

Q9SF53 60S ribosomal protein L35-11.9e-4689.34Show/hide
Query:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS
        ARIKVHELR+KSK+DL  QLK+LKAELALLRVAKVTGGA NK SKIKVVR SIAQVLTV SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QAS
Subjt:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS

Query:  LKTERQKKKEMYFPLRKYAIKV
        LKTER+KKKEMYFP+RKYAIKV
Subjt:  LKTERQKKKEMYFPLRKYAIKV

Arabidopsis top hitse value%identityAlignment
AT2G39390.1 Ribosomal L29 family protein4.6e-4890.16Show/hide
Query:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS
        ARIKVHELR+KSKADL  QLK+ KAELALLRVAKVTGGA NK SKIKVVR SIAQVLTVISQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QAS
Subjt:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS

Query:  LKTERQKKKEMYFPLRKYAIKV
        LKTER+KKKEMYFP+RKYAIKV
Subjt:  LKTERQKKKEMYFPLRKYAIKV

AT3G09500.1 Ribosomal L29 family protein1.3e-4789.34Show/hide
Query:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS
        ARIKVHELR+KSK+DL  QLK+LKAELALLRVAKVTGGA NK SKIKVVR SIAQVLTV SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QAS
Subjt:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS

Query:  LKTERQKKKEMYFPLRKYAIKV
        LKTER+KKKEMYFP+RKYAIKV
Subjt:  LKTERQKKKEMYFPLRKYAIKV

AT3G55170.1 Ribosomal L29 family protein1.5e-4687.7Show/hide
Query:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS
        ARIKVHELR KSK+DL  QLK+LKAELA LRVAKVTGGA NK SKIKVVR SIAQVLTV SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QAS
Subjt:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS

Query:  LKTERQKKKEMYFPLRKYAIKV
        LKTER+KKK+MYFP+RKYAIKV
Subjt:  LKTERQKKKEMYFPLRKYAIKV

AT3G55170.2 Ribosomal L29 family protein1.5e-4687.7Show/hide
Query:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS
        ARIKVHELR KSK+DL  QLK+LKAELA LRVAKVTGGA NK SKIKVVR SIAQVLTV SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QAS
Subjt:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS

Query:  LKTERQKKKEMYFPLRKYAIKV
        LKTER+KKK+MYFP+RKYAIKV
Subjt:  LKTERQKKKEMYFPLRKYAIKV

AT5G02610.1 Ribosomal L29 family protein2.3e-4789.34Show/hide
Query:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS
        ARIKVHELR KSK DL  QLK+ KAELALLRVAKVTGGA NK SKIKVVR SIAQVLTVISQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QAS
Subjt:  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQAS

Query:  LKTERQKKKEMYFPLRKYAIKV
        LKTER+KKKEMYFP+RKYAIKV
Subjt:  LKTERQKKKEMYFPLRKYAIKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACTGTTACCGTATTCAGGAAGCAGAGAGATTGAGGAACCTAGAAATCACGGGAAGACAAATAAACTACTTGAGGTCGGTGGTTGTCTACCATAAAGTTTTGTCGGT
AGAGTTTCTCCAGATGGAACCTAGACAGTCTACAGCTAGTGTTCTTGAAGTATTGATGGGCTTTGATGAGCGGCAATCTCAGCACCGTGCTCCAAGGCATCCTAAAGTTT
TTTCTGACGATTATTTACAAAGGGTTGCATCCATTGGAATCTTGAAGAAGAAATCCTCCTCCAGATGTCATCCATTTAGGATGATAGTCGAAGAGTCAACAGAACTCTTC
AATTCTCTCAAAGTAGAAAATAACTTTAGCCGCTGCAACGAGTTATGGGAACGGGAGAAGGTCGATTCTACTTTATCAGCAGCGTATATGCCACTTACAAGACACACCAT
TATGAATGAAAAGCACTTTTCAACAGGTAAGGTGATACAGACTTCAAAGGATTTTCAAGATTTACCAGTGGTTCTTGATTCTATGGACATCTCACCAAGACCTACAAGAG
GAAAAAATTATATATTCAACCAGGCCAAAAATGAACCAAGTGTTTCAAAAGCACATTATAATGATGCAGGCACTAAATTTAAGGACAGGAAACAAGGGCAGGCACACTCG
TCAGAGGATCTAGATTTTTTGATGCCTTCAAGACCTTTTTTAGAGTGGAGAGATAAACTACGTTTTTCCTCCTCTTCATCAACTTCTTTGAAAGGCTCACATTTAATTAA
TGATAAATGCAAAGATTGTCATAGTTCTCGAAATGGAAAGCTTATTGCTAAAGAAAAAAAAAGGACTATGGAGTATGCACTGCAGCCCATCAAGCAACCATCTCGAGTTT
CAAGTATTCTGGATGGAAGTAGGAGAACAACGAAGCATAATTTTGTTAATTTACATTTGAAGACCTCAAGAACAGAAACCATATATGACGATGTGCGCAGAAAAGAAACC
AAGTGCAAAAGGAATTCTTCCCCCCGTTTATCTAATTGCACGGCGGAATACAAGCATTCCTGCTTCTTTTCAGTTGAGTCATACGAGGCTAGAGAATCCAGGGAGGAAGT
AATAGAAGAAGAAAGGAAGACGGAGAACTTGATGCTATTTACACGAGGTAGGAAAATGAATGAAATGCCTACACTGCCTCATTATGCAATTTTGCCCAGCGATTTGAATT
GCAAACCTGTCAATTATGATTTCCAGAAGCATGATTGTTCGAATAAGGAACATTTGCATTCTGGCAGTCCCTTGTGCTTGAGCCAGAAGGTTAAGAGACTAGATCAACTC
AGTAAAAATTCCCATAGATCGAGATTTGATTCTACTTCTGTGGTGACTACAAGATCTAGAACCAGGAGCAGATACGAGGCCCTTCGAAATACATGGTTCTTAAAGCATGA
AGGTCCTGGTACTTGGCTACAATGCACGCCATTGAATAGAAGTTCCAATAAAAAGGATGCTTCAGAACCTACCTTGAAATTAAGCTCTAAGAAATTGAAGATTTTTCCTT
GCCCTGATTCAGCAAGCGATCATGTTGACAACGATGGCTGTATGGTTGGTGATGATCTGAAGACCACAGTTGAGAAGAAAGACCTTTGTGATCAGAAATCTTTAAACTGT
TTATCACCAAGGAGCAAAGTTGTTTTCTGCACACAAAACAATGCCGTCAAACGAGAAAATCAAGCCATAGAGTGTGCTTTGAAGAGTGATTTTCAAGATAATCTTTCAGG
TATGGCTTCTAATTCATTGGCTGTAAAGACTGATGATGTAGCGAACCCTACTGTGGAGAAGCAAGAACCTAATTCATTGTCTGGCATCCTTTTAGAGACTCGTGGTGATT
CATCTACCAACTCTTGCCGTGCCATGTATACTTCTATTCAACAGGAAGGTCTTGCCTTTGAACACTACCCTAGCAAAGAGCAAGATTCTATTGTGAATTTGGAGGAGGCT
TATCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAAAGAAGAAACATTATTCAGTTCTGAATCCTCAGGCATCAACAGTAGAGATTTAGTGATGCAACTTGAACT
TCTGATGTCGGATTCCCCGGGAACTAACTCAGAAGGACATGATTTGTTCGTATCAAGTGATGATGATGGTGGAGAAGGATCTATATGCGGTTCTGATGAAATTAATGACA
TCATGAGCACATTCAAATTCAAAGATAGTAGAGATTTTTCATACCTTGTTGATGTATTGAACGAGGCAAGCTTACATTGTAAAAACCTGGAGACGGGTTCTGTTCCGTTA
CATGGTCAGGAACATCAGGTAATCAGCCCCACAGTCTTTGAGATCTTAGAGAAGAAGTTTGGGGAACAAATTTCTTGGAGGAGATCAGAAAGAAAGCTTCTCTTTGACAG
GATAAATTCTGGGTTAGTAGAACTCTTTCAGTCATTTGTTGGTGTGCCAGAATGGGCAAAGCCTGTATCGAGAAGATTTCGGCCATTGCTTAACCATGAAATGATCGAGG
AAGAACTATGGATCTTGCTGGATAGCCAAGAAAGGGAAGTGAACAAGGATTTAGTAGATAAGCAGTTTGGAAAGGAGATTGGATGGATAGATCTTGGAGATGAGATTGAT
TCTATTTGTAGAGAAGTAGAGAGATTGTTGTTTAATGAGCTTGTTGCAGAGTTTGCGAGAATCAAGGTTCATGAGTTGAGGCAGAAATCGAAGGCGGATCTGTTGTTGCA
GCTTAAGGATCTTAAGGCGGAGCTTGCTCTCCTTCGAGTCGCCAAAGTCACCGGCGGTGCCCTGAACAAGCGATCCAAAATCAAGGTAGTGAGATTGTCAATCGCTCAAG
TCTTGACAGTAATTTCTCAGAAGCAGAAGTCTGCGTTGAGGGAAGCTTACAAGAAGAAGAAGCTATTGCCTCTCGATCTTCGCCCAAAGAAGACCAGGGCCATTCGTAGA
AGGCTCACCAAGTACCAGGCTTCTTTGAAAACTGAGAGACAGAAGAAGAAAGAAATGTACTTTCCATTGAGGAAATATGCTATTAAGGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTACTGTTACCGTATTCAGGAAGCAGAGAGATTGAGGAACCTAGAAATCACGGGAAGACAAATAAACTACTTGAGGTCGGTGGTTGTCTACCATAAAGTTTTGTCGGT
AGAGTTTCTCCAGATGGAACCTAGACAGTCTACAGCTAGTGTTCTTGAAGTATTGATGGGCTTTGATGAGCGGCAATCTCAGCACCGTGCTCCAAGGCATCCTAAAGTTT
TTTCTGACGATTATTTACAAAGGGTTGCATCCATTGGAATCTTGAAGAAGAAATCCTCCTCCAGATGTCATCCATTTAGGATGATAGTCGAAGAGTCAACAGAACTCTTC
AATTCTCTCAAAGTAGAAAATAACTTTAGCCGCTGCAACGAGTTATGGGAACGGGAGAAGGTCGATTCTACTTTATCAGCAGCGTATATGCCACTTACAAGACACACCAT
TATGAATGAAAAGCACTTTTCAACAGGTAAGGTGATACAGACTTCAAAGGATTTTCAAGATTTACCAGTGGTTCTTGATTCTATGGACATCTCACCAAGACCTACAAGAG
GAAAAAATTATATATTCAACCAGGCCAAAAATGAACCAAGTGTTTCAAAAGCACATTATAATGATGCAGGCACTAAATTTAAGGACAGGAAACAAGGGCAGGCACACTCG
TCAGAGGATCTAGATTTTTTGATGCCTTCAAGACCTTTTTTAGAGTGGAGAGATAAACTACGTTTTTCCTCCTCTTCATCAACTTCTTTGAAAGGCTCACATTTAATTAA
TGATAAATGCAAAGATTGTCATAGTTCTCGAAATGGAAAGCTTATTGCTAAAGAAAAAAAAAGGACTATGGAGTATGCACTGCAGCCCATCAAGCAACCATCTCGAGTTT
CAAGTATTCTGGATGGAAGTAGGAGAACAACGAAGCATAATTTTGTTAATTTACATTTGAAGACCTCAAGAACAGAAACCATATATGACGATGTGCGCAGAAAAGAAACC
AAGTGCAAAAGGAATTCTTCCCCCCGTTTATCTAATTGCACGGCGGAATACAAGCATTCCTGCTTCTTTTCAGTTGAGTCATACGAGGCTAGAGAATCCAGGGAGGAAGT
AATAGAAGAAGAAAGGAAGACGGAGAACTTGATGCTATTTACACGAGGTAGGAAAATGAATGAAATGCCTACACTGCCTCATTATGCAATTTTGCCCAGCGATTTGAATT
GCAAACCTGTCAATTATGATTTCCAGAAGCATGATTGTTCGAATAAGGAACATTTGCATTCTGGCAGTCCCTTGTGCTTGAGCCAGAAGGTTAAGAGACTAGATCAACTC
AGTAAAAATTCCCATAGATCGAGATTTGATTCTACTTCTGTGGTGACTACAAGATCTAGAACCAGGAGCAGATACGAGGCCCTTCGAAATACATGGTTCTTAAAGCATGA
AGGTCCTGGTACTTGGCTACAATGCACGCCATTGAATAGAAGTTCCAATAAAAAGGATGCTTCAGAACCTACCTTGAAATTAAGCTCTAAGAAATTGAAGATTTTTCCTT
GCCCTGATTCAGCAAGCGATCATGTTGACAACGATGGCTGTATGGTTGGTGATGATCTGAAGACCACAGTTGAGAAGAAAGACCTTTGTGATCAGAAATCTTTAAACTGT
TTATCACCAAGGAGCAAAGTTGTTTTCTGCACACAAAACAATGCCGTCAAACGAGAAAATCAAGCCATAGAGTGTGCTTTGAAGAGTGATTTTCAAGATAATCTTTCAGG
TATGGCTTCTAATTCATTGGCTGTAAAGACTGATGATGTAGCGAACCCTACTGTGGAGAAGCAAGAACCTAATTCATTGTCTGGCATCCTTTTAGAGACTCGTGGTGATT
CATCTACCAACTCTTGCCGTGCCATGTATACTTCTATTCAACAGGAAGGTCTTGCCTTTGAACACTACCCTAGCAAAGAGCAAGATTCTATTGTGAATTTGGAGGAGGCT
TATCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAAAGAAGAAACATTATTCAGTTCTGAATCCTCAGGCATCAACAGTAGAGATTTAGTGATGCAACTTGAACT
TCTGATGTCGGATTCCCCGGGAACTAACTCAGAAGGACATGATTTGTTCGTATCAAGTGATGATGATGGTGGAGAAGGATCTATATGCGGTTCTGATGAAATTAATGACA
TCATGAGCACATTCAAATTCAAAGATAGTAGAGATTTTTCATACCTTGTTGATGTATTGAACGAGGCAAGCTTACATTGTAAAAACCTGGAGACGGGTTCTGTTCCGTTA
CATGGTCAGGAACATCAGGTAATCAGCCCCACAGTCTTTGAGATCTTAGAGAAGAAGTTTGGGGAACAAATTTCTTGGAGGAGATCAGAAAGAAAGCTTCTCTTTGACAG
GATAAATTCTGGGTTAGTAGAACTCTTTCAGTCATTTGTTGGTGTGCCAGAATGGGCAAAGCCTGTATCGAGAAGATTTCGGCCATTGCTTAACCATGAAATGATCGAGG
AAGAACTATGGATCTTGCTGGATAGCCAAGAAAGGGAAGTGAACAAGGATTTAGTAGATAAGCAGTTTGGAAAGGAGATTGGATGGATAGATCTTGGAGATGAGATTGAT
TCTATTTGTAGAGAAGTAGAGAGATTGTTGTTTAATGAGCTTGTTGCAGAGTTTGCGAGAATCAAGGTTCATGAGTTGAGGCAGAAATCGAAGGCGGATCTGTTGTTGCA
GCTTAAGGATCTTAAGGCGGAGCTTGCTCTCCTTCGAGTCGCCAAAGTCACCGGCGGTGCCCTGAACAAGCGATCCAAAATCAAGGTAGTGAGATTGTCAATCGCTCAAG
TCTTGACAGTAATTTCTCAGAAGCAGAAGTCTGCGTTGAGGGAAGCTTACAAGAAGAAGAAGCTATTGCCTCTCGATCTTCGCCCAAAGAAGACCAGGGCCATTCGTAGA
AGGCTCACCAAGTACCAGGCTTCTTTGAAAACTGAGAGACAGAAGAAGAAAGAAATGTACTTTCCATTGAGGAAATATGCTATTAAGGTGTAG
Protein sequenceShow/hide protein sequence
MYCYRIQEAERLRNLEITGRQINYLRSVVVYHKVLSVEFLQMEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELF
NSLKVENNFSRCNELWEREKVDSTLSAAYMPLTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHYNDAGTKFKDRKQGQAHS
SEDLDFLMPSRPFLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKET
KCKRNSSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQL
SKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKTTVEKKDLCDQKSLNC
LSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTSIQQEGLAFEHYPSKEQDSIVNLEEA
YQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIMSTFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPL
HGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEID
SICREVERLLFNELVAEFARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRR
RLTKYQASLKTERQKKKEMYFPLRKYAIKV