; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G072170 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G072170
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionHomeodomain-like superfamily protein
Genome locationCicolChr04:28812661..28815577
RNA-Seq ExpressionCcUC04G072170
SyntenyCcUC04G072170
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143433.1 uncharacterized protein LOC101221631 [Cucumis sativus]1.9e-23179.82Show/hide
Query:  MPRTNSTASSVDIKTLRRSPRFLPSTPQQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPGVVS
        MP TNS ASSVDIKTLRRSPRFLPST QQ+F TT RSLRFL++NEISSPT PTF  +Q PIRQVHSSHAS  PSKNVSLKT KS+ VNTPKRTSK GVVS
Subjt:  MPRTNSTASSVDIKTLRRSPRFLPSTPQQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPGVVS

Query:  SKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQDALE
        SKN  S TGSKK S FENGF EKR PRRSPRLS APKIDNALEG  AKVSKSSI+SGG SNDLK+PSP VRRSPRFSNGVGGN+S G S  FSGQQ  LE
Subjt:  SKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQDALE

Query:  KSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAY
        KSSRKRE  SGS R  G L D+N DASVSSHG+KVA  E++KGNS DHE IATKAE  QVV+GEMEKKS   RKRKREDGVVGIRQGWTKEQEV+LQRAY
Subjt:  KSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAY

Query:  YAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGAL
        YAAKPTP FWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPR R RT+S+K S +ELL  SE +LLN+DGAKSRKP  KSQKSHNAQKAVRYLLEKNFQGA+
Subjt:  YAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGAL

Query:  NCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRI
        N EAD FSQLEPNINLSN TPLPSKQLSSI DL GNQGFLH RSLSNHKKPLSRFS+S ER VVSPPVLKQVKNR LHEKYIDQLH REAKRKSMSKCR 
Subjt:  NCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRI

Query:  SCISEAKGLKEVI--TRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE
        SCIS+  G  + I  TRTNDL+AAKNALISDARDAI QL+HL+ NA N++  F +DEDF DN DYDDED+
Subjt:  SCISEAKGLKEVI--TRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE

XP_008440503.1 PREDICTED: uncharacterized protein LOC103484910 [Cucumis melo]3.9e-23280.14Show/hide
Query:  MPRTNSTASSVDIKTLRRSPRFLPSTPQQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPGVVS
        MP  NS ASSVDIKTLRRSPRFL ST QQ+ PTTRRSLRFL+KNEISSPT PTFR + SPIRQVHSSHAS  PS +VSLKT KS+ VNTPKR SK GVVS
Subjt:  MPRTNSTASSVDIKTLRRSPRFLPSTPQQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPGVVS

Query:  SKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQDALE
        SKN  S TGSKK S FEN F EK  PRRSPRLS APKIDNALEG   KVSKSSI+SGG  NDLK+PSP VRRSPRFSNGVGGN+SIGKS  FSGQQ  LE
Subjt:  SKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQDALE

Query:  KSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAY
        KSSRKRE  +GS R  G L D+N DASVSSHGEKVA GE+++GNS D E IATKAE TQVV+GEMEKKS   RKRKREDGVVGIRQGWTKEQEVALQRAY
Subjt:  KSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAY

Query:  YAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGAL
        Y AKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPR R RT+S+KSS  ELL  SE +LLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGA+
Subjt:  YAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGAL

Query:  NCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRI
        N EAD FSQLEPNINLSNHTPLPSKQLSS+ DL GNQGFLH RSLSNHKKPLSRFS+S ER VVSPPVLKQVKNR LHEKYIDQLHCREAKRKSMSKCR 
Subjt:  NCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRI

Query:  SCISEAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE
        SCIS+ +G K +  TRTNDL+AAKNALISDARDAI Q +HL+ANATN++ DF + EDF  N DYD+ED+
Subjt:  SCISEAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE

XP_022132864.1 uncharacterized protein LOC111005607 [Momordica charantia]8.4e-20371.85Show/hide
Query:  MPRTNSTASSVDIKTLRRSPRFLPSTP---QQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPG
        MP TNS  SSV+IKTLRRSPRF   T    Q++FP TRRSLRFLQKN+IS+PT P  R S S IRQVHSSHA   P +NVSLKT KS+L NT  ++SK G
Subjt:  MPRTNSTASSVDIKTLRRSPRFLPSTP---QQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPG

Query:  VVSSKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVS-KSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQ
        VVSSKN+ S TGSKKS++FENGF   R+PRRSPRLS APKI+NALEG  AKVS  SSITSG RS+DL SPSP VRRSPR +NGVG +QS GKS++FS QQ
Subjt:  VVSSKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVS-KSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQ

Query:  DALEKSSRKR-EKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVA
        DALE+  R R +KSSGS +K+G LH  N D SVSS G+ VAEGE+RKGNS D E    K+  TQVV+GEM+KKS ARRKRKRE+ VVGIRQGWT+EQE A
Subjt:  DALEKSSRKR-EKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVA

Query:  LQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKN
        L RAYYAAKPTP+FWKKVSKLVPGKSAQDCFDKVHS+HMTPPQPR RSR +S+KSSQIELLS SE KLLNLDGAK+RK SRK+QKSHNAQK VR+LLEKN
Subjt:  LQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKN

Query:  FQGALNCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSM
        +QGAL+CEADFFS LEPNINLS+ +P PSK+L S K LLGNQ FLHERSL NHKKP SRFSSS E VVVSPPVLKQVKNR+LHEKYIDQLH REAKRKS+
Subjt:  FQGALNCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSM

Query:  SKCRISCISEAKGLKE-VITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDE
        S+C  +C  E K LKE    RTNDL+AAKNALISDAR+AIHQL+ L A++T++ +DF++  D  DN DY+ E
Subjt:  SKCRISCISEAKGLKE-VITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDE

XP_023517243.1 uncharacterized protein LOC111781067 [Cucurbita pepo subsp. pepo]4.3e-16748.9Show/hide
Query:  MPRTNSTASSVDIKTLRRSPRFL-PSTPQQKF--PTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPG
        MP +NS ASSVDIK LRRSPR L P+ P  K   P+TRRSLRFLQK +IS PT P    S S IRQVH SH    PSKNVS KT K +LVNTPK++ KP 
Subjt:  MPRTNSTASSVDIKTLRRSPRFL-PSTPQQKF--PTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPG

Query:  VVSSKNDKSKTGSKKSSSFENGFVEKRVP-----------------------------------------------------------------------
        VVSS+N  S +GSKKSS+FENGF   + P                                                                       
Subjt:  VVSSKNDKSKTGSKKSSSFENGFVEKRVP-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------RRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLK----------------------------SPSPKV-------------------
                 RRS RLSYAPKIDNALEG  AKVSKSSIT GG S DLK                            S +PK+                   
Subjt:  ---------RRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLK----------------------------SPSPKV-------------------

Query:  -----------------------------RRS------------------------------------PRFSNGVGGNQSIGKSQIFSGQQDALEKSSRK
                                     RRS                                    PR +N V G+QSI K QI  GQQDALEKSSRK
Subjt:  -----------------------------RRS------------------------------------PRFSNGVGGNQSIGKSQIFSGQQDALEKSSRK

Query:  REKSSGSCRKRGPLHDMNADASVSSHGEKVAE-GEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAYYAAK
        RE+S  S +K   L+  N     SSH E V E GE+RKGNS DHEGIAT+   T+VV GEMEKKS A RKRKREDGVVGIR GWTKEQE ALQRAYYAAK
Subjt:  REKSSGSCRKRGPLHDMNADASVSSHGEKVAE-GEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAYYAAK

Query:  PTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGALNCEA
        PTPQFWKKVSKLVPGKSAQDCFDKVHSDH+TPPQPR RSRTQSSKS QIEL SLSE KLLN +GAKSRKP RK+Q+S NAQK VRYLLEK FQ A++ EA
Subjt:  PTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGALNCEA

Query:  DFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRISCIS
        D FSQLEPN N SNH+PLPSKQLS  KDL GNQGFLHERSLSNHKKPLSRFSSS ER VVSPPVLKQVKN+ALHEKYIDQLHCREAKRKSM+KC   CIS
Subjt:  DFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRISCIS

Query:  EAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE
        E KGLKEV   RTNDL+AAKNALISDARDAIHQL+HL+AN  ND  +F  D++ YDN D ++EDE
Subjt:  EAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE

XP_038882568.1 uncharacterized protein LOC120073795 [Benincasa hispida]1.0e-25685.44Show/hide
Query:  MPRTNSTASSVDIKTLRRSPRFLPSTPQQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPGVVS
        MP T+STASSVDIKTLRRSPRFL STPQQ FPTTRRSLRFLQKNEISSPTPPTFR +QSPIRQVHSSHAS GPSK+VSLKT KSILVNTPKRTSKPGVVS
Subjt:  MPRTNSTASSVDIKTLRRSPRFLPSTPQQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPGVVS

Query:  SKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQDALE
        SKN+ S TGSKKSS+FENGF  KRVPRRSPRLSYAPKID+ALE     VSKSSI+SG  S+DLK+PSPKVRRSPRFSNGVGGNQ+IGKSQ FSGQQD +E
Subjt:  SKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQDALE

Query:  KSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAY
        KSSRKR+KSSG  RK   LHD+N DAS++SHGEKVAE EKRKGNS DHEGIATKAE T+VV+GEMEKKS ARRKRKREDGVV IRQGWTKEQEVALQRAY
Subjt:  KSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAY

Query:  YAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGAL
        YAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPR RSRT+ +KSS IELLSLSE KLLNLDG KSRKPSRKSQK+HNAQKAVRYLLEKNF+GAL
Subjt:  YAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGAL

Query:  NCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRI
        NCEAD FSQLEPNINLSNHTPLPS+QLSSIKDL G+QGFLHERSLSNHKKPLSRFSSSA+RVV+SPPVLKQVKNRALHEKYIDQLHCREAKRKS+SKCR 
Subjt:  NCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRI

Query:  SCISEAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDES
        SCISE K LKE   TRTNDL+AAKNALISDARDAIHQL+HL+ANA ++V+DF+ DED YDN DYD+ED++
Subjt:  SCISEAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDES

TrEMBL top hitse value%identityAlignment
A0A0A0KIP3 Uncharacterized protein9.3e-23279.82Show/hide
Query:  MPRTNSTASSVDIKTLRRSPRFLPSTPQQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPGVVS
        MP TNS ASSVDIKTLRRSPRFLPST QQ+F TT RSLRFL++NEISSPT PTF  +Q PIRQVHSSHAS  PSKNVSLKT KS+ VNTPKRTSK GVVS
Subjt:  MPRTNSTASSVDIKTLRRSPRFLPSTPQQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPGVVS

Query:  SKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQDALE
        SKN  S TGSKK S FENGF EKR PRRSPRLS APKIDNALEG  AKVSKSSI+SGG SNDLK+PSP VRRSPRFSNGVGGN+S G S  FSGQQ  LE
Subjt:  SKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQDALE

Query:  KSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAY
        KSSRKRE  SGS R  G L D+N DASVSSHG+KVA  E++KGNS DHE IATKAE  QVV+GEMEKKS   RKRKREDGVVGIRQGWTKEQEV+LQRAY
Subjt:  KSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAY

Query:  YAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGAL
        YAAKPTP FWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPR R RT+S+K S +ELL  SE +LLN+DGAKSRKP  KSQKSHNAQKAVRYLLEKNFQGA+
Subjt:  YAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGAL

Query:  NCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRI
        N EAD FSQLEPNINLSN TPLPSKQLSSI DL GNQGFLH RSLSNHKKPLSRFS+S ER VVSPPVLKQVKNR LHEKYIDQLH REAKRKSMSKCR 
Subjt:  NCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRI

Query:  SCISEAKGLKEVI--TRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE
        SCIS+  G  + I  TRTNDL+AAKNALISDARDAI QL+HL+ NA N++  F +DEDF DN DYDDED+
Subjt:  SCISEAKGLKEVI--TRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE

A0A1S3B194 uncharacterized protein LOC1034849101.9e-23280.14Show/hide
Query:  MPRTNSTASSVDIKTLRRSPRFLPSTPQQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPGVVS
        MP  NS ASSVDIKTLRRSPRFL ST QQ+ PTTRRSLRFL+KNEISSPT PTFR + SPIRQVHSSHAS  PS +VSLKT KS+ VNTPKR SK GVVS
Subjt:  MPRTNSTASSVDIKTLRRSPRFLPSTPQQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPGVVS

Query:  SKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQDALE
        SKN  S TGSKK S FEN F EK  PRRSPRLS APKIDNALEG   KVSKSSI+SGG  NDLK+PSP VRRSPRFSNGVGGN+SIGKS  FSGQQ  LE
Subjt:  SKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQDALE

Query:  KSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAY
        KSSRKRE  +GS R  G L D+N DASVSSHGEKVA GE+++GNS D E IATKAE TQVV+GEMEKKS   RKRKREDGVVGIRQGWTKEQEVALQRAY
Subjt:  KSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAY

Query:  YAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGAL
        Y AKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPR R RT+S+KSS  ELL  SE +LLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGA+
Subjt:  YAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGAL

Query:  NCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRI
        N EAD FSQLEPNINLSNHTPLPSKQLSS+ DL GNQGFLH RSLSNHKKPLSRFS+S ER VVSPPVLKQVKNR LHEKYIDQLHCREAKRKSMSKCR 
Subjt:  NCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRI

Query:  SCISEAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE
        SCIS+ +G K +  TRTNDL+AAKNALISDARDAI Q +HL+ANATN++ DF + EDF  N DYD+ED+
Subjt:  SCISEAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE

A0A5D3CMV2 Uncharacterized protein1.9e-23280.14Show/hide
Query:  MPRTNSTASSVDIKTLRRSPRFLPSTPQQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPGVVS
        MP  NS ASSVDIKTLRRSPRFL ST QQ+ PTTRRSLRFL+KNEISSPT PTFR + SPIRQVHSSHAS  PS +VSLKT KS+ VNTPKR SK GVVS
Subjt:  MPRTNSTASSVDIKTLRRSPRFLPSTPQQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPGVVS

Query:  SKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQDALE
        SKN  S TGSKK S FEN F EK  PRRSPRLS APKIDNALEG   KVSKSSI+SGG  NDLK+PSP VRRSPRFSNGVGGN+SIGKS  FSGQQ  LE
Subjt:  SKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQDALE

Query:  KSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAY
        KSSRKRE  +GS R  G L D+N DASVSSHGEKVA GE+++GNS D E IATKAE TQVV+GEMEKKS   RKRKREDGVVGIRQGWTKEQEVALQRAY
Subjt:  KSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAY

Query:  YAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGAL
        Y AKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPR R RT+S+KSS  ELL  SE +LLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGA+
Subjt:  YAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGAL

Query:  NCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRI
        N EAD FSQLEPNINLSNHTPLPSKQLSS+ DL GNQGFLH RSLSNHKKPLSRFS+S ER VVSPPVLKQVKNR LHEKYIDQLHCREAKRKSMSKCR 
Subjt:  NCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRI

Query:  SCISEAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE
        SCIS+ +G K +  TRTNDL+AAKNALISDARDAI Q +HL+ANATN++ DF + EDF  N DYD+ED+
Subjt:  SCISEAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE

A0A6J1BUA6 uncharacterized protein LOC1110056074.1e-20371.85Show/hide
Query:  MPRTNSTASSVDIKTLRRSPRFLPSTP---QQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPG
        MP TNS  SSV+IKTLRRSPRF   T    Q++FP TRRSLRFLQKN+IS+PT P  R S S IRQVHSSHA   P +NVSLKT KS+L NT  ++SK G
Subjt:  MPRTNSTASSVDIKTLRRSPRFLPSTP---QQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPG

Query:  VVSSKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVS-KSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQ
        VVSSKN+ S TGSKKS++FENGF   R+PRRSPRLS APKI+NALEG  AKVS  SSITSG RS+DL SPSP VRRSPR +NGVG +QS GKS++FS QQ
Subjt:  VVSSKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVS-KSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQ

Query:  DALEKSSRKR-EKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVA
        DALE+  R R +KSSGS +K+G LH  N D SVSS G+ VAEGE+RKGNS D E    K+  TQVV+GEM+KKS ARRKRKRE+ VVGIRQGWT+EQE A
Subjt:  DALEKSSRKR-EKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVA

Query:  LQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKN
        L RAYYAAKPTP+FWKKVSKLVPGKSAQDCFDKVHS+HMTPPQPR RSR +S+KSSQIELLS SE KLLNLDGAK+RK SRK+QKSHNAQK VR+LLEKN
Subjt:  LQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKN

Query:  FQGALNCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSM
        +QGAL+CEADFFS LEPNINLS+ +P PSK+L S K LLGNQ FLHERSL NHKKP SRFSSS E VVVSPPVLKQVKNR+LHEKYIDQLH REAKRKS+
Subjt:  FQGALNCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSM

Query:  SKCRISCISEAKGLKE-VITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDE
        S+C  +C  E K LKE    RTNDL+AAKNALISDAR+AIHQL+ L A++T++ +DF++  D  DN DY+ E
Subjt:  SKCRISCISEAKGLKE-VITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDE

A0A6J1KQ47 uncharacterized protein LOC1114972414.0e-16648.21Show/hide
Query:  MPRTNSTASSVDIKTLRRSPRFL-PSTP--QQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPG
        MP +NS ASSVDIK LRRSPR L P+ P  Q + P+TRRSLRFLQK +IS PT P  R S S IRQVH SH    PSKNVS KT K +LVNTPK++ KP 
Subjt:  MPRTNSTASSVDIKTLRRSPRFL-PSTP--QQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPG

Query:  VVSSKNDKSKTGSKKSSSFENGFVEKRVP-----------------------------------------------------------------------
        VVSS+N  S +G KKSS+F NGF   + P                                                                       
Subjt:  VVSSKNDKSKTGSKKSSSFENGFVEKRVP-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------RRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLK----------------------------SPSPKV-------------------
                 RRS RLSYAPKIDNA EG  AKVSK+SIT GGRS DLK                            S +PK+                   
Subjt:  ---------RRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLK----------------------------SPSPKV-------------------

Query:  -----------------------------RRS------------------------------------PRFSNGVGGNQSIGKSQIFSGQQDALEKSSRK
                                     RRS                                    PR +N V G+QSI KSQ+  GQQDALEKSSRK
Subjt:  -----------------------------RRS------------------------------------PRFSNGVGGNQSIGKSQIFSGQQDALEKSSRK

Query:  REKSSGSCRKRGPLHDMNADASVSSHGEKVAE-GEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAYYAAK
        RE+S    +K   L+  N     +SH E + E GE+RKGNS DHEGIAT+   T+VV GEMEKKS A RKRKREDGVVGIRQGWTKEQE ALQRAYYAAK
Subjt:  REKSSGSCRKRGPLHDMNADASVSSHGEKVAE-GEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAYYAAK

Query:  PTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGALNCEA
        PTPQFWKKVSKLVPGKSAQDCFDKVHSDH+TPPQPR RSRTQ SKS QIEL SLSE KLLN +GAKSRKP RK+Q+S NAQK VRYLLEK FQ A++ EA
Subjt:  PTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGALNCEA

Query:  DFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRISCIS
        D FSQLEPN+N SNH+PLPSKQLS  KDL GNQGFLHERSLSNHKKPLSRFS+S ER VVSP VLKQVKN+ALHEKYIDQLHCREAKRKSM+KC   CIS
Subjt:  DFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRISCIS

Query:  EAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE
        E KGLKEV   RTNDL+AAKNALISDARDAIHQL+H++AN  ND  DF  D+D YDN D ++EDE
Subjt:  EAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNNDEDFYDNDDYDDEDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G53440.1 Homeodomain-like superfamily protein3.3e-4034.31Show/hide
Query:  KTHKSILVNTPKRTSKPGVVSSKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNG
        KT + +L    KR     V SS +    + SKK +  ++GF      RRS RLS     +   E PK   + SS T   +S+   S S  +RRSPRFS+G
Subjt:  KTHKSILVNTPKRTSKPGVVSSKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNG

Query:  VG-----GNQSIGK-----------SQIFSGQQ----DALEKSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQ
         G      + SIGK           ++  S ++    D L+   R R +     +K     D N +               R     + E    KA+  +
Subjt:  VG-----GNQSIGK-----------SQIFSGQQ----DALEKSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQ

Query:  VVNGEME----KKSGARRKRKREDGVVGIRQGWTKEQEVALQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIEL
            E E    ++ G + K K E G      GWT+E E+ALQ AY   KP+P FWKKV+K+VPGKSAQ+CFD+V+S  +TP Q + R  T++  S+ I  
Subjt:  VVNGEME----KKSGARRKRKREDGVVGIRQGWTKEQEVALQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIEL

Query:  LSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGALNCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRF
         SLS SKLL  +  K++   R++  S   +K VR+LLEK          D FS LEPN   SN    P ++  S+  +L              + P+   
Subjt:  LSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGALNCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRF

Query:  SSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRISCISEAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNND
        SS     +VSPPVLKQVKN+ALHEKYID LH R+AKRK+ S    + ++  + ++ + I + + ++AAK+AL  D +DAI +L+ L+A  ++     ++ 
Subjt:  SSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRISCISEAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNND

Query:  EDFYDNDDYDDED
        E  YD+ + D+ED
Subjt:  EDFYDNDDYDDED

AT3G53440.2 Homeodomain-like superfamily protein3.3e-4034.31Show/hide
Query:  KTHKSILVNTPKRTSKPGVVSSKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNG
        KT + +L    KR     V SS +    + SKK +  ++GF      RRS RLS     +   E PK   + SS T   +S+   S S  +RRSPRFS+G
Subjt:  KTHKSILVNTPKRTSKPGVVSSKNDKSKTGSKKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNG

Query:  VG-----GNQSIGK-----------SQIFSGQQ----DALEKSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQ
         G      + SIGK           ++  S ++    D L+   R R +     +K     D N +               R     + E    KA+  +
Subjt:  VG-----GNQSIGK-----------SQIFSGQQ----DALEKSSRKREKSSGSCRKRGPLHDMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQ

Query:  VVNGEME----KKSGARRKRKREDGVVGIRQGWTKEQEVALQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIEL
            E E    ++ G + K K E G      GWT+E E+ALQ AY   KP+P FWKKV+K+VPGKSAQ+CFD+V+S  +TP Q + R  T++  S+ I  
Subjt:  VVNGEME----KKSGARRKRKREDGVVGIRQGWTKEQEVALQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHMTPPQPRSRSRTQSSKSSQIEL

Query:  LSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGALNCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRF
         SLS SKLL  +  K++   R++  S   +K VR+LLEK          D FS LEPN   SN    P ++  S+  +L              + P+   
Subjt:  LSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGALNCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFLHERSLSNHKKPLSRF

Query:  SSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRISCISEAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNND
        SS     +VSPPVLKQVKN+ALHEKYID LH R+AKRK+ S    + ++  + ++ + I + + ++AAK+AL  D +DAI +L+ L+A  ++     ++ 
Subjt:  SSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRISCISEAKGLKEV-ITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVIDFNND

Query:  EDFYDNDDYDDED
        E  YD+ + D+ED
Subjt:  EDFYDNDDYDDED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCGCACAAACTCCACCGCCAGCTCCGTGGACATCAAAACTCTTCGAAGATCTCCAAGATTTCTCCCTTCTACTCCACAACAGAAATTTCCCACTACGCGAAGATC
TCTGAGGTTTCTGCAAAAGAATGAAATTTCATCTCCCACACCCCCGACCTTCCGCCTCTCTCAGTCCCCAATCCGGCAGGTACACTCTTCCCACGCTTCTCCCGGACCTT
CTAAGAATGTTTCTCTTAAAACCCATAAATCTATTCTCGTAAATACCCCGAAAAGAACGAGTAAACCTGGTGTTGTTTCGAGTAAAAACGACAAGTCGAAAACAGGGTCG
AAAAAGTCTTCGTCATTTGAAAATGGGTTTGTGGAAAAACGAGTTCCAAGACGATCTCCCAGGCTATCCTATGCGCCGAAAATTGACAATGCGCTCGAGGGTCCGAAGGC
GAAGGTTTCCAAGAGTTCAATTACTTCCGGAGGGCGATCGAATGATTTGAAAAGTCCAAGCCCTAAGGTGAGAAGATCTCCAAGATTCAGTAATGGAGTCGGGGGGAATC
AAAGTATCGGCAAGTCCCAAATTTTTTCGGGTCAGCAGGATGCTCTCGAGAAGAGTAGCCGAAAACGGGAGAAGTCTAGTGGTTCGTGTAGAAAAAGGGGGCCATTACAT
GATATGAATGCGGATGCAAGTGTGAGCTCCCATGGGGAGAAAGTAGCAGAAGGGGAGAAGAGGAAGGGAAATTCTACTGATCATGAGGGTATTGCAACAAAAGCGGAGGC
AACACAAGTGGTTAATGGAGAAATGGAAAAGAAGTCAGGGGCTAGAAGAAAAAGAAAGCGAGAGGATGGCGTGGTTGGGATTAGACAAGGGTGGACTAAGGAACAGGAAG
TCGCATTGCAGAGAGCTTATTATGCTGCAAAGCCCACACCGCAATTTTGGAAGAAGGTTTCCAAATTGGTGCCTGGAAAGTCTGCCCAAGATTGCTTTGATAAAGTTCAT
TCCGACCATATGACCCCTCCTCAACCTCGATCTCGATCCAGAACACAGAGCTCAAAATCATCTCAAATTGAACTCTTGTCCCTTTCAGAGAGTAAACTTCTAAATCTTGA
TGGTGCAAAGTCTAGAAAGCCTAGTCGCAAGAGTCAGAAAAGCCATAATGCGCAGAAAGCTGTGAGATATTTGTTAGAGAAGAATTTTCAGGGGGCCCTTAACTGTGAGG
CCGATTTCTTCTCACAACTTGAGCCAAATATTAATCTGTCTAACCATACCCCTCTTCCAAGTAAACAACTCTCTAGCATCAAGGATTTACTGGGAAACCAAGGATTCCTC
CACGAGAGATCTTTGTCGAATCACAAAAAACCGCTTTCAAGATTTAGCAGCTCAGCTGAGAGAGTTGTCGTTAGTCCACCAGTACTGAAACAGGTGAAGAACAGGGCCTT
GCACGAGAAGTATATCGACCAGTTACATTGTAGGGAAGCAAAGAGAAAATCAATGTCAAAATGCAGAATAAGTTGCATCTCCGAGGCGAAGGGTTTAAAGGAAGTCATTA
CAAGAACTAATGATCTTAAAGCTGCTAAAAATGCCTTGATTTCAGATGCAAGGGATGCGATCCATCAGTTACGACACTTGAAAGCCAATGCAACAAACGATGTTATTGAC
TTCAACAACGATGAAGATTTCTACGACAATGATGATTACGATGATGAAGATGAATCGTAA
mRNA sequenceShow/hide mRNA sequence
ATTTATTAGTTTCTTTTATTTTTTAATCTTTTTTTTAAAAAAATGAGAGCGCGTTTTCCTCAACATTCAAACCAAAGATGTCTGCAAAGTGTTCGTGAGAATGCCCCGCA
CAAACTCCACCGCCAGCTCCGTGGACATCAAAACTCTTCGAAGATCTCCAAGATTTCTCCCTTCTACTCCACAACAGAAATTTCCCACTACGCGAAGATCTCTGAGGTTT
CTGCAAAAGAATGAAATTTCATCTCCCACACCCCCGACCTTCCGCCTCTCTCAGTCCCCAATCCGGCAGGTACACTCTTCCCACGCTTCTCCCGGACCTTCTAAGAATGT
TTCTCTTAAAACCCATAAATCTATTCTCGTAAATACCCCGAAAAGAACGAGTAAACCTGGTGTTGTTTCGAGTAAAAACGACAAGTCGAAAACAGGGTCGAAAAAGTCTT
CGTCATTTGAAAATGGGTTTGTGGAAAAACGAGTTCCAAGACGATCTCCCAGGCTATCCTATGCGCCGAAAATTGACAATGCGCTCGAGGGTCCGAAGGCGAAGGTTTCC
AAGAGTTCAATTACTTCCGGAGGGCGATCGAATGATTTGAAAAGTCCAAGCCCTAAGGTGAGAAGATCTCCAAGATTCAGTAATGGAGTCGGGGGGAATCAAAGTATCGG
CAAGTCCCAAATTTTTTCGGGTCAGCAGGATGCTCTCGAGAAGAGTAGCCGAAAACGGGAGAAGTCTAGTGGTTCGTGTAGAAAAAGGGGGCCATTACATGATATGAATG
CGGATGCAAGTGTGAGCTCCCATGGGGAGAAAGTAGCAGAAGGGGAGAAGAGGAAGGGAAATTCTACTGATCATGAGGGTATTGCAACAAAAGCGGAGGCAACACAAGTG
GTTAATGGAGAAATGGAAAAGAAGTCAGGGGCTAGAAGAAAAAGAAAGCGAGAGGATGGCGTGGTTGGGATTAGACAAGGGTGGACTAAGGAACAGGAAGTCGCATTGCA
GAGAGCTTATTATGCTGCAAAGCCCACACCGCAATTTTGGAAGAAGGTTTCCAAATTGGTGCCTGGAAAGTCTGCCCAAGATTGCTTTGATAAAGTTCATTCCGACCATA
TGACCCCTCCTCAACCTCGATCTCGATCCAGAACACAGAGCTCAAAATCATCTCAAATTGAACTCTTGTCCCTTTCAGAGAGTAAACTTCTAAATCTTGATGGTGCAAAG
TCTAGAAAGCCTAGTCGCAAGAGTCAGAAAAGCCATAATGCGCAGAAAGCTGTGAGATATTTGTTAGAGAAGAATTTTCAGGGGGCCCTTAACTGTGAGGCCGATTTCTT
CTCACAACTTGAGCCAAATATTAATCTGTCTAACCATACCCCTCTTCCAAGTAAACAACTCTCTAGCATCAAGGATTTACTGGGAAACCAAGGATTCCTCCACGAGAGAT
CTTTGTCGAATCACAAAAAACCGCTTTCAAGATTTAGCAGCTCAGCTGAGAGAGTTGTCGTTAGTCCACCAGTACTGAAACAGGTGAAGAACAGGGCCTTGCACGAGAAG
TATATCGACCAGTTACATTGTAGGGAAGCAAAGAGAAAATCAATGTCAAAATGCAGAATAAGTTGCATCTCCGAGGCGAAGGGTTTAAAGGAAGTCATTACAAGAACTAA
TGATCTTAAAGCTGCTAAAAATGCCTTGATTTCAGATGCAAGGGATGCGATCCATCAGTTACGACACTTGAAAGCCAATGCAACAAACGATGTTATTGACTTCAACAACG
ATGAAGATTTCTACGACAATGATGATTACGATGATGAAGATGAATCGTAATACATTCCATTCCTTTATCTTCAGCTAAATGCCTTAGATAGCATGTATCTGTGTATAATG
AGATTTTTTTCCTTTCCTTCTTGTGTAGTTACTAACTTTCAGAAGTTTTAGTCTATGTTATCCGCAATTCTGGAAGCCTTTCTGTCTTTGCTTATGCTTATACAGTTGCT
TATAGAATGTTGTAAATTGTAATCATAAGCATAAACTGATTTGGGTTAATCAGAAGGGAAGGATGACATGAACAAGTGATACTATAATTTGAAGCCAACTAAGAAAAGAG
AG
Protein sequenceShow/hide protein sequence
MPRTNSTASSVDIKTLRRSPRFLPSTPQQKFPTTRRSLRFLQKNEISSPTPPTFRLSQSPIRQVHSSHASPGPSKNVSLKTHKSILVNTPKRTSKPGVVSSKNDKSKTGS
KKSSSFENGFVEKRVPRRSPRLSYAPKIDNALEGPKAKVSKSSITSGGRSNDLKSPSPKVRRSPRFSNGVGGNQSIGKSQIFSGQQDALEKSSRKREKSSGSCRKRGPLH
DMNADASVSSHGEKVAEGEKRKGNSTDHEGIATKAEATQVVNGEMEKKSGARRKRKREDGVVGIRQGWTKEQEVALQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVH
SDHMTPPQPRSRSRTQSSKSSQIELLSLSESKLLNLDGAKSRKPSRKSQKSHNAQKAVRYLLEKNFQGALNCEADFFSQLEPNINLSNHTPLPSKQLSSIKDLLGNQGFL
HERSLSNHKKPLSRFSSSAERVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCRISCISEAKGLKEVITRTNDLKAAKNALISDARDAIHQLRHLKANATNDVID
FNNDEDFYDNDDYDDEDES