; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh13G006350 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh13G006350
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionU-box domain-containing protein 7-like
Genome locationCmo_Chr13:6812540..6815306
RNA-Seq ExpressionCmoCh13G006350
SyntenyCmoCh13G006350
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583966.1 U-box domain-containing protein 6, partial [Cucurbita argyrosperma subsp. sororia]1.4e-21597.62Show/hide
Query:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
        MPPSLFP+SYIKI FLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIR+REESYIRQYDPASSVLQ TVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
Subjt:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM

Query:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP
        VDLGVIPALVAMADSDQLAVRALI+LAND LLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP
Subjt:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP

Query:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW
        ETKAFCLATLFNISTVL+NAETLISNGVVPTLLRFSSVKE SEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKC+ELSADIIMILGHQSW
Subjt:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW

Query:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
        AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSE+EVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
Subjt:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC

Query:  YSPTIRRTLVSSISSKSSPF
        YSPTIRRTLVSSISSKSSPF
Subjt:  YSPTIRRTLVSSISSKSSPF

KAG7019585.1 U-box domain-containing protein 6, partial [Cucurbita argyrosperma subsp. argyrosperma]4.0e-20794.76Show/hide
Query:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
        MPPSLFP+SYIKI FLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIR+REESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
Subjt:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM

Query:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP
        VDLGVIPALVAMADSDQ               N TVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP
Subjt:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP

Query:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW
        ETKAFCLATLFNISTVL+NAETLISNGVVPTLLRFSSVKE SEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKC+ELSADIIMILGHQSW
Subjt:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW

Query:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
        AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
Subjt:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC

Query:  YSPTIRRTLVSSISSKSSPF
        YSPTIRRTLVSSISSKSSPF
Subjt:  YSPTIRRTLVSSISSKSSPF

XP_022927021.1 uncharacterized protein LOC111433976 [Cucurbita moschata]2.9e-221100Show/hide
Query:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
        MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
Subjt:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM

Query:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP
        VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP
Subjt:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP

Query:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW
        ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW
Subjt:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW

Query:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
        AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
Subjt:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC

Query:  YSPTIRRTLVSSISSKSSPF
        YSPTIRRTLVSSISSKSSPF
Subjt:  YSPTIRRTLVSSISSKSSPF

XP_023001724.1 uncharacterized protein LOC111495775 [Cucurbita maxima]3.0e-21095Show/hide
Query:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
        MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPS+ISRPEIR+REE+ IRQYD ASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
Subjt:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM

Query:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP
        VDLGVIPALVAMADSDQLAVRALIELANDTLLNK VMVEEGILSKLPKNTQFATMDSSSFEF ELL SLSCLANTQLFLASTEPV+SYLLTILN+SKSSP
Subjt:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP

Query:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW
        ET+AFCLATLFNISTVLENAETLISNGVVPTLLRFSSV+E SEKALPTLANLAVTSK KQALESNS FAEILVEILTWEEKPKCQELSADIIMILGHQSW
Subjt:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW

Query:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
        AQRERLGESCIAPALLGLALLGSSLAQ+RALKLLQWFKDEREARVGPHSGPQR GIVAVGSGLS++EVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
Subjt:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC

Query:  YSPTIRRTLVSSISSKSSPF
        YSPT+RR LVSSISSKSSPF
Subjt:  YSPTIRRTLVSSISSKSSPF

XP_023519234.1 uncharacterized protein LOC111782668 [Cucurbita pepo subsp. pepo]2.2e-21396.19Show/hide
Query:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
        MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIR+REESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
Subjt:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM

Query:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP
        VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPV+SYLLT+LNNSKSS 
Subjt:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP

Query:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW
        +TKAFCL TLFNISTVL+NAETLISNGVVPTLLRFSSV+E SEKALPTLANLAVTSKGKQALESNS    IL+EILTWEEKPKCQELSADIIMILGHQSW
Subjt:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW

Query:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
        AQRERLGESCIAPALLGLALLGSSLAQ+RALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSE+EVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
Subjt:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC

Query:  YSPTIRRTLVSSISSKSSPF
        YSPTIRRTLVSSISSKSSPF
Subjt:  YSPTIRRTLVSSISSKSSPF

TrEMBL top hitse value%identityAlignment
A0A6J1CE71 U-box domain-containing protein 7-like8.1e-16176.92Show/hide
Query:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSV-LQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKL
        M P +  YSY+KIRF+ RVR+FLRSK SRKRFR PSDPS+ISR ++R++EE  IR+YD A  + LQRTVKSLHFG+GEEK++AA +I RL+KESAKVRKL
Subjt:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSV-LQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKL

Query:  MVDLGVIPALVAMADSDQL--------AVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLT
        MVDLGVIPALVAM DSDQL        AVRALIELAND+ LNKT+MVEEGILSKLPK  QF  +DSSS EFAELLLSLS LANTQLFLASTEPVV YL+T
Subjt:  MVDLGVIPALVAMADSDQL--------AVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLT

Query:  ILNNSKSSPETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADI
        IL NS+S+P+TK  CLATLFNISTVLENAETLISNGVVPTLLRFS VKE SEK+LPTLANLAVTSKGKQALESNS+F +IL+ ILTWEEKPKCQELSA I
Subjt:  ILNNSKSSPETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADI

Query:  IMILGHQSWAQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIIT
        IMIL HQS AQRERL +S I PALL LALLG+ LAQ+RALKLLQWFKDER+ RVGPHSGPQ G + A GS  +  E+EKGKRIMRSLV+QSLYKNMEIIT
Subjt:  IMILGHQSWAQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIIT

Query:  RRANAAGECYSPTIRRTLVSSISSKSSPF
        RRAN AGE    TIRRTLVSS SSKS PF
Subjt:  RRANAAGECYSPTIRRTLVSSISSKSSPF

A0A6J1EGI7 uncharacterized protein LOC1114339761.4e-221100Show/hide
Query:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
        MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
Subjt:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM

Query:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP
        VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP
Subjt:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP

Query:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW
        ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW
Subjt:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW

Query:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
        AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
Subjt:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC

Query:  YSPTIRRTLVSSISSKSSPF
        YSPTIRRTLVSSISSKSSPF
Subjt:  YSPTIRRTLVSSISSKSSPF

A0A6J1GSS3 U-box domain-containing protein 7-like3.7e-16677.18Show/hide
Query:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
        MP SL  YSY+K+R + RVR+FLRSKSSRKRFR+ SDPS++S P +R R+ES+IR+YD A SVLQRTVKSLHFGDGEEK+RAAKEIERLIKESAK+RKLM
Subjt:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM

Query:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILN-----N
        VDLGV+PALVAMADSDQLAVRALIELAN + +++T+MVEEGILSKLP N +FA MDS+S EFAELL SLS LANT++F+ASTEP + YLLTILN     N
Subjt:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILN-----N

Query:  SKSSPETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMIL
        +  SP+TK FCLA LFNISTVLENAETLISNGV+PTLLRFSS+KE SEKALPTLANLAV+S+GKQALESNS F EIL+E++TWEEKPKCQELSA IIMIL
Subjt:  SKSSPETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMIL

Query:  GHQSWAQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRAN
         H SWA RERL +S I PALL LALLG+ LAQ+RALKLLQWFKDER+ARVGPHSGPQ G  VA GS  +++E+EKGKR+MRSLVKQSL KNMEIITRRAN
Subjt:  GHQSWAQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRAN

Query:  AAGECYSPTIRRTLVSSISSKSSPF
          GEC SPTIRRTLVSS SSKS PF
Subjt:  AAGECYSPTIRRTLVSSISSKSSPF

A0A6J1JVQ5 U-box domain-containing protein 7-like2.9e-16677.41Show/hide
Query:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
        MP SL  YSY+K+RF+ RVR+FLRSKSSRKRFR+ SDPS++S P +R R+ES+I +YD A SVLQRTVKSLHFGDGEEK+RAAKEIERLIKESAK+RKL+
Subjt:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM

Query:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILN-----N
        VDLGV+PALV MADSDQLAVRALIELAN + +++T+MVEEGILSKLPKN +F  MDS+S EFAELL SLS LANT+LF+ASTEP + YLLTILN     N
Subjt:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILN-----N

Query:  SKSSPETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMIL
        +  SP+TK FCLA LFNISTVLENAETLISNGV+PTLLRFSSVKE SEKALPTLANLAV+S+GKQALESNS F EIL+E++TWEEKPKCQELSA IIMIL
Subjt:  SKSSPETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMIL

Query:  GHQSWAQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRAN
         HQSWA RERL +S I PALL LALLG+ LAQ+RALKLLQWFKDER+ARVGPHSGPQ G  VA GS  +++E+EKGKR MRSLVKQSL KNMEIITRRAN
Subjt:  GHQSWAQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRAN

Query:  AAGECYSPTIRRTLVSSISSKSSPF
          GEC SPTIRRTLVSS SSKS PF
Subjt:  AAGECYSPTIRRTLVSSISSKSSPF

A0A6J1KNH4 uncharacterized protein LOC1114957751.4e-21095Show/hide
Query:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
        MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPS+ISRPEIR+REE+ IRQYD ASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM
Subjt:  MPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLM

Query:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP
        VDLGVIPALVAMADSDQLAVRALIELANDTLLNK VMVEEGILSKLPKNTQFATMDSSSFEF ELL SLSCLANTQLFLASTEPV+SYLLTILN+SKSSP
Subjt:  VDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSP

Query:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW
        ET+AFCLATLFNISTVLENAETLISNGVVPTLLRFSSV+E SEKALPTLANLAVTSK KQALESNS FAEILVEILTWEEKPKCQELSADIIMILGHQSW
Subjt:  ETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSW

Query:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
        AQRERLGESCIAPALLGLALLGSSLAQ+RALKLLQWFKDEREARVGPHSGPQR GIVAVGSGLS++EVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC
Subjt:  AQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGEC

Query:  YSPTIRRTLVSSISSKSSPF
        YSPT+RR LVSSISSKSSPF
Subjt:  YSPTIRRTLVSSISSKSSPF

SwissProt top hitse value%identityAlignment
O80674 Transcription factor bHLH1062.3e-2742.13Show/hide
Query:  VSEEKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAAEASNG--VFVPLDTDEVNVEPCGVGAN-GHMTLKATIC
        +++++ALAAL+NH EAERRRRERINSHL+ LR+++ C  K DKATLLA+VV++V+ELK++  E S+     +P +TDE++V   G  +N GH+  KA++C
Subjt:  VSEEKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAAEASNG--VFVPLDTDEVNVEPCGVGAN-GHMTLKATIC

Query:  CEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHPEAS-----RLLASSVHRAISLVLEKAS
        CE + +LL DL + L SL++K +++E+ T+G R +S+       A+   H   S       L S + R+   ++E++S
Subjt:  CEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHPEAS-----RLLASSVHRAISLVLEKAS

Q9LET0 Putative transcription factor bHLH1071.6e-2541.04Show/hide
Query:  VSEEKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAAEASNGVFVPLDTDEV---NVEPCGVGANGHMTLKATIC
        V E+KALA+L+NH EAER+RR RINSHL+ LR L+ C  K DK+TLLA+VV++VKELK++  E ++   +P +TDE+   N+E C  G +  +  K + C
Subjt:  VSEEKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAAEASNGVFVPLDTDEV---NVEPCGVGANGHMTLKATIC

Query:  CEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHPEASRLLASSVHRAISLVLEKAS
        CE +PELL DL + L SL ++ + ++++T+G R +++       A+   H   S    + +  A+  +LE++S
Subjt:  CEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHPEASRLLASSVHRAISLVLEKAS

Q9LS08 Transcription factor AIG15.1e-2738.1Show/hide
Query:  SKKRVSEEKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAAEASNGVFVPLDTDEVNVEPCGVGANGHMTLKATI
        S + V + KALAA K+HSEAERRRRERIN+HL+ LR ++P   K DKA+LLAEV++ +KELK++ ++ ++   VP + D++ V+       G++ ++A+ 
Subjt:  SKKRVSEEKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAAEASNGVFVPLDTDEVNVEPCGVGANGHMTLKATI

Query:  CCEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHP--------------EASRLL---ASSVHRAISLVLEK
        CC+ + +L+ D+  AL SL L+ +K+EI+T+G RVK+I F +    +   H               +  R++    SS+  A+  V+EK
Subjt:  CCEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHP--------------EASRLL---ASSVHRAISLVLEK

Q9S7Y1 Transcription factor bHLH307.1e-2937.13Show/hide
Query:  HESTSWTTVFNLRFGELVKASTQP----------SKKRVSEEKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAA
        H       + +   G +V+A + P          + + + + KALAA K+HSEAERRRRERIN+HL+ LR ++P   K DKA+LLAEV++ VKELK++ +
Subjt:  HESTSWTTVFNLRFGELVKASTQP----------SKKRVSEEKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAA

Query:  EASNGVFVPLDTDEVNV-----EPCGVGANGHMTLKATICCEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHPEASRLLASS
          S    VP ++DE+ V     E  G   +G   +KA++CCE + +LL D+ + L ++ LK +K+EI+T+G RVK++ F T    E++           +
Subjt:  EASNGVFVPLDTDEVNV-----EPCGVGANGHMTLKATICCEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHPEASRLLASS

Query:  VHRAISLVLEKASSVEYSPRTTTTLPRKRRRLSSFHT
        +  A+  V+EK S+VE S  +      KR+R+SS +T
Subjt:  VHRAISLVLEKASSVEYSPRTTTTLPRKRRRLSSFHT

Q9XEF0 Transcription factor bHLH519.2e-2134.53Show/hide
Query:  FNLRFGELVKASTQPSKKRVSE-EKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAAEASNGVFVPLDTDEVNVE
        FNL F         P+   V   EKA +  ++H  AE+RRR+RINSHL+ LR LVP   K DKA LLA V+ QVKELK+KAAE+     +P + DEV V+
Subjt:  FNLRFGELVKASTQPSKKRVSE-EKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAAEASNGVFVPLDTDEVNVE

Query:  PCGV----GANGHMTLKATICCEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHPEASRLLASSVHRAISLVLEKASSVEYSP
        P  +         +  KA+ CCE QPE + ++ + L  L L+ +++EI ++G R++  F           +  AS   A ++ +++   L + +S   + 
Subjt:  PCGV----GANGHMTLKATICCEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHPEASRLLASSVHRAISLVLEKASSVEYSP

Query:  RTTTTLPRKRRR--LSSFHTSRQ
         +   +  KR+R  LSS ++  +
Subjt:  RTTTTLPRKRRR--LSSFHTSRQ

Arabidopsis top hitse value%identityAlignment
AT1G68810.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein5.0e-3037.13Show/hide
Query:  HESTSWTTVFNLRFGELVKASTQP----------SKKRVSEEKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAA
        H       + +   G +V+A + P          + + + + KALAA K+HSEAERRRRERIN+HL+ LR ++P   K DKA+LLAEV++ VKELK++ +
Subjt:  HESTSWTTVFNLRFGELVKASTQP----------SKKRVSEEKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAA

Query:  EASNGVFVPLDTDEVNV-----EPCGVGANGHMTLKATICCEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHPEASRLLASS
          S    VP ++DE+ V     E  G   +G   +KA++CCE + +LL D+ + L ++ LK +K+EI+T+G RVK++ F T    E++           +
Subjt:  EASNGVFVPLDTDEVNV-----EPCGVGANGHMTLKATICCEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHPEASRLLASS

Query:  VHRAISLVLEKASSVEYSPRTTTTLPRKRRRLSSFHT
        +  A+  V+EK S+VE S  +      KR+R+SS +T
Subjt:  VHRAISLVLEKASSVEYSPRTTTTLPRKRRRLSSFHT

AT2G27430.1 ARM repeat superfamily protein2.1e-8946.45Show/hide
Query:  SLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSE-----------------ISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEI
        S++   Y+K+ F T++R  L+SK+S ++    + P +                 +S+P   + EE           VLQ+TVK +HFG  EEK++AA EI
Subjt:  SLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPSDPSE-----------------ISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEI

Query:  ERLIKESAKVRKLMVDLGVIPALVAMADSD-----QLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLAS
        E+L +E  K RKLM +LGVI  LV+M  SD     + AV ALI+L++ T  NK +MV   I SKLPKN +     S+   FAELLLSLS L NTQL +AS
Subjt:  ERLIKESAKVRKLMVDLGVIPALVAMADSD-----QLAVRALIELANDTLLNKTVMVEEGILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLAS

Query:  TEPVVSYLLTILNNSKSSPETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEK
        ++ ++ +L+  +N+  +  +TK  CLAT+ N+  VLENA  L+ NG V TLL   S K+ SEKAL +L  L VT  GK+A+E     ++ L+EILTWE+ 
Subjt:  TEPVVSYLLTILNNSKSSPETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEK

Query:  PKCQELSADIIMILGHQSWAQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGI-VAVGSGLSEEEVEKGKRIMRSLVK
        PKCQE +A I+M+L HQSW+QRE++ ++ I P LL ++LLGS L Q+RA+KLLQWFKDER  R+GPHSGPQ G +   +GS +S    E+G+++M++LVK
Subjt:  PKCQELSADIIMILGHQSWAQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGPQRGGI-VAVGSGLSEEEVEKGKRIMRSLVK

Query:  QSLYKNMEIITRRANAAGECYSPTIRRTLVSSISSKS
        QSLYKNME+ITRR N   E  S  + ++L+ S SSKS
Subjt:  QSLYKNMEIITRRANAAGECYSPTIRRTLVSSISSKS

AT2G41130.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.6e-2842.13Show/hide
Query:  VSEEKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAAEASNG--VFVPLDTDEVNVEPCGVGAN-GHMTLKATIC
        +++++ALAAL+NH EAERRRRERINSHL+ LR+++ C  K DKATLLA+VV++V+ELK++  E S+     +P +TDE++V   G  +N GH+  KA++C
Subjt:  VSEEKALAALKNHSEAERRRRERINSHLSTLRDLVPCPIKRDKATLLAEVVRQVKELKKKAAEASNG--VFVPLDTDEVNVEPCGVGAN-GHMTLKATIC

Query:  CEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHPEAS-----RLLASSVHRAISLVLEKAS
        CE + +LL DL + L SL++K +++E+ T+G R +S+       A+   H   S       L S + R+   ++E++S
Subjt:  CEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAENAAHPEAS-----RLLASSVHRAISLVLEKAS

AT4G31890.1 ARM repeat superfamily protein1.0e-3029.29Show/hide
Query:  DPASSVLQRTVKSLHF-----------GDGEEKQRAAKEIERLIKESAKVRKLMVDLGVIPALVAMADSDQL------AVRALIELANDTLLNKTVMVEE
        + A  VL+R V+ L             GD  +K  AA E+  L KE ++ R  +  LG IP LV+M D  ++      ++ AL+ L      NK  +V+ 
Subjt:  DPASSVLQRTVKSLHF-----------GDGEEKQRAAKEIERLIKESAKVRKLMVDLGVIPALVAMADSDQL------AVRALIELANDTLLNKTVMVEE

Query:  GILSKLPKNTQFATMDSSSFEFAELL----LSLSCLANTQLFLASTEPVVSYLLTILN-NSKSSPETKAFCLATLFNISTVLENAETLISNGVVPTLLRF
        G + K+ K  +  + ++   E AE +    L LS L + +  + S+  ++  + T+ N +  SS + +   L  L+N+S    N   ++   ++  LL  
Subjt:  GILSKLPKNTQFATMDSSSFEFAELL----LSLSCLANTQLFLASTEPVVSYLLTILN-NSKSSPETKAFCLATLFNISTVLENAETLISNGVVPTLLRF

Query:  SSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSWAQRERLGESCIAPALLGLALLGSSLAQRRALKLLQ
            E SE+ L  L+NL    +G++A+        +LV++L W + P CQE +  I+M++ H+ +  R+ + E+ I  ALL L LLGS+LAQ+RA ++L+
Subjt:  SSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSWAQRERLGESCIAPALLGLALLGSSLAQRRALKLLQ

Query:  WFKDEREARV-------GPHSGPQRGGIVAVGSGLSEEE----VEKGKRIMRSLVKQSLYKNMEIITRRANAAGECYSPTIRRTLVSSISSKSSPF
          + ++  +V       G  S P  G      +GL  EE    + + ++ ++ LV+QSL  NM+ I +RAN   +       ++L  S +SKS PF
Subjt:  WFKDEREARV-------GPHSGPQRGGIVAVGSGLSEEE----VEKGKRIMRSLVKQSLYKNMEIITRRANAAGECYSPTIRRTLVSSISSKSSPF

AT4G31890.2 ARM repeat superfamily protein1.0e-3029.29Show/hide
Query:  DPASSVLQRTVKSLHF-----------GDGEEKQRAAKEIERLIKESAKVRKLMVDLGVIPALVAMADSDQL------AVRALIELANDTLLNKTVMVEE
        + A  VL+R V+ L             GD  +K  AA E+  L KE ++ R  +  LG IP LV+M D  ++      ++ AL+ L      NK  +V+ 
Subjt:  DPASSVLQRTVKSLHF-----------GDGEEKQRAAKEIERLIKESAKVRKLMVDLGVIPALVAMADSDQL------AVRALIELANDTLLNKTVMVEE

Query:  GILSKLPKNTQFATMDSSSFEFAELL----LSLSCLANTQLFLASTEPVVSYLLTILN-NSKSSPETKAFCLATLFNISTVLENAETLISNGVVPTLLRF
        G + K+ K  +  + ++   E AE +    L LS L + +  + S+  ++  + T+ N +  SS + +   L  L+N+S    N   ++   ++  LL  
Subjt:  GILSKLPKNTQFATMDSSSFEFAELL----LSLSCLANTQLFLASTEPVVSYLLTILN-NSKSSPETKAFCLATLFNISTVLENAETLISNGVVPTLLRF

Query:  SSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSWAQRERLGESCIAPALLGLALLGSSLAQRRALKLLQ
            E SE+ L  L+NL    +G++A+        +LV++L W + P CQE +  I+M++ H+ +  R+ + E+ I  ALL L LLGS+LAQ+RA ++L+
Subjt:  SSVKESSEKALPTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSWAQRERLGESCIAPALLGLALLGSSLAQRRALKLLQ

Query:  WFKDEREARV-------GPHSGPQRGGIVAVGSGLSEEE----VEKGKRIMRSLVKQSLYKNMEIITRRANAAGECYSPTIRRTLVSSISSKSSPF
          + ++  +V       G  S P  G      +GL  EE    + + ++ ++ LV+QSL  NM+ I +RAN   +       ++L  S +SKS PF
Subjt:  WFKDEREARV-------GPHSGPQRGGIVAVGSGLSEEE----VEKGKRIMRSLVKQSLYKNMEIITRRANAAGECYSPTIRRTLVSSISSKSSPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGTTATGGAAGGTATGAAAAAAACCAGAGTTTCCAGCAAGATTTTGAGCAACAGAGTGTGTGTACAACGAAGCTTCACTCTCCACAAGCTTTCCTTTTG
GTTTGCCCCCACGAATCGACCTCCTGGACTACTGTCTTCAACCTTCGGTTCGGAGAGCTCGTCAAAGCCTCGACTCAGCCGTCGAAGAAAAGGGTGTCGGAGGAG
AAAGCACTTGCGGCGCTAAAGAATCACAGCGAGGCGGAGAGGCGGAGGCGAGAGAGAATCAATTCCCATCTCTCGACTCTGCGTGACCTTGTTCCTTGCCCCATT
AAGAGGGACAAAGCAACACTGCTCGCCGAAGTTGTCAGACAAGTGAAGGAATTGAAGAAGAAGGCAGCAGAAGCCAGCAATGGTGTTTTCGTTCCATTGGACACC
GACGAAGTCAACGTCGAACCTTGTGGGGTGGGAGCAAACGGGCATATGACCCTCAAGGCAACAATTTGTTGCGAATATCAACCGGAGCTTCTGTGTGATCTAAAA
CAAGCCCTTGATTCCCTTCACCTGAAGTTAGTAAAGTCAGAGATATCCACTTTGGGAAACAGAGTGAAGAGCATATTCTTTTTCACCAGCCCCATAGCAGAGAAT
GCTGCGCATCCCGAGGCTTCCCGACTTCTCGCATCATCGGTTCACCGGGCAATAAGTCTAGTCCTTGAGAAAGCTTCATCCGTAGAATACTCGCCAAGAACAACA
ACAACGCTCCCAAGGAAAAGGCGAAGGCTGTCTAGTTTCCATACGTCTAGACAGCCAATAATTCCAAATTCACTTCCTCTCATTCCATCATCAATCAAAATCATG
CCTCCTTCTCTATTCCCTTATTCGTACATCAAAATTCGTTTCCTCACTCGCGTCCGCCAATTCCTTCGCTCCAAATCATCTCGTAAGCGATTCCGTTCACCGTCT
GATCCGTCGGAAATTTCGAGGCCAGAAATTCGGGACAGAGAAGAGAGTTATATCCGGCAGTATGATCCGGCGTCGTCCGTGTTGCAGAGGACAGTGAAGAGCCTC
CACTTCGGCGACGGGGAGGAAAAACAGAGAGCGGCGAAGGAGATTGAGAGGTTGATTAAAGAGAGCGCCAAGGTTAGAAAATTGATGGTGGATCTCGGAGTTATA
CCTGCTTTGGTGGCGATGGCGGATTCCGATCAGTTGGCAGTTAGGGCATTGATTGAACTTGCTAACGATACTCTCCTGAACAAGACAGTAATGGTGGAGGAAGGG
ATCTTATCAAAGCTACCGAAGAACACCCAGTTCGCAACAATGGATTCATCCAGCTTTGAATTTGCAGAGCTTTTATTGTCACTTTCATGTCTAGCAAACACCCAG
TTGTTTCTTGCTTCAACCGAACCAGTTGTTTCATATCTCTTAACCATACTCAACAATTCAAAATCGAGCCCTGAAACCAAAGCATTTTGTTTGGCAACTTTATTC
AACATTTCCACTGTCCTAGAAAATGCAGAGACCTTAATCTCCAATGGTGTGGTTCCAACGCTACTCAGATTCTCCAGTGTCAAAGAGTCTTCCGAGAAAGCCTTA
CCGACGCTAGCAAACTTAGCAGTGACTTCAAAAGGGAAGCAAGCTCTGGAAAGCAACTCAAGATTCGCAGAGATTTTGGTAGAGATTTTGACATGGGAAGAGAAA
CCCAAATGCCAAGAACTTTCAGCGGACATCATAATGATTCTGGGCCATCAAAGCTGGGCGCAAAGGGAGAGATTGGGCGAGTCCTGCATCGCCCCTGCGCTGCTG
GGATTGGCGCTGTTAGGAAGCTCATTAGCTCAAAGGCGAGCTCTGAAACTGCTGCAATGGTTTAAAGATGAGAGGGAGGCGAGAGTGGGGCCGCATTCTGGACCT
CAAAGGGGTGGGATAGTTGCAGTAGGCTCAGGATTGAGTGAGGAGGAGGTTGAGAAAGGGAAGAGGATAATGAGAAGCCTGGTGAAGCAGAGCCTGTATAAGAAT
ATGGAGATAATAACTCGACGAGCCAATGCTGCTGGGGAATGTTATAGTCCAACCATTAGGAGGACCTTGGTTTCCAGCATTAGTTCCAAGAGTTCGCCTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCGTTATGGAAGGTATGAAAAAAACCAGAGTTTCCAGCAAGATTTTGAGCAACAGAGTGTGTGTACAACGAAGCTTCACTCTCCACAAGCTTTCCTTTTG
GTTTGCCCCCACGAATCGACCTCCTGGACTACTGTCTTCAACCTTCGGTTCGGAGAGCTCGTCAAAGCCTCGACTCAGCCGTCGAAGAAAAGGGTGTCGGAGGAG
AAAGCACTTGCGGCGCTAAAGAATCACAGCGAGGCGGAGAGGCGGAGGCGAGAGAGAATCAATTCCCATCTCTCGACTCTGCGTGACCTTGTTCCTTGCCCCATT
AAGAGGGACAAAGCAACACTGCTCGCCGAAGTTGTCAGACAAGTGAAGGAATTGAAGAAGAAGGCAGCAGAAGCCAGCAATGGTGTTTTCGTTCCATTGGACACC
GACGAAGTCAACGTCGAACCTTGTGGGGTGGGAGCAAACGGGCATATGACCCTCAAGGCAACAATTTGTTGCGAATATCAACCGGAGCTTCTGTGTGATCTAAAA
CAAGCCCTTGATTCCCTTCACCTGAAGTTAGTAAAGTCAGAGATATCCACTTTGGGAAACAGAGTGAAGAGCATATTCTTTTTCACCAGCCCCATAGCAGAGAAT
GCTGCGCATCCCGAGGCTTCCCGACTTCTCGCATCATCGGTTCACCGGGCAATAAGTCTAGTCCTTGAGAAAGCTTCATCCGTAGAATACTCGCCAAGAACAACA
ACAACGCTCCCAAGGAAAAGGCGAAGGCTGTCTAGTTTCCATACGTCTAGACAGCCAATAATTCCAAATTCACTTCCTCTCATTCCATCATCAATCAAAATCATG
CCTCCTTCTCTATTCCCTTATTCGTACATCAAAATTCGTTTCCTCACTCGCGTCCGCCAATTCCTTCGCTCCAAATCATCTCGTAAGCGATTCCGTTCACCGTCT
GATCCGTCGGAAATTTCGAGGCCAGAAATTCGGGACAGAGAAGAGAGTTATATCCGGCAGTATGATCCGGCGTCGTCCGTGTTGCAGAGGACAGTGAAGAGCCTC
CACTTCGGCGACGGGGAGGAAAAACAGAGAGCGGCGAAGGAGATTGAGAGGTTGATTAAAGAGAGCGCCAAGGTTAGAAAATTGATGGTGGATCTCGGAGTTATA
CCTGCTTTGGTGGCGATGGCGGATTCCGATCAGTTGGCAGTTAGGGCATTGATTGAACTTGCTAACGATACTCTCCTGAACAAGACAGTAATGGTGGAGGAAGGG
ATCTTATCAAAGCTACCGAAGAACACCCAGTTCGCAACAATGGATTCATCCAGCTTTGAATTTGCAGAGCTTTTATTGTCACTTTCATGTCTAGCAAACACCCAG
TTGTTTCTTGCTTCAACCGAACCAGTTGTTTCATATCTCTTAACCATACTCAACAATTCAAAATCGAGCCCTGAAACCAAAGCATTTTGTTTGGCAACTTTATTC
AACATTTCCACTGTCCTAGAAAATGCAGAGACCTTAATCTCCAATGGTGTGGTTCCAACGCTACTCAGATTCTCCAGTGTCAAAGAGTCTTCCGAGAAAGCCTTA
CCGACGCTAGCAAACTTAGCAGTGACTTCAAAAGGGAAGCAAGCTCTGGAAAGCAACTCAAGATTCGCAGAGATTTTGGTAGAGATTTTGACATGGGAAGAGAAA
CCCAAATGCCAAGAACTTTCAGCGGACATCATAATGATTCTGGGCCATCAAAGCTGGGCGCAAAGGGAGAGATTGGGCGAGTCCTGCATCGCCCCTGCGCTGCTG
GGATTGGCGCTGTTAGGAAGCTCATTAGCTCAAAGGCGAGCTCTGAAACTGCTGCAATGGTTTAAAGATGAGAGGGAGGCGAGAGTGGGGCCGCATTCTGGACCT
CAAAGGGGTGGGATAGTTGCAGTAGGCTCAGGATTGAGTGAGGAGGAGGTTGAGAAAGGGAAGAGGATAATGAGAAGCCTGGTGAAGCAGAGCCTGTATAAGAAT
ATGGAGATAATAACTCGACGAGCCAATGCTGCTGGGGAATGTTATAGTCCAACCATTAGGAGGACCTTGGTTTCCAGCATTAGTTCCAAGAGTTCGCCTTTTTGA
AACACCTCCAAATCCATTACGCATTTGTAAACTTTCATCGCCCTGCCCCTCTGTATTCTATCGTCTTCTGCAGCTCAATCAAAGCCAAACGAAGTCAACAATGGT
TTATATTGTCCAGTTTGCAAGCCTT
Protein sequenceShow/hide protein sequence
MNRYGRYEKNQSFQQDFEQQSVCTTKLHSPQAFLLVCPHESTSWTTVFNLRFGELVKASTQPSKKRVSEEKALAALKNHSEAERRRRERINSHLSTLRDLVPCPI
KRDKATLLAEVVRQVKELKKKAAEASNGVFVPLDTDEVNVEPCGVGANGHMTLKATICCEYQPELLCDLKQALDSLHLKLVKSEISTLGNRVKSIFFFTSPIAEN
AAHPEASRLLASSVHRAISLVLEKASSVEYSPRTTTTLPRKRRRLSSFHTSRQPIIPNSLPLIPSSIKIMPPSLFPYSYIKIRFLTRVRQFLRSKSSRKRFRSPS
DPSEISRPEIRDREESYIRQYDPASSVLQRTVKSLHFGDGEEKQRAAKEIERLIKESAKVRKLMVDLGVIPALVAMADSDQLAVRALIELANDTLLNKTVMVEEG
ILSKLPKNTQFATMDSSSFEFAELLLSLSCLANTQLFLASTEPVVSYLLTILNNSKSSPETKAFCLATLFNISTVLENAETLISNGVVPTLLRFSSVKESSEKAL
PTLANLAVTSKGKQALESNSRFAEILVEILTWEEKPKCQELSADIIMILGHQSWAQRERLGESCIAPALLGLALLGSSLAQRRALKLLQWFKDEREARVGPHSGP
QRGGIVAVGSGLSEEEVEKGKRIMRSLVKQSLYKNMEIITRRANAAGECYSPTIRRTLVSSISSKSSPF