; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG03G007780 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG03G007780
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionU-box domain-containing protein 7
Genome locationCG_Chr03:8895281..8903899
RNA-Seq ExpressionClCG03G007780
SyntenyClCG03G007780
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149064.2 uncharacterized protein LOC101207857 [Cucumis sativus]1.7e-16277.99Show/hide
Query:  LRFLSRVRRFLLSKSSRNRFRLPSDDPSD--IPR-EQSIIIRNYD---AAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKE---SAKVRKLIVDLGV
        LRFLS +R FL SKSSR RFR PS   SD   P  EQS I+R YD      SS+ LQRTVK LHFGDGDEK+RAAKEIE LIK+   ++KVR++IVDLGV
Subjt:  LRFLSRVRRFLLSKSSRNRFRLPSDDPSD--IPR-EQSIIIRNYD---AAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKE---SAKVRKLIVDLGV

Query:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK
        IPALVAMADSD FAV+ALIQLANHTFLNKTL++EEGIL+KLP K        DSS+HEFPELL SLSCLANTQLFLASTEP++SYLLT+LN+ ES  ++K
Subjt:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK

Query:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR
        TFCLAT+F+ISTILEN ETLISN V+PTLL+FS IKEFSEKALPTLANLAVTSKGK ALE NS F EILIEILTWEEKPKCQELSAYIIM++AHQSW QR
Subjt:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR

Query:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMA-GIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRANAGECSS
        E+LAK+SIIV ALLGLALLGSPLAQ RALKLLQW KDER+ARV AHSGPQ+  GIVEVGSGFSEKEIEKGKR+MRSLVKQSL+KNMEIITRRAN GECSS
Subjt:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMA-GIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRANAGECSS

Query:  PRIRRTLVSTTSSKSLPF
          IRRTLVS+ SSKSLPF
Subjt:  PRIRRTLVSTTSSKSLPF

XP_022927021.1 uncharacterized protein LOC111433976 [Cucurbita moschata]1.3e-16278.95Show/hide
Query:  SFDTLRFLSRVRRFLLSKSSRNRFRLPSDDPSDIPR-----EQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDLGV
        S+  +RFL+RVR+FL SKSSR RFR PS DPS+I R      +   IR YD A  S+VLQRTVKSLHFGDG+EKQRAAKEIE LIKESAKVRKL+VDLGV
Subjt:  SFDTLRFLSRVRRFLLSKSSRNRFRLPSDDPSDIPR-----EQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDLGV

Query:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK
        IPALVAMADSD  AVRALI+LAN T LNKT++VEEGILSKLP   N  F  MDSSS EF ELL SLSCLANTQLFLASTEPVVSYLLT+LNNS+S P+TK
Subjt:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK

Query:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR
         FCLATLF+IST+LENAETLISNGVVPTLLRFS +KE SEKALPTLANLAVTSKGKQALE+NS F EIL+EILTWEEKPKCQELSA IIMI+ HQSWAQR
Subjt:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR

Query:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRAN-AGECSS
        ERL + S I  ALLGLALLGS LAQ+RALKLLQWFKDER+ARVG HSGPQ  GIV VGSG SE+E+EKGKR+MRSLVKQSL+KNMEIITRRAN AGEC S
Subjt:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRAN-AGECSS

Query:  PRIRRTLVSTTSSKSLPF
        P IRRTLVS+ SSKS PF
Subjt:  PRIRRTLVSTTSSKSLPF

XP_023001724.1 uncharacterized protein LOC111495775 [Cucurbita maxima]2.3e-16278.47Show/hide
Query:  SFDTLRFLSRVRRFLLSKSSRNRFRLPSDDPSDIPR-----EQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDLGV
        S+  +RFL+RVR+FL SKSSR RFR PS DPSDI R      +   IR YD A  S+VLQRTVKSLHFGDG+EKQRAAKEIE LIKESAKVRKL+VDLGV
Subjt:  SFDTLRFLSRVRRFLLSKSSRNRFRLPSDDPSDIPR-----EQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDLGV

Query:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK
        IPALVAMADSD  AVRALI+LAN T LNK ++VEEGILSKLP   N  F  MDSSS EF ELLFSLSCLANTQLFLASTEPV+SYLLT+LN+S+S P+T+
Subjt:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK

Query:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR
         FCLATLF+IST+LENAETLISNGVVPTLLRFS ++EFSEKALPTLANLAVTSK KQALE+NSTF EIL+EILTWEEKPKCQELSA IIMI+ HQSWAQR
Subjt:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR

Query:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRAN-AGECSS
        ERL + S I  ALLGLALLGS LAQKRALKLLQWFKDER+ARVG HSGPQ AGIV VGSG S+KE+EKGKR+MRSLVKQSL+KNMEIITRRAN AGEC S
Subjt:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRAN-AGECSS

Query:  PRIRRTLVSTTSSKSLPF
        P +RR LVS+ SSKS PF
Subjt:  PRIRRTLVSTTSSKSLPF

XP_023519234.1 uncharacterized protein LOC111782668 [Cucurbita pepo subsp. pepo]1.0e-16279.19Show/hide
Query:  SFDTLRFLSRVRRFLLSKSSRNRFRLPSDDPSDIPR-----EQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDLGV
        S+  +RFL+RVR+FL SKSSR RFR PS DPS+I R      +   IR YD A  S+VLQRTVKSLHFGDG+EKQRAAKEIE LIKESAKVRKL+VDLGV
Subjt:  SFDTLRFLSRVRRFLLSKSSRNRFRLPSDDPSDIPR-----EQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDLGV

Query:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK
        IPALVAMADSD  AVRALI+LAN T LNKT++VEEGILSKLP   N  F  MDSSS EF ELL SLSCLANTQLFLASTEPV+SYLLTVLNNS+S  KTK
Subjt:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK

Query:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR
         FCL TLF+IST+L+NAETLISNGVVPTLLRFS ++EFSEKALPTLANLAVTSKGKQALE+NST P ILIEILTWEEKPKCQELSA IIMI+ HQSWAQR
Subjt:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR

Query:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRAN-AGECSS
        ERL + S I  ALLGLALLGS LAQKRALKLLQWFKDER+ARVG HSGPQ  GIV VGSG SEKE+EKGKR+MRSLVKQSL+KNMEIITRRAN AGEC S
Subjt:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRAN-AGECSS

Query:  PRIRRTLVSTTSSKSLPF
        P IRRTLVS+ SSKS PF
Subjt:  PRIRRTLVSTTSSKSLPF

XP_038894080.1 U-box domain-containing protein 6-like [Benincasa hispida]1.3e-18686.87Show/hide
Query:  SSFDTLRFLSRVRRFLLSKSSRNRFRLPSDDPSDIPR------EQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDL
        S    LRF+SRVRRFL S+SSR RFRLPS DPSDIPR      EQS IIR YD     AVLQRTVKSLHFGDGDEK+RAAKEIE  IKESAKVRKLIVDL
Subjt:  SSFDTLRFLSRVRRFLLSKSSRNRFRLPSDDPSDIPR------EQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDL

Query:  GVIPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPK
        GVIPALVAMADSDP AVRALIQLANHT+LNKTL+VEEGIL+KLP  NNP FTKMDSSSHEFPELL SLSCLANTQLFLASTEPV+SYLLT+LNNSES PK
Subjt:  GVIPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPK

Query:  TKTFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWA
        TK FCLATLF+IST+LENAETLISNGVVPTLLRFS IKEFSEKALPTLANLAVTSKGKQALE+NSTFPEILIEILTWEEKP CQELS YIIMI+AHQSWA
Subjt:  TKTFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWA

Query:  QRERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRANAGECS
        QRERLAKSS+IV ALLGLALLGSPLAQKRALKLLQWFK+ERQA+VG HSGPQ+AGIVEVGSGFSEKEIEKGKRMMRSLVKQSL+KNMEIITRRAN GEC 
Subjt:  QRERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRANAGECS

Query:  SPRIRRTLVSTTSSKSLPF
        SPRIRRTLV +TSSKSLPF
Subjt:  SPRIRRTLVSTTSSKSLPF

TrEMBL top hitse value%identityAlignment
A0A0A0LUN8 Uncharacterized protein8.3e-16377.99Show/hide
Query:  LRFLSRVRRFLLSKSSRNRFRLPSDDPSD--IPR-EQSIIIRNYD---AAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKE---SAKVRKLIVDLGV
        LRFLS +R FL SKSSR RFR PS   SD   P  EQS I+R YD      SS+ LQRTVK LHFGDGDEK+RAAKEIE LIK+   ++KVR++IVDLGV
Subjt:  LRFLSRVRRFLLSKSSRNRFRLPSDDPSD--IPR-EQSIIIRNYD---AAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKE---SAKVRKLIVDLGV

Query:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK
        IPALVAMADSD FAV+ALIQLANHTFLNKTL++EEGIL+KLP K        DSS+HEFPELL SLSCLANTQLFLASTEP++SYLLT+LN+ ES  ++K
Subjt:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK

Query:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR
        TFCLAT+F+ISTILEN ETLISN V+PTLL+FS IKEFSEKALPTLANLAVTSKGK ALE NS F EILIEILTWEEKPKCQELSAYIIM++AHQSW QR
Subjt:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR

Query:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMA-GIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRANAGECSS
        E+LAK+SIIV ALLGLALLGSPLAQ RALKLLQW KDER+ARV AHSGPQ+  GIVEVGSGFSEKEIEKGKR+MRSLVKQSL+KNMEIITRRAN GECSS
Subjt:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMA-GIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRANAGECSS

Query:  PRIRRTLVSTTSSKSLPF
          IRRTLVS+ SSKSLPF
Subjt:  PRIRRTLVSTTSSKSLPF

A0A1S3B802 uncharacterized protein LOC1034867965.4e-16278.95Show/hide
Query:  LRFLSRVRRFL--LSKSSRNRFRLPSDDPSDI--PREQSIIIRNYD--AAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKE---SAKVRKLIVDLGV
        LRFLS +RRFL   SKSSR R R PS   SDI  P  +  I+ + D  AA+SS+VLQRTVKSLHFGDGDEK+RAAKEIE LIK+   ++KVRKLIVDLGV
Subjt:  LRFLSRVRRFL--LSKSSRNRFRLPSDDPSDI--PREQSIIIRNYD--AAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKE---SAKVRKLIVDLGV

Query:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK
        IPALVAMADSD FAV+ALIQLANHTFLNKTL++E GIL+KLP K        DSSSHEFPELL SLSCLANTQLFLASTEP++SYLL +LNN ES  K+K
Subjt:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK

Query:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR
        T CLAT+F+ISTILEN ETLISN V+PTLL+FS IKEFSEKALPTLANLAVTSKGKQALE NS F EILIEILTWEEKPKCQELSAYIIMI+AHQSWAQR
Subjt:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR

Query:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMA-GIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRANAGECSS
        ERL K+SIIV ALLGLALLGSPLAQ RALKLLQW KDER+A V AHSGPQ+  GIVEVGSGFSEKEIEKGKR+MRSLVKQSL+KNMEIITRRAN GECSS
Subjt:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMA-GIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRANAGECSS

Query:  PRIRRTLVSTTSSKSLPF
          IRRTLVS+ SSKSLPF
Subjt:  PRIRRTLVSTTSSKSLPF

A0A5A7UPJ5 U-box domain-containing protein 75.4e-16278.95Show/hide
Query:  LRFLSRVRRFL--LSKSSRNRFRLPSDDPSDI--PREQSIIIRNYD--AAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKE---SAKVRKLIVDLGV
        LRFLS +RRFL   SKSSR R R PS   SDI  P  +  I+ + D  AA+SS+VLQRTVKSLHFGDGDEK+RAAKEIE LIK+   ++KVRKLIVDLGV
Subjt:  LRFLSRVRRFL--LSKSSRNRFRLPSDDPSDI--PREQSIIIRNYD--AAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKE---SAKVRKLIVDLGV

Query:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK
        IPALVAMADSD FAV+ALIQLANHTFLNKTL++E GIL+KLP K        DSSSHEFPELL SLSCLANTQLFLASTEP++SYLL +LNN ES  K+K
Subjt:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK

Query:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR
        T CLAT+F+ISTILEN ETLISN V+PTLL+FS IKEFSEKALPTLANLAVTSKGKQALE NS F EILIEILTWEEKPKCQELSAYIIMI+AHQSWAQR
Subjt:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR

Query:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMA-GIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRANAGECSS
        ERL K+SIIV ALLGLALLGSPLAQ RALKLLQW KDER+A V AHSGPQ+  GIVEVGSGFSEKEIEKGKR+MRSLVKQSL+KNMEIITRRAN GECSS
Subjt:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMA-GIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRANAGECSS

Query:  PRIRRTLVSTTSSKSLPF
          IRRTLVS+ SSKSLPF
Subjt:  PRIRRTLVSTTSSKSLPF

A0A6J1EGI7 uncharacterized protein LOC1114339766.4e-16378.95Show/hide
Query:  SFDTLRFLSRVRRFLLSKSSRNRFRLPSDDPSDIPR-----EQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDLGV
        S+  +RFL+RVR+FL SKSSR RFR PS DPS+I R      +   IR YD A  S+VLQRTVKSLHFGDG+EKQRAAKEIE LIKESAKVRKL+VDLGV
Subjt:  SFDTLRFLSRVRRFLLSKSSRNRFRLPSDDPSDIPR-----EQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDLGV

Query:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK
        IPALVAMADSD  AVRALI+LAN T LNKT++VEEGILSKLP   N  F  MDSSS EF ELL SLSCLANTQLFLASTEPVVSYLLT+LNNS+S P+TK
Subjt:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK

Query:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR
         FCLATLF+IST+LENAETLISNGVVPTLLRFS +KE SEKALPTLANLAVTSKGKQALE+NS F EIL+EILTWEEKPKCQELSA IIMI+ HQSWAQR
Subjt:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR

Query:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRAN-AGECSS
        ERL + S I  ALLGLALLGS LAQ+RALKLLQWFKDER+ARVG HSGPQ  GIV VGSG SE+E+EKGKR+MRSLVKQSL+KNMEIITRRAN AGEC S
Subjt:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRAN-AGECSS

Query:  PRIRRTLVSTTSSKSLPF
        P IRRTLVS+ SSKS PF
Subjt:  PRIRRTLVSTTSSKSLPF

A0A6J1KNH4 uncharacterized protein LOC1114957751.1e-16278.47Show/hide
Query:  SFDTLRFLSRVRRFLLSKSSRNRFRLPSDDPSDIPR-----EQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDLGV
        S+  +RFL+RVR+FL SKSSR RFR PS DPSDI R      +   IR YD A  S+VLQRTVKSLHFGDG+EKQRAAKEIE LIKESAKVRKL+VDLGV
Subjt:  SFDTLRFLSRVRRFLLSKSSRNRFRLPSDDPSDIPR-----EQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDLGV

Query:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK
        IPALVAMADSD  AVRALI+LAN T LNK ++VEEGILSKLP   N  F  MDSSS EF ELLFSLSCLANTQLFLASTEPV+SYLLT+LN+S+S P+T+
Subjt:  IPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSESIPKTK

Query:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR
         FCLATLF+IST+LENAETLISNGVVPTLLRFS ++EFSEKALPTLANLAVTSK KQALE+NSTF EIL+EILTWEEKPKCQELSA IIMI+ HQSWAQR
Subjt:  TFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQR

Query:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRAN-AGECSS
        ERL + S I  ALLGLALLGS LAQKRALKLLQWFKDER+ARVG HSGPQ AGIV VGSG S+KE+EKGKR+MRSLVKQSL+KNMEIITRRAN AGEC S
Subjt:  ERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRAN-AGECSS

Query:  PRIRRTLVSTTSSKSLPF
        P +RR LVS+ SSKS PF
Subjt:  PRIRRTLVSTTSSKSLPF

SwissProt top hitse value%identityAlignment
O80674 Transcription factor bHLH1068.1e-2238.73Show/hide
Query:  VSEEKALAALKNHSEAERRRRERINSHLSTLRGLVPC---------------PVKELKKKAAEASNG--IFVPMDTDEVNVEPCGVGAN-GDMSFKATLC
        +++++ALAAL+NH EAERRRRERINSHL+ LR ++ C                V+ELK++  E S+     +P +TDE++V   G  +N G + FKA+LC
Subjt:  VSEEKALAALKNHSEAERRRRERINSHLSTLRGLVPC---------------PVKELKKKAAEASNG--IFVPMDTDEVNVEPCGVGAN-GDMSFKATLC

Query:  CEYRPELLSDLKQALDSLHLKLVKAEISTLGNRVKNIFFFTSAIADNGDHSEASQHLASSVRQAISFVLDKAS
        CE R +LL DL + L SL++K ++AE+ T+G R +++       AD   H   S H    ++ A+  +L+++S
Subjt:  CEYRPELLSDLKQALDSLHLKLVKAEISTLGNRVKNIFFFTSAIADNGDHSEASQHLASSVRQAISFVLDKAS

Q9LET0 Putative transcription factor bHLH1073.1e-2138.15Show/hide
Query:  VSEEKALAALKNHSEAERRRRERINSHLSTLRGLVPC---------------PVKELKKKAAEASNGIFVPMDTDEV---NVEPCGVGANGDMSFKATLC
        V E+KALA+L+NH EAER+RR RINSHL+ LR L+ C                VKELK++  E ++   +P +TDE+   N+E C  G +  + FK + C
Subjt:  VSEEKALAALKNHSEAERRRRERINSHLSTLRGLVPC---------------PVKELKKKAAEASNGIFVPMDTDEV---NVEPCGVGANGDMSFKATLC

Query:  CEYRPELLSDLKQALDSLHLKLVKAEISTLGNRVKNIFFFTSAIADNGDHSEASQHLASSVRQAISFVLDKAS
        CE RPELL DL + L SL ++ + A+++T+G R +N+       AD   H   S    + ++ A+  +L+++S
Subjt:  CEYRPELLSDLKQALDSLHLKLVKAEISTLGNRVKNIFFFTSAIADNGDHSEASQHLASSVRQAISFVLDKAS

Q9LS08 Transcription factor AIG12.8e-2234.92Show/hide
Query:  SKKRVSEEKALAALKNHSEAERRRRERINSHLSTLRGLVPCP---------------VKELKKKAAEASNGIFVPMDTDEVNVEPCGVGANGDMSFKATL
        S + V + KALAA K+HSEAERRRRERIN+HL+ LR ++P                 +KELK++ ++ ++   VP + D++ V+       G++  +A+ 
Subjt:  SKKRVSEEKALAALKNHSEAERRRRERINSHLSTLRGLVPCP---------------VKELKKKAAEASNGIFVPMDTDEVNVEPCGVGANGDMSFKATL

Query:  CCEYRPELLSDLKQALDSLHLKLVKAEISTLGNRVKNIFFFTSAIADNGDHSEASQHL-----------------ASSVRQAISFVLDK
        CC+ R +L+ D+  AL SL L+ +KAEI+T+G RVKNI F +    D  DH    ++                   SS+ +A+  V++K
Subjt:  CCEYRPELLSDLKQALDSLHLKLVKAEISTLGNRVKNIFFFTSAIADNGDHSEASQHL-----------------ASSVRQAISFVLDK

Q9S7Y1 Transcription factor bHLH302.3e-2436.65Show/hide
Query:  GEKGELVKAPIQPSKKRVSEEKALAALKNHSEAERRRRERINSHLSTLRGLVPCP---------------VKELKKKAAEASNGIFVPMDTDEVNVEPCG
        G + EL K   Q     + + KALAA K+HSEAERRRRERIN+HL+ LR ++P                 VKELK++ +  S    VP ++DE+ V    
Subjt:  GEKGELVKAPIQPSKKRVSEEKALAALKNHSEAERRRRERINSHLSTLRGLVPCP---------------VKELKKKAAEASNGIFVPMDTDEVNVEPCG

Query:  VGANGDMSF--KATLCCEYRPELLSDLKQALDSLHLKLVKAEISTLGNRVKNIFFFTSAIADNGDHSEASQHLASSVRQAISFVLDKASSPEYSPRTTLP
            GD  F  KA+LCCE R +LL D+ + L ++ LK +KAEI+T+G RVKN+ F T     +G+  E  ++   ++ +A+  V++K++  E S  ++  
Subjt:  VGANGDMSF--KATLCCEYRPELLSDLKQALDSLHLKLVKAEISTLGNRVKNIFFFTSAIADNGDHSEASQHLASSVRQAISFVLDKASSPEYSPRTTLP

Query:  MKRRRLSSFDTLRFLSRVRRF
         KR+R+SS +T+  + + +++
Subjt:  MKRRRLSSFDTLRFLSRVRRF

Q9XEF0 Transcription factor bHLH511.1e-1532.09Show/hide
Query:  EKALAALKNHSEAERRRRERINSHLSTLRGLVPC---------------PVKELKKKAAEASNGIFVPMDTDEVNVEPCGV----GANGDMSFKATLCCE
        EKA +  ++H  AE+RRR+RINSHL+ LR LVP                 VKELK+KAAE+     +P + DEV V+P  +         + FKA+ CCE
Subjt:  EKALAALKNHSEAERRRRERINSHLSTLRGLVPC---------------PVKELKKKAAEASNGIFVPMDTDEVNVEPCGV----GANGDMSFKATLCCE

Query:  YRPELLSDLKQALDSLHLKLVKAEISTLGNRVKNIFFFTSAIADNGDHSEASQHLASSVRQAISFVLDKASSPEYSPRTTLPMKRRR
         +PE +S++ + L  L L+ ++AEI ++G R++  F    +  +   +  AS   A +++Q++   L++ +S   +  +   ++ +R
Subjt:  YRPELLSDLKQALDSLHLKLVKAEISTLGNRVKNIFFFTSAIADNGDHSEASQHLASSVRQAISFVLDKASSPEYSPRTTLPMKRRR

Arabidopsis top hitse value%identityAlignment
AT1G68810.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.6e-2536.65Show/hide
Query:  GEKGELVKAPIQPSKKRVSEEKALAALKNHSEAERRRRERINSHLSTLRGLVPCP---------------VKELKKKAAEASNGIFVPMDTDEVNVEPCG
        G + EL K   Q     + + KALAA K+HSEAERRRRERIN+HL+ LR ++P                 VKELK++ +  S    VP ++DE+ V    
Subjt:  GEKGELVKAPIQPSKKRVSEEKALAALKNHSEAERRRRERINSHLSTLRGLVPCP---------------VKELKKKAAEASNGIFVPMDTDEVNVEPCG

Query:  VGANGDMSF--KATLCCEYRPELLSDLKQALDSLHLKLVKAEISTLGNRVKNIFFFTSAIADNGDHSEASQHLASSVRQAISFVLDKASSPEYSPRTTLP
            GD  F  KA+LCCE R +LL D+ + L ++ LK +KAEI+T+G RVKN+ F T     +G+  E  ++   ++ +A+  V++K++  E S  ++  
Subjt:  VGANGDMSF--KATLCCEYRPELLSDLKQALDSLHLKLVKAEISTLGNRVKNIFFFTSAIADNGDHSEASQHLASSVRQAISFVLDKASSPEYSPRTTLP

Query:  MKRRRLSSFDTLRFLSRVRRF
         KR+R+SS +T+  + + +++
Subjt:  MKRRRLSSFDTLRFLSRVRRF

AT2G25130.1 ARM repeat superfamily protein5.0e-2728.69Show/hide
Query:  VLQRTVKSL------HFGDGDEKQRAAKEIETLIKESAKVRKLIVDLGVIPALVAMADSDP-------FAVRALIQLANHTFLNKTLIVEEGILSKLPNK
        +L+R VK L           ++K  AA E+  L K+  + R  +  LG IP LV+M D +         ++ AL+ L     +NK  IV+ G++ K+   
Subjt:  VLQRTVKSL------HFGDGDEKQRAAKEIETLIKESAKVRKLIVDLGVIPALVAMADSDP-------FAVRALIQLANHTFLNKTLIVEEGILSKLPNK

Query:  NNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSE-SIPKTKTFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKAL
                 + +         LS L + +  + S+  ++  + T+ N  E S  + +   L  L+++S   +N   ++   ++P LL      E SE+ L
Subjt:  NNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNSE-SIPKTKTFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKAL

Query:  PTLANLAVTSKGKQAL-ENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQRERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQAR
          L N+    +G++A+ E    FP IL+++L W +  KCQE + YI+M+MAH+ +  R  + ++  I S+LL L L+GSPLAQKRA ++L+     R   
Subjt:  PTLANLAVTSKGKQAL-ENNSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQRERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQAR

Query:  VGAHSGPQMAGIVEVG-SGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRAN
         G      + G   +G     +  +   ++ ++ LV+QSL  NM+ I +RAN
Subjt:  VGAHSGPQMAGIVEVG-SGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRAN

AT2G27430.1 ARM repeat superfamily protein3.7e-8646.82Show/hide
Query:  LRFLSRVRRFLLSKSSRNRFRLPS-----------DDPSDIPREQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDL
        L F +++R  L SK+S  +  L +           +  + +P E  I+ +  +      VLQ+TVK +HFG  +EK++AA EIE L +E  K RKL+ +L
Subjt:  LRFLSRVRRFLLSKSSRNRFRLPS-----------DDPSDIPREQSIIIRNYDAAASSAVLQRTVKSLHFGDGDEKQRAAKEIETLIKESAKVRKLIVDL

Query:  GVIPALVAMADSD-----PFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNS
        GVI  LV+M  SD       AV ALIQL++ T+ NK L+V   I SKLP KN     +  S+ H F ELL SLS L NTQL +AS++ ++ +L+  +N+ 
Subjt:  GVIPALVAMADSD-----PFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTEPVVSYLLTVLNNS

Query:  ESIPKTKTFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMA
         +  KTK  CLAT+ ++  +LENA  L+ NG V TLL     K+ SEKAL +L  L VT  GK+A+E+     + LIEILTWE+ PKCQE +AYI+M++A
Subjt:  ESIPKTKTFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIMIMA

Query:  HQSWAQRERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVE-VGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRA
        HQSW+QRE++AK+  IV  LL ++LLGSPL QKRA+KLLQWFKDER  R+G HSGPQ   +   +GS  S +  E+G++MM++LVKQSL+KNME+ITRR 
Subjt:  HQSWAQRERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVE-VGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRA

Query:  NAGECSSPRIRRTLVSTTSSKSLPF
        N    S     ++L+ +TSSKSL +
Subjt:  NAGECSSPRIRRTLVSTTSSKSLPF

AT4G31890.1 ARM repeat superfamily protein6.3e-3029.78Show/hide
Query:  GDGDEKQRAAKEIETLIKESAKVRKLIVDLGVIPALVAMADSDPF------AVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPEL
        GD  +K  AA E+  L KE ++ R  +  LG IP LV+M D          ++ AL+ L      NK  IV+ G + K+        T     +      
Subjt:  GDGDEKQRAAKEIETLIKESAKVRKLIVDLGVIPALVAMADSDPF------AVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPEL

Query:  LFSLSCLANTQLFLASTEPVVSYLLTVLNNSE-SIPKTKTFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALEN
           LS L + +  + S+  ++  + T+ N  E S  + +   L  L+++S    N   ++   ++  LL      E SE+ L  L+NL    +G++A+  
Subjt:  LFSLSCLANTQLFLASTEPVVSYLLTVLNNSE-SIPKTKTFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALEN

Query:  NSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQRERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARV-------GAHSGPQMAGI
              +L+++L W + P CQE + YI+M+MAH+ +  R+ + ++  I SALL L LLGS LAQKRA ++L+  + ++  +V       GA S P + G 
Subjt:  NSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQRERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARV-------GAHSGPQMAGI

Query:  VEVGSGFSEKEI--EKGKRMMRSLVKQSLHKNMEIITRRANAGECSSPRIR-RTLVSTTSSKSLPF
         + G    E ++   + ++ ++ LV+QSL  NM+ I +RAN  +   P    ++L  +++SKSLPF
Subjt:  VEVGSGFSEKEI--EKGKRMMRSLVKQSLHKNMEIITRRANAGECSSPRIR-RTLVSTTSSKSLPF

AT4G31890.2 ARM repeat superfamily protein6.3e-3029.78Show/hide
Query:  GDGDEKQRAAKEIETLIKESAKVRKLIVDLGVIPALVAMADSDPF------AVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPEL
        GD  +K  AA E+  L KE ++ R  +  LG IP LV+M D          ++ AL+ L      NK  IV+ G + K+        T     +      
Subjt:  GDGDEKQRAAKEIETLIKESAKVRKLIVDLGVIPALVAMADSDPF------AVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPEL

Query:  LFSLSCLANTQLFLASTEPVVSYLLTVLNNSE-SIPKTKTFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALEN
           LS L + +  + S+  ++  + T+ N  E S  + +   L  L+++S    N   ++   ++  LL      E SE+ L  L+NL    +G++A+  
Subjt:  LFSLSCLANTQLFLASTEPVVSYLLTVLNNSE-SIPKTKTFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALEN

Query:  NSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQRERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARV-------GAHSGPQMAGI
              +L+++L W + P CQE + YI+M+MAH+ +  R+ + ++  I SALL L LLGS LAQKRA ++L+  + ++  +V       GA S P + G 
Subjt:  NSTFPEILIEILTWEEKPKCQELSAYIIMIMAHQSWAQRERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARV-------GAHSGPQMAGI

Query:  VEVGSGFSEKEI--EKGKRMMRSLVKQSLHKNMEIITRRANAGECSSPRIR-RTLVSTTSSKSLPF
         + G    E ++   + ++ ++ LV+QSL  NM+ I +RAN  +   P    ++L  +++SKSLPF
Subjt:  VEVGSGFSEKEI--EKGKRMMRSLVKQSLHKNMEIITRRANAGECSSPRIR-RTLVSTTSSKSLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAAACCAAAGTCATCAGTATTGTCTCAACAGTTACCAGCAAGATTTTCTGCAACAGAGTCTGTGTAAAACAAAGCTTCACTCTCCATAAGCTCTCCTTTTGGTT
TGCTCCGACGAATCGACCTCCGGGACTACTATCTTCAACGTTCGGTTCCTTAGAGAACTTTAATCGTGGATTTGTGCGTAGTGGGTCAATATTATCTCAGTCTTTAGTGT
TGAATGGTGAGAAGGGAGAGCTTGTAAAAGCCCCAATTCAGCCATCGAAGAAAAGGGTTTCGGAGGAAAAAGCACTTGCGGCGTTGAAGAATCACAGCGAGGCGGAGAGG
CGGAGGAGAGAAAGAATCAATTCCCATCTCTCGACTCTGCGTGGCCTTGTTCCTTGCCCCGTTAAGGAATTGAAGAAGAAGGCTGCAGAAGCCAGCAATGGCATTTTCGT
TCCAATGGATACCGACGAAGTTAATGTCGAACCTTGTGGAGTGGGAGCAAATGGGGATATGTCCTTCAAGGCAACCCTGTGTTGCGAATATCGACCTGAGCTTCTGTCTG
ATCTTAAACAAGCCCTTGATTCCCTTCACCTGAAGTTGGTGAAGGCAGAAATATCAACCTTGGGAAACAGGGTGAAGAACATATTCTTTTTCACCAGTGCCATTGCAGAC
AATGGTGACCATTCTGAGGCTTCCCAACATCTCGCATCATCGGTTCGTCAGGCAATAAGTTTTGTTCTTGACAAAGCTTCATCCCCAGAATACTCGCCCCGAACAACGCT
CCCAATGAAAAGGCGACGGCTGTCTAGTTTCGATACATTACGCTTCCTCAGTCGCGTCCGCCGGTTCCTTCTCTCCAAATCATCTCGCAACCGATTCCGTTTGCCGTCCG
ATGATCCATCGGATATTCCAAGGGAACAGAGTATAATTATCCGGAACTATGATGCGGCCGCCTCCTCCGCCGTGTTGCAGAGGACAGTGAAGAGCCTCCACTTCGGCGAC
GGAGACGAAAAACAGAGAGCCGCCAAGGAAATTGAGACGTTAATTAAAGAAAGCGCCAAGGTTAGAAAGCTGATTGTGGATCTAGGAGTTATACCTGCTTTGGTGGCAAT
GGCGGATTCCGATCCCTTCGCCGTCAGGGCCTTGATTCAACTTGCTAATCATACTTTCCTGAACAAAACACTAATAGTGGAGGAAGGAATCTTATCAAAGCTACCAAACA
AAAACAACCCCCACTTCACAAAAATGGACTCATCCAGCCATGAATTTCCAGAGCTTTTATTTTCACTATCTTGTCTAGCAAACACCCAGTTGTTTCTAGCTTCAACAGAA
CCAGTAGTTTCATATCTCTTAACCGTACTCAATAATTCAGAATCCATCCCCAAAACCAAAACATTTTGTTTAGCAACTTTATTCAGCATTTCCACCATCTTAGAAAATGC
AGAAACCTTAATCTCCAATGGTGTGGTTCCAACGCTACTCAGATTCTCCTGCATCAAAGAATTTTCAGAGAAAGCCCTACCAACATTAGCAAACTTGGCAGTGACTTCAA
AAGGAAAACAAGCTTTGGAAAACAACTCAACATTTCCAGAGATTTTGATAGAGATTTTGACATGGGAAGAGAAACCCAAATGCCAAGAACTTTCAGCTTATATCATAATG
ATTATGGCACATCAAAGCTGGGCTCAAAGAGAGAGATTGGCCAAGTCCAGCATTATTGTCTCTGCGCTGCTGGGATTGGCTTTGTTAGGAAGTCCATTAGCTCAAAAGAG
AGCTTTGAAATTGCTGCAATGGTTTAAAGATGAGAGGCAAGCGAGAGTGGGGGCTCATTCTGGACCTCAGATGGCTGGGATAGTTGAAGTAGGCTCAGGATTCAGTGAGA
AGGAGATTGAGAAAGGGAAGAGGATGATGAGAAGCTTGGTGAAGCAGAGTTTGCATAAGAATATGGAGATAATAACTAGGCGAGCCAATGCTGGGGAATGTTCAAGTCCA
AGGATTAGGAGGACTTTGGTTTCCACCACTAGTTCCAAGAGTTTGCCTTTTTGA
mRNA sequenceShow/hide mRNA sequence
GTATGAAAAAAACCAAAGTCATCAGTATTGTCTCAACAGTTACCAGCAAGATTTTCTGCAACAGAGTCTGTGTAAAACAAAGCTTCACTCTCCATAAGCTCTCCTTTTGG
TTTGCTCCGACGAATCGACCTCCGGGACTACTATCTTCAACGTTCGGTTCCTTAGAGAACTTTAATCGTGGATTTGTGCGTAGTGGGTCAATATTATCTCAGTCTTTAGT
GTTGAATGGTGAGAAGGGAGAGCTTGTAAAAGCCCCAATTCAGCCATCGAAGAAAAGGGTTTCGGAGGAAAAAGCACTTGCGGCGTTGAAGAATCACAGCGAGGCGGAGA
GGCGGAGGAGAGAAAGAATCAATTCCCATCTCTCGACTCTGCGTGGCCTTGTTCCTTGCCCCGTTAAGGAATTGAAGAAGAAGGCTGCAGAAGCCAGCAATGGCATTTTC
GTTCCAATGGATACCGACGAAGTTAATGTCGAACCTTGTGGAGTGGGAGCAAATGGGGATATGTCCTTCAAGGCAACCCTGTGTTGCGAATATCGACCTGAGCTTCTGTC
TGATCTTAAACAAGCCCTTGATTCCCTTCACCTGAAGTTGGTGAAGGCAGAAATATCAACCTTGGGAAACAGGGTGAAGAACATATTCTTTTTCACCAGTGCCATTGCAG
ACAATGGTGACCATTCTGAGGCTTCCCAACATCTCGCATCATCGGTTCGTCAGGCAATAAGTTTTGTTCTTGACAAAGCTTCATCCCCAGAATACTCGCCCCGAACAACG
CTCCCAATGAAAAGGCGACGGCTGTCTAGTTTCGATACATTACGCTTCCTCAGTCGCGTCCGCCGGTTCCTTCTCTCCAAATCATCTCGCAACCGATTCCGTTTGCCGTC
CGATGATCCATCGGATATTCCAAGGGAACAGAGTATAATTATCCGGAACTATGATGCGGCCGCCTCCTCCGCCGTGTTGCAGAGGACAGTGAAGAGCCTCCACTTCGGCG
ACGGAGACGAAAAACAGAGAGCCGCCAAGGAAATTGAGACGTTAATTAAAGAAAGCGCCAAGGTTAGAAAGCTGATTGTGGATCTAGGAGTTATACCTGCTTTGGTGGCA
ATGGCGGATTCCGATCCCTTCGCCGTCAGGGCCTTGATTCAACTTGCTAATCATACTTTCCTGAACAAAACACTAATAGTGGAGGAAGGAATCTTATCAAAGCTACCAAA
CAAAAACAACCCCCACTTCACAAAAATGGACTCATCCAGCCATGAATTTCCAGAGCTTTTATTTTCACTATCTTGTCTAGCAAACACCCAGTTGTTTCTAGCTTCAACAG
AACCAGTAGTTTCATATCTCTTAACCGTACTCAATAATTCAGAATCCATCCCCAAAACCAAAACATTTTGTTTAGCAACTTTATTCAGCATTTCCACCATCTTAGAAAAT
GCAGAAACCTTAATCTCCAATGGTGTGGTTCCAACGCTACTCAGATTCTCCTGCATCAAAGAATTTTCAGAGAAAGCCCTACCAACATTAGCAAACTTGGCAGTGACTTC
AAAAGGAAAACAAGCTTTGGAAAACAACTCAACATTTCCAGAGATTTTGATAGAGATTTTGACATGGGAAGAGAAACCCAAATGCCAAGAACTTTCAGCTTATATCATAA
TGATTATGGCACATCAAAGCTGGGCTCAAAGAGAGAGATTGGCCAAGTCCAGCATTATTGTCTCTGCGCTGCTGGGATTGGCTTTGTTAGGAAGTCCATTAGCTCAAAAG
AGAGCTTTGAAATTGCTGCAATGGTTTAAAGATGAGAGGCAAGCGAGAGTGGGGGCTCATTCTGGACCTCAGATGGCTGGGATAGTTGAAGTAGGCTCAGGATTCAGTGA
GAAGGAGATTGAGAAAGGGAAGAGGATGATGAGAAGCTTGGTGAAGCAGAGTTTGCATAAGAATATGGAGATAATAACTAGGCGAGCCAATGCTGGGGAATGTTCAAGTC
CAAGGATTAGGAGGACTTTGGTTTCCACCACTAGTTCCAAGAGTTTGCCTTTTTGAAAGATTTCCTAATCAATATCACATTCATGAACTTTCATCATCTGCCTCTGTGTA
TTGTATTGTCATCTTCTGCTTCTCAATTGAAATCTAGAAGATGAATGAGTAGAGAGGATTATCTATGTTATCTTATAAAGATGGTGAATTAATGGGAGATCTTCAAGATG
CCTAACAAAATGACGCCTCTTTAGGTTCGTCACTTTTGGATCC
Protein sequenceShow/hide protein sequence
MKKTKVISIVSTVTSKIFCNRVCVKQSFTLHKLSFWFAPTNRPPGLLSSTFGSLENFNRGFVRSGSILSQSLVLNGEKGELVKAPIQPSKKRVSEEKALAALKNHSEAER
RRRERINSHLSTLRGLVPCPVKELKKKAAEASNGIFVPMDTDEVNVEPCGVGANGDMSFKATLCCEYRPELLSDLKQALDSLHLKLVKAEISTLGNRVKNIFFFTSAIAD
NGDHSEASQHLASSVRQAISFVLDKASSPEYSPRTTLPMKRRRLSSFDTLRFLSRVRRFLLSKSSRNRFRLPSDDPSDIPREQSIIIRNYDAAASSAVLQRTVKSLHFGD
GDEKQRAAKEIETLIKESAKVRKLIVDLGVIPALVAMADSDPFAVRALIQLANHTFLNKTLIVEEGILSKLPNKNNPHFTKMDSSSHEFPELLFSLSCLANTQLFLASTE
PVVSYLLTVLNNSESIPKTKTFCLATLFSISTILENAETLISNGVVPTLLRFSCIKEFSEKALPTLANLAVTSKGKQALENNSTFPEILIEILTWEEKPKCQELSAYIIM
IMAHQSWAQRERLAKSSIIVSALLGLALLGSPLAQKRALKLLQWFKDERQARVGAHSGPQMAGIVEVGSGFSEKEIEKGKRMMRSLVKQSLHKNMEIITRRANAGECSSP
RIRRTLVSTTSSKSLPF