; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G002860 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G002860
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionBZIP transcription factor family protein
Genome locationCmo_Chr14:1318275..1322359
RNA-Seq ExpressionCmoCh14G002860
SyntenyCmoCh14G002860
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009414 - response to water deprivation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044827 - G-box-binding factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580686.1 hypothetical protein SDJN03_20688, partial [Cucurbita argyrosperma subsp. sororia]1.0e-27698.63Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRS SSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RI-QDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
        RI QDRGVISH PSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
Subjt:  RI-QDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA

Query:  WENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
        WENENLKREKELALKEYQSLEITNKELKEQIA AEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
Subjt:  WENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP

Query:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP
        SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSA EKNEAHDLNEA 
Subjt:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP

Query:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKL
        SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPE+VSCKKTIDAMAATEARRRRKELTKL
Subjt:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKL

Query:  KNLHTRPCRMHF
        KNLHTRPCRMHF
Subjt:  KNLHTRPCRMHF

KAG7017441.1 hypothetical protein SDJN02_19306 [Cucurbita argyrosperma subsp. argyrosperma]7.6e-28099.22Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRS SSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RIQDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
        RIQDRGVISH PSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
Subjt:  RIQDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW

Query:  ENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
        ENENLKREKELALKEYQSLEITNKELKEQIA AEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
Subjt:  ENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS

Query:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
        NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
Subjt:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS

Query:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLK
        LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPE+VSCKKTIDAMAATEARRRRKELTKLK
Subjt:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLK

Query:  NLHTRPCRMHF
        NLHTRPCRMHF
Subjt:  NLHTRPCRMHF

XP_022934487.1 uncharacterized protein LOC111441650 isoform X1 [Cucurbita moschata]2.4e-28199.8Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RI-QDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
        RI QDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
Subjt:  RI-QDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA

Query:  WENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
        WENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
Subjt:  WENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP

Query:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP
        SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP
Subjt:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP

Query:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKL
        SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKL
Subjt:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKL

Query:  KNLHTRPCRMHF
        KNLHTRPCRMHF
Subjt:  KNLHTRPCRMHF

XP_022934488.1 uncharacterized protein LOC111441650 isoform X2 [Cucurbita moschata]9.6e-283100Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RIQDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
        RIQDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
Subjt:  RIQDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW

Query:  ENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
        ENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
Subjt:  ENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS

Query:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
        NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
Subjt:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS

Query:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLK
        LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLK
Subjt:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLK

Query:  NLHTRPCRMHF
        NLHTRPCRMHF
Subjt:  NLHTRPCRMHF

XP_023528186.1 uncharacterized protein LOC111791175 isoform X2 [Cucurbita pepo subsp. pepo]3.3e-27597.85Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRS SSSSMEADQMVKVEIEAAEALADLAV AVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RIQDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
        RIQDR VISH PSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
Subjt:  RIQDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW

Query:  ENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
        ENENLKREKELALKEYQSLEITNKELKEQIA AEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLH VAVVPPSVRSPS
Subjt:  ENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS

Query:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
        NNTVYVSDSSH+QENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCP GNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEA+DLNEAPS
Subjt:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS

Query:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLK
        LK+HTQNTVGVVVDRFE DTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLK
Subjt:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLK

Query:  NLHTRPCRMHF
        NLHTRPCRMHF
Subjt:  NLHTRPCRMHF

TrEMBL top hitse value%identityAlignment
A0A0A0LBD4 BZIP domain-containing protein5.3e-18671.99Show/hide
Query:  ASSSKCSEATSCSGLSSSSTRSFSSSSME---------ADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSL
        ASSSKCS+ T+ SGLSSSS+ S SSSS           ADQMVKVEIEAAEALA LAVLAVR++G QP +TKW IK  KGKRARKEVKTESPTS F DSL
Subjt:  ASSSKCSEATSCSGLSSSSTRSFSSSSME---------ADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSL

Query:  PSRADLDLRI-QDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKL------SHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQ
        P+RADLDLRI QDRGV+ H PSEKEC   S PE ETT E+ K +KEAES K+      S+  FGCRRSRR LTEAEKEERRIRR+LANRESARQTIRRRQ
Subjt:  PSRADLDLRI-QDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKL------SHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQ

Query:  ALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAHA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYH
        ALCE+LT+KA+DLAWENENLKREKE+ALKEYQSLE TNKELKEQ+A A +PK+EEIPGN+RSSHVQ PPLPTN PLFLFSR P   YFWPSVVQ +S YH
Subjt:  ALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAHA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYH

Query:  DLHNVAVVPPSVRSPSNNTVYVSDSSHVQENFTNVTGLRTPFCIV-PCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSL
        +L NV VVP S+  P+NN   VS SS  QENFTN TG R P CI+ P SWLLPHHD RNQQS Q   PAGN QE +YS SQNSA TSK  VRAESRHSSL
Subjt:  DLHNVAVVPPSVRSPSNNTVYVSDSSHVQENFTNVTGLRTPFCIV-PCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSL

Query:  PSAEEKNEAHDLNEAPSL------KEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSC
        PSAEE+NEA DLNEAPSL      K+ TQNTVGV V+ F+ + R  VRKVLSPVRLECIEP+S    D  +EDD G+SSRTCDDLC+ AE++HEPE+V C
Subjt:  PSAEEKNEAHDLNEAPSL------KEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSC

Query:  KKTIDAMAATEARRRRKELTKLKNLHTRPCRM
        KKT+DAMAATEARRRRKELTKLKNL+ R CRM
Subjt:  KKTIDAMAATEARRRRKELTKLKNLHTRPCRM

A0A6J1F2W5 uncharacterized protein LOC111441650 isoform X11.1e-28199.8Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RI-QDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
        RI QDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
Subjt:  RI-QDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA

Query:  WENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
        WENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
Subjt:  WENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP

Query:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP
        SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP
Subjt:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP

Query:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKL
        SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKL
Subjt:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKL

Query:  KNLHTRPCRMHF
        KNLHTRPCRMHF
Subjt:  KNLHTRPCRMHF

A0A6J1F7T1 uncharacterized protein LOC111441650 isoform X24.7e-283100Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RIQDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
        RIQDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
Subjt:  RIQDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW

Query:  ENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
        ENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
Subjt:  ENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS

Query:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
        NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
Subjt:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS

Query:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLK
        LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLK
Subjt:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLK

Query:  NLHTRPCRMHF
        NLHTRPCRMHF
Subjt:  NLHTRPCRMHF

A0A6J1J476 uncharacterized protein LOC111481617 isoform X21.5e-27397.06Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRS SSSSMEADQMVKVEIEAAEAL DLAVLAVRDSGV+PSETKWRIK KKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RIQDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
        RIQDRGVISH PSEKECADHSHPEWETTKEMIKAEKE ESPKLSHPLFGCRR RRNLTEAEKEERRIRRVLANRESARQTIRRRQ LCEDLTKKASDLAW
Subjt:  RIQDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW

Query:  ENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
        ENENLKREKELALKEYQSLEITNKELKEQIA AEPKMEEIPGNNRSSHVQTPPLPTNYPLF FSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
Subjt:  ENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS

Query:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
        NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQE IYSNSQNSAYTSKVVVRAESR SSLPSAEEKNEAHDLNEAPS
Subjt:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS

Query:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLK
        LK+HTQNTVGVVVDRFEADTRD+VRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPE+VSCKKTIDAMAATEARRRRKELTKLK
Subjt:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLK

Query:  NLHTRPCRMHF
        NLHTRPCRMHF
Subjt:  NLHTRPCRMHF

A0A6J1J5U4 uncharacterized protein LOC111481617 isoform X13.7e-27296.88Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRS SSSSMEADQMVKVEIEAAEAL DLAVLAVRDSGV+PSETKWRIK KKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RI-QDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
        RI QDRGVISH PSEKECADHSHPEWETTKEMIKAEKE ESPKLSHPLFGCRR RRNLTEAEKEERRIRRVLANRESARQTIRRRQ LCEDLTKKASDLA
Subjt:  RI-QDRGVISHHPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA

Query:  WENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
        WENENLKREKELALKEYQSLEITNKELKEQIA AEPKMEEIPGNNRSSHVQTPPLPTNYPLF FSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
Subjt:  WENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP

Query:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP
        SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQE IYSNSQNSAYTSKVVVRAESR SSLPSAEEKNEAHDLNEAP
Subjt:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP

Query:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKL
        SLK+HTQNTVGVVVDRFEADTRD+VRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPE+VSCKKTIDAMAATEARRRRKELTKL
Subjt:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKL

Query:  KNLHTRPCRMHF
        KNLHTRPCRMHF
Subjt:  KNLHTRPCRMHF

SwissProt top hitse value%identityAlignment
A0A3B6KF13 bZIP transcription factor 1-A1.5e-0437.5Show/hide
Query:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSH
        E+E +R +R  +NR+SAR++  R+QA CE+L ++A  L  EN +LK E     KEY  L   N  LK+ +   + K +E   +N+  H
Subjt:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSH

A0A3B6MPP5 bZIP transcription factor 1-D1.5e-0437.5Show/hide
Query:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSH
        E+E +R +R  +NR+SAR++  R+QA CE+L ++A  L  EN +LK E     KEY  L   N  LK+ +   + K +E   +N+  H
Subjt:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSH

B6E107 bZIP transcription factor 1-B1.5e-0437.5Show/hide
Query:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSH
        E+E +R +R  +NR+SAR++  R+QA CE+L ++A  L  EN +LK E     KEY  L   N  LK+ +   + K +E   +N+  H
Subjt:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPGNNRSSH

P23922 Transcription factor HBP-1a2.6e-0437Show/hide
Query:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQI--------AHAEPKMEEIPGNNRSSHVQTP
        E+E ++ +R L+NRESAR++  R+QA CE+L ++A  L  EN +L+ E +   KEY+ L   N  LK ++        + A P M E    N  SH + P
Subjt:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQI--------AHAEPKMEEIPGNNRSSHVQTP

P25032 DNA-binding protein EMBP-19.8e-0437.8Show/hide
Query:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPG
        E+E +R RR  +NRESAR++  R+Q  CE+L +K S+L   N  L+ E +   K+ +++E  NK+L  +I   + KM++  G
Subjt:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAHAEPKMEEIPG

Arabidopsis top hitse value%identityAlignment
AT1G19490.1 Basic-leucine zipper (bZIP) transcription factor family protein4.3e-5537.79Show/hide
Query:  LSSSSTRSFSSSSMEADQMV--KVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL----RIQDRGVIS
        +SSS   S SSSS E +       E+EAAEALADLA LA+    V  S   W     KGKR RK VKTESP S   DSL    D D      + +  ++ 
Subjt:  LSSSSTRSFSSSSMEADQMV--KVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL----RIQDRGVIS

Query:  HHPSEKECADHSHPEWETTKEMIKAEKEAESPK--LSHPLF------GCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWE
            E+E    +    E TK  +K+E   E+PK  L+  L       GC RSR+NL+EAE+EERRIRR+LANRESARQTIRRRQA+CE+L+KKA+DL +E
Subjt:  HHPSEKECADHSHPEWETTKEMIKAEKEAESPK--LSHPLF------GCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWE

Query:  NENLKREKELALKEYQSLEITNKELKEQIAHA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
        NENL+REK+ ALKE+QSLE  NK LKEQ+  + +P  +E   + + S V+     T  P + +++ PY  + WP V Q S+P        ++ P     S
Subjt:  NENLKREKELALKEYQSLEITNKELKEQIAHA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS

Query:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAG--NIQEYIYSNSQN-SAYTSKVVVRAESRHSSLPS--AEEKNEAHDL
              + ++   EN  +  G +T F +VPC W LP  DH       N  P G  + Q   +SN  +    +++ +   E+  S LP+   EE + + + 
Subjt:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAG--NIQEYIYSNSQN-SAYTSKVVVRAESRHSSLPS--AEEKNEAHDL

Query:  NEAPSLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDD--LCHLAEKKHEPEIVSCKKTIDAMAATEARRRR
             L E     +    D F             PV     +   ++K +  SE   G++        L  L EKKH            ++AA EAR+RR
Subjt:  NEAPSLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDD--LCHLAEKKHEPEIVSCKKTIDAMAATEARRRR

Query:  KELTKLKNLHTRPCRM
        KELT+LKNLH R CRM
Subjt:  KELTKLKNLHTRPCRM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCCAAGTGCTCCGAGGCGACCAGTTGTTCGGGTTTGAGTTCTTCTTCTACACGTTCGTTTTCTTCCTCCTCCATGGAGGCGGATCAGATGGTCAAGGT
TGAGATTGAGGCGGCGGAGGCTCTTGCAGATTTGGCGGTTTTGGCGGTCAGAGATAGTGGAGTTCAACCGTCGGAAACCAAATGGCGGATTAAAGAGAAGAAAGGGAAAC
GGGCCAGGAAGGAGGTTAAGACCGAGTCGCCGACTTCTGCCTTCGTCGACTCTTTACCTAGTCGCGCGGATCTGGACCTTCGGATTCAGGATAGAGGGGTGATAAGTCAT
CATCCATCAGAAAAGGAATGCGCTGATCACTCCCATCCTGAGTGGGAAACAACCAAAGAGATGATTAAGGCGGAGAAGGAGGCCGAATCACCTAAACTAAGCCACCCATT
ATTTGGCTGCCGGAGGTCAAGGCGTAATCTGACTGAGGCTGAAAAGGAAGAAAGGAGAATACGAAGGGTTTTAGCAAACAGAGAATCAGCCCGGCAGACAATTCGTCGTA
GGCAGGCTCTGTGCGAGGACTTGACCAAAAAGGCTTCTGATTTAGCATGGGAGAACGAAAATTTAAAGAGGGAAAAGGAGTTGGCCCTGAAAGAGTACCAATCTCTGGAG
ATTACTAACAAGGAATTGAAGGAACAGATTGCTCATGCAGAGCCCAAAATGGAGGAAATCCCAGGAAACAATAGATCATCTCATGTTCAGACTCCTCCTTTACCTACCAA
CTACCCTCTTTTCTTGTTTAGTCGCCCTCCATATGCATCGTATTTCTGGCCGTCAGTGGTTCAACCTTCAAGTCCCTATCATGACCTACACAATGTTGCAGTCGTCCCGC
CAAGTGTTCGTTCGCCTTCTAATAATACTGTTTATGTATCCGACTCTTCCCATGTACAAGAAAACTTTACAAATGTCACTGGCTTGAGAACACCCTTTTGTATCGTACCT
TGTTCTTGGTTGTTGCCTCATCATGATCATAGGAATCAACAGAGTTCTCAAAACTCGTGTCCAGCGGGAAATATTCAAGAGTATATTTATTCAAATTCCCAGAACAGTGC
TTATACTTCAAAGGTTGTTGTGCGTGCAGAAAGCAGACATTCTTCTTTGCCTTCAGCTGAAGAGAAAAACGAAGCTCATGACTTGAATGAAGCCCCGAGTCTAAAGGAGC
ATACTCAGAACACAGTTGGAGTAGTTGTGGATCGATTTGAAGCCGACACAAGAGATCAAGTTAGGAAAGTGCTTTCTCCTGTGAGACTTGAATGTATTGAACCCACTTCC
ACTGTCAAACAAGATAAACCGAGCGAAGATGATCGCGGTCTGTCATCAAGAACGTGTGATGACTTATGTCATTTGGCAGAAAAAAAGCATGAACCAGAGATAGTCTCATG
TAAGAAAACTATAGATGCAATGGCTGCAACTGAGGCAAGGAGGAGAAGAAAAGAACTAACAAAGTTAAAGAATCTTCACACTCGTCCTTGCCGTATGCACTTCTGA
mRNA sequenceShow/hide mRNA sequence
GTCTTCTTCTTCTTCTTCTTCTTCTTCTCTGGTTCTTATTATCCTTTGATTTTTTTTTTTTCATGGCTTCTTCTTCCAAGTGCTCCGAGGCGACCAGTTGTTCGGGTTTG
AGTTCTTCTTCTACACGTTCGTTTTCTTCCTCCTCCATGGAGGCGGATCAGATGGTCAAGGTTGAGATTGAGGCGGCGGAGGCTCTTGCAGATTTGGCGGTTTTGGCGGT
CAGAGATAGTGGAGTTCAACCGTCGGAAACCAAATGGCGGATTAAAGAGAAGAAAGGGAAACGGGCCAGGAAGGAGGTTAAGACCGAGTCGCCGACTTCTGCCTTCGTCG
ACTCTTTACCTAGTCGCGCGGATCTGGACCTTCGGATTCAGGATAGAGGGGTGATAAGTCATCATCCATCAGAAAAGGAATGCGCTGATCACTCCCATCCTGAGTGGGAA
ACAACCAAAGAGATGATTAAGGCGGAGAAGGAGGCCGAATCACCTAAACTAAGCCACCCATTATTTGGCTGCCGGAGGTCAAGGCGTAATCTGACTGAGGCTGAAAAGGA
AGAAAGGAGAATACGAAGGGTTTTAGCAAACAGAGAATCAGCCCGGCAGACAATTCGTCGTAGGCAGGCTCTGTGCGAGGACTTGACCAAAAAGGCTTCTGATTTAGCAT
GGGAGAACGAAAATTTAAAGAGGGAAAAGGAGTTGGCCCTGAAAGAGTACCAATCTCTGGAGATTACTAACAAGGAATTGAAGGAACAGATTGCTCATGCAGAGCCCAAA
ATGGAGGAAATCCCAGGAAACAATAGATCATCTCATGTTCAGACTCCTCCTTTACCTACCAACTACCCTCTTTTCTTGTTTAGTCGCCCTCCATATGCATCGTATTTCTG
GCCGTCAGTGGTTCAACCTTCAAGTCCCTATCATGACCTACACAATGTTGCAGTCGTCCCGCCAAGTGTTCGTTCGCCTTCTAATAATACTGTTTATGTATCCGACTCTT
CCCATGTACAAGAAAACTTTACAAATGTCACTGGCTTGAGAACACCCTTTTGTATCGTACCTTGTTCTTGGTTGTTGCCTCATCATGATCATAGGAATCAACAGAGTTCT
CAAAACTCGTGTCCAGCGGGAAATATTCAAGAGTATATTTATTCAAATTCCCAGAACAGTGCTTATACTTCAAAGGTTGTTGTGCGTGCAGAAAGCAGACATTCTTCTTT
GCCTTCAGCTGAAGAGAAAAACGAAGCTCATGACTTGAATGAAGCCCCGAGTCTAAAGGAGCATACTCAGAACACAGTTGGAGTAGTTGTGGATCGATTTGAAGCCGACA
CAAGAGATCAAGTTAGGAAAGTGCTTTCTCCTGTGAGACTTGAATGTATTGAACCCACTTCCACTGTCAAACAAGATAAACCGAGCGAAGATGATCGCGGTCTGTCATCA
AGAACGTGTGATGACTTATGTCATTTGGCAGAAAAAAAGCATGAACCAGAGATAGTCTCATGTAAGAAAACTATAGATGCAATGGCTGCAACTGAGGCAAGGAGGAGAAG
AAAAGAACTAACAAAGTTAAAGAATCTTCACACTCGTCCTTGCCGTATGCACTTCTGATCTATATAGCTGGGGAGTTCAACAACTGTTTGTTGTCAACAATCTTATCTGT
GTGAAGTCCTGTATTCACTGGTTTTTGTTGGCAGAGGCTAGCACAGAGTATTGAAACATCACCATAGTTCAGGCTTTCTTTTTGAGGCGTTGTCGGTGGTTTTGTTCCGA
AGCTCGGAGAGCGACATCATGCTGCTAGTTGTAGCCAGCAGTGATTGGGAAGAAAATTTACATTTCGGGGCATTTTTTTCGCGTCTGTTCCGATGAGTTCTCTAACAAAT
GAAATGAGATGGAATGAAATGAGATGGATGGATACTGATGACTGAGTTGGATTTGGAAGGGCAGCAAAGAGGAGGAGGAGAGTGTAACAATTCTGTATTTAAATCTTCTT
ATGGCTACTTCAAAGACACCCTTTATGGGGTTGGCTACAGTTCAATAAATTGAAAAGTTGTTGTCCTATGACTTGGTGCCCATCTTTCTATATTGGTTTTTAGAGGTCAA
TCAGAACTCAATTATAAATAAATATGAACGAATGAACCGAGACCTTAAACACCAAAATTGGGCTCCCAACGTTCTGCTTAGAATCATCCCATTCAAGTGAAGCTGTCTAA
GGAACCTTCTCAGTCCTCATCCGCCGGCATCATACGGTTGTTTTTCGTTCGGCCGATCGGACTTCAACTCTCTGGCGAGACTCTCACCGGCAGACCGGCCGGCGACTTGA
CATTAGCCTTGTACATCTACGGACTGGAGCTGACATGTACATGGAGGGATAGTTGAGGGCGTCGGAGCTGGCTGGCGTACGTTTATTGGAGAAGAGGAAAGAGAGGTCGT
ATATGAGGCATAGATGCGCTGCCGTTCTCTTCTAATTTTTCTCCGATCATCGATGGCATTGCTGCAGAATTCACCGACACAAACTCAAAAAGGCATTGGATTGTCCTTAT
TAGATGAACACGATTCTACACCATAGGTCCCAGCCAATGAAGAGAGAATTCTTTTATTGTATGATATTGTCTCCTTACTTTGCTTTGGGCTTCCCCAAAAGGTCTCATCC
AATTAAGAGAGAATTCTTTTACTATATACCCATCCATGAACATTCCCTAAATTAGCTCAGTCAACGGAGAGTATTTTTTTTTTATGGAATGAGAGGTGTGACTCCC
Protein sequenceShow/hide protein sequence
MASSSKCSEATSCSGLSSSSTRSFSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDLRIQDRGVISH
HPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLE
ITNKELKEQIAHAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPSNNTVYVSDSSHVQENFTNVTGLRTPFCIVP
CSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPSLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTS
TVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLKNLHTRPCRMHF