; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg03687 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg03687
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionBZIP transcription factor family protein
Genome locationCarg_Chr14:1277130..1280941
RNA-Seq ExpressionCarg03687
SyntenyCarg03687
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009414 - response to water deprivation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044827 - G-box-binding factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580686.1 hypothetical protein SDJN03_20688, partial [Cucurbita argyrosperma subsp. sororia]9.3e-27899.41Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RI-QDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
        RI QDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
Subjt:  RI-QDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA

Query:  WENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
        WENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
Subjt:  WENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP

Query:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP
        SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSA EKNEAHDLNEA 
Subjt:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP

Query:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKL
        SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKL
Subjt:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKL

Query:  KNLHTRPCRMHF
        KNLHTRPCRMHF
Subjt:  KNLHTRPCRMHF

KAG7017441.1 hypothetical protein SDJN02_19306 [Cucurbita argyrosperma subsp. argyrosperma]6.9e-281100Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RIQDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
        RIQDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
Subjt:  RIQDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW

Query:  ENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
        ENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
Subjt:  ENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS

Query:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
        NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
Subjt:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS

Query:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKLK
        LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKLK
Subjt:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKLK

Query:  NLHTRPCRMHF
        NLHTRPCRMHF
Subjt:  NLHTRPCRMHF

XP_022934487.1 uncharacterized protein LOC111441650 isoform X1 [Cucurbita moschata]1.6e-27799.02Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRS SSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RI-QDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
        RI QDRGVISH PSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
Subjt:  RI-QDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA

Query:  WENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
        WENENLKREKELALKEYQSLEITNKELKEQIA AEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
Subjt:  WENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP

Query:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP
        SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP
Subjt:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP

Query:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKL
        SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPE+VSCKKTIDAMAATEARRRRKELTKL
Subjt:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKL

Query:  KNLHTRPCRMHF
        KNLHTRPCRMHF
Subjt:  KNLHTRPCRMHF

XP_022934488.1 uncharacterized protein LOC111441650 isoform X2 [Cucurbita moschata]6.5e-27999.22Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRS SSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RIQDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
        RIQDRGVISH PSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
Subjt:  RIQDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW

Query:  ENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
        ENENLKREKELALKEYQSLEITNKELKEQIA AEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
Subjt:  ENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS

Query:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
        NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
Subjt:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS

Query:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKLK
        LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPE+VSCKKTIDAMAATEARRRRKELTKLK
Subjt:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKLK

Query:  NLHTRPCRMHF
        NLHTRPCRMHF
Subjt:  NLHTRPCRMHF

XP_023528186.1 uncharacterized protein LOC111791175 isoform X2 [Cucurbita pepo subsp. pepo]5.1e-27698.24Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAV AVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RIQDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
        RIQDR VISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
Subjt:  RIQDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW

Query:  ENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
        ENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLH VAVVPPSVRSPS
Subjt:  ENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS

Query:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
        NNTVYVSDSSH+QENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCP GNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEA+DLNEAPS
Subjt:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS

Query:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKLK
        LK+HTQNTVGVVVDRFE DTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPE+VSCKKTIDAMAATEARRRRKELTKLK
Subjt:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKLK

Query:  NLHTRPCRMHF
        NLHTRPCRMHF
Subjt:  NLHTRPCRMHF

TrEMBL top hitse value%identityAlignment
A0A0A0LBD4 BZIP domain-containing protein1.1e-18672.56Show/hide
Query:  ASSSKCSEATSCSGLSSSSTRSSSSSSME---------ADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSL
        ASSSKCS+ T+ SGLSSSS+ SSSSSS           ADQMVKVEIEAAEALA LAVLAVR++G QP +TKW IK  KGKRARKEVKTESPTS F DSL
Subjt:  ASSSKCSEATSCSGLSSSSTRSSSSSSME---------ADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSL

Query:  PSRADLDLRI-QDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKL------SHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQ
        P+RADLDLRI QDRGV+ HQPSEKEC   S PE ETT E+ K +KEAES K+      S+  FGCRRSRR LTEAEKEERRIRR+LANRESARQTIRRRQ
Subjt:  PSRADLDLRI-QDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKL------SHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQ

Query:  ALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAQA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYH
        ALCE+LT+KA+DLAWENENLKREKE+ALKEYQSLE TNKELKEQ+A+A +PK+EEIPGN+RSSHVQ PPLPTN PLFLFSR P   YFWPSVVQ +S YH
Subjt:  ALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAQA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYH

Query:  DLHNVAVVPPSVRSPSNNTVYVSDSSHVQENFTNVTGLRTPFCIV-PCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSL
        +L NV VVP S+  P+NN   VS SS  QENFTN TG R P CI+ P SWLLPHHD RNQQS Q   PAGN QE +YS SQNSA TSK  VRAESRHSSL
Subjt:  DLHNVAVVPPSVRSPSNNTVYVSDSSHVQENFTNVTGLRTPFCIV-PCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSL

Query:  PSAEEKNEAHDLNEAPSL------KEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSC
        PSAEE+NEA DLNEAPSL      K+ TQNTVGV V+ F+ + R  VRKVLSPVRLECIEP+S    D  +EDD G+SSRTCDDLC+ AE++HEPEVV C
Subjt:  PSAEEKNEAHDLNEAPSL------KEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSC

Query:  KKTIDAMAATEARRRRKELTKLKNLHTRPCRM
        KKT+DAMAATEARRRRKELTKLKNL+ R CRM
Subjt:  KKTIDAMAATEARRRRKELTKLKNLHTRPCRM

A0A6J1F2W5 uncharacterized protein LOC111441650 isoform X17.7e-27899.02Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRS SSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RI-QDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
        RI QDRGVISH PSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
Subjt:  RI-QDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA

Query:  WENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
        WENENLKREKELALKEYQSLEITNKELKEQIA AEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
Subjt:  WENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP

Query:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP
        SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP
Subjt:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP

Query:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKL
        SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPE+VSCKKTIDAMAATEARRRRKELTKL
Subjt:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKL

Query:  KNLHTRPCRMHF
        KNLHTRPCRMHF
Subjt:  KNLHTRPCRMHF

A0A6J1F7T1 uncharacterized protein LOC111441650 isoform X23.1e-27999.22Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRS SSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RIQDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
        RIQDRGVISH PSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
Subjt:  RIQDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW

Query:  ENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
        ENENLKREKELALKEYQSLEITNKELKEQIA AEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
Subjt:  ENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS

Query:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
        NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
Subjt:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS

Query:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKLK
        LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPE+VSCKKTIDAMAATEARRRRKELTKLK
Subjt:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKLK

Query:  NLHTRPCRMHF
        NLHTRPCRMHF
Subjt:  NLHTRPCRMHF

A0A6J1J476 uncharacterized protein LOC111481617 isoform X22.3e-27497.65Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEAL DLAVLAVRDSGV+PSETKWRIK KKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RIQDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW
        RIQDRGVISHQPSEKECADHSHPEWETTKEMIKAEKE ESPKLSHPLFGCRR RRNLTEAEKEERRIRRVLANRESARQTIRRRQ LCEDLTKKASDLAW
Subjt:  RIQDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAW

Query:  ENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
        ENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLF FSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
Subjt:  ENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS

Query:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS
        NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQE IYSNSQNSAYTSKVVVRAESR SSLPSAEEKNEAHDLNEAPS
Subjt:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPS

Query:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKLK
        LK+HTQNTVGVVVDRFEADTRD+VRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPE+VSCKKTIDAMAATEARRRRKELTKLK
Subjt:  LKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKLK

Query:  NLHTRPCRMHF
        NLHTRPCRMHF
Subjt:  NLHTRPCRMHF

A0A6J1J5U4 uncharacterized protein LOC111481617 isoform X15.7e-27397.46Show/hide
Query:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL
        MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEAL DLAVLAVRDSGV+PSETKWRIK KKGKRARKEVKTESPTSAFVDSLPSRADLDL
Subjt:  MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL

Query:  RI-QDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA
        RI QDRGVISHQPSEKECADHSHPEWETTKEMIKAEKE ESPKLSHPLFGCRR RRNLTEAEKEERRIRRVLANRESARQTIRRRQ LCEDLTKKASDLA
Subjt:  RI-QDRGVISHQPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLA

Query:  WENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
        WENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLF FSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP
Subjt:  WENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSP

Query:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP
        SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQE IYSNSQNSAYTSKVVVRAESR SSLPSAEEKNEAHDLNEAP
Subjt:  SNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAP

Query:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKL
        SLK+HTQNTVGVVVDRFEADTRD+VRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPE+VSCKKTIDAMAATEARRRRKELTKL
Subjt:  SLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKL

Query:  KNLHTRPCRMHF
        KNLHTRPCRMHF
Subjt:  KNLHTRPCRMHF

SwissProt top hitse value%identityAlignment
A0A3B6KF13 bZIP transcription factor 1-A1.5e-0437.5Show/hide
Query:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSH
        E+E +R +R  +NR+SAR++  R+QA CE+L ++A  L  EN +LK E     KEY  L   N  LK+ +   + K +E   +N+  H
Subjt:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSH

A0A3B6MPP5 bZIP transcription factor 1-D1.5e-0437.5Show/hide
Query:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSH
        E+E +R +R  +NR+SAR++  R+QA CE+L ++A  L  EN +LK E     KEY  L   N  LK+ +   + K +E   +N+  H
Subjt:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSH

B6E107 bZIP transcription factor 1-B1.5e-0437.5Show/hide
Query:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSH
        E+E +R +R  +NR+SAR++  R+QA CE+L ++A  L  EN +LK E     KEY  L   N  LK+ +   + K +E   +N+  H
Subjt:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPGNNRSSH

P23922 Transcription factor HBP-1a2.6e-0437Show/hide
Query:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAQ--------AEPKMEEIPGNNRSSHVQTP
        E+E ++ +R L+NRESAR++  R+QA CE+L ++A  L  EN +L+ E +   KEY+ L   N  LK ++ +        A P M E    N  SH + P
Subjt:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAQ--------AEPKMEEIPGNNRSSHVQTP

P25032 DNA-binding protein EMBP-19.8e-0437.8Show/hide
Query:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPG
        E+E +R RR  +NRESAR++  R+Q  CE+L +K S+L   N  L+ E +   K+ +++E  NK+L  +I   + KM++  G
Subjt:  EKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNKELKEQIAQAEPKMEEIPG

Arabidopsis top hitse value%identityAlignment
AT1G19490.1 Basic-leucine zipper (bZIP) transcription factor family protein1.5e-5537.98Show/hide
Query:  LSSSSTRSSSSSSMEADQMV--KVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL----RIQDRGVIS
        +SSS   SSSSSS E +       E+EAAEALADLA LA+    V  S   W     KGKR RK VKTESP S   DSL    D D      + +  ++ 
Subjt:  LSSSSTRSSSSSSMEADQMV--KVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDL----RIQDRGVIS

Query:  HQPSEKECADHSHPEWETTKEMIKAEKEAESPK--LSHPLF------GCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWE
         +  E+E    +    E TK  +K+E   E+PK  L+  L       GC RSR+NL+EAE+EERRIRR+LANRESARQTIRRRQA+CE+L+KKA+DL +E
Subjt:  HQPSEKECADHSHPEWETTKEMIKAEKEAESPK--LSHPLF------GCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWE

Query:  NENLKREKELALKEYQSLEITNKELKEQIAQA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS
        NENL+REK+ ALKE+QSLE  NK LKEQ+ ++ +P  +E   + + S V+     T  P + +++ PY  + WP V Q S+P        ++ P     S
Subjt:  NENLKREKELALKEYQSLEITNKELKEQIAQA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPS

Query:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAG--NIQEYIYSNSQN-SAYTSKVVVRAESRHSSLPS--AEEKNEAHDL
              + ++   EN  +  G +T F +VPC W LP  DH       N  P G  + Q   +SN  +    +++ +   E+  S LP+   EE + + + 
Subjt:  NNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQQSSQNSCPAG--NIQEYIYSNSQN-SAYTSKVVVRAESRHSSLPS--AEEKNEAHDL

Query:  NEAPSLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDD--LCHLAEKKHEPEVVSCKKTIDAMAATEARRRR
             L E     +    D F             PV     +   ++K +  SE   G++        L  L EKKH            ++AA EAR+RR
Subjt:  NEAPSLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSRTCDD--LCHLAEKKHEPEVVSCKKTIDAMAATEARRRR

Query:  KELTKLKNLHTRPCRM
        KELT+LKNLH R CRM
Subjt:  KELTKLKNLHTRPCRM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCCAAGTGCTCCGAGGCGACCAGTTGTTCGGGTTTGAGTTCTTCTTCTACACGTTCGTCTTCTTCCTCCTCCATGGAGGCGGATCAGATGGTCAAGGT
TGAGATTGAGGCGGCGGAGGCTCTTGCAGATTTGGCGGTTTTGGCGGTCAGAGATAGTGGAGTTCAACCGTCGGAAACCAAATGGCGGATTAAAGAGAAGAAAGGGAAAC
GGGCCAGGAAGGAGGTTAAGACCGAGTCGCCGACTTCTGCCTTCGTCGACTCTTTACCTAGTCGCGCGGATCTGGACCTTCGGATTCAGGATAGAGGGGTGATAAGTCAT
CAGCCATCAGAAAAGGAATGCGCTGATCACTCCCATCCTGAGTGGGAAACAACCAAAGAGATGATTAAGGCGGAGAAGGAGGCCGAATCACCTAAACTAAGCCACCCATT
ATTTGGCTGCCGGAGGTCAAGGCGTAATCTGACTGAGGCTGAAAAGGAAGAAAGGAGAATACGAAGGGTTTTAGCAAACAGAGAATCAGCCCGGCAGACAATTCGTCGTA
GGCAGGCTCTGTGCGAGGACTTGACCAAAAAGGCTTCTGATTTAGCATGGGAGAACGAAAATTTAAAGAGGGAAAAGGAGTTGGCCCTGAAAGAGTACCAATCTCTGGAG
ATTACTAACAAGGAATTGAAGGAACAGATTGCTCAAGCAGAGCCCAAAATGGAGGAAATCCCAGGAAACAATAGATCATCTCATGTTCAGACTCCTCCTTTACCTACCAA
CTACCCTCTTTTCTTGTTTAGTCGCCCTCCATATGCATCGTATTTCTGGCCGTCAGTGGTTCAACCTTCAAGTCCCTATCATGACCTACACAATGTTGCAGTCGTCCCTC
CAAGTGTTCGTTCGCCTTCTAATAATACTGTTTATGTATCCGACTCTTCCCATGTACAAGAAAACTTTACAAATGTCACTGGCTTGAGAACACCCTTTTGTATCGTACCT
TGTTCTTGGTTGTTGCCTCATCATGATCATAGGAATCAACAGAGTTCTCAAAACTCGTGTCCAGCAGGAAATATTCAAGAGTATATTTATTCAAATTCCCAGAACAGTGC
TTATACTTCAAAGGTTGTTGTGCGTGCAGAAAGCAGACATTCTTCTTTGCCTTCAGCTGAAGAGAAAAACGAAGCTCATGACTTGAATGAAGCCCCGAGTCTAAAGGAGC
ATACTCAGAACACAGTTGGAGTAGTTGTGGATCGATTTGAAGCCGACACAAGAGATCAAGTTAGGAAAGTGCTTTCTCCTGTGAGACTTGAATGTATTGAACCCACTTCC
ACTGTCAAACAAGATAAACCGAGCGAAGATGATCGCGGTCTGTCATCAAGAACGTGTGATGACTTATGTCATTTGGCAGAAAAAAAGCATGAACCAGAGGTAGTCTCATG
TAAGAAAACCATAGATGCAATGGCTGCAACTGAGGCAAGGAGGAGAAGAAAAGAACTAACAAAGTTAAAGAATCTTCACACTCGTCCTTGCCGTATGCACTTCTGA
mRNA sequenceShow/hide mRNA sequence
CTTCAGTCTTCATCTTCTTCTTCTTCTTCTTCTCTGGTTCTTATTATCCTTTGATTTTTTCTTTTTCATGGCTTCTTCTTCCAAGTGCTCCGAGGCGACCAGTTGTTCGG
GTTTGAGTTCTTCTTCTACACGTTCGTCTTCTTCCTCCTCCATGGAGGCGGATCAGATGGTCAAGGTTGAGATTGAGGCGGCGGAGGCTCTTGCAGATTTGGCGGTTTTG
GCGGTCAGAGATAGTGGAGTTCAACCGTCGGAAACCAAATGGCGGATTAAAGAGAAGAAAGGGAAACGGGCCAGGAAGGAGGTTAAGACCGAGTCGCCGACTTCTGCCTT
CGTCGACTCTTTACCTAGTCGCGCGGATCTGGACCTTCGGATTCAGGATAGAGGGGTGATAAGTCATCAGCCATCAGAAAAGGAATGCGCTGATCACTCCCATCCTGAGT
GGGAAACAACCAAAGAGATGATTAAGGCGGAGAAGGAGGCCGAATCACCTAAACTAAGCCACCCATTATTTGGCTGCCGGAGGTCAAGGCGTAATCTGACTGAGGCTGAA
AAGGAAGAAAGGAGAATACGAAGGGTTTTAGCAAACAGAGAATCAGCCCGGCAGACAATTCGTCGTAGGCAGGCTCTGTGCGAGGACTTGACCAAAAAGGCTTCTGATTT
AGCATGGGAGAACGAAAATTTAAAGAGGGAAAAGGAGTTGGCCCTGAAAGAGTACCAATCTCTGGAGATTACTAACAAGGAATTGAAGGAACAGATTGCTCAAGCAGAGC
CCAAAATGGAGGAAATCCCAGGAAACAATAGATCATCTCATGTTCAGACTCCTCCTTTACCTACCAACTACCCTCTTTTCTTGTTTAGTCGCCCTCCATATGCATCGTAT
TTCTGGCCGTCAGTGGTTCAACCTTCAAGTCCCTATCATGACCTACACAATGTTGCAGTCGTCCCTCCAAGTGTTCGTTCGCCTTCTAATAATACTGTTTATGTATCCGA
CTCTTCCCATGTACAAGAAAACTTTACAAATGTCACTGGCTTGAGAACACCCTTTTGTATCGTACCTTGTTCTTGGTTGTTGCCTCATCATGATCATAGGAATCAACAGA
GTTCTCAAAACTCGTGTCCAGCAGGAAATATTCAAGAGTATATTTATTCAAATTCCCAGAACAGTGCTTATACTTCAAAGGTTGTTGTGCGTGCAGAAAGCAGACATTCT
TCTTTGCCTTCAGCTGAAGAGAAAAACGAAGCTCATGACTTGAATGAAGCCCCGAGTCTAAAGGAGCATACTCAGAACACAGTTGGAGTAGTTGTGGATCGATTTGAAGC
CGACACAAGAGATCAAGTTAGGAAAGTGCTTTCTCCTGTGAGACTTGAATGTATTGAACCCACTTCCACTGTCAAACAAGATAAACCGAGCGAAGATGATCGCGGTCTGT
CATCAAGAACGTGTGATGACTTATGTCATTTGGCAGAAAAAAAGCATGAACCAGAGGTAGTCTCATGTAAGAAAACCATAGATGCAATGGCTGCAACTGAGGCAAGGAGG
AGAAGAAAAGAACTAACAAAGTTAAAGAATCTTCACACTCGTCCTTGCCGTATGCACTTCTGATCTATATAGCTGGGGAGTTCAACAACTGTTTGTTGTCAACAATCTTA
TCTGTGTGAAGTCCTGTATTCACTGGTTTTTGTTGGCAGAGGCAAGCACAGAGTATTGAAACATCACCATAGTTCAGGCTTTCTTTTTGAGGCGTTCTCGGTGGTTTTGT
TCCGAAGCTCGGAGAGCGCCGACATCATGCTGCTGCTAGTTGTAGCCAGCAGTGATGGGGAAGAAAATTTACATTTCGGGGCATTTTTTTCGCGTCTGTTCCGATGAGTT
CTCTAACAAATGAAATGAGATGGATGGATACTGATGACTGAGTTGGATTTGGAAGGGCAGCAAAGAGGAGGAGGAGGAGGAGAGTTTAACAATTCTGTATTTAAATCTTC
TTATGGCTACTTCAAAGACACCCTTTATGGGGTTGGCTACAGTTCTTCAATAAATTGAAAAGTTGTTGTCCTAGGACTTGGTGCCCATCTTTCTACATTGGTTTTTAGAG
GTCAATCAGAACTCAATTATAAATAAATATGAACGAATGAACTGAGACCTTAAACACCAAAATTGGGCTCCCAACGTTCTGTTTAGAATCATCCCATTCAAGTGAAGCTG
TCTAAAGAACCTTCTCAGTCCTCATCTGCCGGCATCAAACGGTCGTTTTTCGTTCGGCCGATCGGGACTTCAACTCTCTGGCGAGACTCTCACGGCAGACCCGCCGGTGA
CTTGACATTAGCCTTGTACATCGACGGACTGGAGCTGATGTGTACATGGAGGGATAGTTGAGGGCGTCGGAGCCGGCGTACGTTTATTGGAGAAGAGGAAAGAGAGGTCG
TATATGAGGTATAGATGCGCTGCTATTCTCTTCTAATTTTTCTCCGATCATCGA
Protein sequenceShow/hide protein sequence
MASSSKCSEATSCSGLSSSSTRSSSSSSMEADQMVKVEIEAAEALADLAVLAVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDLRIQDRGVISH
QPSEKECADHSHPEWETTKEMIKAEKEAESPKLSHPLFGCRRSRRNLTEAEKEERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLE
ITNKELKEQIAQAEPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPYHDLHNVAVVPPSVRSPSNNTVYVSDSSHVQENFTNVTGLRTPFCIVP
CSWLLPHHDHRNQQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPSLKEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTS
TVKQDKPSEDDRGLSSRTCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKLKNLHTRPCRMHF