; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003586 (gene) of Snake gourd v1 genome

Gene IDTan0003586
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBZIP domain-containing protein
Genome locationLG04:261545..263634
RNA-Seq ExpressionTan0003586
SyntenyTan0003586
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044759 - RF2-like transcription factor, bZIP domain
IPR044797 - Uncharacterized protein At4g06598-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601617.1 hypothetical protein SDJN03_06850, partial [Cucurbita argyrosperma subsp. sororia]9.8e-16886.86Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSKVLSNMRN++YSGKH LLPPKSPF SGSSSY ADYFP+PIIGSRAVQNPREGNV HHRTSSES LME+QPSWL+DLLDEPETPVQRGGHRRSSSDS
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST
        FAYLDA NV N+NYTQDDSQCKNM LPSWAS DFD RKDP QASF MKAS IKQKNR REL PTT TT   +R SAKSSILLESSR +ST QE NGFSST
Subjt:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST

Query:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS
        T EKQDSAET + D KSSE++D PH+KPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLE+LS
Subjt:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDSGRSSPNLV
        QEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGP+S    P L+
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDSGRSSPNLV

KAG7032375.1 hypothetical protein SDJN02_06420 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-16587.95Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSKVLSNMRN++YSGKH LLPPKSPF SGSSSY ADYFP+PIIGSRAVQNPREGNV HHRTSSES LME+QPSWL+DLLDEPETPVQRGGHRRSSSDS
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST
        FAYLDA NV N+NYTQDDSQCKNM LPSWAS DFD RKDP QASF MKAS IKQKNR REL PTT TT   +  SAKSSILLESSR +ST QE NGFSST
Subjt:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST

Query:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS
        T EKQDSAET + D KSSE++D PH+KPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLE+LS
Subjt:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS
        QEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGP+S
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS

XP_022956708.1 uncharacterized protein At4g06598-like [Cucurbita moschata]4.6e-16587.67Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSKVLSNMRN++YSGKH LLPPKSPF SGSS Y ADYFP+PIIGSRAVQNPREGNV HHRTSSES LME+QPSWL+DLLDEPETPVQRGGHRRSSSDS
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST
        FAYLDA NV N+NYTQDDSQCKNM LPSWAS DFD RKDP QASF MKAS IKQKNR REL PTT TT   +  SAKSSILLESSR +ST QE NGFSST
Subjt:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST

Query:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS
        T EKQDSAET + D KSSE++D PH+KPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLE+LS
Subjt:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS
        QEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGP+S
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS

XP_022998117.1 uncharacterized protein At4g06598-like [Cucurbita maxima]1.7e-16788.49Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSKVLSNMRN++YSGKH LLPPKSPF SGSSSY ADYFP+PIIGSRAVQNPREGNV HHRTSSES LME+QPSWL+DLLDEPETPVQRGGHRRSSSDS
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST
        FAYLDA NV N+NYTQDDSQCKNM LPSWAS DFD RKDP QASF MKAS IKQKNR REL PTT TTN  +R SAKSSILLESSR +ST QE NGFSST
Subjt:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST

Query:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS
        T EKQDSAET + D KSSE++D PH+KPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLE+LS
Subjt:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS
        QEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGP+S
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS

XP_023524171.1 uncharacterized protein At4g06598-like [Cucurbita pepo subsp. pepo]5.0e-16487.7Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSKVLSN RN++YSGKH LLPPKSPF SGSS Y ADYFP+PIIGSRAVQNPREGNV HHRTSSES LME+QPSWL+DLLDEPETPVQRGGHRRSSSDS
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST
        FAYLDA NV N+NYTQDDSQCKNM LPSWAS DFD RKDP QASF MKAS IKQKNR  EL P T TTN  +R SAKSSILLESSR +ST QE NGFSST
Subjt:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST

Query:  TIEKQDSAET-GLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESL
        T EKQDSAET  + D KSSERID PH+KPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLE+L
Subjt:  TIEKQDSAET-GLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESL

Query:  SQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS
        SQEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGP+S
Subjt:  SQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS

TrEMBL top hitse value%identityAlignment
A0A1S3BEP8 uncharacterized protein At4g06598 isoform X18.4e-15784.38Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSKVLSNMRNM+ SGKH LLPPKSP  SGSS+YS +Y PNPI+GSRAVQNPR GNV HHRTSSES LMEEQPSWLDDLL+EPETPVQRGGHRRSSSDS
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST
        FAYLDA NVSN+NYTQDDSQCKNM LPSWAS DFDS     QAS YMK SW KQKNRTREL PTT TTNP  R SAK+SILLES R +ST+QE N FSST
Subjt:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST

Query:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS
        T EK DSAET L D K SER+DS HVKP P DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQA GSEVSAELEFLS+QNLILGMENKALKQRLESLS
Subjt:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS
        QEQLIKY EHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQF+KLSLRQKD RSG +S
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS

A0A1S3BEV4 uncharacterized protein At4g06598 isoform X28.4e-15784.38Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSKVLSNMRNM+ SGKH LLPPKSP  SGSS+YS +Y PNPI+GSRAVQNPR GNV HHRTSSES LMEEQPSWLDDLL+EPETPVQRGGHRRSSSDS
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST
        FAYLDA NVSN+NYTQDDSQCKNM LPSWAS DFDS     QAS YMK SW KQKNRTREL PTT TTNP  R SAK+SILLES R +ST+QE N FSST
Subjt:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST

Query:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS
        T EK DSAET L D K SER+DS HVKP P DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQA GSEVSAELEFLS+QNLILGMENKALKQRLESLS
Subjt:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS
        QEQLIKY EHEVLE+EIGRLRMLYQQQQQPQPPPS+LKRTKSRDLETQF+KLSLRQKD RSG +S
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS

A0A6J1DEI2 uncharacterized protein At4g065984.2e-15683.56Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSK L NMRN+MYSGKH LLPPKSPF SGSSS   DYFPNPIIGSRAVQNPREGNV HHRTSS S LMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST
        FAYLDAVNVSN  YTQDDS+CKN+SLPSWAS DF+ RKD  Q SF+M+ASWIKQKNRTREL PTT   N     SAKSSILLESSRP+ST  E NG SST
Subjt:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST

Query:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS
        T EKQDSAETGLQD K ++RI+SP+VKP   DTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLIL MENKALKQRLESLS
Subjt:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS
        QEQLIKY EHEVLEREIGRLR+LYQ+QQQPQ PPSSLKR+KSRDLETQF  LSLRQ DG +G +S
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS

A0A6J1GXW7 uncharacterized protein At4g06598-like2.2e-16587.67Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSKVLSNMRN++YSGKH LLPPKSPF SGSS Y ADYFP+PIIGSRAVQNPREGNV HHRTSSES LME+QPSWL+DLLDEPETPVQRGGHRRSSSDS
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST
        FAYLDA NV N+NYTQDDSQCKNM LPSWAS DFD RKDP QASF MKAS IKQKNR REL PTT TT   +  SAKSSILLESSR +ST QE NGFSST
Subjt:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST

Query:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS
        T EKQDSAET + D KSSE++D PH+KPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLE+LS
Subjt:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS
        QEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGP+S
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS

A0A6J1K701 uncharacterized protein At4g06598-like8.1e-16888.49Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MENSKVLSNMRN++YSGKH LLPPKSPF SGSSSY ADYFP+PIIGSRAVQNPREGNV HHRTSSES LME+QPSWL+DLLDEPETPVQRGGHRRSSSDS
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST
        FAYLDA NV N+NYTQDDSQCKNM LPSWAS DFD RKDP QASF MKAS IKQKNR REL PTT TTN  +R SAKSSILLESSR +ST QE NGFSST
Subjt:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST

Query:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS
        T EKQDSAET + D KSSE++D PH+KPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLE+LS
Subjt:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS
        QEQLIKY EHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGP+S
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS

SwissProt top hitse value%identityAlignment
F4IN23 Basic leucine zipper 343.8e-2134.67Show/hide
Query:  EQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFY-----MKASWIKQKNRTRELLPTTF
        + PSW+D+ LD   +  +RG HRRS SDS A+L+A  VS ++                  H FD   D Q  S +     + ++     N+   + PT  
Subjt:  EQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFY-----MKASWIKQKNRTRELLPTTF

Query:  TTNPCARTSAKSSILLESSRPISTSQE------TNGF-----SSTTIEKQD--SAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQY
        ++N    TS  S+   + ++ +  S         N +     S   +E +D  ++     DS  +  +D   VK          A +Q AQRSRVRKLQY
Subjt:  TTNPCARTSAKSSILLESSRPISTSQE------TNGF-----SSTTIEKQD--SAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQY

Query:  IAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQ
        I+ELER+V +LQAE S +S  + FL  Q L+L ++N ALKQR+ +LSQ++L K    E L+REI RLR +Y QQ
Subjt:  IAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQ

Q5JMK6 Basic leucine zipper 61.5e-1454.44Show/hide
Query:  AKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQ
        A +Q AQRSRVRKLQYI+ELER+V  LQ E S +S  + FL QQ  IL + N  LKQR+ +L+Q+++ K    E L +EI RLR +YQQQ
Subjt:  AKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQ

Q5QNI5 Basic leucine zipper 21.5e-1732.01Show/hide
Query:  PREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWI
        P  G  Q      +    + QPSW+D+ LD   T  +RG HRRS SDS A+LD V+  N                   +HDFD   D Q  S +      
Subjt:  PREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWI

Query:  KQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSSTTIEKQDSAETGLQDSKSSERIDSPHVKPA-PADTDNKRAK-----QQFAQRS
             + +L P      P A              P +++   +  +S   EKQD  ET   D   SE   +   +PA PA  D KR K     +Q AQRS
Subjt:  KQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSSTTIEKQDSAETGLQDSKSSERIDSPHVKPA-PADTDNKRAK-----QQFAQRS

Query:  RVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLE
        RVRKLQYI+ELER+V +LQ E S +S  + FL  Q  +L + N  LKQR+ +L+Q+++ K    E  +RE        + Q++  P        + R   
Subjt:  RVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLE

Query:  TQFAKLSLRQKDGRSGPDSGRSSPNLVL
                RQ+ GR     GR+ P LV+
Subjt:  TQFAKLSLRQKDGRSGPDSGRSSPNLVL

Q8W3M7 Uncharacterized protein At4g065981.2e-4649.09Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        M +SK   N RN+  +GK  LLPPKSPF +G  ++SAD+ P+ +IGS+AVQ   EGN  HHRTSSES L+EEQPSWLDDLL+EPETPV++GGHRRSSSDS
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKD----PQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNG
        FAY+D     + +YT  D    N +   +++H    ++      Q   FY  A   KQK R  + LP     +  AR ++ S  L  SS  I+ S  +  
Subjt:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKD----PQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNG

Query:  FSSTTIEKQDSAETGLQD-----SKSS-ERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQ
           T  EK  SA    +D     +KSS E+ D+P  K A ++ D KRA+QQFAQRSRVRK+QYIAELERNVQ LQ
Subjt:  FSSTTIEKQDSAETGLQD-----SKSS-ERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQ

Q9M2K4 Basic leucine zipper 617.4e-1731.62Show/hide
Query:  EEQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLD--AVNVSNKNYTQ-DDSQCKNM--------SLPSWASHDFDSRKDPQQASFYMKA----SWIKQK
        ++ PSW+D+ LD   T  +RG HRRS SDS A+L+  +  V N ++ + DD Q  +M        +      H  +    P ++S         + +   
Subjt:  EEQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLD--AVNVSNKNYTQ-DDSQCKNM--------SLPSWASHDFDSRKDPQQASFYMKA----SWIKQK

Query:  NRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSSTTIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIA
        +  +E  P+    +     + +++    +    + S E      T  +   SA      S  +   D   VK          A +Q AQRSRVRKLQYI+
Subjt:  NRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSSTTIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIA

Query:  ELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQ
        ELER+V +LQ E S +S  + FL  Q L+L ++N A+KQR+ +L+Q+++ K    E L+REI RLR +Y QQ
Subjt:  ELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQ

Arabidopsis top hitse value%identityAlignment
AT1G35490.1 bZIP family transcription factor1.3e-2931.87Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        MEN + LSN  N  + G+     P+    +  S       PN              N+ HH  S +    E+QP+WLD+LL EP +P    GHRRS+SD+
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST
         AYL++  + +K      S  +  +   W S+ ++                  Q N+      T   TN   + +     L  SS+P             
Subjt:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSST

Query:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS
         IEK  S         +S + D P  K     TD+KR K Q A R+R+R+L+YI++LER +Q LQ EG E+S+ + +L QQ L+L MEN+ALKQR++SL+
Subjt:  TIEKQDSAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLS

Query:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSR----------DLETQFAKLSL
        + Q +K+ E ++LEREIG L+   + QQQPQ     ++  ++R          + + QFA L++
Subjt:  QEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSR----------DLETQFAKLSL

AT1G58110.1 Basic-leucine zipper (bZIP) transcription factor family protein4.8e-8053.48Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDE-PETPVQRGGHRRSSSD
        M +SK   ++RN+MY GKH LLPPK PF S S+SYS +Y P  +IGSR  Q        H RTSSESHL+EE P WLDDLL+E PE+P ++ GHRRSSSD
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDE-PETPVQRGGHRRSSSD

Query:  SFAYLDAVNVSNKNYT-QDDSQCKNMSLPSWAS-HDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTN---PCARTSAKSSILLESSRPISTSQET
        S+AYLD  N +N + T Q+D   +N  L +     + D  K+ Q A+FY  AS++KQK+R R+ L  T       P AR +     L      +  SQ+ 
Subjt:  SFAYLDAVNVSNKNYT-QDDSQCKNMSLPSWAS-HDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTN---PCARTSAKSSILLESSRPISTSQET

Query:  NGFSSTTIEKQDSAETGLQDSK--SSERIDSPHVKPAPADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKA
           SS   E+++ AE    D K  SSE  +S +  P   + DN KRAKQQFAQRSRVRKLQYI+ELERNVQ LQAEGS+VSAEL+FL+Q+NLIL MENKA
Subjt:  NGFSSTTIEKQDSAETGLQDSK--SSERIDSPHVKPAPADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKA

Query:  LKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS
        LK+RLES++QE+LIK  E EVLE+EIGRLR LYQQQQQ Q P +S  R  S+DL++QF+ LSL  KD     DS
Subjt:  LKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS

AT1G58110.2 Basic-leucine zipper (bZIP) transcription factor family protein4.8e-8053.48Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDE-PETPVQRGGHRRSSSD
        M +SK   ++RN+MY GKH LLPPK PF S S+SYS +Y P  +IGSR  Q        H RTSSESHL+EE P WLDDLL+E PE+P ++ GHRRSSSD
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDE-PETPVQRGGHRRSSSD

Query:  SFAYLDAVNVSNKNYT-QDDSQCKNMSLPSWAS-HDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTN---PCARTSAKSSILLESSRPISTSQET
        S+AYLD  N +N + T Q+D   +N  L +     + D  K+ Q A+FY  AS++KQK+R R+ L  T       P AR +     L      +  SQ+ 
Subjt:  SFAYLDAVNVSNKNYT-QDDSQCKNMSLPSWAS-HDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTN---PCARTSAKSSILLESSRPISTSQET

Query:  NGFSSTTIEKQDSAETGLQDSK--SSERIDSPHVKPAPADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKA
           SS   E+++ AE    D K  SSE  +S +  P   + DN KRAKQQFAQRSRVRKLQYI+ELERNVQ LQAEGS+VSAEL+FL+Q+NLIL MENKA
Subjt:  NGFSSTTIEKQDSAETGLQDSK--SSERIDSPHVKPAPADTDN-KRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKA

Query:  LKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS
        LK+RLES++QE+LIK  E EVLE+EIGRLR LYQQQQQ Q P +S  R  S+DL++QF+ LSL  KD     DS
Subjt:  LKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDS

AT2G42380.2 Basic-leucine zipper (bZIP) transcription factor family protein2.7e-2234.67Show/hide
Query:  EQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFY-----MKASWIKQKNRTRELLPTTF
        + PSW+D+ LD   +  +RG HRRS SDS A+L+A  VS ++                  H FD   D Q  S +     + ++     N+   + PT  
Subjt:  EQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFY-----MKASWIKQKNRTRELLPTTF

Query:  TTNPCARTSAKSSILLESSRPISTSQE------TNGF-----SSTTIEKQD--SAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQY
        ++N    TS  S+   + ++ +  S         N +     S   +E +D  ++     DS  +  +D   VK          A +Q AQRSRVRKLQY
Subjt:  TTNPCARTSAKSSILLESSRPISTSQE------TNGF-----SSTTIEKQD--SAETGLQDSKSSERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQY

Query:  IAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQ
        I+ELER+V +LQAE S +S  + FL  Q L+L ++N ALKQR+ +LSQ++L K    E L+REI RLR +Y QQ
Subjt:  IAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQ

AT4G06598.1 BEST Arabidopsis thaliana protein match is: Basic-leucine zipper (bZIP) transcription factor family protein (TAIR:AT1G58110.2)6.1e-6749.2Show/hide
Query:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS
        M +SK   N RN+  +GK  LLPPKSPF +G  ++SAD+ P+ +IGS+AVQ   EGN  HHRTSSES L+EEQPSWLDDLL+EPETPV++GGHRRSSSDS
Subjt:  MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDS

Query:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKD----PQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNG
        FAY+D     + +YT  D    N +   +++H    ++      Q   FY  A   KQK R  + LP     +  AR ++ S  L  SS  I+ S  +  
Subjt:  FAYLDAVNVSNKNYTQDDSQCKNMSLPSWASHDFDSRKD----PQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNG

Query:  FSSTTIEKQDSAETGLQD-----SKSS-ERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENK
           T  EK  SA    +D     +KSS E+ D+P  K A ++ D KRA+QQFAQRSRVRK+QYIAELERNVQ L                       ENK
Subjt:  FSSTTIEKQDSAETGLQD-----SKSS-ERIDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENK

Query:  ALKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPP---------PSSLKRTKSRDLETQFAKLSLR
        +LK RLESL+QEQLIKY EH+VLE+EI RLR LYQ QQQ +P           SS +R+KSRDLETQF  LSLR
Subjt:  ALKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQQQPQPP---------PSSLKRTKSRDLETQFAKLSLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATTCAAAGGTGCTGTCAAACATGAGAAATATGATGTACTCTGGAAAGCATCCTCTACTTCCTCCCAAGAGTCCATTTCATAGTGGTTCCTCTTCATATTCTGC
TGATTATTTCCCCAATCCCATTATTGGTTCAAGAGCAGTGCAGAATCCCAGAGAGGGAAATGTGCAACATCATCGAACATCATCTGAAAGTCATCTGATGGAGGAGCAAC
CATCTTGGCTCGATGATCTCCTTGATGAACCTGAAACACCTGTTCAAAGAGGTGGTCATCGACGTTCATCAAGTGATTCCTTTGCTTACTTAGACGCCGTGAATGTTTCA
AATAAAAATTATACACAAGATGACTCCCAATGTAAAAATATGTCTTTACCTTCCTGGGCATCACACGACTTTGATTCCCGCAAAGATCCACAACAAGCTTCATTTTATAT
GAAAGCAAGCTGGATCAAACAGAAGAACAGGACACGGGAATTGCTTCCAACTACATTTACTACTAACCCATGTGCCCGCACTTCTGCTAAGAGTAGCATTCTTCTTGAAA
GCTCAAGGCCGATAAGTACATCACAGGAAACAAATGGGTTTTCATCAACAACTATTGAAAAGCAGGATTCAGCAGAAACTGGTCTGCAGGATTCAAAGTCATCTGAGAGA
ATTGATAGTCCCCATGTTAAGCCAGCTCCAGCTGATACAGATAATAAAAGAGCTAAACAGCAATTTGCTCAACGTTCTCGTGTGCGGAAACTTCAGTACATTGCAGAGCT
AGAAAGAAACGTACAAGCTTTACAAGCAGAAGGTTCTGAAGTTTCTGCCGAGCTTGAATTTCTCAGTCAGCAAAACTTAATTCTTGGCATGGAGAATAAAGCGCTTAAAC
AGCGATTAGAAAGTTTATCTCAAGAGCAGCTTATAAAATACTTTGAGCATGAAGTACTGGAGAGGGAAATTGGAAGACTACGAATGCTGTATCAGCAGCAGCAGCAGCCA
CAGCCACCACCTTCCAGCCTAAAACGCACCAAGAGCAGAGACCTTGAGACGCAGTTTGCAAAGCTCTCATTGAGACAGAAGGATGGGCGTTCAGGTCCCGACTCTGGCCG
CTCCAGTCCAAATCTAGTTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATTCAAAGGTGCTGTCAAACATGAGAAATATGATGTACTCTGGAAAGCATCCTCTACTTCCTCCCAAGAGTCCATTTCATAGTGGTTCCTCTTCATATTCTGC
TGATTATTTCCCCAATCCCATTATTGGTTCAAGAGCAGTGCAGAATCCCAGAGAGGGAAATGTGCAACATCATCGAACATCATCTGAAAGTCATCTGATGGAGGAGCAAC
CATCTTGGCTCGATGATCTCCTTGATGAACCTGAAACACCTGTTCAAAGAGGTGGTCATCGACGTTCATCAAGTGATTCCTTTGCTTACTTAGACGCCGTGAATGTTTCA
AATAAAAATTATACACAAGATGACTCCCAATGTAAAAATATGTCTTTACCTTCCTGGGCATCACACGACTTTGATTCCCGCAAAGATCCACAACAAGCTTCATTTTATAT
GAAAGCAAGCTGGATCAAACAGAAGAACAGGACACGGGAATTGCTTCCAACTACATTTACTACTAACCCATGTGCCCGCACTTCTGCTAAGAGTAGCATTCTTCTTGAAA
GCTCAAGGCCGATAAGTACATCACAGGAAACAAATGGGTTTTCATCAACAACTATTGAAAAGCAGGATTCAGCAGAAACTGGTCTGCAGGATTCAAAGTCATCTGAGAGA
ATTGATAGTCCCCATGTTAAGCCAGCTCCAGCTGATACAGATAATAAAAGAGCTAAACAGCAATTTGCTCAACGTTCTCGTGTGCGGAAACTTCAGTACATTGCAGAGCT
AGAAAGAAACGTACAAGCTTTACAAGCAGAAGGTTCTGAAGTTTCTGCCGAGCTTGAATTTCTCAGTCAGCAAAACTTAATTCTTGGCATGGAGAATAAAGCGCTTAAAC
AGCGATTAGAAAGTTTATCTCAAGAGCAGCTTATAAAATACTTTGAGCATGAAGTACTGGAGAGGGAAATTGGAAGACTACGAATGCTGTATCAGCAGCAGCAGCAGCCA
CAGCCACCACCTTCCAGCCTAAAACGCACCAAGAGCAGAGACCTTGAGACGCAGTTTGCAAAGCTCTCATTGAGACAGAAGGATGGGCGTTCAGGTCCCGACTCTGGCCG
CTCCAGTCCAAATCTAGTTTTGTAA
Protein sequenceShow/hide protein sequence
MENSKVLSNMRNMMYSGKHPLLPPKSPFHSGSSSYSADYFPNPIIGSRAVQNPREGNVQHHRTSSESHLMEEQPSWLDDLLDEPETPVQRGGHRRSSSDSFAYLDAVNVS
NKNYTQDDSQCKNMSLPSWASHDFDSRKDPQQASFYMKASWIKQKNRTRELLPTTFTTNPCARTSAKSSILLESSRPISTSQETNGFSSTTIEKQDSAETGLQDSKSSER
IDSPHVKPAPADTDNKRAKQQFAQRSRVRKLQYIAELERNVQALQAEGSEVSAELEFLSQQNLILGMENKALKQRLESLSQEQLIKYFEHEVLEREIGRLRMLYQQQQQP
QPPPSSLKRTKSRDLETQFAKLSLRQKDGRSGPDSGRSSPNLVL