; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013972 (gene) of Snake gourd v1 genome

Gene IDTan0013972
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBZIP transcription factor family protein
Genome locationLG08:69238977..69243437
RNA-Seq ExpressionTan0013972
SyntenyTan0013972
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009414 - response to water deprivation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044827 - G-box-binding factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017441.1 hypothetical protein SDJN02_19306 [Cucurbita argyrosperma subsp. argyrosperma]1.6e-21177.13Show/hide
Query:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWG-RKGKGKRVRKEVKAELPICAFVDSLPSR
        MASSSKCSE TSCS LSSSS+ S     SSSSSM ADQMVKVE EAAEALADLA LAVR+SG +PSETKW  ++ KGKR RKEVK E P  AFVDSLPSR
Subjt:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWG-RKGKGKRVRKEVKAELPICAFVDSLPSR

Query:  VDLDLRIQDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALC
         DLDLRIQDRGV+SHQP EKEC + SHP+WE+T++M+KA+K EA+S K+      S+PLFGCRRSRRNLTEAEKEERRIRR+LANRESARQTIRRRQALC
Subjt:  VDLDLRIQDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALC

Query:  EELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELH
        E+LTK+A+DLAWENENL+REKELAL+EYQSLE TNKELKEQ+A+A +PK+EEIPGNNRS HVQ PP PTNYPLFLFSRPPYASYFWPS+VQPSSPYH+LH
Subjt:  EELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELH

Query:  NVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSAE
        NV VVP S+  P NN V VSDSSHVQENFTNV GLRTPFCI+PCSWLLPHHDHRN+ S   SCP GN QE IY NSQN  YTSKVVV +ESRHSSLPSAE
Subjt:  NVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSAE

Query:  EKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKTI
        EK EA DLNEAP+L      K+ TQNTVGVVV+ F+AD R QVRK+LSPVRLECIEPTS VKQD  SE+D GLSSRT DD C  AEKKHEPE+  CKKTI
Subjt:  EKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKTI

Query:  DAMAAAEARKRRKELTKLKNLHARQCRMH
        DAMAA EAR+RRKELTKLKNLH R CRMH
Subjt:  DAMAAAEARKRRKELTKLKNLHARQCRMH

XP_022934488.1 uncharacterized protein LOC111441650 isoform X2 [Cucurbita moschata]3.9e-21076.94Show/hide
Query:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWG-RKGKGKRVRKEVKAELPICAFVDSLPSR
        MASSSKCSE TSCS LSSSS+ S      SSSSM ADQMVKVE EAAEALADLA LAVR+SG +PSETKW  ++ KGKR RKEVK E P  AFVDSLPSR
Subjt:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWG-RKGKGKRVRKEVKAELPICAFVDSLPSR

Query:  VDLDLRIQDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALC
         DLDLRIQDRGV+SH P EKEC + SHP+WE+T++M+KA+K EA+S K+      S+PLFGCRRSRRNLTEAEKEERRIRR+LANRESARQTIRRRQALC
Subjt:  VDLDLRIQDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALC

Query:  EELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELH
        E+LTK+A+DLAWENENL+REKELAL+EYQSLE TNKELKEQ+A A +PK+EEIPGNNRS HVQ PP PTNYPLFLFSRPPYASYFWPS+VQPSSPYH+LH
Subjt:  EELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELH

Query:  NVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSAE
        NV VVP S+  P NN V VSDSSHVQENFTNV GLRTPFCI+PCSWLLPHHDHRN+ S   SCP GN QE IY NSQN  YTSKVVV +ESRHSSLPSAE
Subjt:  NVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSAE

Query:  EKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKTI
        EK EA DLNEAP+L      K+ TQNTVGVVV+ F+AD R QVRK+LSPVRLECIEPTS VKQD  SE+D GLSSRT DD C  AEKKHEPEI  CKKTI
Subjt:  EKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKTI

Query:  DAMAAAEARKRRKELTKLKNLHARQCRMH
        DAMAA EAR+RRKELTKLKNLH R CRMH
Subjt:  DAMAAAEARKRRKELTKLKNLHARQCRMH

XP_022982915.1 uncharacterized protein LOC111481617 isoform X2 [Cucurbita maxima]4.3e-20976.56Show/hide
Query:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKG-KGKRVRKEVKAELPICAFVDSLPSR
        MASSSKCSE TSCS LSSSS+ S     SSSSSM ADQMVKVE EAAEAL DLA LAVR+SG EPSETKW  KG KGKR RKEVK E P  AFVDSLPSR
Subjt:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKG-KGKRVRKEVKAELPICAFVDSLPSR

Query:  VDLDLRIQDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALC
         DLDLRIQDRGV+SHQP EKEC + SHP+WE+T++M+KA+K E +S K+      S+PLFGCRR RRNLTEAEKEERRIRR+LANRESARQTIRRRQ LC
Subjt:  VDLDLRIQDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALC

Query:  EELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELH
        E+LTK+A+DLAWENENL+REKELAL+EYQSLE TNKELKEQ+A+A +PK+EEIPGNNRS HVQ PP PTNYPLF FSRPPYASYFWPS+VQPSSPYH+LH
Subjt:  EELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELH

Query:  NVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSAE
        NV VVP S+  P NN V VSDSSHVQENFTNV GLRTPFCI+PCSWLLPHHDHRN+ S   SCP GN QE IY NSQN  YTSKVVV +ESR SSLPSAE
Subjt:  NVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSAE

Query:  EKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKTI
        EK EA DLNEAP+L      KD TQNTVGVVV+ F+AD R +VRK+LSPVRLECIEPTS VKQD  SE+D GLSSRT DD C  AEKKHEPE+  CKKTI
Subjt:  EKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKTI

Query:  DAMAAAEARKRRKELTKLKNLHARQCRMH
        DAMAA EAR+RRKELTKLKNLH R CRMH
Subjt:  DAMAAAEARKRRKELTKLKNLHARQCRMH

XP_038904850.1 uncharacterized protein LOC120091090 isoform X1 [Benincasa hispida]1.1e-20977.26Show/hide
Query:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKGKGKRVRKEVKAELPICAFVDSLPSRV
        MASSSKCS GTSCS LSSSSSSS        SS AADQMVKVE EAAEALA LA LAVRE+G +P ETKWG KGKGKR RKEVK ELP   F DSLPS  
Subjt:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKGKGKRVRKEVKAELPICAFVDSLPSRV

Query:  DLDLRI--QDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQAL
        DLDLRI  QDRGVV HQP EKECTNQSHP+WE+T +++KADK EA+S KVSP CTTSY LFGCRRSRRNLTEAEKEERR+RRILANRESARQTIRRRQAL
Subjt:  DLDLRI--QDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQAL

Query:  CEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHEL
        CEELTK+AADLAWENENL+REKELAL+EYQ+LETTN ELKEQ+AEAVKPKV EIPGNNRS HVQMPP PTNYPLFL SR P   YFWPS+VQP++PYH+L
Subjt:  CEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHEL

Query:  HNVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCIL-PCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPS
         NVVVVPSSI++P NNNVSVS SSHVQENF +V G RTP CIL PCSWLLPHHD RN+ +P +  P GN+QE IY  SQ+   TSK VVH+ESR  SLPS
Subjt:  HNVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCIL-PCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPS

Query:  AEEKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKK
        AEE+ EAPDLNEAPNLNKAS PKD TQNTVGV V+GFD + RAQVRK+LSPVRLECIEP+ AVKQDN SE+DH L S+T DD CDFAE++HEPEI  CKK
Subjt:  AEEKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKK

Query:  TIDAMAAAEARKRRKELTKLKNLHARQCRMHS
        TIDAMAA EAR+RRKELTKLKNL+ RQCRM S
Subjt:  TIDAMAAAEARKRRKELTKLKNLHARQCRMHS

XP_038904851.1 uncharacterized protein LOC120091090 isoform X2 [Benincasa hispida]8.6e-21077.4Show/hide
Query:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKGKGKRVRKEVKAELPICAFVDSLPSRV
        MASSSKCS GTSCS LSSSSSSS        SS AADQMVKVE EAAEALA LA LAVRE+G +P ETKWG KGKGKR RKEVK ELP   F DSLPS  
Subjt:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKGKGKRVRKEVKAELPICAFVDSLPSRV

Query:  DLDLRI-QDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALC
        DLDLRI QDRGVV HQP EKECTNQSHP+WE+T +++KADK EA+S KVSP CTTSY LFGCRRSRRNLTEAEKEERR+RRILANRESARQTIRRRQALC
Subjt:  DLDLRI-QDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALC

Query:  EELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELH
        EELTK+AADLAWENENL+REKELAL+EYQ+LETTN ELKEQ+AEAVKPKV EIPGNNRS HVQMPP PTNYPLFL SR P   YFWPS+VQP++PYH+L 
Subjt:  EELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELH

Query:  NVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCIL-PCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSA
        NVVVVPSSI++P NNNVSVS SSHVQENF +V G RTP CIL PCSWLLPHHD RN+ +P +  P GN+QE IY  SQ+   TSK VVH+ESR  SLPSA
Subjt:  NVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCIL-PCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSA

Query:  EEKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKT
        EE+ EAPDLNEAPNLNKAS PKD TQNTVGV V+GFD + RAQVRK+LSPVRLECIEP+ AVKQDN SE+DH L S+T DD CDFAE++HEPEI  CKKT
Subjt:  EEKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKT

Query:  IDAMAAAEARKRRKELTKLKNLHARQCRMHS
        IDAMAA EAR+RRKELTKLKNL+ RQCRM S
Subjt:  IDAMAAAEARKRRKELTKLKNLHARQCRMHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBD4 BZIP domain-containing protein4.6e-20174.72Show/hide
Query:  ASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSS----MAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKGKGKRVRKEVKAELPICAFVDSLP
        ASSSKCS+GT+ S LSSSSSSSS+ S SSS S     AADQMVKVE EAAEALA LA LAVRE+G +P +TKWG KGKGKR RKEVK E P   F DSLP
Subjt:  ASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSS----MAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKGKGKRVRKEVKAELPICAFVDSLP

Query:  SRVDLDLRI-QDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQ
        +R DLDLRI QDRGVV HQP EKECT QS P+ E+T ++ K DK EA+S KVSP CTTSY  FGCRRSRR LTEAEKEERRIRRILANRESARQTIRRRQ
Subjt:  SRVDLDLRI-QDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQ

Query:  ALCEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYH
        ALCEELT++AADLAWENENL+REKE+AL+EYQSLETTNKELKEQ+AEAVKPKVEEIPGN+RS HVQMPP PTN PLFLFSR P   YFWPS+VQ +S YH
Subjt:  ALCEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYH

Query:  ELHNVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCIL-PCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSL
        EL NVVVVPSSI+ P NNN SVS SS  QENFTN  G R P CIL P SWLLPHHD RN+ SP +  P GNDQEG+Y  SQN   TSK  V +ESRHSSL
Subjt:  ELHNVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCIL-PCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSL

Query:  PSAEEKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPC
        PSAEE+ EAPDLNEAP+L+++S+PKD TQNTVGV VEGFD +ARA VRK+LSPVRLECIEP+SA   DN +E+DHG+SSRT DD C FAE++HEPE+ PC
Subjt:  PSAEEKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPC

Query:  KKTIDAMAAAEARKRRKELTKLKNLHARQCRMHS
        KKT+DAMAA EAR+RRKELTKLKNL+ARQCRM S
Subjt:  KKTIDAMAAAEARKRRKELTKLKNLHARQCRMHS

A0A6J1F2W5 uncharacterized protein LOC111441650 isoform X14.6e-20976.79Show/hide
Query:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWG-RKGKGKRVRKEVKAELPICAFVDSLPSR
        MASSSKCSE TSCS LSSSS+ S      SSSSM ADQMVKVE EAAEALADLA LAVR+SG +PSETKW  ++ KGKR RKEVK E P  AFVDSLPSR
Subjt:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWG-RKGKGKRVRKEVKAELPICAFVDSLPSR

Query:  VDLDLRI-QDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQAL
         DLDLRI QDRGV+SH P EKEC + SHP+WE+T++M+KA+K EA+S K+      S+PLFGCRRSRRNLTEAEKEERRIRR+LANRESARQTIRRRQAL
Subjt:  VDLDLRI-QDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQAL

Query:  CEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHEL
        CE+LTK+A+DLAWENENL+REKELAL+EYQSLE TNKELKEQ+A A +PK+EEIPGNNRS HVQ PP PTNYPLFLFSRPPYASYFWPS+VQPSSPYH+L
Subjt:  CEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHEL

Query:  HNVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSA
        HNV VVP S+  P NN V VSDSSHVQENFTNV GLRTPFCI+PCSWLLPHHDHRN+ S   SCP GN QE IY NSQN  YTSKVVV +ESRHSSLPSA
Subjt:  HNVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSA

Query:  EEKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKT
        EEK EA DLNEAP+L      K+ TQNTVGVVV+ F+AD R QVRK+LSPVRLECIEPTS VKQD  SE+D GLSSRT DD C  AEKKHEPEI  CKKT
Subjt:  EEKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKT

Query:  IDAMAAAEARKRRKELTKLKNLHARQCRMH
        IDAMAA EAR+RRKELTKLKNLH R CRMH
Subjt:  IDAMAAAEARKRRKELTKLKNLHARQCRMH

A0A6J1F7T1 uncharacterized protein LOC111441650 isoform X21.9e-21076.94Show/hide
Query:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWG-RKGKGKRVRKEVKAELPICAFVDSLPSR
        MASSSKCSE TSCS LSSSS+ S      SSSSM ADQMVKVE EAAEALADLA LAVR+SG +PSETKW  ++ KGKR RKEVK E P  AFVDSLPSR
Subjt:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWG-RKGKGKRVRKEVKAELPICAFVDSLPSR

Query:  VDLDLRIQDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALC
         DLDLRIQDRGV+SH P EKEC + SHP+WE+T++M+KA+K EA+S K+      S+PLFGCRRSRRNLTEAEKEERRIRR+LANRESARQTIRRRQALC
Subjt:  VDLDLRIQDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALC

Query:  EELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELH
        E+LTK+A+DLAWENENL+REKELAL+EYQSLE TNKELKEQ+A A +PK+EEIPGNNRS HVQ PP PTNYPLFLFSRPPYASYFWPS+VQPSSPYH+LH
Subjt:  EELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELH

Query:  NVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSAE
        NV VVP S+  P NN V VSDSSHVQENFTNV GLRTPFCI+PCSWLLPHHDHRN+ S   SCP GN QE IY NSQN  YTSKVVV +ESRHSSLPSAE
Subjt:  NVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSAE

Query:  EKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKTI
        EK EA DLNEAP+L      K+ TQNTVGVVV+ F+AD R QVRK+LSPVRLECIEPTS VKQD  SE+D GLSSRT DD C  AEKKHEPEI  CKKTI
Subjt:  EKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKTI

Query:  DAMAAAEARKRRKELTKLKNLHARQCRMH
        DAMAA EAR+RRKELTKLKNLH R CRMH
Subjt:  DAMAAAEARKRRKELTKLKNLHARQCRMH

A0A6J1J476 uncharacterized protein LOC111481617 isoform X22.1e-20976.56Show/hide
Query:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKG-KGKRVRKEVKAELPICAFVDSLPSR
        MASSSKCSE TSCS LSSSS+ S     SSSSSM ADQMVKVE EAAEAL DLA LAVR+SG EPSETKW  KG KGKR RKEVK E P  AFVDSLPSR
Subjt:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKG-KGKRVRKEVKAELPICAFVDSLPSR

Query:  VDLDLRIQDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALC
         DLDLRIQDRGV+SHQP EKEC + SHP+WE+T++M+KA+K E +S K+      S+PLFGCRR RRNLTEAEKEERRIRR+LANRESARQTIRRRQ LC
Subjt:  VDLDLRIQDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALC

Query:  EELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELH
        E+LTK+A+DLAWENENL+REKELAL+EYQSLE TNKELKEQ+A+A +PK+EEIPGNNRS HVQ PP PTNYPLF FSRPPYASYFWPS+VQPSSPYH+LH
Subjt:  EELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELH

Query:  NVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSAE
        NV VVP S+  P NN V VSDSSHVQENFTNV GLRTPFCI+PCSWLLPHHDHRN+ S   SCP GN QE IY NSQN  YTSKVVV +ESR SSLPSAE
Subjt:  NVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSAE

Query:  EKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKTI
        EK EA DLNEAP+L      KD TQNTVGVVV+ F+AD R +VRK+LSPVRLECIEPTS VKQD  SE+D GLSSRT DD C  AEKKHEPE+  CKKTI
Subjt:  EKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKTI

Query:  DAMAAAEARKRRKELTKLKNLHARQCRMH
        DAMAA EAR+RRKELTKLKNLH R CRMH
Subjt:  DAMAAAEARKRRKELTKLKNLHARQCRMH

A0A6J1J5U4 uncharacterized protein LOC111481617 isoform X15.1e-20876.42Show/hide
Query:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKG-KGKRVRKEVKAELPICAFVDSLPSR
        MASSSKCSE TSCS LSSSS+ S     SSSSSM ADQMVKVE EAAEAL DLA LAVR+SG EPSETKW  KG KGKR RKEVK E P  AFVDSLPSR
Subjt:  MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKG-KGKRVRKEVKAELPICAFVDSLPSR

Query:  VDLDLRI-QDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQAL
         DLDLRI QDRGV+SHQP EKEC + SHP+WE+T++M+KA+K E +S K+      S+PLFGCRR RRNLTEAEKEERRIRR+LANRESARQTIRRRQ L
Subjt:  VDLDLRI-QDRGVVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQAL

Query:  CEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHEL
        CE+LTK+A+DLAWENENL+REKELAL+EYQSLE TNKELKEQ+A+A +PK+EEIPGNNRS HVQ PP PTNYPLF FSRPPYASYFWPS+VQPSSPYH+L
Subjt:  CEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHEL

Query:  HNVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSA
        HNV VVP S+  P NN V VSDSSHVQENFTNV GLRTPFCI+PCSWLLPHHDHRN+ S   SCP GN QE IY NSQN  YTSKVVV +ESR SSLPSA
Subjt:  HNVVVVPSSIHMPPNNNVSVSDSSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSA

Query:  EEKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKT
        EEK EA DLNEAP+L      KD TQNTVGVVV+ F+AD R +VRK+LSPVRLECIEPTS VKQD  SE+D GLSSRT DD C  AEKKHEPE+  CKKT
Subjt:  EEKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKT

Query:  IDAMAAAEARKRRKELTKLKNLHARQCRMH
        IDAMAA EAR+RRKELTKLKNLH R CRMH
Subjt:  IDAMAAAEARKRRKELTKLKNLHARQCRMH

SwissProt top hitse value%identityAlignment
P23922 Transcription factor HBP-1a5.9e-0443.84Show/hide
Query:  EKEERRIRRILANRESARQTIRRRQALCEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEA
        E+E ++ +R L+NRESAR++  R+QA CEEL +RA  L  EN +LR E +   +EY+ L + N  LK ++ E+
Subjt:  EKEERRIRRILANRESARQTIRRRQALCEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEA

Arabidopsis top hitse value%identityAlignment
AT1G19490.1 Basic-leucine zipper (bZIP) transcription factor family protein1.3e-5939.2Show/hide
Query:  SSSTCSFSSSSS----MAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKGKGKRVRKEVKAELPICAFVDSLPSRVDLD-LRIQDRGVVSHQP
        SSS CS SSSSS     AA  M   E EAAEALADLA LA+       S   WG   KGKRVRK VK E P     DSL    D D L   D   ++ + 
Subjt:  SSSTCSFSSSSS----MAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKGKGKRVRKEVKAELPICAFVDSLPSRVDLD-LRIQDRGVVSHQP

Query:  LEKECTNQSHPKWESTRKMLKAD-KEEAKSHKVSP-------TCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTKRAAD
        L KE   +   +   T+++ KA  K E       P        C+ S    GC RSR+NL+EAE+EERRIRRILANRESARQTIRRRQA+CEEL+K+AAD
Subjt:  LEKECTNQSHPKWESTRKMLKAD-KEEAKSHKVSP-------TCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTKRAAD

Query:  LAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELHNVVVVPSSI
        L +ENENLRREK+ AL+E+QSLET NK LKEQ+ ++VKP  +E   + +   V+M    T  P + +++ PY  + WP + Q S+P         + S +
Subjt:  LAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELHNVVVVPSSI

Query:  HMPPNNNVSVSD-SSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTS-KVVVHSESRHSSLPS--AEEKTEA
          P +   S    ++   EN  + NG +T F ++PC W LP  DH N     V     + Q G + N  + D +S + +  +E+  S LP+   EE + +
Subjt:  HMPPNNNVSVSD-SSHVQENFTNVNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTS-KVVVHSESRHSSLPS--AEEKTEA

Query:  PDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKTIDAMAA
        P+     +LN         ++   V+ EG D                   +   ++K ++ SE  +G++              H   I+  +K   ++AA
Subjt:  PDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARAQVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKTIDAMAA

Query:  AEARKRRKELTKLKNLHARQCRM
        AEARKRRKELT+LKNLH RQCRM
Subjt:  AEARKRRKELTKLKNLHARQCRM

AT1G32150.1 basic region/leucine zipper transcription factor 684.7e-0442.47Show/hide
Query:  EKEERRIRRILANRESARQTIRRRQALCEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEA
        E+E +R RR  +NRESAR++  R+QA C+EL +RA  L  EN +LR E      +Y+ L   N  LK + + A
Subjt:  EKEERRIRRILANRESARQTIRRRQALCEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEA

AT2G35530.1 basic region/leucine zipper transcription factor 162.7e-0441.18Show/hide
Query:  EKEERRIRRILANRESARQTIRRRQALCEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNN
        ++E +R RR  +NRESAR++  R+QA C+EL +RA  L  EN NLR E      + + L T N  LK+Q+  ++ P +E I  +N
Subjt:  EKEERRIRRILANRESARQTIRRRQALCEELTKRAADLAWENENLRREKELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCCAAGTGTTCCGAGGGAACCAGTTGTTCGGATTTGAGTTCTTCTTCTTCTTCTTCTTCTACTTGTTCGTTTTCTTCTTCCTCATCCATGGCGGCGGA
TCAGATGGTCAAGGTTGAGTTTGAGGCGGCGGAGGCTCTTGCGGATTTGGCCGCTTTGGCGGTGAGGGAGAGTGGACGTGAGCCCTCGGAAACCAAATGGGGGCGTAAAG
GGAAGGGAAAACGGGTCAGGAAGGAGGTTAAGGCCGAGTTGCCGATTTGTGCCTTTGTCGACTCTTTACCTAGTCGAGTGGATCTGGACCTTCGGATTCAGGATAGAGGA
GTGGTAAGTCATCAACCATTAGAAAAAGAATGTACAAATCAATCCCATCCCAAGTGGGAATCGACCAGAAAGATGTTAAAGGCGGACAAGGAGGAGGCCAAATCACATAA
AGTGAGTCCTACATGCACCACAAGCTACCCATTATTTGGCTGCAGGAGGTCAAGGCGTAATCTAACTGAGGCTGAAAAGGAAGAAAGGAGAATACGAAGGATTTTAGCAA
ATAGAGAGTCAGCCAGGCAGACAATTCGGCGTAGGCAGGCTCTGTGCGAGGAGTTGACCAAAAGGGCTGCTGATTTAGCATGGGAAAATGAAAATTTAAGGAGGGAAAAG
GAGTTGGCCCTGGAAGAGTACCAATCTCTGGAGACTACTAACAAGGAATTAAAGGAACAGATGGCTGAAGCAGTAAAGCCCAAGGTGGAGGAAATCCCAGGAAACAATAG
ATCATTTCATGTTCAGATGCCTCCTTTTCCTACCAACTACCCTCTTTTCCTGTTTAGTCGCCCTCCATATGCATCGTATTTCTGGCCATCTATGGTTCAACCTTCAAGTC
CTTATCATGAACTACACAATGTTGTCGTCGTCCCTTCAAGTATTCATATGCCTCCAAATAATAATGTTTCTGTGTCCGACTCTTCCCATGTACAAGAAAACTTTACGAAC
GTCAATGGCCTGAGAACACCCTTTTGTATACTACCTTGTTCTTGGTTGTTGCCTCATCATGATCATAGGAATCGACATAGTCCTGCAGTCTCATGTCCCACGGGAAATGA
TCAAGAGGGTATTTATTTGAATTCCCAAAACAGGGATTATACTTCCAAGGTGGTTGTGCATTCAGAAAGCAGACATTCTTCTTTGCCTTCAGCTGAAGAAAAAACTGAAG
CGCCTGACTTGAATGAAGCTCCTAACTTGAACAAAGCTTCGGATCCAAAGGATTGTACTCAGAACACAGTTGGAGTAGTTGTGGAGGGATTTGATGCCGACGCGAGAGCT
CAAGTTAGGAAACTGCTTTCTCCTGTAAGACTTGAATGTATCGAACCGACATCCGCTGTCAAACAAGATAACCGGAGTGAAAACGATCATGGTCTGTCATCAAGAACTTC
TGATGACTTCTGTGATTTTGCAGAAAAAAAGCATGAACCAGAGATTGCCCCCTGTAAGAAAACCATAGATGCAATGGCTGCAGCCGAGGCAAGGAAGAGAAGAAAAGAAC
TAACGAAGTTGAAGAACCTTCATGCTCGACAATGCCGTATGCATTCTTGA
mRNA sequenceShow/hide mRNA sequence
TTCTTCTTCTCCTCTGTTTCTTTCTTATCTCTGATTTTTTTTTCCCCTTTTCTTTTATGGTTTTTCTCTTGCTCTCATGGCTTCTTCTTCCAAGTGTTCCGAGGGAACCA
GTTGTTCGGATTTGAGTTCTTCTTCTTCTTCTTCTTCTACTTGTTCGTTTTCTTCTTCCTCATCCATGGCGGCGGATCAGATGGTCAAGGTTGAGTTTGAGGCGGCGGAG
GCTCTTGCGGATTTGGCCGCTTTGGCGGTGAGGGAGAGTGGACGTGAGCCCTCGGAAACCAAATGGGGGCGTAAAGGGAAGGGAAAACGGGTCAGGAAGGAGGTTAAGGC
CGAGTTGCCGATTTGTGCCTTTGTCGACTCTTTACCTAGTCGAGTGGATCTGGACCTTCGGATTCAGGATAGAGGAGTGGTAAGTCATCAACCATTAGAAAAAGAATGTA
CAAATCAATCCCATCCCAAGTGGGAATCGACCAGAAAGATGTTAAAGGCGGACAAGGAGGAGGCCAAATCACATAAAGTGAGTCCTACATGCACCACAAGCTACCCATTA
TTTGGCTGCAGGAGGTCAAGGCGTAATCTAACTGAGGCTGAAAAGGAAGAAAGGAGAATACGAAGGATTTTAGCAAATAGAGAGTCAGCCAGGCAGACAATTCGGCGTAG
GCAGGCTCTGTGCGAGGAGTTGACCAAAAGGGCTGCTGATTTAGCATGGGAAAATGAAAATTTAAGGAGGGAAAAGGAGTTGGCCCTGGAAGAGTACCAATCTCTGGAGA
CTACTAACAAGGAATTAAAGGAACAGATGGCTGAAGCAGTAAAGCCCAAGGTGGAGGAAATCCCAGGAAACAATAGATCATTTCATGTTCAGATGCCTCCTTTTCCTACC
AACTACCCTCTTTTCCTGTTTAGTCGCCCTCCATATGCATCGTATTTCTGGCCATCTATGGTTCAACCTTCAAGTCCTTATCATGAACTACACAATGTTGTCGTCGTCCC
TTCAAGTATTCATATGCCTCCAAATAATAATGTTTCTGTGTCCGACTCTTCCCATGTACAAGAAAACTTTACGAACGTCAATGGCCTGAGAACACCCTTTTGTATACTAC
CTTGTTCTTGGTTGTTGCCTCATCATGATCATAGGAATCGACATAGTCCTGCAGTCTCATGTCCCACGGGAAATGATCAAGAGGGTATTTATTTGAATTCCCAAAACAGG
GATTATACTTCCAAGGTGGTTGTGCATTCAGAAAGCAGACATTCTTCTTTGCCTTCAGCTGAAGAAAAAACTGAAGCGCCTGACTTGAATGAAGCTCCTAACTTGAACAA
AGCTTCGGATCCAAAGGATTGTACTCAGAACACAGTTGGAGTAGTTGTGGAGGGATTTGATGCCGACGCGAGAGCTCAAGTTAGGAAACTGCTTTCTCCTGTAAGACTTG
AATGTATCGAACCGACATCCGCTGTCAAACAAGATAACCGGAGTGAAAACGATCATGGTCTGTCATCAAGAACTTCTGATGACTTCTGTGATTTTGCAGAAAAAAAGCAT
GAACCAGAGATTGCCCCCTGTAAGAAAACCATAGATGCAATGGCTGCAGCCGAGGCAAGGAAGAGAAGAAAAGAACTAACGAAGTTGAAGAACCTTCATGCTCGACAATG
CCGTATGCATTCTTGATCTATACGGCCGGGGACTTCGGCGTTTGTTCGTCATTGGCAATCTCATCTGTGTAAAGTCTTGTACTTCACTGGTTTTTGTTGCTAGAGGCAAG
CACACAGAGCATTGATACGTAACCAAAGTTCTGGCTTTCCTTTTGAGGCATTCCCTTTGCTTTTGTTGCCAAGCTCAGTCCACGTCGAGATCTTGCTGCTAGTTATGGCA
GTGATGGGGAAGAAAACTCACGTATCGGGGCGTTTTTTTCGTGTCTGTTCCGATGAATTTACTAACAAATGAAATAAGAGAGGTACTGAGGACTAAGATGCTGATTACTT
CCAAGAGCTATGAGTTGGATTTGGAAGGGCAGCAAAGAGGAGGAGAATGTAAGAGTCTGTATTTAAAATCTTTTATGGCTACTTCAAAGACCCTTTGTTGGGTTGGCTTC
AGTTCAATAAAATGAAGCATTAGAACACATAAAAGATGAGGAAAGCCAAATTGAGATTTAACTTGCTGTTTTTCTAAAAACTTGAATAGAAATGCTTCATACAAACTAGA
GTTTTGGTTCTAGAAATTGTGCAAAATCCTTTTAAATTGGTTTCTTTTTTAAACTTTAATTTCAATTCACTTAGTCTAGATATGAGTAATACCAAGTAGCATACACATTT
TTTTTTTGTTTATTTTCTGATCATTTATTTATTTTTTAAACAAACAAAACACATTTTTATTAGCATAGAATTTATACTTTTCTTTAATGAGTTGAAGAAGAGGGAGGAGT
GGG
Protein sequenceShow/hide protein sequence
MASSSKCSEGTSCSDLSSSSSSSSTCSFSSSSSMAADQMVKVEFEAAEALADLAALAVRESGREPSETKWGRKGKGKRVRKEVKAELPICAFVDSLPSRVDLDLRIQDRG
VVSHQPLEKECTNQSHPKWESTRKMLKADKEEAKSHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTKRAADLAWENENLRREK
ELALEEYQSLETTNKELKEQMAEAVKPKVEEIPGNNRSFHVQMPPFPTNYPLFLFSRPPYASYFWPSMVQPSSPYHELHNVVVVPSSIHMPPNNNVSVSDSSHVQENFTN
VNGLRTPFCILPCSWLLPHHDHRNRHSPAVSCPTGNDQEGIYLNSQNRDYTSKVVVHSESRHSSLPSAEEKTEAPDLNEAPNLNKASDPKDCTQNTVGVVVEGFDADARA
QVRKLLSPVRLECIEPTSAVKQDNRSENDHGLSSRTSDDFCDFAEKKHEPEIAPCKKTIDAMAAAEARKRRKELTKLKNLHARQCRMHS