; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0015906 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0015906
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionBZIP transcription factor family protein
Genome locationchr04:29961660..29966381
RNA-Seq ExpressionIVF0015906
SyntenyIVF0015906
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009414 - response to water deprivation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044827 - G-box-binding factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442819.2 PREDICTED: uncharacterized protein LOC103486593 isoform X1 [Cucumis melo]0.099.63Show/hide
Query:  MGSPPLTLMAASSSKCSDGTTSSSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP
        MGSPPLTLMAASSSKCSDGTTSSSSSSSS SSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP
Subjt:  MGSPPLTLMAASSSKCSDGTTSSSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP

Query:  ISGFADSLPARADLDLRIEQ-DRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRES
        ISGFADSLPARADLDLRIEQ DRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRES
Subjt:  ISGFADSLPARADLDLRIEQ-DRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRES

Query:  ARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQ
        ARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQ
Subjt:  ARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQ

Query:  STSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAES
        STSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAES
Subjt:  STSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAES

Query:  RHSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVP
        RHSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVP
Subjt:  RHSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVP

Query:  CKKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
        CKKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
Subjt:  CKKSIDAMAATEARRRRKELTKVKNLYARQCRMQS

XP_008442820.2 PREDICTED: uncharacterized protein LOC103486593 isoform X2 [Cucumis melo]0.099.81Show/hide
Query:  MGSPPLTLMAASSSKCSDGTTSSSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP
        MGSPPLTLMAASSSKCSDGTTSSSSSSSS SSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP
Subjt:  MGSPPLTLMAASSSKCSDGTTSSSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP

Query:  ISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESA
        ISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESA
Subjt:  ISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESA

Query:  RQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQS
        RQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQS
Subjt:  RQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQS

Query:  TSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESR
        TSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESR
Subjt:  TSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESR

Query:  HSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPC
        HSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPC
Subjt:  HSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPC

Query:  KKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
        KKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
Subjt:  KKSIDAMAATEARRRRKELTKVKNLYARQCRMQS

XP_008442821.2 PREDICTED: uncharacterized protein LOC103486593 isoform X3 [Cucumis melo]0.099.63Show/hide
Query:  MGSPPLTLMAASSSKCSDGTTSSSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP
        MGSPPLTLMAASSSKCSDGTTSSSSSSSS SSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP
Subjt:  MGSPPLTLMAASSSKCSDGTTSSSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP

Query:  ISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESA
        ISGFADSLPARADLDLRIE DRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESA
Subjt:  ISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESA

Query:  RQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQS
        RQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQS
Subjt:  RQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQS

Query:  TSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESR
        TSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESR
Subjt:  TSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESR

Query:  HSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPC
        HSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPC
Subjt:  HSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPC

Query:  KKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
        KKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
Subjt:  KKSIDAMAATEARRRRKELTKVKNLYARQCRMQS

XP_011652004.1 uncharacterized protein LOC101210630 isoform X1 [Cucumis sativus]0.090.39Show/hide
Query:  HPHALSSLFTPSLLSSFFFSSVMGSPPLTLMAASSSKCSDGTTSSS-SSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQP
        H  +LS   +P     F+FSS MGS PLTLMAASSSKCSDGTTSS  SSSSSSSSSSSSSS M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQP
Subjt:  HPHALSSLFTPSLLSSFFFSSVMGSPPLTLMAASSSKCSDGTTSSS-SSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQP

Query:  SQTKWGIKGKGKRARKEFKTESPISGFADSLPARADLDLRIEQ-DRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRS
         QTKWGIKGKGKRARKE KTESP SGFADSLPARADLDLRIEQ DRGVVKHQPSEKECT QSQ EPETT EVTKMDKEAESSKVSPA TTSYQ FGCRRS
Subjt:  SQTKWGIKGKGKRARKEFKTESPISGFADSLPARADLDLRIEQ-DRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRS

Query:  RRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMP
        RR LTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNH SSHVQMP
Subjt:  RRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMP

Query:  PLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGN
        PLP+NCPLFLFSR PYFWPSVVQSTSSYHELPNV+VVPSSIN PANNNASVSGSSQTQENFTN TGSRAP C+LPP SWLLPHHDFRNQQSPQIWFPAGN
Subjt:  PLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGN

Query:  DQEDIYSKSQDSAITSKVVHAESRHS----AEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNE
        DQE +YSKSQ+SAITSK V AESRHS    AEEEN+APDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAA LDNWNE
Subjt:  DQEDIYSKSQDSAITSKVVHAESRHS----AEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNE

Query:  DDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
        DDHGVSSRTCDDLCYFAERRHEPE+VPCKK++DAMAATEARRRRKELTK+KNLYARQCRMQS
Subjt:  DDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKELTKVKNLYARQCRMQS

XP_011652005.1 uncharacterized protein LOC101210630 isoform X2 [Cucumis sativus]0.090.55Show/hide
Query:  HPHALSSLFTPSLLSSFFFSSVMGSPPLTLMAASSSKCSDGTTSSS-SSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQP
        H  +LS   +P     F+FSS MGS PLTLMAASSSKCSDGTTSS  SSSSSSSSSSSSSS M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQP
Subjt:  HPHALSSLFTPSLLSSFFFSSVMGSPPLTLMAASSSKCSDGTTSSS-SSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQP

Query:  SQTKWGIKGKGKRARKEFKTESPISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSR
         QTKWGIKGKGKRARKE KTESP SGFADSLPARADLDLRIEQDRGVVKHQPSEKECT QSQ EPETT EVTKMDKEAESSKVSPA TTSYQ FGCRRSR
Subjt:  SQTKWGIKGKGKRARKEFKTESPISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSR

Query:  RNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPP
        R LTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNH SSHVQMPP
Subjt:  RNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPP

Query:  LPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGND
        LP+NCPLFLFSR PYFWPSVVQSTSSYHELPNV+VVPSSIN PANNNASVSGSSQTQENFTN TGSRAP C+LPP SWLLPHHDFRNQQSPQIWFPAGND
Subjt:  LPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGND

Query:  QEDIYSKSQDSAITSKVVHAESRHS----AEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNED
        QE +YSKSQ+SAITSK V AESRHS    AEEEN+APDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAA LDNWNED
Subjt:  QEDIYSKSQDSAITSKVVHAESRHS----AEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNED

Query:  DHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
        DHGVSSRTCDDLCYFAERRHEPE+VPCKK++DAMAATEARRRRKELTK+KNLYARQCRMQS
Subjt:  DHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKELTKVKNLYARQCRMQS

TrEMBL top hitse value%identityAlignment
A0A0A0LBD4 BZIP domain-containing protein4.6e-26692Show/hide
Query:  SLLSSFFFSSVMGSPPLTLMAASSSKCSDGTTSSS-SSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKG
        SL   F+FSS MGS PLTLMAASSSKCSDGTTSS  SSSSSSSSSSSSSS M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQP QTKWGIKGKG
Subjt:  SLLSSFFFSSVMGSPPLTLMAASSSKCSDGTTSSS-SSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKG

Query:  KRARKEFKTESPISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEER
        KRARKE KTESP SGFADSLPARADLDLRIEQDRGVVKHQPSEKECT QSQ EPETT EVTKMDKEAESSKVSPA TTSYQ FGCRRSRR LTEAEKEER
Subjt:  KRARKEFKTESPISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEER

Query:  RIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFS
        RIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNH SSHVQMPPLP+NCPLFLFS
Subjt:  RIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFS

Query:  RFPYFWPSVVQSTSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDS
        R PYFWPSVVQSTSSYHELPNV+VVPSSIN PANNNASVSGSSQTQENFTN TGSRAP C+LPP SWLLPHHDFRNQQSPQIWFPAGNDQE +YSKSQ+S
Subjt:  RFPYFWPSVVQSTSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDS

Query:  AITSKVVHAESRH----SAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDD
        AITSK V AESRH    SAEEEN+APDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAA LDNWNEDDHGVSSRTCDD
Subjt:  AITSKVVHAESRH----SAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDD

Query:  LCYFAERRHEPEIVPCKKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
        LCYFAERRHEPE+VPCKK++DAMAATEARRRRKELTK+KNLYARQCRMQS
Subjt:  LCYFAERRHEPEIVPCKKSIDAMAATEARRRRKELTKVKNLYARQCRMQS

A0A1S3B649 uncharacterized protein LOC103486593 isoform X32.6e-28599.63Show/hide
Query:  MGSPPLTLMAASSSKCSDGTTSSSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP
        MGSPPLTLMAASSSKCSDGTT SSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP
Subjt:  MGSPPLTLMAASSSKCSDGTTSSSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP

Query:  ISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESA
        ISGFADSLPARADLDLRIE DRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESA
Subjt:  ISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESA

Query:  RQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQS
        RQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQS
Subjt:  RQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQS

Query:  TSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESR
        TSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESR
Subjt:  TSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESR

Query:  HSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPC
        HSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPC
Subjt:  HSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPC

Query:  KKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
        KKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
Subjt:  KKSIDAMAATEARRRRKELTKVKNLYARQCRMQS

A0A1S3B7B6 uncharacterized protein LOC103486593 isoform X18.9e-28699.63Show/hide
Query:  MGSPPLTLMAASSSKCSDGTTSSSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP
        MGSPPLTLMAASSSKCSDGTT SSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP
Subjt:  MGSPPLTLMAASSSKCSDGTTSSSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP

Query:  ISGFADSLPARADLDLRIE-QDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRES
        ISGFADSLPARADLDLRIE QDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRES
Subjt:  ISGFADSLPARADLDLRIE-QDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRES

Query:  ARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQ
        ARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQ
Subjt:  ARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQ

Query:  STSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAES
        STSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAES
Subjt:  STSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAES

Query:  RHSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVP
        RHSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVP
Subjt:  RHSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVP

Query:  CKKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
        CKKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
Subjt:  CKKSIDAMAATEARRRRKELTKVKNLYARQCRMQS

A0A1S3B7D8 uncharacterized protein LOC103486593 isoform X23.6e-28799.81Show/hide
Query:  MGSPPLTLMAASSSKCSDGTTSSSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP
        MGSPPLTLMAASSSKCSDGTT SSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP
Subjt:  MGSPPLTLMAASSSKCSDGTTSSSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESP

Query:  ISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESA
        ISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESA
Subjt:  ISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESA

Query:  RQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQS
        RQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQS
Subjt:  RQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQS

Query:  TSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESR
        TSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESR
Subjt:  TSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESR

Query:  HSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPC
        HSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPC
Subjt:  HSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPC

Query:  KKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
        KKSIDAMAATEARRRRKELTKVKNLYARQCRMQS
Subjt:  KKSIDAMAATEARRRRKELTKVKNLYARQCRMQS

A0A5A7TM13 BZIP transcription factor family protein2.7e-21099.74Show/hide
Query:  MDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELK
        MDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELK
Subjt:  MDKEAESSKVSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELK

Query:  EQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCML
        EQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCML
Subjt:  EQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCML

Query:  PPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESRHSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKV
        PPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESRHSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKV
Subjt:  PPCSWLLPHHDFRNQQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESRHSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKV

Query:  LSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKELTKVKNLYARQCR
        LSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKELTKVKNLYARQC+
Subjt:  LSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDAMAATEARRRRKELTKVKNLYARQCR

SwissProt top hitse value%identityAlignment
P23922 Transcription factor HBP-1a6.7e-0443.84Show/hide
Query:  EKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEA
        E+E ++ +R L+NRESAR++  R+QA CEEL ++A  L  EN +L+ E +   KEY+ L + N  LK +L E+
Subjt:  EKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEA

Arabidopsis top hitse value%identityAlignment
AT1G19490.1 Basic-leucine zipper (bZIP) transcription factor family protein5.2e-5237.76Show/hide
Query:  SSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESPISGFADSLPARADLDLRIEQDR
        SSS  SSSSSSS   +   ++M          E+EAAEALA LA LA+       S   WG   KGKR RK  KTESP S   DSL    D D     D 
Subjt:  SSSSSSSSSSSSSSSSFMPSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESPISGFADSLPARADLDLRIEQDR

Query:  GVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLF------GCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTR
          +  +   KE   + + EP  T+E+TK   ++E +  +P    +  L       GC RSR+NL+EAE+EERRIRRILANRESARQTIRRRQA+CEEL++
Subjt:  GVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSKVSPASTTSYQLF------GCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTR

Query:  KAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPY---FWPSVVQSTSSYHELPNVIVV
        KAADL +ENENL+REK+ ALKE+QSLET NK LKEQ+ ++VKP  +E   +   S V+M    S+ P + +++ PY    WP V QS+       N ++ 
Subjt:  KAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHISSHVQMPPLPSNCPLFLFSRFPY---FWPSVVQSTSSYHELPNVIVV

Query:  PSSINLPANNNASVSG-SSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRN------QQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESRHSAEEE
        P  +  P +  AS    ++Q  EN  +  G +  F ++ PC W LP  D  N      Q + +  F  G+  +D  ++  D   T +  H  +R   EE+
Subjt:  PSSINLPANNNASVSG-SSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRN------QQSPQIWFPAGNDQEDIYSKSQDSAITSKVVHAESRHSAEEE

Query:  NDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDA
        + +P+      L+ES+         +    +GF    +A   K                         H   S T + +       H    +P KK   +
Subjt:  NDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFAERRHEPEIVPCKKSIDA

Query:  MAATEARRRRKELTKVKNLYARQCRMQ
        +AA EAR+RRKELT++KNL+ RQCRMQ
Subjt:  MAATEARRRRKELTKVKNLYARQCRMQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCGTAGCCTACAATTCTTGTACGGCGGAACTGCCACGTGTAACAATTATACGAGCGCCGATTCGATTACGTGGCTTTCACTGATTCATGGAGACGCGCAAGTTTT
CACTCATCATCCTCACGCGCTCTCTTCCCTCTTCACTCCCTCCTTACTCTCTTCTTTCTTCTTCTCCTCTGTTATGGGTTCTCCTCCTCTCACTCTCATGGCGGCTTCTT
CTTCCAAGTGCTCCGATGGGACCACTTCTTCTTCCTCTTCTTCTTCTTCCTCTTCTTCTTCTTCTTCTTCTTCTTCCTTTATGCCCTCTTCTATGGCTAAGGCGGCGGAT
CAGATGGTTAAGGTTGAGATTGAGGCGGCGGAGGCTCTTGCTGGTTTGGCGGTTTTGGCGGTTCGAGAAACTGGAACTCAACCTTCCCAAACTAAATGGGGGATTAAAGG
GAAAGGGAAACGAGCTAGGAAGGAGTTTAAGACCGAATCGCCCATTTCTGGCTTTGCCGACTCTTTGCCTGCTCGTGCGGATCTGGACCTTCGGATTGAGCAGGATAGAG
GGGTGGTAAAACATCAACCCTCAGAGAAAGAGTGTACTAATCAGTCCCAGCATGAGCCGGAAACAACCAGAGAGGTGACAAAGATGGACAAGGAGGCTGAATCATCTAAA
GTGAGTCCTGCAAGCACTACGAGCTACCAGTTGTTTGGCTGCAGAAGGTCAAGACGTAATCTAACTGAGGCTGAAAAGGAAGAAAGGAGAATACGAAGGATTTTGGCGAA
CAGAGAGTCAGCCCGGCAGACAATTCGGCGTAGGCAGGCTCTATGCGAGGAATTAACCAGAAAGGCTGCTGATCTAGCATGGGAAAATGAAAATTTAAAGAGGGAAAAGG
AGTTGGCCCTGAAAGAGTACCAATCTCTGGAGACTACTAACAAGGAATTAAAGGAACAGTTGGCTGAAGCAGTAAAGCCGAAGGTGGAGGAGATCCCAGGAAACCATATA
TCATCTCATGTTCAGATGCCTCCTTTACCCTCCAACTGCCCTCTTTTCTTGTTTAGCCGCTTTCCATATTTCTGGCCATCTGTGGTTCAATCTACAAGTTCCTATCATGA
ACTACCCAATGTCATCGTTGTTCCGTCAAGTATTAATCTGCCTGCTAACAATAATGCTTCTGTATCTGGCTCTTCCCAGACACAAGAAAACTTTACGAATGTCACCGGCT
CCAGAGCACCATTTTGCATGTTACCACCTTGTTCTTGGTTGTTACCTCATCATGATTTTAGGAACCAACAGAGTCCTCAAATCTGGTTTCCCGCTGGAAATGATCAAGAG
GATATTTATTCGAAATCCCAAGATAGTGCTATTACTTCAAAGGTTGTCCATGCAGAAAGCAGACATTCAGCCGAAGAAGAAAACGATGCTCCTGACTTGAATGAAGCTCC
CAGTTTAGATGAATCTTCAAATCCAAAGGATGATACTCAGAACACAGTTGGAGTAGCTGTGGAGGGATTCGATACCAATGCAAGAGCTCCAGTTAGAAAAGTGCTTTCTC
CTGTAAGACTTGAATGTATTGAACCCAGTTCCGCTGCCAAACTAGATAACTGGAATGAAGATGATCATGGTGTGTCATCAAGAACGTGTGATGACTTGTGTTATTTTGCA
GAAAGAAGGCATGAACCGGAGATAGTCCCTTGTAAGAAATCCATAGATGCAATGGCCGCAACTGAGGCCAGGAGGAGGAGAAAAGAACTGACAAAGGTAAAGAACCTTTA
CGCCCGTCAATGCCGTATGCAATCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCGTAGCCTACAATTCTTGTACGGCGGAACTGCCACGTGTAACAATTATACGAGCGCCGATTCGATTACGTGGCTTTCACTGATTCATGGAGACGCGCAAGTTTT
CACTCATCATCCTCACGCGCTCTCTTCCCTCTTCACTCCCTCCTTACTCTCTTCTTTCTTCTTCTCCTCTGTTATGGGTTCTCCTCCTCTCACTCTCATGGCGGCTTCTT
CTTCCAAGTGCTCCGATGGGACCACTTCTTCTTCCTCTTCTTCTTCTTCCTCTTCTTCTTCTTCTTCTTCTTCTTCCTTTATGCCCTCTTCTATGGCTAAGGCGGCGGAT
CAGATGGTTAAGGTTGAGATTGAGGCGGCGGAGGCTCTTGCTGGTTTGGCGGTTTTGGCGGTTCGAGAAACTGGAACTCAACCTTCCCAAACTAAATGGGGGATTAAAGG
GAAAGGGAAACGAGCTAGGAAGGAGTTTAAGACCGAATCGCCCATTTCTGGCTTTGCCGACTCTTTGCCTGCTCGTGCGGATCTGGACCTTCGGATTGAGCAGGATAGAG
GGGTGGTAAAACATCAACCCTCAGAGAAAGAGTGTACTAATCAGTCCCAGCATGAGCCGGAAACAACCAGAGAGGTGACAAAGATGGACAAGGAGGCTGAATCATCTAAA
GTGAGTCCTGCAAGCACTACGAGCTACCAGTTGTTTGGCTGCAGAAGGTCAAGACGTAATCTAACTGAGGCTGAAAAGGAAGAAAGGAGAATACGAAGGATTTTGGCGAA
CAGAGAGTCAGCCCGGCAGACAATTCGGCGTAGGCAGGCTCTATGCGAGGAATTAACCAGAAAGGCTGCTGATCTAGCATGGGAAAATGAAAATTTAAAGAGGGAAAAGG
AGTTGGCCCTGAAAGAGTACCAATCTCTGGAGACTACTAACAAGGAATTAAAGGAACAGTTGGCTGAAGCAGTAAAGCCGAAGGTGGAGGAGATCCCAGGAAACCATATA
TCATCTCATGTTCAGATGCCTCCTTTACCCTCCAACTGCCCTCTTTTCTTGTTTAGCCGCTTTCCATATTTCTGGCCATCTGTGGTTCAATCTACAAGTTCCTATCATGA
ACTACCCAATGTCATCGTTGTTCCGTCAAGTATTAATCTGCCTGCTAACAATAATGCTTCTGTATCTGGCTCTTCCCAGACACAAGAAAACTTTACGAATGTCACCGGCT
CCAGAGCACCATTTTGCATGTTACCACCTTGTTCTTGGTTGTTACCTCATCATGATTTTAGGAACCAACAGAGTCCTCAAATCTGGTTTCCCGCTGGAAATGATCAAGAG
GATATTTATTCGAAATCCCAAGATAGTGCTATTACTTCAAAGGTTGTCCATGCAGAAAGCAGACATTCAGCCGAAGAAGAAAACGATGCTCCTGACTTGAATGAAGCTCC
CAGTTTAGATGAATCTTCAAATCCAAAGGATGATACTCAGAACACAGTTGGAGTAGCTGTGGAGGGATTCGATACCAATGCAAGAGCTCCAGTTAGAAAAGTGCTTTCTC
CTGTAAGACTTGAATGTATTGAACCCAGTTCCGCTGCCAAACTAGATAACTGGAATGAAGATGATCATGGTGTGTCATCAAGAACGTGTGATGACTTGTGTTATTTTGCA
GAAAGAAGGCATGAACCGGAGATAGTCCCTTGTAAGAAATCCATAGATGCAATGGCCGCAACTGAGGCCAGGAGGAGGAGAAAAGAACTGACAAAGGTAAAGAACCTTTA
CGCCCGTCAATGCCGTATGCAATCCTGATCCACATGGCCAGTTAGTTTGGCAACAGTGTTTGTTGTCGACACAGCAATCTTATGTGTTAAAGTTCTGTGTTGCATTGGCT
TTTGTTGCCAGAGGCAGGCCACAGAGCTATGAGACGTAGCCAGAGTTCTGACTTCACTTCAGCTTTCTTGTCATGGCGTACGTTGTGCATTCCAGAGCTCAGAGCACATA
AAGATCTTGGTGCTGGAGATAGGTAGTGACGGGAAATAAGATTTAGGCACAATTTGCTAACAAATTAAATGAGAGAGGTACTAGGACTAAAATGCCGATTAGTTCAAAGA
GCTTTGAATTTGATTTGGTAGGGCAGCAAAGAGGAGGAGGTGAGTGTAAGAATTTTGTATTTAAATCTTTCTATGGTTACTTCGAAGACCCTTGTTATTGGGTTGGCTTA
CACTTCTTCAATGAAGTGAAGCATAAGAATGGAGTGCATGGAGATCTGTTAGTTACTTTATCAAGA
Protein sequenceShow/hide protein sequence
MVRSLQFLYGGTATCNNYTSADSITWLSLIHGDAQVFTHHPHALSSLFTPSLLSSFFFSSVMGSPPLTLMAASSSKCSDGTTSSSSSSSSSSSSSSSSSFMPSSMAKAAD
QMVKVEIEAAEALAGLAVLAVRETGTQPSQTKWGIKGKGKRARKEFKTESPISGFADSLPARADLDLRIEQDRGVVKHQPSEKECTNQSQHEPETTREVTKMDKEAESSK
VSPASTTSYQLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHI
SSHVQMPPLPSNCPLFLFSRFPYFWPSVVQSTSSYHELPNVIVVPSSINLPANNNASVSGSSQTQENFTNVTGSRAPFCMLPPCSWLLPHHDFRNQQSPQIWFPAGNDQE
DIYSKSQDSAITSKVVHAESRHSAEEENDAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAAKLDNWNEDDHGVSSRTCDDLCYFA
ERRHEPEIVPCKKSIDAMAATEARRRRKELTKVKNLYARQCRMQS