; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G019130 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G019130
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionBZIP domain-containing protein
Genome locationchr03:30480300..30485004
RNA-Seq ExpressionLsi03G019130
SyntenyLsi03G019130
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR044827 - G-box-binding factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011652004.1 uncharacterized protein LOC101210630 isoform X1 [Cucumis sativus]1.0e-22368.76Show/hide
Query:  TRALFLF---HSLTLFILLLLLLCYGFSLLPL---MASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRET
        TR L L     SL+L +            LPL    ASSSKCSD  T SGL       SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRET
Subjt:  TRALFLF---HSLTLFILLLLLLCYGFSLLPL---MASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRET

Query:  GPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVE
        G QP +TKWGIKGKGKRARKEVK ESPTS FADSLP+RADLDLRIE                                                      
Subjt:  GPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVE

Query:  IFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIR
                      +D GVV+HQPSEKECT QS PE ETTGE+ K DKEAES KVSPACTTSYQ FGCRRSRR LTEAEKEERR+RRILANRESARQTIR
Subjt:  IFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIR

Query:  RRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNR
        RRQALCEELTRKAADLAWENENLKR                                    EKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+R
Subjt:  RRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNR

Query:  TSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQ
        +SHVQMPPLPTN PLFLFSR+PYFWPSVVQ TS YHELPNV+VVPSSIN PANNN SVSGSS  QENFTN TG R PLCI+PP SWLLPHHDFRNQQSPQ
Subjt:  TSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQ

Query:  VWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAV
        +WFPAGN+ E + SKSQNSA +SK V AESRHSSLPSAEEE+EAPDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA 
Subjt:  VWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAV

Query:  QQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATEARRRRKELTKLKNLYTRQFRMHS
          +N +EDDHG+SSRTCDDLC FAERRHEPE+VPCKKT+DAMAATEARRRRKELTKLKNLY RQ RM S
Subjt:  QQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATEARRRRKELTKLKNLYTRQFRMHS

XP_011652005.1 uncharacterized protein LOC101210630 isoform X2 [Cucumis sativus]7.7e-22468.76Show/hide
Query:  TRALFLF---HSLTLFILLLLLLCYGFSLLPL---MASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRET
        TR L L     SL+L +            LPL    ASSSKCSD  T SGL       SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRET
Subjt:  TRALFLF---HSLTLFILLLLLLCYGFSLLPL---MASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRET

Query:  GPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVE
        G QP +TKWGIKGKGKRARKEVK ESPTS FADSLP+RADLDLRIE                                                      
Subjt:  GPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVE

Query:  IFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIR
                      +D GVV+HQPSEKECT QS PE ETTGE+ K DKEAES KVSPACTTSYQ FGCRRSRR LTEAEKEERR+RRILANRESARQTIR
Subjt:  IFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIR

Query:  RRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNR
        RRQALCEELTRKAADLAWENENLKR                                    EKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+R
Subjt:  RRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNR

Query:  TSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQ
        +SHVQMPPLPTN PLFLFSR+PYFWPSVVQ TS YHELPNV+VVPSSIN PANNN SVSGSS  QENFTN TG R PLCI+PP SWLLPHHDFRNQQSPQ
Subjt:  TSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQ

Query:  VWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAV
        +WFPAGN+ E + SKSQNSA +SK V AESRHSSLPSAEEE+EAPDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA 
Subjt:  VWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAV

Query:  QQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATEARRRRKELTKLKNLYTRQFRMHS
          +N +EDDHG+SSRTCDDLC FAERRHEPE+VPCKKT+DAMAATEARRRRKELTKLKNLY RQ RM S
Subjt:  QQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATEARRRRKELTKLKNLYTRQFRMHS

XP_011652006.1 uncharacterized protein LOC101210630 isoform X3 [Cucumis sativus]2.3e-22368.76Show/hide
Query:  TRALFLF---HSLTLFILLLLLLCYGFSLLPL---MASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRET
        TR L L     SL+L +            LPL    ASSSKCSD  T SGL       SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRET
Subjt:  TRALFLF---HSLTLFILLLLLLCYGFSLLPL---MASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRET

Query:  GPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVE
        G QP +TKWGIKGKGKRARKEVK ESPTS FADSLP+RADLDLRI                                                       
Subjt:  GPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVE

Query:  IFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIR
                      ED GVV+HQPSEKECT QS PE ETTGE+ K DKEAES KVSPACTTSYQ FGCRRSRR LTEAEKEERR+RRILANRESARQTIR
Subjt:  IFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIR

Query:  RRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNR
        RRQALCEELTRKAADLAWENENLKR                                    EKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+R
Subjt:  RRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNR

Query:  TSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQ
        +SHVQMPPLPTN PLFLFSR+PYFWPSVVQ TS YHELPNV+VVPSSIN PANNN SVSGSS  QENFTN TG R PLCI+PP SWLLPHHDFRNQQSPQ
Subjt:  TSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQ

Query:  VWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAV
        +WFPAGN+ E + SKSQNSA +SK V AESRHSSLPSAEEE+EAPDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA 
Subjt:  VWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAV

Query:  QQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATEARRRRKELTKLKNLYTRQFRMHS
          +N +EDDHG+SSRTCDDLC FAERRHEPE+VPCKKT+DAMAATEARRRRKELTKLKNLY RQ RM S
Subjt:  QQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATEARRRRKELTKLKNLYTRQFRMHS

XP_038904850.1 uncharacterized protein LOC120091090 isoform X1 [Benincasa hispida]1.0e-23674Show/hide
Query:  MASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLD
        MASSSKCS+  +CS LSSSSSSS +SS   KAADQMVKVEIEAAEALAGLAVLAVRETG QP ETKWGIKGKGKRARKEVK E PTS FADSLPS ADLD
Subjt:  MASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLD

Query:  LRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGE
        LRIE                                                                    +D GVVRHQPSEKECTNQSHPEWETTGE
Subjt:  LRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGE

Query:  MIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPP
        +IKADKEAES KVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELT+KAADLAWENENLKR                 
Subjt:  MIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPP

Query:  TPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVL
                           EKELALKEYQ+LETTN ELKEQLAEAVKPKV EIPGNNR+SHVQMPPLPTNYPLFL SR+PYFWPSVVQPT+PYH+LPNV+
Subjt:  TPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVL

Query:  VVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEH
        VVPSSINLPANNNVSVSGSSHVQENF +VTGPRTPLCI+PPCSWLLPHHDFRNQQ+PQ+WFPAGNN EDI SKSQ+SAN+SKVVHAESR  SLPSAEEE+
Subjt:  VVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEH

Query:  EAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAM
        EAPDLNE+PNLN+AS PKDHT+NTVGV VDGFDTNTRA VRKVLSPVRLE IEPS AV+Q+N SEDDH L S+TCDDLCDFAERRHEPEIV CKKTIDAM
Subjt:  EAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAM

Query:  AATEARRRRKELTKLKNLYTRQFRMHS
        AATEARRRRKELTKLKNLYTRQ RM S
Subjt:  AATEARRRRKELTKLKNLYTRQFRMHS

XP_038904851.1 uncharacterized protein LOC120091090 isoform X2 [Benincasa hispida]8.0e-23774Show/hide
Query:  MASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLD
        MASSSKCS+  +CS LSSSSSSS +SS   KAADQMVKVEIEAAEALAGLAVLAVRETG QP ETKWGIKGKGKRARKEVK E PTS FADSLPS ADLD
Subjt:  MASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLD

Query:  LRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGE
        LRIE                                                                    +D GVVRHQPSEKECTNQSHPEWETTGE
Subjt:  LRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGE

Query:  MIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPP
        +IKADKEAES KVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELT+KAADLAWENENLKR                 
Subjt:  MIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPP

Query:  TPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVL
                           EKELALKEYQ+LETTN ELKEQLAEAVKPKV EIPGNNR+SHVQMPPLPTNYPLFL SR+PYFWPSVVQPT+PYH+LPNV+
Subjt:  TPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVL

Query:  VVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEH
        VVPSSINLPANNNVSVSGSSHVQENF +VTGPRTPLCI+PPCSWLLPHHDFRNQQ+PQ+WFPAGNN EDI SKSQ+SAN+SKVVHAESR  SLPSAEEE+
Subjt:  VVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEH

Query:  EAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAM
        EAPDLNE+PNLN+AS PKDHT+NTVGV VDGFDTNTRA VRKVLSPVRLE IEPS AV+Q+N SEDDH L S+TCDDLCDFAERRHEPEIV CKKTIDAM
Subjt:  EAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAM

Query:  AATEARRRRKELTKLKNLYTRQFRMHS
        AATEARRRRKELTKLKNLYTRQ RM S
Subjt:  AATEARRRRKELTKLKNLYTRQFRMHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBD4 BZIP domain-containing protein3.7e-22468.76Show/hide
Query:  TRALFLF---HSLTLFILLLLLLCYGFSLLPL---MASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRET
        TR L L     SL+L +            LPL    ASSSKCSD  T SGL       SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRET
Subjt:  TRALFLF---HSLTLFILLLLLLCYGFSLLPL---MASSSKCSDVATCSGL-------SSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRET

Query:  GPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVE
        G QP +TKWGIKGKGKRARKEVK ESPTS FADSLP+RADLDLRIE                                                      
Subjt:  GPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVE

Query:  IFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIR
                      +D GVV+HQPSEKECT QS PE ETTGE+ K DKEAES KVSPACTTSYQ FGCRRSRR LTEAEKEERR+RRILANRESARQTIR
Subjt:  IFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIR

Query:  RRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNR
        RRQALCEELTRKAADLAWENENLKR                                    EKE+ALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+R
Subjt:  RRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNR

Query:  TSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQ
        +SHVQMPPLPTN PLFLFSR+PYFWPSVVQ TS YHELPNV+VVPSSIN PANNN SVSGSS  QENFTN TG R PLCI+PP SWLLPHHDFRNQQSPQ
Subjt:  TSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQ

Query:  VWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAV
        +WFPAGN+ E + SKSQNSA +SK V AESRHSSLPSAEEE+EAPDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA 
Subjt:  VWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAV

Query:  QQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATEARRRRKELTKLKNLYTRQFRMHS
          +N +EDDHG+SSRTCDDLC FAERRHEPE+VPCKKT+DAMAATEARRRRKELTKLKNLY RQ RM S
Subjt:  QQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATEARRRRKELTKLKNLYTRQFRMHS

A0A1S3B649 uncharacterized protein LOC103486593 isoform X38.6e-22170.14Show/hide
Query:  LMASSSKCSDVATCSGLSSSSSSSS-----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP
        + ASSSKCSD  T S  SSSSSSSS     M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QPS+TKWGIKGKGKRARKE K ESP S FADSLP
Subjt:  LMASSSKCSDVATCSGLSSSSSSSS-----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP

Query:  SRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPE
        +RADLDLRI                                                                     ED GVV+HQPSEKECTNQS  E
Subjt:  SRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPE

Query:  WETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFF
         ETT E+ K DKEAES KVSPA TTSYQLFGCRRSRRNLTEAEKEERR+RRILANRESARQTIRRRQALCEELTRKAADLAWENENLKR           
Subjt:  WETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFF

Query:  FDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYH
                                 EKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+ +SHVQMPPLP+N PLFLFSR PYFWPSVVQ TS YH
Subjt:  FDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYH

Query:  ELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLP
        ELPNV+VVPSSINLPANNN SVSGSS  QENFTNVTG R P C++PPCSWLLPHHDFRNQQSPQ+WFPAGN+ EDI SKSQ+SA +SKVVHAESRH    
Subjt:  ELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLP

Query:  SAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCK
        SAEEE++APDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA + +N +EDDHG+SSRTCDDLC FAERRHEPEIVPCK
Subjt:  SAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCK

Query:  KTIDAMAATEARRRRKELTKLKNLYTRQFRMHS
        K+IDAMAATEARRRRKELTK+KNLY RQ RM S
Subjt:  KTIDAMAATEARRRRKELTKLKNLYTRQFRMHS

A0A1S3B7B6 uncharacterized protein LOC103486593 isoform X13.0e-22170.14Show/hide
Query:  LMASSSKCSDVATCSGLSSSSSSSS-----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP
        + ASSSKCSD  T S  SSSSSSSS     M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QPS+TKWGIKGKGKRARKE K ESP S FADSLP
Subjt:  LMASSSKCSDVATCSGLSSSSSSSS-----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP

Query:  SRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPE
        +RADLDLRIE                                                                    +D GVV+HQPSEKECTNQS  E
Subjt:  SRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPE

Query:  WETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFF
         ETT E+ K DKEAES KVSPA TTSYQLFGCRRSRRNLTEAEKEERR+RRILANRESARQTIRRRQALCEELTRKAADLAWENENLKR           
Subjt:  WETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFF

Query:  FDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYH
                                 EKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+ +SHVQMPPLP+N PLFLFSR PYFWPSVVQ TS YH
Subjt:  FDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYH

Query:  ELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLP
        ELPNV+VVPSSINLPANNN SVSGSS  QENFTNVTG R P C++PPCSWLLPHHDFRNQQSPQ+WFPAGN+ EDI SKSQ+SA +SKVVHAESRH    
Subjt:  ELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLP

Query:  SAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCK
        SAEEE++APDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA + +N +EDDHG+SSRTCDDLC FAERRHEPEIVPCK
Subjt:  SAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCK

Query:  KTIDAMAATEARRRRKELTKLKNLYTRQFRMHS
        K+IDAMAATEARRRRKELTK+KNLY RQ RM S
Subjt:  KTIDAMAATEARRRRKELTKLKNLYTRQFRMHS

A0A1S3B7D8 uncharacterized protein LOC103486593 isoform X23.0e-22170.14Show/hide
Query:  LMASSSKCSDVATCSGLSSSSSSSS-----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP
        + ASSSKCSD  T S  SSSSSSSS     M SSMAKAADQMVKVEIEAAEALAGLAVLAVRETG QPS+TKWGIKGKGKRARKE K ESP S FADSLP
Subjt:  LMASSSKCSDVATCSGLSSSSSSSS-----MSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLP

Query:  SRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPE
        +RADLDLRIE                                                                    +D GVV+HQPSEKECTNQS  E
Subjt:  SRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPE

Query:  WETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFF
         ETT E+ K DKEAES KVSPA TTSYQLFGCRRSRRNLTEAEKEERR+RRILANRESARQTIRRRQALCEELTRKAADLAWENENLKR           
Subjt:  WETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFF

Query:  FDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYH
                                 EKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGN+ +SHVQMPPLP+N PLFLFSR PYFWPSVVQ TS YH
Subjt:  FDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYH

Query:  ELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLP
        ELPNV+VVPSSINLPANNN SVSGSS  QENFTNVTG R P C++PPCSWLLPHHDFRNQQSPQ+WFPAGN+ EDI SKSQ+SA +SKVVHAESRH    
Subjt:  ELPNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLP

Query:  SAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCK
        SAEEE++APDLNE+P+L+E+SNPKD T+NTVGVAV+GFDTN RAPVRKVLSPVRLE IEPSSA + +N +EDDHG+SSRTCDDLC FAERRHEPEIVPCK
Subjt:  SAEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCK

Query:  KTIDAMAATEARRRRKELTKLKNLYTRQFRMHS
        K+IDAMAATEARRRRKELTK+KNLY RQ RM S
Subjt:  KTIDAMAATEARRRRKELTKLKNLYTRQFRMHS

A0A6J1F2W5 uncharacterized protein LOC111441650 isoform X11.1e-18864.03Show/hide
Query:  MASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIK-GKGKRARKEVKNESPTSAFADSLPSRADL
        MASSSKCS+  +CSGLSSSS+ S  SSSM   ADQMVKVEIEAAEALA LAVLAVR++G QPSETKW IK  KGKRARKEVK ESPTSAF DSLPSRADL
Subjt:  MASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIK-GKGKRARKEVKNESPTSAFADSLPSRADL

Query:  DLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTG
        DLRI+                                                                    +D GV+ H PSEKEC + SHPEWETT 
Subjt:  DLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTG

Query:  EMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTP
        EMIKA+KEAES K+      S+ LFGCRRSRRNLTEAEKEERR+RR+LANRESARQTIRRRQALCE+LT+KA+DLAWENENLKR                
Subjt:  EMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTP

Query:  PTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVP---YFWPSVVQPTSPYHEL
                            EKELALKEYQSLE TNKELKEQ+A A +PK+EEIPGNNR+SHVQ PPLPTNYPLFLFSR P   YFWPSVVQP+SPYH+L
Subjt:  PTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVP---YFWPSVVQPTSPYHEL

Query:  PNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSK-VVHAESRHSSLPS
         NV VVP S+  P+NN V VS SSHVQENFTNVTG RTP CIV PCSWLLPHHD RNQQS Q   PAGN  E I S SQNSA +SK VV AESRHSSLPS
Subjt:  PNVLVVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSK-VVHAESRHSSLPS

Query:  AEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKK
        AEE++EA DLNE+P+L      K+HT+NTVGV VD F+ +TR  VRKVLSPVRLE IEP+S V+Q+  SEDD GLSSRTCDDLC  AE++HEPEIV CKK
Subjt:  AEEEHEAPDLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKK

Query:  TIDAMAATEARRRRKELTKLKNLYTRQFRMH
        TIDAMAATEARRRRKELTKLKNL+TR  RMH
Subjt:  TIDAMAATEARRRRKELTKLKNLYTRQFRMH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19490.1 Basic-leucine zipper (bZIP) transcription factor family protein3.2e-4231.35Show/hide
Query:  LSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLDLRIEILIAFCPLILL
        +SSS  SSS SSS  +        E+EAAEALA LA LA+       S   WG   KGKR RK VK ESP S   DSL                      
Subjt:  LSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIKGKGKRARKEVKNESPTSAFADSLPSRADLDLRIEILIAFCPLILL

Query:  KFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFKVSP
                                      L P              DS T   P L +  +V+ +  E+E    +    E T   +K++   E+ K   
Subjt:  KFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRHQPSEKECTNQSHPEWETTGEMIKADKEAESFKVSP

Query:  ACT--TSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPPTPPHPKHQKKDPF
        A T     +  GC RSR+NL+EAE+EERR+RRILANRESARQTIRRRQA+CEEL++KAADL +ENENL+R                              
Subjt:  ACT--TSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPENDSIFFFDPTPPTPPHPKHQKKDPF

Query:  IFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPY---FWPSVVQPTSPYHELPNVLVVPSSINLPA
              EK+ ALKE+QSLET NK LKEQ+ ++VKP  +E   + + S V+M    T  P + +++ PY    WP V Q ++P         + S +  P 
Subjt:  IFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPY---FWPSVVQPTSPYHELPNVLVVPSSINLPA

Query:  NNNVSVSG-SSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRN------QQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAP
        +   S    ++   EN  +  G +T   +V PC W LP  D  N      Q + +  F  G++++D      +SA    V      H      EE+  +P
Subjt:  NNNVSVSG-SSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRN------QQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAP

Query:  DLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAAT
        +     +LNE++         +    DGF    +A                  +++  + SE  +G++              H   I   +K   ++AA 
Subjt:  DLNESPNLNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAAT

Query:  EARRRRKELTKLKNLYTRQFRM
        EAR+RRKELT+LKNL+ RQ RM
Subjt:  EARRRRKELTKLKNLYTRQFRM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACGTGCGCGTAAGTTTTCACCATCATCCACACGCGCTCTCTTTCTCTTTCACTCTCTTACTCTCTTCATTCTTCTTCTTCTTCTCCTCTGTTATGGGTTCTCTCT
TCTTCCTCTCATGGCTTCTTCTTCCAAGTGCTCCGACGTGGCCACTTGTTCTGGTTTGAGTTCTTCTTCTTCTTCCTCCTCCATGTCCTCTTCTATGGCCAAGGCGGCGG
ATCAGATGGTCAAGGTTGAGATTGAGGCGGCAGAGGCTCTTGCTGGTTTGGCGGTTTTGGCGGTCAGAGAAACTGGACCTCAACCGTCCGAAACCAAATGGGGAATTAAA
GGGAAAGGGAAACGAGCTAGGAAGGAGGTTAAGAACGAGTCGCCGACTTCTGCCTTCGCCGACTCTTTACCTAGTCGAGCGGATCTGGACCTTCGGATTGAGATTTTGAT
TGCATTTTGTCCTTTGATTTTGCTGAAATTCAAAATGAGGGGTGGATTTCTGATTTTGCGTTTTTTAGATGGAGAACTAAATTTTGCACTCGGACTAGGATTCAATTTGG
TGATCCTTATCCCAGAAGTTTGGAGTAGAATTAAGAATGTGGAGATTTTTGTCTTGGATTCTAAAACAAGTAGTGGGCCATTGCTAGAGGATGGAGGGGTGGTAAGACAT
CAGCCATCAGAAAAAGAATGTACTAATCAGTCCCACCCTGAGTGGGAAACAACCGGAGAGATGATAAAGGCGGACAAGGAGGCCGAATCATTTAAAGTGAGTCCTGCATG
CACTACAAGCTACCAGTTATTTGGCTGCAGGAGATCAAGGCGTAATCTAACTGAGGCTGAAAAGGAAGAAAGGAGAGTACGACGGATTTTAGCAAACAGAGAGTCAGCCC
GGCAGACAATTCGGCGTAGGCAGGCTCTGTGCGAGGAGTTGACCAGAAAAGCTGCTGATCTAGCATGGGAAAATGAAAATTTAAAGAGGGTAGGCATCCCCGAAAATGAT
TCTATTTTCTTCTTCGACCCCACCCCACCTACCCCACCGCACCCCAAACACCAGAAGAAGGACCCCTTTATATTCTTTAACTTTTCTGAAAAGGAGTTGGCCCTGAAAGA
GTATCAATCTCTGGAGACTACTAACAAGGAATTAAAAGAACAGTTGGCTGAAGCAGTAAAGCCGAAGGTGGAGGAGATCCCAGGAAACAATAGAACATCTCATGTTCAGA
TGCCTCCTTTACCTACCAACTACCCTCTTTTCTTGTTTAGTCGTGTTCCGTATTTCTGGCCATCTGTGGTTCAACCTACAAGTCCTTATCATGAACTACCCAATGTCCTC
GTCGTCCCGTCAAGTATTAATTTGCCTGCTAATAATAATGTTTCTGTGTCTGGCTCTTCCCATGTACAAGAAAACTTTACGAACGTCACTGGCCCGAGAACACCCTTGTG
TATAGTACCACCTTGTTCTTGGTTGTTGCCTCATCATGATTTTAGGAACCAACAGAGTCCTCAAGTCTGGTTTCCCGCTGGAAATAATCTAGAGGATATTTGTTCGAAAT
CCCAAAACAGTGCTAATTCTTCAAAGGTTGTGCATGCAGAAAGCAGACATTCTTCTTTGCCCTCAGCTGAAGAAGAACACGAAGCTCCTGACTTGAATGAATCTCCTAAT
TTAAACGAAGCTTCAAATCCAAAGGATCATACTGAGAACACAGTTGGAGTAGCTGTGGACGGATTTGATACCAACACAAGAGCTCCAGTTAGAAAAGTGCTTTCTCCTGT
AAGACTTGAACGTATTGAACCCAGTTCCGCTGTCCAACAAAATAACCGGAGCGAAGATGATCACGGTCTGTCATCAAGAACTTGTGATGACTTGTGTGATTTTGCAGAAA
GAAGGCATGAACCAGAGATAGTCCCCTGTAAGAAAACCATAGATGCAATGGCTGCAACTGAGGCCAGGAGGAGGAGAAAAGAACTAACAAAATTAAAGAACCTTTACACC
CGTCAGTTCCGTATGCATTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCACGTGCGCGTAAGTTTTCACCATCATCCACACGCGCTCTCTTTCTCTTTCACTCTCTTACTCTCTTCATTCTTCTTCTTCTTCTCCTCTGTTATGGGTTCTCTCT
TCTTCCTCTCATGGCTTCTTCTTCCAAGTGCTCCGACGTGGCCACTTGTTCTGGTTTGAGTTCTTCTTCTTCTTCCTCCTCCATGTCCTCTTCTATGGCCAAGGCGGCGG
ATCAGATGGTCAAGGTTGAGATTGAGGCGGCAGAGGCTCTTGCTGGTTTGGCGGTTTTGGCGGTCAGAGAAACTGGACCTCAACCGTCCGAAACCAAATGGGGAATTAAA
GGGAAAGGGAAACGAGCTAGGAAGGAGGTTAAGAACGAGTCGCCGACTTCTGCCTTCGCCGACTCTTTACCTAGTCGAGCGGATCTGGACCTTCGGATTGAGATTTTGAT
TGCATTTTGTCCTTTGATTTTGCTGAAATTCAAAATGAGGGGTGGATTTCTGATTTTGCGTTTTTTAGATGGAGAACTAAATTTTGCACTCGGACTAGGATTCAATTTGG
TGATCCTTATCCCAGAAGTTTGGAGTAGAATTAAGAATGTGGAGATTTTTGTCTTGGATTCTAAAACAAGTAGTGGGCCATTGCTAGAGGATGGAGGGGTGGTAAGACAT
CAGCCATCAGAAAAAGAATGTACTAATCAGTCCCACCCTGAGTGGGAAACAACCGGAGAGATGATAAAGGCGGACAAGGAGGCCGAATCATTTAAAGTGAGTCCTGCATG
CACTACAAGCTACCAGTTATTTGGCTGCAGGAGATCAAGGCGTAATCTAACTGAGGCTGAAAAGGAAGAAAGGAGAGTACGACGGATTTTAGCAAACAGAGAGTCAGCCC
GGCAGACAATTCGGCGTAGGCAGGCTCTGTGCGAGGAGTTGACCAGAAAAGCTGCTGATCTAGCATGGGAAAATGAAAATTTAAAGAGGGTAGGCATCCCCGAAAATGAT
TCTATTTTCTTCTTCGACCCCACCCCACCTACCCCACCGCACCCCAAACACCAGAAGAAGGACCCCTTTATATTCTTTAACTTTTCTGAAAAGGAGTTGGCCCTGAAAGA
GTATCAATCTCTGGAGACTACTAACAAGGAATTAAAAGAACAGTTGGCTGAAGCAGTAAAGCCGAAGGTGGAGGAGATCCCAGGAAACAATAGAACATCTCATGTTCAGA
TGCCTCCTTTACCTACCAACTACCCTCTTTTCTTGTTTAGTCGTGTTCCGTATTTCTGGCCATCTGTGGTTCAACCTACAAGTCCTTATCATGAACTACCCAATGTCCTC
GTCGTCCCGTCAAGTATTAATTTGCCTGCTAATAATAATGTTTCTGTGTCTGGCTCTTCCCATGTACAAGAAAACTTTACGAACGTCACTGGCCCGAGAACACCCTTGTG
TATAGTACCACCTTGTTCTTGGTTGTTGCCTCATCATGATTTTAGGAACCAACAGAGTCCTCAAGTCTGGTTTCCCGCTGGAAATAATCTAGAGGATATTTGTTCGAAAT
CCCAAAACAGTGCTAATTCTTCAAAGGTTGTGCATGCAGAAAGCAGACATTCTTCTTTGCCCTCAGCTGAAGAAGAACACGAAGCTCCTGACTTGAATGAATCTCCTAAT
TTAAACGAAGCTTCAAATCCAAAGGATCATACTGAGAACACAGTTGGAGTAGCTGTGGACGGATTTGATACCAACACAAGAGCTCCAGTTAGAAAAGTGCTTTCTCCTGT
AAGACTTGAACGTATTGAACCCAGTTCCGCTGTCCAACAAAATAACCGGAGCGAAGATGATCACGGTCTGTCATCAAGAACTTGTGATGACTTGTGTGATTTTGCAGAAA
GAAGGCATGAACCAGAGATAGTCCCCTGTAAGAAAACCATAGATGCAATGGCTGCAACTGAGGCCAGGAGGAGGAGAAAAGAACTAACAAAATTAAAGAACCTTTACACC
CGTCAGTTCCGTATGCATTCCTGATCTATATGGCCAGGAAGTTGGGCAACCGTTTGTCGTCATCGTCGACACATCAACCTTATGTGTTAAAGTCCTGTATTTCATTGTCT
TTTGTTGCCAGAGGCAATCACAGAGCAGTGAGACGTAACCAAAGTTCTGAACTTCACTTCAGCTTTCTTGTTGAGGCGTTCCTTGTGCATTTGCAGAGCTCAGAGCGGGT
CGAGATCTTGCTGCTGGGTTTGGCAGTGATGGGGAAGGAAATTTAGGCATCGGAGCATTTTTTTTTCGTGCCTGTTTTGCTGAATGAAATGAGAGAGGTACTGAGGAATA
AGATGTTGATTAGTTGCAAGAGGTTTGAGTTGGATTTGGTAGGGAAGCAAGCAAAGAGGAGGAGAAGGAGGAGATTGTAAGAATTTTGTATTTAAATGTTCTATGGCTAC
TTCAAAACCCTTTTATTGGGTTGGCTACAGTTCTTCAAAAAAGTGAAGCAGAAGAATGCATCTTCTGATGAGTATGGAGATCTGCTAATCTACTTCATTAAGAAGCAGGA
AACAAATAAATGATCTAGAGGTAGCCAAATGAAAATCTTACTGTTTTTTTTTTTTTCTGAAAACTTGAATAGGATTACTTCATACAAATTATAGTTATCCTAAGACTTTT
GCCCATCTTTTCTTTTCCAATTTTATAGCAAATTTGAATTCGGTTTGGTAGCCA
Protein sequenceShow/hide protein sequence
MPRARKFSPSSTRALFLFHSLTLFILLLLLLCYGFSLLPLMASSSKCSDVATCSGLSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLAVRETGPQPSETKWGIK
GKGKRARKEVKNESPTSAFADSLPSRADLDLRIEILIAFCPLILLKFKMRGGFLILRFLDGELNFALGLGFNLVILIPEVWSRIKNVEIFVLDSKTSSGPLLEDGGVVRH
QPSEKECTNQSHPEWETTGEMIKADKEAESFKVSPACTTSYQLFGCRRSRRNLTEAEKEERRVRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKRVGIPEND
SIFFFDPTPPTPPHPKHQKKDPFIFFNFSEKELALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNNRTSHVQMPPLPTNYPLFLFSRVPYFWPSVVQPTSPYHELPNVL
VVPSSINLPANNNVSVSGSSHVQENFTNVTGPRTPLCIVPPCSWLLPHHDFRNQQSPQVWFPAGNNLEDICSKSQNSANSSKVVHAESRHSSLPSAEEEHEAPDLNESPN
LNEASNPKDHTENTVGVAVDGFDTNTRAPVRKVLSPVRLERIEPSSAVQQNNRSEDDHGLSSRTCDDLCDFAERRHEPEIVPCKKTIDAMAATEARRRRKELTKLKNLYT
RQFRMHS