; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G019100 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G019100
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDNA binding protein, putative isoform 1
Genome locationchr03:30463683..30471816
RNA-Seq ExpressionLsi03G019100
SyntenyLsi03G019100
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0000127 - transcription factor TFIIIC complex (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR017956 - AT hook, DNA-binding motif
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043896.1 DNA binding protein, putative isoform 1 [Cucumis melo var. makuwa]0.0e+0081.08Show/hide
Query:  KGKKKPPARE-KEPQKRGKKKEAGAT----------TSVNEDQATGRLDG--PKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSST
        KGKKKPPA+E KEP+KR KKK   AT          TSVNE Q T RL+   PKV VSEFD CVENHF+AMD IVELC EAE+GDGGIDESDIQRFSSST
Subjt:  KGKKKPPARE-KEPQKRGKKKEAGAT----------TSVNEDQATGRLDG--PKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSST

Query:  IFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPG
        IFLREWRFYNYE KTIKFA  S GPEGKDADITINLPQFSSAAVLK G+PPGA+TSLDFRNF MHVGGPVWA+DWCPQVH RT+S IKCEFIAVSAHPPG
Subjt:  IFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPG

Query:  SSYHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESNDKKGDGYQLVQAFSIENPA
        SSYHKMGIPLTGRGMVQIWCLVHGTE+YEP DVGEPPSDLSSQPK+PRGRPPGRKK  ASGLPS PKRPRGRPKK+Q+ES DKKGD  QLVQ FS+ENP 
Subjt:  SSYHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESNDKKGDGYQLVQAFSIENPA

Query:  GSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLC
        GSS+LLE DGVPKN+E  VLLEN+VERE STLQEVSTCNSEDEVP +KRRVRRK +++N VDDVG SSLTE QED S A NH+A+ENV  EYSGEDNLLC
Subjt:  GSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLC

Query:  KNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLK
        K+ISEN VLD SSIE +IPESVALPRVVLCLAHNGKVAWDLKWKP NACTDNCKHRMGYLAVLLG+GSLEVWEVPFPHAVK IYSKFNGEGTDPRFVKLK
Subjt:  KNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLK

Query:  PVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RD
        P+FRCS L+TAN QSIPLTVEWSL PPYDYLLAGCHDGTVALWKFSANS+CEDTRPLLRFSADTVPIRAVAWAPSE                      RD
Subjt:  PVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RD

Query:  PFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVS
        PFRPLWDLHPAPRIIYSLDWLP+PRCVFLSFDDGTLRLLSLLKAA DVP TGQPFTAIKQKGLHTY CSSYAIWSIQVSRQTGMVAYCGADGAVVRFQ  
Subjt:  PFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVS

Query:  STCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVD
                  LTTKAADKENSRHRTPHY+CEYLTEEESIIT  +P  NVP  LKKLSNKSEHPLSMRAILSDS+QSNEGNHKTATA  LENE+++CSDVD
Subjt:  STCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVD

Query:  VGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQT--------DADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLC
        VGVESGSEDT +S KKKN+TQ KCKKK VEN ELEC+ EP DDA +DADV+AQT        DADVVP SGD FE+LPPKSVAMHRVRWNMN+GSE+WLC
Subjt:  VGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQT--------DADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLC

Query:  YGGAAGILR
        YGGA+GILR
Subjt:  YGGAAGILR

XP_004149225.3 uncharacterized protein LOC101210135 isoform X1 [Cucumis sativus]0.0e+0078.27Show/hide
Query:  KGKKKPPARE-KEPQKRGKKK--------EAGATTSVNEDQATGRLDG--PKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSSTIF
        KGKKKPPA+E KEP+KR KKK         A  +T VN+ Q+T RLD   P+V VSEFD CVENHF+AMD IVELC EAEDGDGGIDESDIQRFSSSTIF
Subjt:  KGKKKPPARE-KEPQKRGKKK--------EAGATTSVNEDQATGRLDG--PKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSSTIF

Query:  LREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPGSS
        LREWRFYNYEPKTIKFA  SRGPEGKDADITI+LPQFSSAAVLK G+PPGA+TSLDFRNF MHVGGPVWA+DWCPQVHERT+S IKCEFIAVSAHPPGSS
Subjt:  LREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPGSS

Query:  YHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESND-KKGDGYQLVQAFSIENPAG
        YHKMGIPLTGRGMVQIWCLVHGTESYEP DVGEPPSDLSSQPKRPRGRPPGRK+ GAS LPSQPKRPRGRPKK+Q+ESND KKGD  QLVQ FS+ENP G
Subjt:  YHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESND-KKGDGYQLVQAFSIENPAG

Query:  SSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTC-------------------------------NSEDEVPTQKRRVRRKAETKNHVDDVGTSSLT
        SSNLLE DGVPKN+E  VLLEN+VERE STLQEVSTC                               NSEDEVP +KRRVRRK + +N VDDVG  SL 
Subjt:  SSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTC-------------------------------NSEDEVPTQKRRVRRKAETKNHVDDVGTSSLT

Query:  ENQEDRSNAMNHDANENVIHEYSGEDNLLCKNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLE
        E QED S A NH+ANENV  EYSGEDNLLCK+ISEN VLD SSIE +IPESVALPRVVLCLAHNGKVAWDLKWKP NACTDNCKHRMGYLAVLLG+GSLE
Subjt:  ENQEDRSNAMNHDANENVIHEYSGEDNLLCKNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLE

Query:  VWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAV
        VWEVPFPHAVKAIYSKFNGEGTDPRF+KLKP+FRCS L+T N QSIPLTVEWS TPPYDYLLAGCHDGTVALWKFSANS+CEDTRPLLRFSADTVPIRAV
Subjt:  VWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAV

Query:  AWAPSE----------------------RDPFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSS
        AWAPSE                      RDPFRPLWDLHPAPRIIYSLDWLP+PRCVFLSFDDGTLRLLSLLKAA DVP TG+PFTAIKQKGLHTY CSS
Subjt:  AWAPSE----------------------RDPFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSS

Query:  YAIWSIQVSRQTGMVAYCGADGAVVRFQVSSTCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAIL
        YAIWSIQVSRQTGMVAYCGADGAVVRFQ            LTTKAADKENSRHRTPHY+CEYLTEEESIIT  +P  NVP  LKKLSNKSEHPLSMRAIL
Subjt:  YAIWSIQVSRQTGMVAYCGADGAVVRFQVSSTCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAIL

Query:  SDSVQSNEGNHKTATAPALENESTLCSDVDVGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQT--------DADVVPGS
        SDSVQSNE   KTATA  LENE+T+CSDVDV VESGSEDT+   KKKN+TQ KC K+ VE  ELECS+EP DDA MDADVDAQT        DAD +P S
Subjt:  SDSVQSNEGNHKTATAPALENESTLCSDVDVGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQT--------DADVVPGS

Query:  GDRFESLPPKSVAMHRVRWNMNIGSERWLCYGGAAGILR
        GD FE+LPPKSVAMHRVRWNMNIGSE WLCYGGAAGILR
Subjt:  GDRFESLPPKSVAMHRVRWNMNIGSERWLCYGGAAGILR

XP_008442823.1 PREDICTED: uncharacterized protein LOC103486595 [Cucumis melo]0.0e+0080.75Show/hide
Query:  KGKKKPPARE-KEPQKRGKKKEAGAT----------TSVNEDQATGRLDG--PKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSST
        KGKKKPPA+E KEP+KR KKK   AT          TSVNE Q T RL+   PKV VSEFD CVENHF+AMD IVELC EAE+GDGGIDESDIQRFSSST
Subjt:  KGKKKPPARE-KEPQKRGKKKEAGAT----------TSVNEDQATGRLDG--PKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSST

Query:  IFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPG
        IFLREWRFYNYE KTIKFA  S GPEGKDADITINLPQFSSAAVLK G+PPGA+TSLDFRNF MHVGGPVWA+DWCPQVH RT+S IKCEFIAVSAHPPG
Subjt:  IFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPG

Query:  SSYHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESNDKKGDGYQLVQAFSIENPA
        SSYHKMGIPLTGRGMVQIWCLVHGTE+YEP DVGEPPSDLSSQPK+PRGRPPGRKK  ASGLPS PKRPRGRPKK+Q+ES DKKGD  QLVQ FS+ENP 
Subjt:  SSYHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESNDKKGDGYQLVQAFSIENPA

Query:  GSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLC
        GSS+LLE DGVPKN+E  VLLEN+VERE STLQEVSTCNSEDEVP +KRRVRRK +++N VDDVG SSLTE QED S A NH+A+ENV  EYSGEDNLLC
Subjt:  GSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLC

Query:  KNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLK
        K+ISEN VLD SSIE +IPESVALPRVVLCLAHNGKVAWDLKWKP NACTDNCKHRMGYLAVLLG+GSLEVWEVPFPHAVK IYSKFNGEGTDPRFVKLK
Subjt:  KNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLK

Query:  PVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RD
        P+FRCS L+TAN QSIPLTVEWSL PPYDYLLAGCHDGTVALWKFSANS+CEDTRPLLRFSADTVPIRAVAWAPSE                      RD
Subjt:  PVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RD

Query:  PFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVS
        PFRPLWDLHPAPRIIYSLDWLP+PR + LSFDDGTLRLLSLLKAA DVP TGQPFTAIKQKGLHTY CSSYAIWSIQVSRQTGMVAYCGADGAVVRFQ  
Subjt:  PFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVS

Query:  STCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVD
                  LTTKAADKENSRHRTPHY+CEYLTEEESIIT  +P  NVP  LKKLSNKSEHPLSMRAILSDS+QSNEGNHKTATA  LENE+++CSDVD
Subjt:  STCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVD

Query:  VGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQT--------DADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLC
        VGVESGSEDT +S KKKN+TQ KCKKK VEN ELEC+ EP DDA +DADV+AQT        DADVVP SGD FE+LPPKSVAMHRVRWNMN+GSE+WLC
Subjt:  VGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQT--------DADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLC

Query:  YGGAAGILR
        YGGA+GILR
Subjt:  YGGAAGILR

XP_023528187.1 uncharacterized protein LOC111791176 [Cucurbita pepo subsp. pepo]0.0e+0077.86Show/hide
Query:  MEELQPLAEPTIGTSCKKGKKKPPAREKEPQKRGKKKEAGATTSVNEDQATGRLDGPKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRF
        MEEL   AE ++GTSCKKGKKK  + E EPQKR KKK AGA TSVNE Q TGRLD  +V VSEFDHCVENHF+A+D I EL GEAE+G+GG+DESD QRF
Subjt:  MEELQPLAEPTIGTSCKKGKKKPPAREKEPQKRGKKKEAGATTSVNEDQATGRLDGPKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRF

Query:  SSSTIFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSA
        SSST FLREW+FYNYEPKT+KF + SR PEGKDADIT+ LPQFSSAAVLKNG+PPGATTSLDFRNF+MHVGGPVWA+DWCP VHERTDS IKCEFIAVSA
Subjt:  SSSTIFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSA

Query:  HPPGSSYHKMGIPLTGRGMVQIWCLVHGTESY--EPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESN-DKKGDGYQLVQA
        HPPGSSYH MGIPL+GRGMVQIWCLVHGTES+  E T   E      SQPKRPRGRPPGRKKNGAS LPSQPKRPRGRPKKKQEE N D K   YQLVQ 
Subjt:  HPPGSSYHKMGIPLTGRGMVQIWCLVHGTESY--EPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESN-DKKGDGYQLVQA

Query:  FSIENPAGSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYS
         S+E P  SSNLLE D VP NSEK V LENSVER  ST++E+STCNSEDEVP QKRRVRR A+TKNHVDDVGT SL EN+ED  NA NH+ANENV  EYS
Subjt:  FSIENPAGSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYS

Query:  GEDNLLCKNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTD
        GED LLCKNISENA+LDT S   +IPESVALPR+VLCLAHNGKVAWDLKWKPTNA T  CK RMGYLAVLLG+GSLEVWEVPFPH VKAIYSK NGEGTD
Subjt:  GEDNLLCKNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTD

Query:  PRFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE-----------------
        PRFV+LKP FRCSML++A+ QSIPLTVEWS TPPYDYLLAGCHDGTVALWKFSANST EDTRPLLRFSADTVPIRAVAWAPSE                 
Subjt:  PRFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE-----------------

Query:  -----RDPFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGA
             RDPFRPLWDLHPAPRIIYSLDWLP+PRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTY CS +AIWSIQVSRQTGMVAYCGADGA
Subjt:  -----RDPFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGA

Query:  VVRFQVSSTCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENES
        VVRFQ            LTTKA DKENSR+RTPH++CEYLTEE+SIIT+H+PA++VP  LKKL+NKSE PLSMRAILSDS+Q NEGN K+AT  ALENES
Subjt:  VVRFQVSSTCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENES

Query:  TLCSDVDVGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQTDADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCY
         LC D DVGVESGSEDT MSI+ KNQTQSK KKK V NQELE S+EP+         D+QTD DVVPGSGD FE+ PPKSVA+HR+RWNMNIGSERWLCY
Subjt:  TLCSDVDVGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQTDADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCY

Query:  GGAAGILR
        GGAAGILR
Subjt:  GGAAGILR

XP_038903194.1 uncharacterized protein LOC120089853 [Benincasa hispida]0.0e+0085.57Show/hide
Query:  MEELQPLAEPTIGTSCKKGKKKPPAREKEPQKRGKKKEAGA---TTSVNEDQATGRLDGPKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDI
        MEELQP  +P+IGTS KKGKKKPPAREK+  ++  + + GA   TTSVN+ Q TGRLDGPKV VSEFDHC+ENHF AMDTIVELC EAE  DGGIDESDI
Subjt:  MEELQPLAEPTIGTSCKKGKKKPPAREKEPQKRGKKKEAGA---TTSVNEDQATGRLDGPKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDI

Query:  QRFSSSTIFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIA
        QRF+SSTIFLREWRFYNYEPK IKFA+ SRGPEGKDADITI LPQFSSAAVLKNG+PPGATTSLDFRNF MHVGGPVWALDWCPQVHERTDS IKCEFIA
Subjt:  QRFSSSTIFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIA

Query:  VSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESNDKKGDGYQLVQA
        VSAHPPGSSYHKMGIPLTGRGMVQIWC VHGTESYEPT+V EPP+DLSSQPKRPRGRP GRKKNGASGLP QPKRPRGRPKKKQEESNDKKGD   LVQA
Subjt:  VSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESNDKKGDGYQLVQA

Query:  FSIENPAGSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYS
        FSIENP GSSNLLE DGVPKNSE IVLLENSVERE STLQEVSTCNSEDEVP QKRRVRRK E KNHV DVG  SLTEN+ED SNA++ +ANENV+ EYS
Subjt:  FSIENPAGSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYS

Query:  GEDNLLCKNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTD
        GEDNLLCKNIS NAVLDTSSIE +IPESVALPRVVLCLAHNGKVAWDLKWKPTNA TDNCK RMGYLAVLLG+GSLEVWEVPFPHAVKAIYSKFNGEGTD
Subjt:  GEDNLLCKNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTD

Query:  PRFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE-----------------
        PRFVKLKP+FRCSML+ AN QSIPLTVEWS TPPYDYLLAGCHDGTVALWKFSANS+CEDTRPLLRFSADTVPIRAVAWAPSE                 
Subjt:  PRFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE-----------------

Query:  -----RDPFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGA
             RDPFRPLWDLHPAPRIIYSLDWLP+PRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGA
Subjt:  -----RDPFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGA

Query:  VVRFQVSSTCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENES
        VVRFQ            LTTKAADKENSRHRTPHY+CEYLTEEES IT+H+P  N+PFSLKKLSNKSEHPLSMRAILSDS+QSNEGNHKTATAPALENES
Subjt:  VVRFQVSSTCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENES

Query:  TLCSDVDVGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQTDADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCY
         LCSDVDVGVESG EDT+MSIKKKN+TQSKC KK VENQ+L+CS+EPNDDAQMDADVD QTDA VVPGS D+FESLPPKSVAMHRVRWNMNIGSERWLCY
Subjt:  TLCSDVDVGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQTDADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCY

Query:  GGAAGILR
        GGAAGILR
Subjt:  GGAAGILR

TrEMBL top hitse value%identityAlignment
A0A0A0LGM2 Uncharacterized protein0.0e+0080.73Show/hide
Query:  KGKKKPPARE-KEPQKRGKKK--------EAGATTSVNEDQATGRLDG--PKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSSTIF
        KGKKKPPA+E KE +KR KKK         A  +T VN+ Q+T RLD   P+V VSEFD CVENHF+AMD IVELC EAEDGDGGIDESDIQRFSSSTIF
Subjt:  KGKKKPPARE-KEPQKRGKKK--------EAGATTSVNEDQATGRLDG--PKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSSTIF

Query:  LREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPGSS
        LREWRFYNYEPKTIKFA  SRGPEGKDADITI+LPQFSSAAVLK G+PPGA+TSLDFRNF MHVGGPVWA+DWCPQVHERT+S IKCEFIAVSAHPPGSS
Subjt:  LREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPGSS

Query:  YHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESND-KKGDGYQLVQAFSIENPAG
        YHKMGIPLTGRGMVQIWCLVHGTESYEP DVGEPPSDLSSQPKRPRGRPPGRK+ GAS LPSQPKRPRGRPKK+Q+ESND KKGD  QLVQ FS+ENP G
Subjt:  YHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESND-KKGDGYQLVQAFSIENPAG

Query:  SSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLCK
        SSNLLE DGVPKN+E  VLLEN+VERE STLQEVSTC+SEDEVP +KRRVRRK + +N VDDVG  SL E QED S A NH+ANENV  EYSGEDNLLCK
Subjt:  SSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLCK

Query:  NISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKP
        +ISEN VLD SSIE +IPESVALPRVVLCLAHNGKVAWDLKWKP NACTDNCKHRMGYLAVLLG+GSLEVWEVPFPHAVKAIYSKFNGEGTDPRF+KLKP
Subjt:  NISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKP

Query:  VFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RDP
        +FRCS L+T N QSIPLTVEWS TPPYDYLLAGCHDGTVALWKFSANS+CEDTRPLLRFSADTVPIRAVAWAPSE                      RDP
Subjt:  VFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RDP

Query:  FRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVSS
        FRPLWDLHPAPRIIYSLDWLP+PRCVFLSFDDGTLRLLSLLKAA DVP TG+PFTAIKQKGLHTY CSSYAIWSIQVSRQTGMVAYCGADGAVVRFQ   
Subjt:  FRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVSS

Query:  TCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVDV
                 LTTKAADKENSRHRTPHY+CEYLTEEESIIT  +P  NVP  LKKLSNKSEHPLSMRAILSDSVQSNE   KTATA  LENE+T+CSDVDV
Subjt:  TCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVDV

Query:  GVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQT--------DADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCY
         VESGSEDT+   KKKN+TQ KC K+ VE  ELECS+EP DDA MDADVDAQT        DAD +P SGD FE+LPPKSVAMHRVRWNMNIGSE WLCY
Subjt:  GVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQT--------DADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCY

Query:  GGAAGILR
        GGAAGILR
Subjt:  GGAAGILR

A0A1S3B6M4 uncharacterized protein LOC1034865950.0e+0080.75Show/hide
Query:  KGKKKPPARE-KEPQKRGKKKEAGAT----------TSVNEDQATGRLDG--PKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSST
        KGKKKPPA+E KEP+KR KKK   AT          TSVNE Q T RL+   PKV VSEFD CVENHF+AMD IVELC EAE+GDGGIDESDIQRFSSST
Subjt:  KGKKKPPARE-KEPQKRGKKKEAGAT----------TSVNEDQATGRLDG--PKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSST

Query:  IFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPG
        IFLREWRFYNYE KTIKFA  S GPEGKDADITINLPQFSSAAVLK G+PPGA+TSLDFRNF MHVGGPVWA+DWCPQVH RT+S IKCEFIAVSAHPPG
Subjt:  IFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPG

Query:  SSYHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESNDKKGDGYQLVQAFSIENPA
        SSYHKMGIPLTGRGMVQIWCLVHGTE+YEP DVGEPPSDLSSQPK+PRGRPPGRKK  ASGLPS PKRPRGRPKK+Q+ES DKKGD  QLVQ FS+ENP 
Subjt:  SSYHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESNDKKGDGYQLVQAFSIENPA

Query:  GSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLC
        GSS+LLE DGVPKN+E  VLLEN+VERE STLQEVSTCNSEDEVP +KRRVRRK +++N VDDVG SSLTE QED S A NH+A+ENV  EYSGEDNLLC
Subjt:  GSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLC

Query:  KNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLK
        K+ISEN VLD SSIE +IPESVALPRVVLCLAHNGKVAWDLKWKP NACTDNCKHRMGYLAVLLG+GSLEVWEVPFPHAVK IYSKFNGEGTDPRFVKLK
Subjt:  KNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLK

Query:  PVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RD
        P+FRCS L+TAN QSIPLTVEWSL PPYDYLLAGCHDGTVALWKFSANS+CEDTRPLLRFSADTVPIRAVAWAPSE                      RD
Subjt:  PVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RD

Query:  PFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVS
        PFRPLWDLHPAPRIIYSLDWLP+PR + LSFDDGTLRLLSLLKAA DVP TGQPFTAIKQKGLHTY CSSYAIWSIQVSRQTGMVAYCGADGAVVRFQ  
Subjt:  PFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVS

Query:  STCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVD
                  LTTKAADKENSRHRTPHY+CEYLTEEESIIT  +P  NVP  LKKLSNKSEHPLSMRAILSDS+QSNEGNHKTATA  LENE+++CSDVD
Subjt:  STCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVD

Query:  VGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQT--------DADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLC
        VGVESGSEDT +S KKKN+TQ KCKKK VEN ELEC+ EP DDA +DADV+AQT        DADVVP SGD FE+LPPKSVAMHRVRWNMN+GSE+WLC
Subjt:  VGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQT--------DADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLC

Query:  YGGAAGILR
        YGGA+GILR
Subjt:  YGGAAGILR

A0A5D3DPQ1 DNA binding protein, putative isoform 10.0e+0081.08Show/hide
Query:  KGKKKPPARE-KEPQKRGKKKEAGAT----------TSVNEDQATGRLDG--PKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSST
        KGKKKPPA+E KEP+KR KKK   AT          TSVNE Q T RL+   PKV VSEFD CVENHF+AMD IVELC EAE+GDGGIDESDIQRFSSST
Subjt:  KGKKKPPARE-KEPQKRGKKKEAGAT----------TSVNEDQATGRLDG--PKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSST

Query:  IFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPG
        IFLREWRFYNYE KTIKFA  S GPEGKDADITINLPQFSSAAVLK G+PPGA+TSLDFRNF MHVGGPVWA+DWCPQVH RT+S IKCEFIAVSAHPPG
Subjt:  IFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPG

Query:  SSYHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESNDKKGDGYQLVQAFSIENPA
        SSYHKMGIPLTGRGMVQIWCLVHGTE+YEP DVGEPPSDLSSQPK+PRGRPPGRKK  ASGLPS PKRPRGRPKK+Q+ES DKKGD  QLVQ FS+ENP 
Subjt:  SSYHKMGIPLTGRGMVQIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESNDKKGDGYQLVQAFSIENPA

Query:  GSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLC
        GSS+LLE DGVPKN+E  VLLEN+VERE STLQEVSTCNSEDEVP +KRRVRRK +++N VDDVG SSLTE QED S A NH+A+ENV  EYSGEDNLLC
Subjt:  GSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLC

Query:  KNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLK
        K+ISEN VLD SSIE +IPESVALPRVVLCLAHNGKVAWDLKWKP NACTDNCKHRMGYLAVLLG+GSLEVWEVPFPHAVK IYSKFNGEGTDPRFVKLK
Subjt:  KNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLK

Query:  PVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RD
        P+FRCS L+TAN QSIPLTVEWSL PPYDYLLAGCHDGTVALWKFSANS+CEDTRPLLRFSADTVPIRAVAWAPSE                      RD
Subjt:  PVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RD

Query:  PFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVS
        PFRPLWDLHPAPRIIYSLDWLP+PRCVFLSFDDGTLRLLSLLKAA DVP TGQPFTAIKQKGLHTY CSSYAIWSIQVSRQTGMVAYCGADGAVVRFQ  
Subjt:  PFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVS

Query:  STCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVD
                  LTTKAADKENSRHRTPHY+CEYLTEEESIIT  +P  NVP  LKKLSNKSEHPLSMRAILSDS+QSNEGNHKTATA  LENE+++CSDVD
Subjt:  STCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVD

Query:  VGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQT--------DADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLC
        VGVESGSEDT +S KKKN+TQ KCKKK VEN ELEC+ EP DDA +DADV+AQT        DADVVP SGD FE+LPPKSVAMHRVRWNMN+GSE+WLC
Subjt:  VGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQT--------DADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLC

Query:  YGGAAGILR
        YGGA+GILR
Subjt:  YGGAAGILR

A0A6J1F7U5 uncharacterized protein LOC111441649 isoform X10.0e+0076.87Show/hide
Query:  MEELQPLAEPTIGTSCKKGKKKPPAREKEPQKRGKKKEAGATTSVNEDQATGRLDGPKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRF
        MEEL   AE ++GTSCKKGKKK  + E EPQKR KKK  G  TSVNE Q TGRLD  +V VSEFDHCVENHF+A+D I EL GEAE+G+GG+DESD QRF
Subjt:  MEELQPLAEPTIGTSCKKGKKKPPAREKEPQKRGKKKEAGATTSVNEDQATGRLDGPKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRF

Query:  SSSTIFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSA
        SSST FLREW+FYNYEPKT+KF + SR PEGKDADIT+ LPQFSSAAVLKNG+PPGAT SLDFRNF+MHVGGPVWA+DWCP VHERTDS IKCEFIAVSA
Subjt:  SSSTIFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSA

Query:  HPPGSSYHKMGIPLTGRGMVQIWCLVHGTESY--EPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESN-DKKGDGYQLVQA
        HPPGSSYH MGIPL+GRGMVQIWCLVHGTES+  E T   E      SQPKRPRGRPPGRKKNGAS LPSQPKRPRGRPKKKQEE N D K   YQLVQ 
Subjt:  HPPGSSYHKMGIPLTGRGMVQIWCLVHGTESY--EPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESN-DKKGDGYQLVQA

Query:  FSIENPAGSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYS
         S+E P  SSNLLE D V  NSEK V LENSVER  ST++E+STCNSEDEVP QKRRVRR A+TKNHVDDVGT SL EN+ED SNA NH+ANENV  EYS
Subjt:  FSIENPAGSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYS

Query:  GEDNLLCKNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTD
        GED  LCKNISE A+LDT S   +IPE+VALPR+VLCLAHNGKVAWDLKWKPTNA T  CK RMGYLAVLLG+GSLEVWEVPFPH VKAIYSK NGEGTD
Subjt:  GEDNLLCKNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTD

Query:  PRFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE-----------------
        PRFVKLKP FRCSML++A+ QSIPLTVEWS TPPYDYLLAGCHDGTVALWKFSA+ST EDTRPLLRFSADTVPIRAVAWAPSE                 
Subjt:  PRFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE-----------------

Query:  -----RDPFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGA
             RDPFRPLWDLHPAPRIIYSLDWLP+PRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTY CS +AIWSIQVSRQTGMVAYCGADGA
Subjt:  -----RDPFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGA

Query:  VVRFQVSSTCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENES
        VVRFQ            LTTKA DKENSR+RTPH++CEYLTEE+SIIT+H+PA++VP  LKKLSNKSE PLSMRAILSDS+Q NEGN K+AT  ALENES
Subjt:  VVRFQVSSTCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENES

Query:  TLCSDVDVGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQTDADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCY
         LC D DV VESGSEDT MSI+ KNQTQSK KKK V NQELE S+EP+         D+QTD DVVPG G+ FE+ PPKSVA+HR+RWNMNIGSERWL Y
Subjt:  TLCSDVDVGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQTDADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCY

Query:  GGAAGILR
        GGAAGILR
Subjt:  GGAAGILR

A0A6J1J0H6 uncharacterized protein LOC1114815740.0e+0074.86Show/hide
Query:  MEELQPLAEPTIGTSCKKGKKKPPAREKEPQKRGKKKEAGATTSVNEDQATGRLDGPKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRF
        MEEL   AE ++GTSCKKGKKK  + E EP KR KKK AGA TSVNE Q TGRLD  +V VSEFDHCVENHF+A+D I EL GEAE+G+GG+DESD QRF
Subjt:  MEELQPLAEPTIGTSCKKGKKKPPAREKEPQKRGKKKEAGATTSVNEDQATGRLDGPKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRF

Query:  SSSTIFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSA
        SSST FLREW+FYNYEPKT+KF + SR PEGKDADIT+ LPQFSSAAVLKNG+PPGATTSLDFRNF+MHVGGPVWA+DWCP VHERTDS IKCEFIAVSA
Subjt:  SSSTIFLREWRFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSA

Query:  HPPGSSYHKMGIPLTGRGMVQIWCLVHGTESY--EPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESNDKKGDGYQLVQAF
        HPPGSSYH MGIPL+GRGMVQIWCLVHGTES+  E T+  E  +   SQPKRPRGRPPGRKKNGAS L SQ KRPRGRPKKKQEE ND +   YQLVQ  
Subjt:  HPPGSSYHKMGIPLTGRGMVQIWCLVHGTESY--EPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESNDKKGDGYQLVQAF

Query:  SIENPAGSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSG
        S+E P  SSNLLE D VP NSEK+V LENSVER  ST++E+STCNSEDEVP QKRR RR A+TKNHVDDVGT                            
Subjt:  SIENPAGSSNLLETDGVPKNSEKIVLLENSVEREGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSG

Query:  EDNLLCKNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDP
            LCKNISENA+LDT S   +IPESVALPR+VLCLAHNGKVAWDLKWKPTNA T  CK RMGYLAVLLG+GSLEVWE+PFPH VKAIYS  NGEGTDP
Subjt:  EDNLLCKNISENAVLDTSSIELTIPESVALPRVVLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDP

Query:  RFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE------------------
        RFVKLKP FRCSML++A+ QSIPLTVEWS TPPYDYLLAGCHDGTVALWKFSANST EDTRPLLRFSADTVPIRAVAWAPSE                  
Subjt:  RFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE------------------

Query:  ----RDPFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAV
            RDPFRPLWDLHPAPRIIYSLDWLP+PRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTY CS +AIWSIQVSRQTGMVAYCGADGAV
Subjt:  ----RDPFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAV

Query:  VRFQVSSTCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENEST
        VRFQ            LTTKA DKENSR+RTPH++CEYLTEE+SIIT+H+PA++VP  LKKLSNKSE PLSMRAILSDS+Q NEGN K+AT  ALENES 
Subjt:  VRFQVSSTCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENEST

Query:  LCSDVDVGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQTDADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCYG
        LC D DVGVESGSEDT MSI+ KNQTQSK KKK V NQELE S+EP+         D+QTD DVVPG GD FE+ PPKSVA+HR+RWNMNIGSERWLCYG
Subjt:  LCSDVDVGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQTDADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCYG

Query:  GAAGILR
        GAAGILR
Subjt:  GAAGILR

SwissProt top hitse value%identityAlignment
Q5RDC3 General transcription factor 3C polypeptide 22.7e-0423.57Show/hide
Query:  WDLKWKPTNAC-------TDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPVFRCSMLKTANAQSIP-------LTVEWSL
        WDLK+ P+ A              R+G LA+    G + ++ +P P   +A+ ++   +   P   K++ V   + L+  + Q+         L++ W  
Subjt:  WDLKWKPTNAC-------TDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPVFRCSMLKTANAQSIP-------LTVEWSL

Query:  TPPYDYLLAGCHDGTVALWKFSANSTCEDTR---------PLLRFSADTVPIRAVAW
        T P+ +L AG ++G V  W    NS  +  R         P   F A    +R + W
Subjt:  TPPYDYLLAGCHDGTVALWKFSANSTCEDTR---------PLLRFSADTVPIRAVAW

Q8BL74 General transcription factor 3C polypeptide 22.1e-0423.12Show/hide
Query:  WDLKWKPTNACTD-------NCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPVFRCSMLKTANAQSIP----------LTVE
        WDLK+ P+ A              R+G LA+    G + ++ +P P A+ A           P       +++   L T    S+           L++ 
Subjt:  WDLKWKPTNACTD-------NCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPVFRCSMLKTANAQSIP----------LTVE

Query:  WSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTR---------PLLRFSADTVPIRAVAW
        W  T P+ +L AG ++G V  W    NS  +  R         P   F A    +R + W
Subjt:  WSLTPPYDYLLAGCHDGTVALWKFSANSTCEDTR---------PLLRFSADTVPIRAVAW

Q8WUA4 General transcription factor 3C polypeptide 22.7e-0423.57Show/hide
Query:  WDLKWKPTNAC-------TDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPVFRCSMLKTANAQSIP-------LTVEWSL
        WDLK+ P+ A              R+G LA+    G + ++ +P P   +A+ ++   +   P   K++ V   + L+  + Q+         L++ W  
Subjt:  WDLKWKPTNAC-------TDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPVFRCSMLKTANAQSIP-------LTVEWSL

Query:  TPPYDYLLAGCHDGTVALWKFSANSTCEDTR---------PLLRFSADTVPIRAVAW
        T P+ +L AG ++G V  W    NS  +  R         P   F A    +R + W
Subjt:  TPPYDYLLAGCHDGTVALWKFSANSTCEDTR---------PLLRFSADTVPIRAVAW

Arabidopsis top hitse value%identityAlignment
AT1G19485.1 Transducin/WD40 repeat-like superfamily protein8.0e-18544.08Show/hide
Query:  LDGPKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSSTIFLREWRFYNYEPKTIKFAT-GSRGPEGKDADITINLPQFSSAAV--LK
        +DG +  +S FD+  E+H KA+++I +LCGEA   +  IDE+DI   SSS  FLREWR YN+EPK+  F     +  + KD + +  LPQFSSA    +K
Subjt:  LDGPKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSSTIFLREWRFYNYEPKTIKFAT-GSRGPEGKDADITINLPQFSSAAV--LK

Query:  NGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGT---ESYEPTDVG---------
              +++    ++FVMHVGG VWA++WCP+VH   D+  KCEF+AV+ HPP S  HK+GIPL GRG++QIWC+++ T   +S + +D G         
Subjt:  NGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGT---ESYEPTDVG---------

Query:  EPPSDL--SSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQ-EESNDKKGDGYQLVQAFSIENPAGSSNLLETDGVPKNSEKIV----LLENSVER
        +P  +   +++PK+PRGRP   +K+      ++PK+PRGRP+KK   E   +  D    V+A S+  P       E   VP    +I+    + E  V  
Subjt:  EPPSDL--SSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQ-EESNDKKGDGYQLVQAFSIENPAGSSNLLETDGVPKNSEKIV----LLENSVER

Query:  EGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLCKNISENAVLDTSSIELTIPESVALPRV
        EGS  Q +S+ N+  ++P     VRRK +     ++  T  + E  E   N  +  ++                 ISE              + VALPRV
Subjt:  EGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLCKNISENAVLDTSSIELTIPESVALPRV

Query:  VLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPP
        VLCLAHNGKV WD+KW+P+ A     KH MGYLAVLLG+GSLEVW+VP P A  A+Y       TDPRFVKL PVF+CS LK  + +SIPLTVEWS    
Subjt:  VLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPP

Query:  YDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RDPFRPLWDLHPAPRIIYSLDWLPSPRCV
         D+LLAGCHDGTVALWKFS   + EDTRPLL FSADT PIRAVAWAP E                      RDPFRPLWDLHP PR IYSLDWL  P CV
Subjt:  YDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RDPFRPLWDLHPAPRIIYSLDWLPSPRCV

Query:  FLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVSSTCTVDFAMELTTKAADKENSRHRTPH
         LSFDDGTLR+LSL+K AYDVP TG+P+   KQ+GL  Y CS++ IWSIQVSR TG+ AYC ADG++  F            ELTTKA +K+ +R+RTPH
Subjt:  FLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVSSTCTVDFAMELTTKAADKENSRHRTPH

Query:  YICEYLTEEESIITLHTPAANVPFSLKK-LSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVDVGVESGSEDTMMSIKKKNQTQSKCKK
        Y+C  LT ++S   +H+P  ++P  LKK +    E    +R++L++S      N       A  +        D G+ES SE T     K    + K   
Subjt:  YICEYLTEEESIITLHTPAANVPFSLKK-LSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVDVGVESGSEDTMMSIKKKNQTQSKCKK

Query:  KRVENQE---LECSNEPNDDAQMDADVDAQTDADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCYGGAAGILR
        +  E++    L C  E         + + +  A     +G + E  PPK VAMHRVRWNMN GSERWLCYGGAAGI+R
Subjt:  KRVENQE---LECSNEPNDDAQMDADVDAQTDADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCYGGAAGILR

AT1G19485.2 Transducin/WD40 repeat-like superfamily protein8.0e-18544.08Show/hide
Query:  LDGPKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSSTIFLREWRFYNYEPKTIKFAT-GSRGPEGKDADITINLPQFSSAAV--LK
        +DG +  +S FD+  E+H KA+++I +LCGEA   +  IDE+DI   SSS  FLREWR YN+EPK+  F     +  + KD + +  LPQFSSA    +K
Subjt:  LDGPKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSSTIFLREWRFYNYEPKTIKFAT-GSRGPEGKDADITINLPQFSSAAV--LK

Query:  NGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGT---ESYEPTDVG---------
              +++    ++FVMHVGG VWA++WCP+VH   D+  KCEF+AV+ HPP S  HK+GIPL GRG++QIWC+++ T   +S + +D G         
Subjt:  NGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGT---ESYEPTDVG---------

Query:  EPPSDL--SSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQ-EESNDKKGDGYQLVQAFSIENPAGSSNLLETDGVPKNSEKIV----LLENSVER
        +P  +   +++PK+PRGRP   +K+      ++PK+PRGRP+KK   E   +  D    V+A S+  P       E   VP    +I+    + E  V  
Subjt:  EPPSDL--SSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQ-EESNDKKGDGYQLVQAFSIENPAGSSNLLETDGVPKNSEKIV----LLENSVER

Query:  EGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLCKNISENAVLDTSSIELTIPESVALPRV
        EGS  Q +S+ N+  ++P     VRRK +     ++  T  + E  E   N  +  ++                 ISE              + VALPRV
Subjt:  EGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLCKNISENAVLDTSSIELTIPESVALPRV

Query:  VLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPP
        VLCLAHNGKV WD+KW+P+ A     KH MGYLAVLLG+GSLEVW+VP P A  A+Y       TDPRFVKL PVF+CS LK  + +SIPLTVEWS    
Subjt:  VLCLAHNGKVAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPP

Query:  YDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RDPFRPLWDLHPAPRIIYSLDWLPSPRCV
         D+LLAGCHDGTVALWKFS   + EDTRPLL FSADT PIRAVAWAP E                      RDPFRPLWDLHP PR IYSLDWL  P CV
Subjt:  YDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSE----------------------RDPFRPLWDLHPAPRIIYSLDWLPSPRCV

Query:  FLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVSSTCTVDFAMELTTKAADKENSRHRTPH
         LSFDDGTLR+LSL+K AYDVP TG+P+   KQ+GL  Y CS++ IWSIQVSR TG+ AYC ADG++  F            ELTTKA +K+ +R+RTPH
Subjt:  FLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQVSRQTGMVAYCGADGAVVRFQVSSTCTVDFAMELTTKAADKENSRHRTPH

Query:  YICEYLTEEESIITLHTPAANVPFSLKK-LSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVDVGVESGSEDTMMSIKKKNQTQSKCKK
        Y+C  LT ++S   +H+P  ++P  LKK +    E    +R++L++S      N       A  +        D G+ES SE T     K    + K   
Subjt:  YICEYLTEEESIITLHTPAANVPFSLKK-LSNKSEHPLSMRAILSDSVQSNEGNHKTATAPALENESTLCSDVDVGVESGSEDTMMSIKKKNQTQSKCKK

Query:  KRVENQE---LECSNEPNDDAQMDADVDAQTDADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCYGGAAGILR
        +  E++    L C  E         + + +  A     +G + E  PPK VAMHRVRWNMN GSERWLCYGGAAGI+R
Subjt:  KRVENQE---LECSNEPNDDAQMDADVDAQTDADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCYGGAAGILR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAACTTCAACCTCTAGCAGAACCAACCATTGGCACTAGCTGCAAGAAAGGGAAGAAGAAGCCACCGGCTCGGGAGAAGGAACCACAGAAAAGAGGTAAGAAGAA
GGAAGCAGGGGCTACTACTTCAGTCAACGAAGACCAAGCTACTGGTCGATTAGATGGCCCCAAGGTTACGGTTTCAGAGTTTGATCATTGTGTTGAAAATCATTTTAAAG
CCATGGATACAATTGTCGAGCTCTGTGGTGAGGCAGAGGATGGGGATGGCGGAATTGATGAAAGTGACATTCAGCGCTTTTCATCATCCACAATTTTCTTGAGGGAATGG
AGGTTCTACAATTATGAGCCGAAAACTATCAAGTTTGCTACCGGTTCGAGAGGCCCTGAGGGTAAGGATGCTGACATCACAATCAACTTACCACAGTTTTCTTCTGCAGC
TGTTCTAAAGAATGGATCACCGCCTGGAGCCACTACATCTCTGGACTTCCGAAACTTTGTTATGCATGTCGGTGGGCCTGTTTGGGCCTTAGATTGGTGTCCTCAAGTTC
ATGAAAGGACCGACTCCCATATCAAATGTGAGTTTATTGCCGTTTCTGCTCATCCACCTGGTTCTTCTTATCACAAGATGGGTATCCCGCTCACTGGAAGAGGTATGGTG
CAGATATGGTGTTTAGTGCATGGCACTGAAAGCTATGAACCGACCGATGTAGGAGAGCCACCTTCAGATTTGTCATCTCAACCAAAGAGGCCTAGGGGAAGACCACCTGG
GCGCAAGAAAAATGGGGCATCGGGCTTGCCATCTCAACCAAAGAGGCCTAGAGGAAGACCTAAAAAGAAACAAGAAGAATCCAATGATAAGAAGGGTGACGGTTACCAAC
TTGTTCAGGCTTTTTCTATTGAAAACCCAGCTGGTTCATCCAACTTGCTTGAGACTGATGGTGTCCCCAAAAATTCTGAAAAAATTGTATTACTGGAAAACAGTGTTGAA
AGAGAGGGGAGTACCTTACAAGAAGTTTCTACATGCAATTCTGAAGATGAAGTTCCTACGCAGAAGAGAAGAGTGAGAAGAAAAGCTGAGACTAAGAATCATGTTGATGA
CGTGGGAACGTCATCACTTACAGAGAATCAAGAAGATAGATCCAATGCTATGAATCATGATGCAAATGAGAATGTTATACATGAATATTCTGGGGAAGACAATCTATTAT
GTAAGAACATTTCAGAGAATGCTGTTTTAGACACTAGCTCAATTGAACTTACTATTCCCGAGAGTGTTGCTTTGCCAAGGGTCGTACTGTGCTTAGCTCACAATGGAAAG
GTAGCATGGGATTTGAAATGGAAGCCAACTAATGCGTGTACTGACAATTGCAAGCACCGAATGGGCTACCTTGCTGTCTTGCTGGGCAGTGGATCTCTAGAAGTCTGGGA
GGTTCCTTTTCCCCATGCAGTGAAGGCCATCTATTCTAAATTCAATGGGGAGGGTACAGATCCTCGCTTTGTGAAGTTGAAGCCTGTCTTCAGATGCTCGATGTTGAAAA
CTGCAAATGCGCAGAGCATCCCTCTGACAGTGGAATGGTCGCTAACACCTCCTTATGATTATCTTCTCGCTGGATGTCATGATGGAACGGTCGCATTGTGGAAGTTTTCT
GCAAATAGTACCTGTGAAGATACGAGGCCTTTACTTCGTTTTAGTGCAGATACAGTTCCAATAAGAGCGGTTGCATGGGCACCAAGTGAAAGAGATCCTTTCCGTCCCTT
GTGGGACCTTCATCCGGCACCGAGGATCATATATAGTCTGGATTGGCTTCCTAGTCCTAGATGCGTTTTCTTATCCTTTGATGATGGAACATTGAGACTTCTCAGTTTGC
TAAAGGCTGCATATGATGTTCCAGTAACTGGTCAACCCTTTACAGCGATAAAACAAAAAGGGCTACACACTTACTTTTGTTCATCATATGCCATCTGGAGTATTCAAGTG
TCGAGGCAGACAGGCATGGTTGCATACTGCGGTGCTGATGGAGCTGTTGTACGTTTCCAGGTAAGTTCAACTTGCACAGTAGATTTTGCTATGGAGCTTACTACAAAAGC
AGCGGACAAAGAGAATTCACGCCATCGCACCCCACATTATATATGCGAATACTTAACCGAGGAGGAATCAATTATTACACTCCACACTCCAGCAGCAAATGTGCCATTCT
CTTTGAAGAAGCTGTCCAACAAATCTGAACATCCATTGTCCATGCGAGCTATTTTATCTGATTCGGTACAGTCAAATGAAGGAAATCATAAAACTGCCACAGCTCCAGCA
TTGGAAAATGAATCAACTCTTTGCTCGGATGTCGATGTCGGTGTTGAATCTGGATCTGAGGATACAATGATGTCCATCAAGAAGAAAAACCAAACTCAATCAAAGTGCAA
GAAGAAGAGAGTTGAGAACCAAGAATTGGAATGTAGCAATGAGCCTAATGATGATGCACAGATGGACGCTGACGTAGATGCACAGACGGATGCTGACGTAGTGCCTGGTT
CGGGGGATCGCTTTGAAAGTCTCCCTCCCAAATCAGTTGCAATGCATAGAGTGAGATGGAACATGAACATAGGGAGTGAAAGATGGTTGTGCTATGGCGGAGCAGCTGGA
ATTCTACGCTTTATAAGACCAGGGAAGCCATTTCAAATTGCAATAGTTGTTATCGCCATCGAGCCACGGCGCCTTAATCCTCGAGGGGCTAAATTATGGGAGCAGGCTAA
AGGAAGAAAAGAGCATGACCTTAGGATAGGATAA
mRNA sequenceShow/hide mRNA sequence
AGAGAGTCTCGTGAAGACAGCAGTCCCGTACAGCCTTACAGATGGAGACTGATGATTAGGGTTCGAATCTCCGTAAACGCACAGTGCTTAATAAATCGAATTGGAAATCG
TCTAGGGTTCGAATCTCCATAAGCTTAACAAAATTTGGAATTTCCAAAGCATTACGAAGGAAGGAGAGTGGTAATGGAAGAACTTCAACCTCTAGCAGAACCAACCATTG
GCACTAGCTGCAAGAAAGGGAAGAAGAAGCCACCGGCTCGGGAGAAGGAACCACAGAAAAGAGGTAAGAAGAAGGAAGCAGGGGCTACTACTTCAGTCAACGAAGACCAA
GCTACTGGTCGATTAGATGGCCCCAAGGTTACGGTTTCAGAGTTTGATCATTGTGTTGAAAATCATTTTAAAGCCATGGATACAATTGTCGAGCTCTGTGGTGAGGCAGA
GGATGGGGATGGCGGAATTGATGAAAGTGACATTCAGCGCTTTTCATCATCCACAATTTTCTTGAGGGAATGGAGGTTCTACAATTATGAGCCGAAAACTATCAAGTTTG
CTACCGGTTCGAGAGGCCCTGAGGGTAAGGATGCTGACATCACAATCAACTTACCACAGTTTTCTTCTGCAGCTGTTCTAAAGAATGGATCACCGCCTGGAGCCACTACA
TCTCTGGACTTCCGAAACTTTGTTATGCATGTCGGTGGGCCTGTTTGGGCCTTAGATTGGTGTCCTCAAGTTCATGAAAGGACCGACTCCCATATCAAATGTGAGTTTAT
TGCCGTTTCTGCTCATCCACCTGGTTCTTCTTATCACAAGATGGGTATCCCGCTCACTGGAAGAGGTATGGTGCAGATATGGTGTTTAGTGCATGGCACTGAAAGCTATG
AACCGACCGATGTAGGAGAGCCACCTTCAGATTTGTCATCTCAACCAAAGAGGCCTAGGGGAAGACCACCTGGGCGCAAGAAAAATGGGGCATCGGGCTTGCCATCTCAA
CCAAAGAGGCCTAGAGGAAGACCTAAAAAGAAACAAGAAGAATCCAATGATAAGAAGGGTGACGGTTACCAACTTGTTCAGGCTTTTTCTATTGAAAACCCAGCTGGTTC
ATCCAACTTGCTTGAGACTGATGGTGTCCCCAAAAATTCTGAAAAAATTGTATTACTGGAAAACAGTGTTGAAAGAGAGGGGAGTACCTTACAAGAAGTTTCTACATGCA
ATTCTGAAGATGAAGTTCCTACGCAGAAGAGAAGAGTGAGAAGAAAAGCTGAGACTAAGAATCATGTTGATGACGTGGGAACGTCATCACTTACAGAGAATCAAGAAGAT
AGATCCAATGCTATGAATCATGATGCAAATGAGAATGTTATACATGAATATTCTGGGGAAGACAATCTATTATGTAAGAACATTTCAGAGAATGCTGTTTTAGACACTAG
CTCAATTGAACTTACTATTCCCGAGAGTGTTGCTTTGCCAAGGGTCGTACTGTGCTTAGCTCACAATGGAAAGGTAGCATGGGATTTGAAATGGAAGCCAACTAATGCGT
GTACTGACAATTGCAAGCACCGAATGGGCTACCTTGCTGTCTTGCTGGGCAGTGGATCTCTAGAAGTCTGGGAGGTTCCTTTTCCCCATGCAGTGAAGGCCATCTATTCT
AAATTCAATGGGGAGGGTACAGATCCTCGCTTTGTGAAGTTGAAGCCTGTCTTCAGATGCTCGATGTTGAAAACTGCAAATGCGCAGAGCATCCCTCTGACAGTGGAATG
GTCGCTAACACCTCCTTATGATTATCTTCTCGCTGGATGTCATGATGGAACGGTCGCATTGTGGAAGTTTTCTGCAAATAGTACCTGTGAAGATACGAGGCCTTTACTTC
GTTTTAGTGCAGATACAGTTCCAATAAGAGCGGTTGCATGGGCACCAAGTGAAAGAGATCCTTTCCGTCCCTTGTGGGACCTTCATCCGGCACCGAGGATCATATATAGT
CTGGATTGGCTTCCTAGTCCTAGATGCGTTTTCTTATCCTTTGATGATGGAACATTGAGACTTCTCAGTTTGCTAAAGGCTGCATATGATGTTCCAGTAACTGGTCAACC
CTTTACAGCGATAAAACAAAAAGGGCTACACACTTACTTTTGTTCATCATATGCCATCTGGAGTATTCAAGTGTCGAGGCAGACAGGCATGGTTGCATACTGCGGTGCTG
ATGGAGCTGTTGTACGTTTCCAGGTAAGTTCAACTTGCACAGTAGATTTTGCTATGGAGCTTACTACAAAAGCAGCGGACAAAGAGAATTCACGCCATCGCACCCCACAT
TATATATGCGAATACTTAACCGAGGAGGAATCAATTATTACACTCCACACTCCAGCAGCAAATGTGCCATTCTCTTTGAAGAAGCTGTCCAACAAATCTGAACATCCATT
GTCCATGCGAGCTATTTTATCTGATTCGGTACAGTCAAATGAAGGAAATCATAAAACTGCCACAGCTCCAGCATTGGAAAATGAATCAACTCTTTGCTCGGATGTCGATG
TCGGTGTTGAATCTGGATCTGAGGATACAATGATGTCCATCAAGAAGAAAAACCAAACTCAATCAAAGTGCAAGAAGAAGAGAGTTGAGAACCAAGAATTGGAATGTAGC
AATGAGCCTAATGATGATGCACAGATGGACGCTGACGTAGATGCACAGACGGATGCTGACGTAGTGCCTGGTTCGGGGGATCGCTTTGAAAGTCTCCCTCCCAAATCAGT
TGCAATGCATAGAGTGAGATGGAACATGAACATAGGGAGTGAAAGATGGTTGTGCTATGGCGGAGCAGCTGGAATTCTACGCTTTATAAGACCAGGGAAGCCATTTCAAA
TTGCAATAGTTGTTATCGCCATCGAGCCACGGCGCCTTAATCCTCGAGGGGCTAAATTATGGGAGCAGGCTAAAGGAAGAAAAGAGCATGACCTTAGGATAGGATAA
Protein sequenceShow/hide protein sequence
MEELQPLAEPTIGTSCKKGKKKPPAREKEPQKRGKKKEAGATTSVNEDQATGRLDGPKVTVSEFDHCVENHFKAMDTIVELCGEAEDGDGGIDESDIQRFSSSTIFLREW
RFYNYEPKTIKFATGSRGPEGKDADITINLPQFSSAAVLKNGSPPGATTSLDFRNFVMHVGGPVWALDWCPQVHERTDSHIKCEFIAVSAHPPGSSYHKMGIPLTGRGMV
QIWCLVHGTESYEPTDVGEPPSDLSSQPKRPRGRPPGRKKNGASGLPSQPKRPRGRPKKKQEESNDKKGDGYQLVQAFSIENPAGSSNLLETDGVPKNSEKIVLLENSVE
REGSTLQEVSTCNSEDEVPTQKRRVRRKAETKNHVDDVGTSSLTENQEDRSNAMNHDANENVIHEYSGEDNLLCKNISENAVLDTSSIELTIPESVALPRVVLCLAHNGK
VAWDLKWKPTNACTDNCKHRMGYLAVLLGSGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFVKLKPVFRCSMLKTANAQSIPLTVEWSLTPPYDYLLAGCHDGTVALWKFS
ANSTCEDTRPLLRFSADTVPIRAVAWAPSERDPFRPLWDLHPAPRIIYSLDWLPSPRCVFLSFDDGTLRLLSLLKAAYDVPVTGQPFTAIKQKGLHTYFCSSYAIWSIQV
SRQTGMVAYCGADGAVVRFQVSSTCTVDFAMELTTKAADKENSRHRTPHYICEYLTEEESIITLHTPAANVPFSLKKLSNKSEHPLSMRAILSDSVQSNEGNHKTATAPA
LENESTLCSDVDVGVESGSEDTMMSIKKKNQTQSKCKKKRVENQELECSNEPNDDAQMDADVDAQTDADVVPGSGDRFESLPPKSVAMHRVRWNMNIGSERWLCYGGAAG
ILRFIRPGKPFQIAIVVIAIEPRRLNPRGAKLWEQAKGRKEHDLRIG