; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G14830 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G14830
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionWD_REPEATS_REGION domain-containing protein
Genome locationClcChr08:25748430..25758227
RNA-Seq ExpressionClc08G14830
SyntenyClc08G14830
Gene Ontology termsGO:0006384 - transcription initiation from RNA polymerase III promoter (biological process)
GO:0016573 - histone acetylation (biological process)
GO:0000127 - transcription factor TFIIIC complex (cellular component)
GO:0004402 - histone acetyltransferase activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR024761 - Transcription factor IIIC, 90kDa subunit, N-terminal
IPR036322 - WD40-repeat-containing domain superfamily
IPR044230 - General transcription factor 3C polypeptide 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444806.1 PREDICTED: uncharacterized protein LOC103488044 isoform X1 [Cucumis melo]0.0e+0084.54Show/hide
Query:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP
        MVET+FQAV LVAAPNYPNA+AWSDENLIA+ASGPLVTI+NP SPFGARGTITIPA+DPLRIGL+ERKDLF+DCLLTTCLSRDD PRAQS+AWSPIG+AP
Subjt:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP

Query:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNS--------N
        NAGCLLAVCTSEG VKLYRPPFCDFSAEWIEI+D+SNKLYDYLESIKYGELDVLSSK SDIPAKE G+AV  QE+FTK NSKRRKKDEL S        +
Subjt:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNS--------N

Query:  NESSLNRSLEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTR
        NESSLN+SLEKSKEKR RRR+EDSSVPPLISAQQYASRSAMLLSLV+AWSPVIKPS K H HQNSS  VLA+GTKSGKVSFWKVNVPECYSLAEC VPT 
Subjt:  NESSLNRSLEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTR

Query:  VLLVGILQAHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSG
         LLVGILQAHNSWINCISWMLFDSDSS+ KVL+ATGSTDGSV+IWQC CEELLASSDSNFASFSLLKEVISGEG+PT+LSL  PNL  HKLFLAIGRGSG
Subjt:  VLLVGILQAHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSG

Query:  SLEIRIFNLSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRS
        SLEIRIFNLS+ EFDNV LY+AH HVVTGVAWA DGRYLFTCSEDN LRGWSLDESSLREVPISS IP+LGGSIDL DTFRSCFGI +SPGNLV AVVR+
Subjt:  SLEIRIFNLSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRS

Query:  FDIESLDRMYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWL
        FD+ESLD+MY+AR+QKAAVQFFWIGGEEIEVMPNSSYFYTENF ++SKKEFV WESS+LWSLNQ KNLNKPMVVW+VVAALL FR SI EYVDHILLKWL
Subjt:  FDIESLDRMYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWL

Query:  SMSYLQWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FA
        + SYL W+ ELSATKILS++S+NVSTFSTRQLHLLNIICRRVVLSE +QDQVN++LQNL     ERL D ENE HILWK+LLLSSERELRQRLIGLC FA
Subjt:  SMSYLQWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FA

Query:  CAKLRSPSTTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSM
        CAKLRS S TEYRPGFWYPIGL EMQQW+ +NPEHLQES+K +AS+AGK R SKHS+MEQCTYCSA VP ESPEFG+CQG K N GV QSHKL+RCSVSM
Subjt:  CAKLRSPSTTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSM

Query:  QVCPATTPLWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV
        QVCPAT PLWFC+CC RSAFRLAPDILFQMSETP+F SL LS+SEIPS+PLCPFCGILLQRRQPDFLLSACPV
Subjt:  QVCPATTPLWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV

XP_008444807.1 PREDICTED: uncharacterized protein LOC103488044 isoform X3 [Cucumis melo]0.0e+0085.32Show/hide
Query:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP
        MVET+FQAV LVAAPNYPNA+AWSDENLIA+ASGPLVTI+NP SPFGARGTITIPA+DPLRIGL+ERKDLF+DCLLTTCLSRDD PRAQS+AWSPIG+AP
Subjt:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP

Query:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS
        NAGCLLAVCTSEG VKLYRPPFCDFSAEWIEI+D+SNKLYDYLESIKYGELDVLSSK SDIPAKE G+AV  QE+FTK NSKRRKKDEL S+NESSLN+S
Subjt:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS

Query:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ
        LEKSKEKR RRR+EDSSVPPLISAQQYASRSAMLLSLV+AWSPVIKPS K H HQNSS  VLA+GTKSGKVSFWKVNVPECYSLAEC VPT  LLVGILQ
Subjt:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ

Query:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN
        AHNSWINCISWMLFDSDSS+ KVL+ATGSTDGSV+IWQC CEELLASSDSNFASFSLLKEVISGEG+PT+LSL  PNL  HKLFLAIGRGSGSLEIRIFN
Subjt:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN

Query:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESLDR
        LS+ EFDNV LY+AH HVVTGVAWA DGRYLFTCSEDN LRGWSLDESSLREVPISS IP+LGGSIDL DTFRSCFGI +SPGNLV AVVR+FD+ESLD+
Subjt:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESLDR

Query:  MYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYLQWN
        MY+AR+QKAAVQFFWIGGEEIEVMPNSSYFYTENF ++SKKEFV WESS+LWSLNQ KNLNKPMVVW+VVAALL FR SI EYVDHILLKWL+ SYL W+
Subjt:  MYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYLQWN

Query:  KELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPS
         ELSATKILS++S+NVSTFSTRQLHLLNIICRRVVLSE +QDQVN++LQNL     ERL D ENE HILWK+LLLSSERELRQRLIGLC FACAKLRS S
Subjt:  KELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPS

Query:  TTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTP
         TEYRPGFWYPIGL EMQQW+ +NPEHLQES+K +AS+AGK R SKHS+MEQCTYCSA VP ESPEFG+CQG K N GV QSHKL+RCSVSMQVCPAT P
Subjt:  TTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTP

Query:  LWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV
        LWFC+CC RSAFRLAPDILFQMSETP+F SL LS+SEIPS+PLCPFCGILLQRRQPDFLLSACPV
Subjt:  LWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV

XP_008444808.1 PREDICTED: uncharacterized protein LOC103488044 isoform X4 [Cucumis melo]0.0e+0085.32Show/hide
Query:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP
        MVET+FQAV LVAAPNYPNA+AWSDENLIA+ASGPLVTI+NP SPFGARGTITIPA+DPLRIGL+ERKDLF+DCLLTTCLSRDD PRAQS+AWSPIG+AP
Subjt:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP

Query:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS
        NAGCLLAVCTSEG VKLYRPPFCDFSAEWIEI+D+SNKLYDYLESIKYGELDVLSSK SDIPAKE G+AV  QE+FTK NSKRRKKDEL  NNESSLN+S
Subjt:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS

Query:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ
        LEKSKEKR RRR+EDSSVPPLISAQQYASRSAMLLSLV+AWSPVIKPS K H HQNSS  VLA+GTKSGKVSFWKVNVPECYSLAEC VPT  LLVGILQ
Subjt:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ

Query:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN
        AHNSWINCISWMLFDSDSS+ KVL+ATGSTDGSV+IWQC CEELLASSDSNFASFSLLKEVISGEG+PT+LSL  PNL  HKLFLAIGRGSGSLEIRIFN
Subjt:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN

Query:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESLDR
        LS+ EFDNV LY+AH HVVTGVAWA DGRYLFTCSEDN LRGWSLDESSLREVPISS IP+LGGSIDL DTFRSCFGI +SPGNLV AVVR+FD+ESLD+
Subjt:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESLDR

Query:  MYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYLQWN
        MY+AR+QKAAVQFFWIGGEEIEVMPNSSYFYTENF ++SKKEFV WESS+LWSLNQ KNLNKPMVVW+VVAALL FR SI EYVDHILLKWL+ SYL W+
Subjt:  MYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYLQWN

Query:  KELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPS
         ELSATKILS++S+NVSTFSTRQLHLLNIICRRVVLSE +QDQVN++LQNL     ERL D ENE HILWK+LLLSSERELRQRLIGLC FACAKLRS S
Subjt:  KELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPS

Query:  TTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTP
         TEYRPGFWYPIGL EMQQW+ +NPEHLQES+K +AS+AGK R SKHS+MEQCTYCSA VP ESPEFG+CQG K N GV QSHKL+RCSVSMQVCPAT P
Subjt:  TTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTP

Query:  LWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV
        LWFC+CC RSAFRLAPDILFQMSETP+F SL LS+SEIPS+PLCPFCGILLQRRQPDFLLSACPV
Subjt:  LWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV

XP_038885355.1 uncharacterized protein LOC120075765 isoform X1 [Benincasa hispida]0.0e+0089.69Show/hide
Query:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP
        MVETYFQAV LVAAPNYPNA+AWSDENLIAVASGPLVTILNP SPFGARGTITIPA+DPLRIGLIER+DLF+DCLLTTCLSRDD PRAQSI+WSPIG+AP
Subjt:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP

Query:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS
        NAGCLLAVCTSEG VKLYRPPFCDFSAEW EIMD+SNKLYDYLESIKYGELDVLS KRSDIP KEG NA G QEHFTK NSKRRKKDELN  NESSLNR+
Subjt:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS

Query:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ
        LEKSKEKRP+RRTEDSS  PLISAQQYASRSAMLLSLV+AWSPVIKPS  VHSH+NSSVSVLA+GTKSGKVSFWKV VPECYSLAEC VPTRVLLVGILQ
Subjt:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ

Query:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN
        AHNSWINCISWMLFDSDSSNPKVLLATGS DGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEG+PT+LSLYAPNLPVHKLFLA+GRGSGSLEIRIFN
Subjt:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN

Query:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLG--GSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESL
        LSSCEFDNVRLY+AHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISS IPDLG  GSIDL DTFRSCFGI VSPGNLVAAVVR+FD+ESL
Subjt:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLG--GSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESL

Query:  DRMYEARSQKAAVQFFWIGGEEIEVMP-NSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYL
        DRMY+AR+QKAAVQFFWIGGEEIEVMP +SSY YTE  PD+SKKE VHWESS+LWSLNQF+NLNKPMVVWDVVAALL FRQSI EYVDHILLKWLS SYL
Subjt:  DRMYEARSQKAAVQFFWIGGEEIEVMP-NSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYL

Query:  QWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNLERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPSTT
        QWN ELSATKIL++VSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNLERL+DAENE HILWKELLLSSERELRQRLI LC FACAK RS STT
Subjt:  QWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNLERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPSTT

Query:  EYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTPLW
        E RPGFWYP GLAEMQQWI  N EHLQESVKVIASKAG +R SKHSAMEQCTYCSA VPFESPE G CQG K NTGV QSHKLVRCSVSMQVCPATTPLW
Subjt:  EYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTPLW

Query:  FCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV
        FC+CCYR+AFRLAPD+LFQ+SETP+FRSL LS  EIPS+PLCPFCGILLQRRQPDFLLSACPV
Subjt:  FCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV

XP_038885356.1 uncharacterized protein LOC120075765 isoform X2 [Benincasa hispida]0.0e+0089.34Show/hide
Query:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP
        MVETYFQAV LVAAPNYPNA+AWSDENLIAVASGPLVTILNP SPFGARGTITIPA+DPLRIGLIER+DLF+DCLLTTCLSRDD PRAQSI+WSPIG+AP
Subjt:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP

Query:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS
        NAGCLLAVCTSEG VKLYRPPFCDFSAEW EIMD+SNKLYDYLESIKYGELDVLS KRSDIP KEG NA G QEHFTK NSKRRKKDELN N    LNR+
Subjt:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS

Query:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ
        LEKSKEKRP+RRTEDSS  PLISAQQYASRSAMLLSLV+AWSPVIKPS  VHSH+NSSVSVLA+GTKSGKVSFWKV VPECYSLAEC VPTRVLLVGILQ
Subjt:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ

Query:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN
        AHNSWINCISWMLFDSDSSNPKVLLATGS DGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEG+PT+LSLYAPNLPVHKLFLA+GRGSGSLEIRIFN
Subjt:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN

Query:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLG--GSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESL
        LSSCEFDNVRLY+AHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISS IPDLG  GSIDL DTFRSCFGI VSPGNLVAAVVR+FD+ESL
Subjt:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLG--GSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESL

Query:  DRMYEARSQKAAVQFFWIGGEEIEVMP-NSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYL
        DRMY+AR+QKAAVQFFWIGGEEIEVMP +SSY YTE  PD+SKKE VHWESS+LWSLNQF+NLNKPMVVWDVVAALL FRQSI EYVDHILLKWLS SYL
Subjt:  DRMYEARSQKAAVQFFWIGGEEIEVMP-NSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYL

Query:  QWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNLERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPSTT
        QWN ELSATKIL++VSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNLERL+DAENE HILWKELLLSSERELRQRLI LC FACAK RS STT
Subjt:  QWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNLERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPSTT

Query:  EYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTPLW
        E RPGFWYP GLAEMQQWI  N EHLQESVKVIASKAG +R SKHSAMEQCTYCSA VPFESPE G CQG K NTGV QSHKLVRCSVSMQVCPATTPLW
Subjt:  EYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTPLW

Query:  FCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV
        FC+CCYR+AFRLAPD+LFQ+SETP+FRSL LS  EIPS+PLCPFCGILLQRRQPDFLLSACPV
Subjt:  FCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV

TrEMBL top hitse value%identityAlignment
A0A1S3BB76 uncharacterized protein LOC103488044 isoform X10.0e+0084.54Show/hide
Query:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP
        MVET+FQAV LVAAPNYPNA+AWSDENLIA+ASGPLVTI+NP SPFGARGTITIPA+DPLRIGL+ERKDLF+DCLLTTCLSRDD PRAQS+AWSPIG+AP
Subjt:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP

Query:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNS--------N
        NAGCLLAVCTSEG VKLYRPPFCDFSAEWIEI+D+SNKLYDYLESIKYGELDVLSSK SDIPAKE G+AV  QE+FTK NSKRRKKDEL S        +
Subjt:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNS--------N

Query:  NESSLNRSLEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTR
        NESSLN+SLEKSKEKR RRR+EDSSVPPLISAQQYASRSAMLLSLV+AWSPVIKPS K H HQNSS  VLA+GTKSGKVSFWKVNVPECYSLAEC VPT 
Subjt:  NESSLNRSLEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTR

Query:  VLLVGILQAHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSG
         LLVGILQAHNSWINCISWMLFDSDSS+ KVL+ATGSTDGSV+IWQC CEELLASSDSNFASFSLLKEVISGEG+PT+LSL  PNL  HKLFLAIGRGSG
Subjt:  VLLVGILQAHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSG

Query:  SLEIRIFNLSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRS
        SLEIRIFNLS+ EFDNV LY+AH HVVTGVAWA DGRYLFTCSEDN LRGWSLDESSLREVPISS IP+LGGSIDL DTFRSCFGI +SPGNLV AVVR+
Subjt:  SLEIRIFNLSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRS

Query:  FDIESLDRMYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWL
        FD+ESLD+MY+AR+QKAAVQFFWIGGEEIEVMPNSSYFYTENF ++SKKEFV WESS+LWSLNQ KNLNKPMVVW+VVAALL FR SI EYVDHILLKWL
Subjt:  FDIESLDRMYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWL

Query:  SMSYLQWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FA
        + SYL W+ ELSATKILS++S+NVSTFSTRQLHLLNIICRRVVLSE +QDQVN++LQNL     ERL D ENE HILWK+LLLSSERELRQRLIGLC FA
Subjt:  SMSYLQWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FA

Query:  CAKLRSPSTTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSM
        CAKLRS S TEYRPGFWYPIGL EMQQW+ +NPEHLQES+K +AS+AGK R SKHS+MEQCTYCSA VP ESPEFG+CQG K N GV QSHKL+RCSVSM
Subjt:  CAKLRSPSTTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSM

Query:  QVCPATTPLWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV
        QVCPAT PLWFC+CC RSAFRLAPDILFQMSETP+F SL LS+SEIPS+PLCPFCGILLQRRQPDFLLSACPV
Subjt:  QVCPATTPLWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV

A0A1S3BB77 uncharacterized protein LOC103488044 isoform X30.0e+0085.32Show/hide
Query:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP
        MVET+FQAV LVAAPNYPNA+AWSDENLIA+ASGPLVTI+NP SPFGARGTITIPA+DPLRIGL+ERKDLF+DCLLTTCLSRDD PRAQS+AWSPIG+AP
Subjt:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP

Query:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS
        NAGCLLAVCTSEG VKLYRPPFCDFSAEWIEI+D+SNKLYDYLESIKYGELDVLSSK SDIPAKE G+AV  QE+FTK NSKRRKKDEL S+NESSLN+S
Subjt:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS

Query:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ
        LEKSKEKR RRR+EDSSVPPLISAQQYASRSAMLLSLV+AWSPVIKPS K H HQNSS  VLA+GTKSGKVSFWKVNVPECYSLAEC VPT  LLVGILQ
Subjt:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ

Query:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN
        AHNSWINCISWMLFDSDSS+ KVL+ATGSTDGSV+IWQC CEELLASSDSNFASFSLLKEVISGEG+PT+LSL  PNL  HKLFLAIGRGSGSLEIRIFN
Subjt:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN

Query:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESLDR
        LS+ EFDNV LY+AH HVVTGVAWA DGRYLFTCSEDN LRGWSLDESSLREVPISS IP+LGGSIDL DTFRSCFGI +SPGNLV AVVR+FD+ESLD+
Subjt:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESLDR

Query:  MYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYLQWN
        MY+AR+QKAAVQFFWIGGEEIEVMPNSSYFYTENF ++SKKEFV WESS+LWSLNQ KNLNKPMVVW+VVAALL FR SI EYVDHILLKWL+ SYL W+
Subjt:  MYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYLQWN

Query:  KELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPS
         ELSATKILS++S+NVSTFSTRQLHLLNIICRRVVLSE +QDQVN++LQNL     ERL D ENE HILWK+LLLSSERELRQRLIGLC FACAKLRS S
Subjt:  KELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPS

Query:  TTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTP
         TEYRPGFWYPIGL EMQQW+ +NPEHLQES+K +AS+AGK R SKHS+MEQCTYCSA VP ESPEFG+CQG K N GV QSHKL+RCSVSMQVCPAT P
Subjt:  TTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTP

Query:  LWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV
        LWFC+CC RSAFRLAPDILFQMSETP+F SL LS+SEIPS+PLCPFCGILLQRRQPDFLLSACPV
Subjt:  LWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV

A0A1S3BBZ6 uncharacterized protein LOC103488044 isoform X40.0e+0085.32Show/hide
Query:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP
        MVET+FQAV LVAAPNYPNA+AWSDENLIA+ASGPLVTI+NP SPFGARGTITIPA+DPLRIGL+ERKDLF+DCLLTTCLSRDD PRAQS+AWSPIG+AP
Subjt:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP

Query:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS
        NAGCLLAVCTSEG VKLYRPPFCDFSAEWIEI+D+SNKLYDYLESIKYGELDVLSSK SDIPAKE G+AV  QE+FTK NSKRRKKDEL  NNESSLN+S
Subjt:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS

Query:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ
        LEKSKEKR RRR+EDSSVPPLISAQQYASRSAMLLSLV+AWSPVIKPS K H HQNSS  VLA+GTKSGKVSFWKVNVPECYSLAEC VPT  LLVGILQ
Subjt:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ

Query:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN
        AHNSWINCISWMLFDSDSS+ KVL+ATGSTDGSV+IWQC CEELLASSDSNFASFSLLKEVISGEG+PT+LSL  PNL  HKLFLAIGRGSGSLEIRIFN
Subjt:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN

Query:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESLDR
        LS+ EFDNV LY+AH HVVTGVAWA DGRYLFTCSEDN LRGWSLDESSLREVPISS IP+LGGSIDL DTFRSCFGI +SPGNLV AVVR+FD+ESLD+
Subjt:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESLDR

Query:  MYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYLQWN
        MY+AR+QKAAVQFFWIGGEEIEVMPNSSYFYTENF ++SKKEFV WESS+LWSLNQ KNLNKPMVVW+VVAALL FR SI EYVDHILLKWL+ SYL W+
Subjt:  MYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYLQWN

Query:  KELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPS
         ELSATKILS++S+NVSTFSTRQLHLLNIICRRVVLSE +QDQVN++LQNL     ERL D ENE HILWK+LLLSSERELRQRLIGLC FACAKLRS S
Subjt:  KELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPS

Query:  TTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTP
         TEYRPGFWYPIGL EMQQW+ +NPEHLQES+K +AS+AGK R SKHS+MEQCTYCSA VP ESPEFG+CQG K N GV QSHKL+RCSVSMQVCPAT P
Subjt:  TTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTP

Query:  LWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV
        LWFC+CC RSAFRLAPDILFQMSETP+F SL LS+SEIPS+PLCPFCGILLQRRQPDFLLSACPV
Subjt:  LWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV

A0A1S4DVH0 uncharacterized protein LOC103488044 isoform X20.0e+0084.62Show/hide
Query:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP
        MVET+FQAV LVAAPNYPNA+AWSDENLIA+ASGPLVTI+NP SPFGARGTITIPA+DPLRIGL+ERKDLF+DCLLTTCLSRDD PRAQS+AWSPIG+AP
Subjt:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP

Query:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNS------NNE
        NAGCLLAVCTSEG VKLYRPPFCDFSAEWIEI+D+SNKLYDYLESIKYGELDVLSSK SDIPAKE G+AV  QE+FTK NSKRRKKDEL +      +NE
Subjt:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNS------NNE

Query:  SSLNRSLEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVL
        SSLN+SLEKSKEKR RRR+EDSSVPPLISAQQYASRSAMLLSLV+AWSPVIKPS K H HQNSS  VLA+GTKSGKVSFWKVNVPECYSLAEC VPT  L
Subjt:  SSLNRSLEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVL

Query:  LVGILQAHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSL
        LVGILQAHNSWINCISWMLFDSDSS+ KVL+ATGSTDGSV+IWQC CEELLASSDSNFASFSLLKEVISGEG+PT+LSL  PNL  HKLFLAIGRGSGSL
Subjt:  LVGILQAHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSL

Query:  EIRIFNLSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRSFD
        EIRIFNLS+ EFDNV LY+AH HVVTGVAWA DGRYLFTCSEDN LRGWSLDESSLREVPISS IP+LGGSIDL DTFRSCFGI +SPGNLV AVVR+FD
Subjt:  EIRIFNLSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRSFD

Query:  IESLDRMYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSM
        +ESLD+MY+AR+QKAAVQFFWIGGEEIEVMPNSSYFYTENF ++SKKEFV WESS+LWSLNQ KNLNKPMVVW+VVAALL FR SI EYVDHILLKWL+ 
Subjt:  IESLDRMYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSM

Query:  SYLQWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FACA
        SYL W+ ELSATKILS++S+NVSTFSTRQLHLLNIICRRVVLSE +QDQVN++LQNL     ERL D ENE HILWK+LLLSSERELRQRLIGLC FACA
Subjt:  SYLQWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FACA

Query:  KLRSPSTTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQV
        KLRS S TEYRPGFWYPIGL EMQQW+ +NPEHLQES+K +AS+AGK R SKHS+MEQCTYCSA VP ESPEFG+CQG K N GV QSHKL+RCSVSMQV
Subjt:  KLRSPSTTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQV

Query:  CPATTPLWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV
        CPAT PLWFC+CC RSAFRLAPDILFQMSETP+F SL LS+SEIPS+PLCPFCGILLQRRQPDFLLSACPV
Subjt:  CPATTPLWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV

A0A5A7VH44 WD_REPEATS_REGION domain-containing protein0.0e+0085.32Show/hide
Query:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP
        MVET+FQAV LVAAPNYPNA+AWSDENLIA+ASGPLVTI+NP SPFGARGTITIPA+DPLRIGL+ERKDLF+DCLLTTCLSRDD PRAQS+AWSPIG+AP
Subjt:  MVETYFQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAP

Query:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS
        NAGCLLAVCTSEG VKLYRPPFCDFSAEWIEI+D+SNKLYDYLESIKYGELDVLSSK SDIPAKE G+AV  QE+FTK NSKRRKKDEL S+NESSLN+S
Subjt:  NAGCLLAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS

Query:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ
        LEKSKEKR RRR+EDSSVPPLISAQQYASRSAMLLSLV+AWSPVIKPS K H HQNSS  VLA+GTKSGKVSFWKVNVPECYSLAEC VPT  LLVGILQ
Subjt:  LEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQ

Query:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN
        AHNSWINCISWMLFDSDSS+ KVL+ATGSTDGSV+IWQC CEELLASSDSNFASFSLLKEVISGEG+PT+LSL  PNL  HKLFLAIGRGSGSLEIRIFN
Subjt:  AHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFN

Query:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESLDR
        LS+ EFDNV LY+AH HVVTGVAWA DGRYLFTCSEDN LRGWSLDESSLREVPISS IP+LGGSIDL DTFRSCFGI +SPGNLV AVVR+FD+ESLD+
Subjt:  LSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESLDR

Query:  MYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYLQWN
        MY+AR+QKAAVQFFWIGGEEIEVMPNSSYFYTENF ++SKKEFV WESS+LWSLNQ KNLNKPMVVW+VVAALL FR SI EYVDHILLKWL+ SYL W+
Subjt:  MYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILEYVDHILLKWLSMSYLQWN

Query:  KELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPS
         ELSATKILS++S+NVSTFSTRQLHLLNIICRRVVLSE +QDQVN++LQNL     ERL D ENE HILWK+LLLSSERELRQRLIGLC FACAKLRS S
Subjt:  KELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNL-----ERLSDAENENHILWKELLLSSERELRQRLIGLC-FACAKLRSPS

Query:  TTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTP
         TEYRPGFWYPIGL EMQQW+ +NPEHLQES+K +AS+AGK R SKHS+MEQCTYCSA VP ESPEFG+CQG K N GV QSHKL+RCSVSMQVCPAT P
Subjt:  TTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQVCPATTP

Query:  LWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV
        LWFC+CC RSAFRLAPDILFQMSETP+F SL LS+SEIPS+PLCPFCGILLQRRQPDFLLSACPV
Subjt:  LWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV

SwissProt top hitse value%identityAlignment
A6ZYM0 Probable cytosolic iron-sulfur protein assembly protein 13.9e-0630.47Show/hide
Query:  LLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFNLS---SCEFDNVRLYNAHDHVVT
        +LATGSTD  ++        L++  D +F    +L E    + + ++   + P    H   LA G    ++ I     S   + E D + +   H++ V 
Subjt:  LLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFNLS---SCEFDNVRLYNAHDHVVT

Query:  GVAWAFDGRYLFTCSEDNILRGWSLDES
        GVAW+ DG YL TCS D  +  W  DES
Subjt:  GVAWAFDGRYLFTCSEDNILRGWSLDES

Q05583 Cytosolic iron-sulfur protein assembly protein 14.3e-0529.69Show/hide
Query:  LLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFNLS---SCEFDNVRLYNAHDHVVT
        +LATGSTD  ++        L++    +F    +L E    + + ++   + P    H   LA G    ++ I     S   + E D + +   H++ V 
Subjt:  LLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFNLS---SCEFDNVRLYNAHDHVVT

Query:  GVAWAFDGRYLFTCSEDNILRGWSLDES
        GVAW+ DG YL TCS D  +  W  DES
Subjt:  GVAWAFDGRYLFTCSEDNILRGWSLDES

Arabidopsis top hitse value%identityAlignment
AT3G49400.1 Transducin/WD40 repeat-like superfamily protein1.0e-19843.29Show/hide
Query:  FQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAPNAGCL
        FQ   LV +P+YPNAVAWS ENLIAVA+G LV I+NP  P G RG ITI  ++  +IG +  +DL T  LL + L R+  P  +S++WS IG++PN GCL
Subjt:  FQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAPNAGCL

Query:  LAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS-----
        LAVCT+EG VKLYRPP+ DF AEWIEI+D+S  LY+ L S+ +GE    S+  S     E  +     E  + + +++R+K   N+ N    N +     
Subjt:  LAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS-----

Query:  -----------LEKSKEKRPRRRTEDSSVPPL-------ISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECY
                   LE    K+     +  S+P         IS Q Y SR A+L S  VAWS +++ S +         S+LAIG+KSG VS WKV+ PECY
Subjt:  -----------LEKSKEKRPRRRTEDSSVPPL-------ISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECY

Query:  SLAECTVPTRVLLVGILQAHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHK
         +    V   V L  I+Q H+SW++ +SW +F  DSSNP+V+L TGS DGSV+IW    E+L  S +   +SF LLKEV++   +      +  +   + 
Subjt:  SLAECTVPTRVLLVGILQAHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHK

Query:  LFLAIGRGSGSLEIRIFNLSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSP
        + LAIG+GSGS E+    +S+ +F+ +   NAH+ VVTG+AW++DGR L++CS+DN +R W L E+++ EVPI +  P L  + DL D F SC G+ +SP
Subjt:  LFLAIGRGSGSLEIRIFNLSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSP

Query:  GNLVAAVVRSFDIESLDRMYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILE
        GNL  A+VR+F++E L+ MY+ARSQKAAV+F W G ++     +S+   TE     SK EF +WES+ILWSL +F  LNKP+V+WD+VAA+L F+QS+ E
Subjt:  GNLVAAVVRSFDIESLDRMYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILE

Query:  YVDHILLKWLSMSYLQWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNLERLSDAENENHILWKELLLSSERELRQRLIG
        +V+ +L KWLS+SYL ++ ++S   ++  +++  S   +R LH+LN+I RRV+LSEL  +++N  LQ      + E +   LW +LL  SERELR+RL+G
Subjt:  YVDHILLKWLSMSYLQWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNLERLSDAENENHILWKELLLSSERELRQRLIG

Query:  LCFACAKLRSPSTTEYRPGF--WYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRL----SKHSAMEQ--CTYCSALVPFESPEFGLCQG-------V
        L F+   L   S     P    W P GLA +QQW+  N + +   ++ ++ +   SR     S  +A+E+  C YC+A V F S E   C+         
Subjt:  LCFACAKLRSPSTTEYRPGF--WYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRL----SKHSAMEQ--CTYCSALVPFESPEFGLCQG-------V

Query:  KHNTGVGQSHKLVRCSVSMQVCPATTPLWFCICCYRSAFRLAPDILFQMSETP-DFRSLTLSE-SEIPSRPLCPFCGILLQRRQPDFLLSACPV
        K      +SHKL RC VSMQVCP  TPLWFC CC R    LAP+ LF +   P D +SL  S  S++ S+P C FCG+LLQR+QP+FLLSA PV
Subjt:  KHNTGVGQSHKLVRCSVSMQVCPATTPLWFCICCYRSAFRLAPDILFQMSETP-DFRSLTLSE-SEIPSRPLCPFCGILLQRRQPDFLLSACPV

AT3G49400.2 Transducin/WD40 repeat-like superfamily protein1.1e-18642.06Show/hide
Query:  FQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAPNAGCL
        FQ   LV +P+YPNAVAWS ENLIAVA+G LV I+NP  P G RG ITI  ++  +IG +  +DL T  LL + L R+  P  +S++WS IG++PN GCL
Subjt:  FQAVKLVAAPNYPNAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAPNAGCL

Query:  LAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS-----
        LAVCT+EG VKLYRPP+ DF AEWIEI+D+S  LY+ L S+ +GE    S+  S     E  +     E  + + +++R+K   N+ N    N +     
Subjt:  LAVCTSEGSVKLYRPPFCDFSAEWIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRS-----

Query:  -----------LEKSKEKRPRRRTEDSSVPPL-------ISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECY
                   LE    K+     +  S+P         IS Q Y SR A+L S  VAWS +++ S +         S+LAIG+KSG VS WKV+ PECY
Subjt:  -----------LEKSKEKRPRRRTEDSSVPPL-------ISAQQYASRSAMLLSLVVAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECY

Query:  SLAECTVPTRVLLVGILQAHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHK
         +    V   V L  I+Q H+SW++ +SW +F  DSSNP+V+L TGS DGSV+IW    E+L  S +   +SF LLKEV++   +      +  +   + 
Subjt:  SLAECTVPTRVLLVGILQAHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASSDSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHK

Query:  LFLAIGRGSGSLEIRIFNLSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSP
        + LAIG+GSGS E+    +S+ +F+ +   NAH+ V                  DN +R W L E+++ EVPI +  P L  + DL D F SC G+ +SP
Subjt:  LFLAIGRGSGSLEIRIFNLSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSRIPDLGGSIDLSDTFRSCFGITVSP

Query:  GNLVAAVVRSFDIESLDRMYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILE
        GNL  A+VR+F++E L+ MY+ARSQKAAV+F W G ++     +S+   TE     SK EF +WES+ILWSL +F  LNKP+V+WD+VAA+L F+QS+ E
Subjt:  GNLVAAVVRSFDIESLDRMYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWDVVAALLGFRQSILE

Query:  YVDHILLKWLSMSYLQWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNLERLSDAENENHILWKELLLSSERELRQRLIG
        +V+ +L KWLS+SYL ++ ++S   ++  +++  S   +R LH+LN+I RRV+LSEL  +++N  LQ      + E +   LW +LL  SERELR+RL+G
Subjt:  YVDHILLKWLSMSYLQWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNLERLSDAENENHILWKELLLSSERELRQRLIG

Query:  LCFACAKLRSPSTTEYRPGF--WYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRL----SKHSAMEQ--CTYCSALVPFESPEFGLCQG-------V
        L F+   L   S     P    W P GLA +QQW+  N + +   ++ ++ +   SR     S  +A+E+  C YC+A V F S E   C+         
Subjt:  LCFACAKLRSPSTTEYRPGF--WYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRL----SKHSAMEQ--CTYCSALVPFESPEFGLCQG-------V

Query:  KHNTGVGQSHKLVRCSVSMQVCPATTPLWFCICCYRSAFRLAPDILFQMSETP-DFRSLTLSE-SEIPSRPLCPFCGILLQRRQPDFLLSACPV
        K      +SHKL RC VSMQVCP  TPLWFC CC R    LAP+ LF +   P D +SL  S  S++ S+P C FCG+LLQR+QP+FLLSA PV
Subjt:  KHNTGVGQSHKLVRCSVSMQVCPATTPLWFCICCYRSAFRLAPDILFQMSETP-DFRSLTLSE-SEIPSRPLCPFCGILLQRRQPDFLLSACPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCGCCGTCACTCTCGAGCGTTGACAATCCCTAACGCCGCCGCTACTAGCTTGCGTCTTCCTCCCCTCCGGCGACTATTCACTGCAGTGCGGTTCCCTCCTGCTCA
GGCAAGGCGAGAGCCATTGTACGCCGCCTCCGCAGCTCTCTTCCGGCTCTTGCTTTCGCCTGGTTTTCGGCCTCCGCCGGCTGCCAAACCACCGGCACACAATCGCAAAT
CGTCAAACAGAGGCCTCCGAACCGTCGATTCACGAACTCTGGAAGAGCGAAGAGCAATGGTGGAAACATATTTTCAGGCCGTCAAGTTGGTCGCTGCCCCAAATTACCCA
AATGCTGTTGCATGGTCCGACGAGAATTTAATCGCCGTTGCCTCAGGGCCCCTTGTCACTATACTGAATCCGACTTCGCCTTTTGGAGCACGAGGCACTATTACAATCCC
TGCAAGTGATCCACTTCGAATAGGGTTGATAGAGAGAAAAGATTTATTTACTGACTGCTTGTTGACAACTTGCTTATCTCGGGATGATCCACCTCGTGCACAGTCCATAG
CATGGTCTCCGATTGGCCTGGCTCCTAATGCAGGGTGCTTGTTGGCTGTTTGCACATCTGAAGGAAGTGTGAAGCTTTACCGTCCACCGTTCTGTGACTTTAGTGCTGAA
TGGATTGAGATTATGGACATGTCAAATAAACTTTATGATTATCTTGAAAGTATTAAATATGGGGAGCTGGATGTTCTTTCTTCCAAGCGTTCTGATATTCCAGCAAAGGA
AGGTGGGAATGCTGTTGGTGGCCAAGAGCATTTCACAAAGGTGAACAGCAAGCGAAGAAAGAAAGATGAACTCAACTCAAACAATGAAAGCAGTTTGAATCGATCATTGG
AGAAATCAAAAGAGAAGCGTCCTAGGAGGAGAACTGAAGATAGCTCCGTGCCTCCGTTGATTAGTGCACAACAATATGCTTCTCGCAGTGCAATGTTGTTGTCTCTTGTT
GTTGCTTGGTCCCCAGTAATAAAGCCATCTCATAAAGTTCATTCGCACCAGAATTCATCTGTCAGTGTTCTTGCAATAGGAACAAAGTCTGGTAAAGTTTCATTTTGGAA
AGTTAATGTACCAGAATGCTACTCCCTTGCTGAGTGCACAGTTCCAACAAGAGTTCTGCTTGTTGGGATTCTTCAGGCACACAATTCATGGATCAACTGTATCAGTTGGA
TGTTGTTTGATTCTGATTCATCAAATCCAAAGGTTCTATTGGCAACTGGGAGCACAGATGGGAGTGTGAGGATCTGGCAATGTTACTGTGAAGAGTTATTAGCATCTTCA
GACTCTAATTTTGCTTCATTCTCCCTATTGAAGGAGGTTATCAGTGGTGAAGGAATGCCAACTCTACTCTCACTCTATGCGCCCAATTTACCTGTGCATAAACTATTTTT
GGCCATTGGCAGAGGATCTGGATCACTTGAAATAAGGATATTTAACCTATCCAGCTGTGAATTTGATAACGTCAGGCTGTACAATGCACATGATCACGTTGTTACAGGTG
TAGCTTGGGCTTTTGATGGACGTTATTTGTTCACCTGCAGTGAGGATAATATTCTGCGAGGTTGGAGTTTAGATGAGAGTTCTCTCCGTGAAGTACCCATTTCATCACGC
ATCCCTGATCTTGGAGGCTCCATTGATCTTTCAGATACATTTCGGTCATGCTTTGGCATCACAGTGTCCCCAGGAAATCTTGTGGCTGCCGTGGTTCGCAGCTTTGATAT
TGAATCACTTGATCGAATGTATGAAGCAAGGTCTCAGAAAGCTGCTGTTCAGTTCTTCTGGATTGGAGGAGAAGAAATAGAAGTCATGCCAAACAGTTCATACTTTTATA
CTGAAAATTTTCCAGACATTTCAAAGAAGGAATTTGTTCATTGGGAATCCAGTATATTGTGGTCTTTAAATCAATTTAAAAATCTGAATAAGCCTATGGTTGTTTGGGAT
GTTGTAGCCGCTTTGCTGGGATTCAGGCAATCCATACTGGAATATGTTGACCACATTCTACTTAAGTGGCTTTCAATGTCATATCTCCAATGGAACAAGGAGCTCTCTGC
TACAAAGATTTTGTCAAATGTATCGAGAAATGTGTCAACATTTTCAACTCGACAACTTCACCTCCTTAACATTATTTGTAGACGTGTAGTTCTATCAGAATTGATACAGG
ACCAGGTGAACAATGATCTGCAGAATTTGGAGAGACTTAGCGATGCTGAAAACGAAAATCATATTTTGTGGAAGGAGTTGCTTTTAAGCAGTGAAAGAGAACTCCGTCAG
AGGCTAATCGGTCTATGTTTTGCTTGTGCAAAGCTTCGTTCACCATCCACCACCGAATATCGACCTGGGTTCTGGTATCCCATTGGATTAGCAGAAATGCAGCAGTGGAT
TAGAAATAATCCTGAACATTTACAGGAATCAGTAAAAGTCATTGCATCAAAAGCGGGAAAAAGCCGTTTGAGTAAACATTCAGCAATGGAGCAGTGCACCTACTGTTCAG
CACTGGTTCCATTTGAGTCTCCAGAATTCGGATTATGCCAGGGTGTTAAGCACAATACCGGTGTCGGTCAGAGCCACAAACTAGTAAGGTGTTCTGTATCAATGCAGGTC
TGCCCTGCTACTACTCCCTTATGGTTTTGCATTTGTTGTTATAGAAGTGCTTTCAGATTGGCTCCAGATATACTTTTTCAGATGTCTGAGACTCCTGACTTTCGGTCTTT
AACACTCTCCGAGTCGGAGATACCCTCAAGACCATTATGTCCATTTTGTGGTATACTGTTACAACGTCGACAGCCCGACTTTTTACTGTCAGCATGCCCGGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCCGCCGTCACTCTCGAGCGTTGACAATCCCTAACGCCGCCGCTACTAGCTTGCGTCTTCCTCCCCTCCGGCGACTATTCACTGCAGTGCGGTTCCCTCCTGCTCA
GGCAAGGCGAGAGCCATTGTACGCCGCCTCCGCAGCTCTCTTCCGGCTCTTGCTTTCGCCTGGTTTTCGGCCTCCGCCGGCTGCCAAACCACCGGCACACAATCGCAAAT
CGTCAAACAGAGGCCTCCGAACCGTCGATTCACGAACTCTGGAAGAGCGAAGAGCAATGGTGGAAACATATTTTCAGGCCGTCAAGTTGGTCGCTGCCCCAAATTACCCA
AATGCTGTTGCATGGTCCGACGAGAATTTAATCGCCGTTGCCTCAGGGCCCCTTGTCACTATACTGAATCCGACTTCGCCTTTTGGAGCACGAGGCACTATTACAATCCC
TGCAAGTGATCCACTTCGAATAGGGTTGATAGAGAGAAAAGATTTATTTACTGACTGCTTGTTGACAACTTGCTTATCTCGGGATGATCCACCTCGTGCACAGTCCATAG
CATGGTCTCCGATTGGCCTGGCTCCTAATGCAGGGTGCTTGTTGGCTGTTTGCACATCTGAAGGAAGTGTGAAGCTTTACCGTCCACCGTTCTGTGACTTTAGTGCTGAA
TGGATTGAGATTATGGACATGTCAAATAAACTTTATGATTATCTTGAAAGTATTAAATATGGGGAGCTGGATGTTCTTTCTTCCAAGCGTTCTGATATTCCAGCAAAGGA
AGGTGGGAATGCTGTTGGTGGCCAAGAGCATTTCACAAAGGTGAACAGCAAGCGAAGAAAGAAAGATGAACTCAACTCAAACAATGAAAGCAGTTTGAATCGATCATTGG
AGAAATCAAAAGAGAAGCGTCCTAGGAGGAGAACTGAAGATAGCTCCGTGCCTCCGTTGATTAGTGCACAACAATATGCTTCTCGCAGTGCAATGTTGTTGTCTCTTGTT
GTTGCTTGGTCCCCAGTAATAAAGCCATCTCATAAAGTTCATTCGCACCAGAATTCATCTGTCAGTGTTCTTGCAATAGGAACAAAGTCTGGTAAAGTTTCATTTTGGAA
AGTTAATGTACCAGAATGCTACTCCCTTGCTGAGTGCACAGTTCCAACAAGAGTTCTGCTTGTTGGGATTCTTCAGGCACACAATTCATGGATCAACTGTATCAGTTGGA
TGTTGTTTGATTCTGATTCATCAAATCCAAAGGTTCTATTGGCAACTGGGAGCACAGATGGGAGTGTGAGGATCTGGCAATGTTACTGTGAAGAGTTATTAGCATCTTCA
GACTCTAATTTTGCTTCATTCTCCCTATTGAAGGAGGTTATCAGTGGTGAAGGAATGCCAACTCTACTCTCACTCTATGCGCCCAATTTACCTGTGCATAAACTATTTTT
GGCCATTGGCAGAGGATCTGGATCACTTGAAATAAGGATATTTAACCTATCCAGCTGTGAATTTGATAACGTCAGGCTGTACAATGCACATGATCACGTTGTTACAGGTG
TAGCTTGGGCTTTTGATGGACGTTATTTGTTCACCTGCAGTGAGGATAATATTCTGCGAGGTTGGAGTTTAGATGAGAGTTCTCTCCGTGAAGTACCCATTTCATCACGC
ATCCCTGATCTTGGAGGCTCCATTGATCTTTCAGATACATTTCGGTCATGCTTTGGCATCACAGTGTCCCCAGGAAATCTTGTGGCTGCCGTGGTTCGCAGCTTTGATAT
TGAATCACTTGATCGAATGTATGAAGCAAGGTCTCAGAAAGCTGCTGTTCAGTTCTTCTGGATTGGAGGAGAAGAAATAGAAGTCATGCCAAACAGTTCATACTTTTATA
CTGAAAATTTTCCAGACATTTCAAAGAAGGAATTTGTTCATTGGGAATCCAGTATATTGTGGTCTTTAAATCAATTTAAAAATCTGAATAAGCCTATGGTTGTTTGGGAT
GTTGTAGCCGCTTTGCTGGGATTCAGGCAATCCATACTGGAATATGTTGACCACATTCTACTTAAGTGGCTTTCAATGTCATATCTCCAATGGAACAAGGAGCTCTCTGC
TACAAAGATTTTGTCAAATGTATCGAGAAATGTGTCAACATTTTCAACTCGACAACTTCACCTCCTTAACATTATTTGTAGACGTGTAGTTCTATCAGAATTGATACAGG
ACCAGGTGAACAATGATCTGCAGAATTTGGAGAGACTTAGCGATGCTGAAAACGAAAATCATATTTTGTGGAAGGAGTTGCTTTTAAGCAGTGAAAGAGAACTCCGTCAG
AGGCTAATCGGTCTATGTTTTGCTTGTGCAAAGCTTCGTTCACCATCCACCACCGAATATCGACCTGGGTTCTGGTATCCCATTGGATTAGCAGAAATGCAGCAGTGGAT
TAGAAATAATCCTGAACATTTACAGGAATCAGTAAAAGTCATTGCATCAAAAGCGGGAAAAAGCCGTTTGAGTAAACATTCAGCAATGGAGCAGTGCACCTACTGTTCAG
CACTGGTTCCATTTGAGTCTCCAGAATTCGGATTATGCCAGGGTGTTAAGCACAATACCGGTGTCGGTCAGAGCCACAAACTAGTAAGGTGTTCTGTATCAATGCAGGTC
TGCCCTGCTACTACTCCCTTATGGTTTTGCATTTGTTGTTATAGAAGTGCTTTCAGATTGGCTCCAGATATACTTTTTCAGATGTCTGAGACTCCTGACTTTCGGTCTTT
AACACTCTCCGAGTCGGAGATACCCTCAAGACCATTATGTCCATTTTGTGGTATACTGTTACAACGTCGACAGCCCGACTTTTTACTGTCAGCATGCCCGGTGTAG
Protein sequenceShow/hide protein sequence
MGRRHSRALTIPNAAATSLRLPPLRRLFTAVRFPPAQARREPLYAASAALFRLLLSPGFRPPPAAKPPAHNRKSSNRGLRTVDSRTLEERRAMVETYFQAVKLVAAPNYP
NAVAWSDENLIAVASGPLVTILNPTSPFGARGTITIPASDPLRIGLIERKDLFTDCLLTTCLSRDDPPRAQSIAWSPIGLAPNAGCLLAVCTSEGSVKLYRPPFCDFSAE
WIEIMDMSNKLYDYLESIKYGELDVLSSKRSDIPAKEGGNAVGGQEHFTKVNSKRRKKDELNSNNESSLNRSLEKSKEKRPRRRTEDSSVPPLISAQQYASRSAMLLSLV
VAWSPVIKPSHKVHSHQNSSVSVLAIGTKSGKVSFWKVNVPECYSLAECTVPTRVLLVGILQAHNSWINCISWMLFDSDSSNPKVLLATGSTDGSVRIWQCYCEELLASS
DSNFASFSLLKEVISGEGMPTLLSLYAPNLPVHKLFLAIGRGSGSLEIRIFNLSSCEFDNVRLYNAHDHVVTGVAWAFDGRYLFTCSEDNILRGWSLDESSLREVPISSR
IPDLGGSIDLSDTFRSCFGITVSPGNLVAAVVRSFDIESLDRMYEARSQKAAVQFFWIGGEEIEVMPNSSYFYTENFPDISKKEFVHWESSILWSLNQFKNLNKPMVVWD
VVAALLGFRQSILEYVDHILLKWLSMSYLQWNKELSATKILSNVSRNVSTFSTRQLHLLNIICRRVVLSELIQDQVNNDLQNLERLSDAENENHILWKELLLSSERELRQ
RLIGLCFACAKLRSPSTTEYRPGFWYPIGLAEMQQWIRNNPEHLQESVKVIASKAGKSRLSKHSAMEQCTYCSALVPFESPEFGLCQGVKHNTGVGQSHKLVRCSVSMQV
CPATTPLWFCICCYRSAFRLAPDILFQMSETPDFRSLTLSESEIPSRPLCPFCGILLQRRQPDFLLSACPV