; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G14610 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G14610
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF789)
Genome locationClcChr01:27405048..27419648
RNA-Seq ExpressionClc01G14610
SyntenyClc01G14610
Gene Ontology termsGO:0006351 - transcription, DNA-templated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR006591 - RNA polymerase archaeal subunit P/eukaryotic subunit RPABC4
IPR008507 - Protein of unknown function DUF789
IPR029040 - RNA polymerase subunit RPABC4/transcription elongation factor Spt4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572995.1 DNA-directed RNA polymerases II, IV and V subunit 12, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0069.52Show/hide
Query:  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRRKSILLNLWRWL-TSKSISWLFSMRPAEMLILAIDFLTGWFTFAFRYQHLIPIQNFL
        MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR  I+ + W  L  S+ + W                                     
Subjt:  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRRKSILLNLWRWL-TSKSISWLFSMRPAEMLILAIDFLTGWFTFAFRYQHLIPIQNFL

Query:  VFFLRVKEEDKSYCPLEQQALLLVIFLRFSPANGNGLQKTMQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNLFDYRCAVISFLT
                                            +QKTMQCAL +SS+FQK  DKGK+ LE++++E++CSR IKDS+VSSF WRN FDYR AVIS LT
Subjt:  VFFLRVKEEDKSYCPLEQQALLLVIFLRFSPANGNGLQKTMQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNLFDYRCAVISFLT

Query:  VESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQKGPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAI
        +ESDGLWRIVALP Q +DSL VSCLPQMN+F A+RKLV  GPAS+ TYS NSFRCRSLLESN  LLDSKA +SSNK+S KFS RSSCS SAL+  DSSAI
Subjt:  VESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQKGPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAI

Query:  SAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAETEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQP
        S IPIG AK+QRYGKKN RKKAKK++IECKK SSDFV AETE+SS+DSARGS L EACG+N SDCRDGSVLCS ARETF  D RASKNDF+RD+++IIQP
Subjt:  SAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAETEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQP

Query:  PGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQALIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEK
         G TDSISS+I +  ASEVP SA KN SG Y    SENQ LIK PGCT   G V+ +ERLF   CNDFC K S DNNSPD       S CD+  LKL E 
Subjt:  PGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQALIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEK

Query:  KGFGVDLLEERSSPSREN-CYSHNSVRDEVDVNAKVEKPNRGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRN
        +GFG+DLLE ++SPSREN C  HNS+RDEVDVNA+ EK N GIQGCT SET  +LPGKKTKQNKKL+G+SR NR+GG GSSQR TGKEN  TVWQKVQ+N
Subjt:  KGFGVDLLEERSSPSREN-CYSHNSVRDEVDVNAKVEKPNRGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRN

Query:  NSGGCCEQLDQVS-PVSKQFKGICTPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRK
        NSGGCC QLDQVS PVSKQ KG+C P VGVQ PKVKDKKTGNRKQLK+KF +RLK KNTS Q+K+YRP+++S GSNT+SM H  PNERLDI  M FDI K
Subjt:  NSGGCCEQLDQVS-PVSKQFKGICTPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRK

Query:  SSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISDGLNSQKVENVSSSLPRACNSLNQSNPVEVQSPVYLPHLFFQATKGSSLAERSKHNSQSR
        SS   R+ FQND+TDKCMTSES ESTQ+CLDG MS KLISDGLN+Q+VEN SS+  R+C+SLNQSNP++ QSPVY+PHLFFQATKGSSLAERSKH++QSR
Subjt:  SSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISDGLNSQKVENVSSSLPRACNSLNQSNPVEVQSPVYLPHLFFQATKGSSLAERSKHNSQSR

Query:  SPFQNWLPSGGEGSRLTT-LARPDFSSLKDASTQPAEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFD
        SP QNW+PS  EGSRLTT L RPDFSSLKDA+ QPAEFG SEKSIQE V+CN++DPVS V E IQHSRDG+H PLE ECE Q+ +G+DT ALQD R E D
Subjt:  SPFQNWLPSGGEGSRLTT-LARPDFSSLKDASTQPAEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFD

Query:  VDEHFNSKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGS
        VDEHFN KS+C DA+ +EQ VN+AC+AQL  +A+       IAEFERFLHLSSPVI+Q P LRS +I  +NS GD IPCS++TA+ISL CLWQWYEKHGS
Subjt:  VDEHFNSKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGS

Query:  YGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQH
        YGLE+KANGHE SNGFG DNS F AYFVPFLSAVQLFKSHKTH G TT PVG DS VSDIK  EP T  LPIFSVLFPKPCTDDA+  + C+QLHSSE+ 
Subjt:  YGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQH

Query:  LASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTY
        LASEK+  S QSV+  LSGESELIFEYFE EQPQQRRPLFDKI QLV+GDG L+GKIYGDPT+L S+TLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTY
Subjt:  LASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTY

Query:  HSLGHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        HSLGHFV RTSQS+S +T+SC+VCPVVGLQS+NAQNECWF+PRN   TF P    P +++ERLR+LEETASLMARAVVKKG+LNS N HPDYEFFLSRR
Subjt:  HSLGHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

TYJ99070.1 uncharacterized protein E5676_scaffold248G002740 [Cucumis melo var. makuwa]0.0e+0079.55Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGI-KDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQ
        MQCALVRSSDFQK LDKGKESL+LRLE+NSCSRGI KD +VSSFAWRN FDYRCAVI FLT+ESDGLWRIVALPPQY+DSL+VSCLPQMN+F A RKLVQ
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGI-KDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQ

Query:  KGPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA
        KG ASN TYSFNS RCRSLLESN KLLDSKAI+S NKSSGK  C SSCS SALM SDS A S IPI GAK+QRYGKKNPRKKAKKKE+E KKISS+FV A
Subjt:  KGPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA

Query:  ETEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ
        ETEVS +DSAR SFLSEACGSNDSD R+ +VLCSIA ETFLP       DFERD++  IQP G  DS+SS+I D H+S+V SSAIKNFSGY++VCGSENQ
Subjt:  ETEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ

Query:  ALIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSRENCYSHNSVRDEVDVNAKVEKPN
        AL   PGC HV  G+NSRE L A SCNDFC   SLDNNS DSK  SLNS CD+ NLKLNEKKGFGVDLLEERSSP RENC S NS RDEVD+N +VEK  
Subjt:  ALIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSRENCYSHNSVRDEVDVNAKVEKPN

Query:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTG
         GIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGG GSSQRRTGKENRHTVWQKVQR+NSGGC EQLDQVSP+SKQFKGIC PV GVQMPKVKDKKTG
Subjt:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTG

Query:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISD
        NRKQLKEK  RRLKRKNTSGQEK+YRPTRNSCGSNTSSMVHKPPNERLDIR M FDIR+SSG+PRSRFQNDTTDKCM SE+ E  Q+  D L S+KLI D
Subjt:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISD

Query:  GLNSQKVENVSSSLPRACNSLNQSNP------------------------------------VEVQSPVYLPHLFFQATKGSSLAERSKHNSQSRSPFQN
        GL+SQKVEN SSSLP++CNS NQSNP                                    VEV+SPVYLPHLFFQATKGSSLAERSKH +QSRSP QN
Subjt:  GLNSQKVENVSSSLPRACNSLNQSNP------------------------------------VEVQSPVYLPHLFFQATKGSSLAERSKHNSQSRSPFQN

Query:  WLPSGGEGSRLTTLARPDFSSLKDASTQPAEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFN
        WLPSG EGSR TTLARPDFSSL+DA+TQPAEFGTSEKSI+ERVNC++++PVS V EGIQH RD  HG LE+ECEVQK+YG+DTT LQ+ + EF+VDEHFN
Subjt:  WLPSGGEGSRLTTLARPDFSSLKDASTQPAEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFN

Query:  SKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK
         KSSCED S MEQAVNNAC+AQL SEAIQMETG PIAEFERFLHLSSPVI+Q PKLRSSEI PRN PGDVIPCSNET +ISL CLWQWYEKHGSYGLEIK
Subjt:  SKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK

Query:  ANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKK
        A  HENSNGFG  NSAFRAYFVPFLSA+QLFKS KTH GTTT P+GFDSCVSDIKVKEPSTCHLPIFS+LFP+P TDD S  RVCN+ HSSEQ LASEK+
Subjt:  ANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKK

Query:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHF
        KSS QS +L+LSGESELIFEYFEGEQPQ RRPLFDKIHQLVEGDG LQGKIYGDPTMLNS+TL+DLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHF
Subjt:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHF

Query:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        VSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR    TFT  LN PR+L+ERLR+LEETASLMARAVVKKGNLNS N HPDYEFFLSRR
Subjt:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

XP_004137638.2 uncharacterized protein LOC101212209 [Cucumis sativus]0.0e+0079.5Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIK-DSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQ
        MQC LV SSDFQK LDKGKESLELRLE+NSCSRGI  DSKVSSFAWRN FDYR A+IS LT+ESDGLWRIVALPPQY+DSL++SCLPQMN+F A RKLVQ
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIK-DSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQ

Query:  KGPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA
        KGPASN TYSFNS RCRSLLESN KLLDSKAI+S  +SSGKF C SSCSGSALM SDS AIS IP+ GAK+QRYGKKNPRKKAKKKEIECK ISSDFV A
Subjt:  KGPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA

Query:  ETEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ
        ETEVS +DSAR SFLSEACGSNDSD RD SVLCSIA+ETFLP       DFE+D+  +IQP G  DS+SS+I D H+S+V S AIKNFSGYY+VCGSENQ
Subjt:  ETEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ

Query:  ALIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSRENCYSHNSVRDEVDVNAKVEKPN
        ALI VPGC HV  G+NSRER  A SCNDFC K  LDN S DSK  SLN  CD+ NLKLNEK+GFGVDLLEERSSPS+      NS RDEVD+NA+VEK N
Subjt:  ALIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSRENCYSHNSVRDEVDVNAKVEKPN

Query:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTG
         GI+GCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGG GSSQRRTGKENRHTVWQKVQR++SGGC EQLDQVSP+SKQFKGIC PVVGVQMPKVKDKKTG
Subjt:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTG

Query:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISD
        N+KQLKEK PRRLKRKNTSGQEK+YRPTRNSCGSNTSSMVHKPPNE+LD+R M FDIR+SSGDPRS FQND+TDKC  SES ES Q+ LD L+S+KLI+D
Subjt:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISD

Query:  GLNSQKVENVSSSLPRACNSLNQSNPVEVQSP------------------------------------VYLPHLFFQATKGSSLAERSKHNSQSRSPFQN
        GL+SQKVEN SSSLP++CNS NQSNPVEV+SP                                    VYLPHLFFQATKGSSL ERSKH++QSRSP QN
Subjt:  GLNSQKVENVSSSLPRACNSLNQSNPVEVQSP------------------------------------VYLPHLFFQATKGSSLAERSKHNSQSRSPFQN

Query:  WLPSGGEGSRLTTLARPDFSSLKDASTQPAEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFN
        WLPSG EGSR  TLARPDFSSL+DA+TQPAEFGT EKSI+ERVNCN+++PVS V EGIQH RD   GPLE+EC VQKMYGYDTT LQDH+SEFDVDEHFN
Subjt:  WLPSGGEGSRLTTLARPDFSSLKDASTQPAEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFN

Query:  SKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK
         KSSCED S MEQAVNNACRAQL SEAIQMETG PIAEFERFLHLSSPVI+Q P   SS+I PRN PGDVIPCSNET +ISLGCLWQWYEKHGSYGLEIK
Subjt:  SKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK

Query:  ANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKK
        A G ENSNGFG  NSAFRAYFVPFLSAVQLFKS KTH GT T P+GF+SCVSDIKVKEPSTCHLPIFS+LFPKPCTDD S  RVCNQ HSSEQHLASEKK
Subjt:  ANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKK

Query:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHF
        KSS QS +L+LSGESELIFEYFEGEQPQ RRPLFDKIHQLVEGDG LQGKIYGDPT+LNS+TL+DLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHF
Subjt:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHF

Query:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNI--APTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        VSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR+     TFT  LN PRIL+ERLR+LEETASLMARAVVKKGNLNS N HPDYEFFLSRR
Subjt:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNI--APTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

XP_038894653.1 uncharacterized protein LOC120083142 isoform X1 [Benincasa hispida]0.0e+0085.74Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQK
        MQCA + SSDFQK LDK KESLELRLEEN CSRGIKDSKVSSFAWRN F YRCAVISFLTVESDGLWRIVALP QY+DS+DVSCLPQMN+F AERKLVQ+
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQK

Query:  GPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE
        GPAS  TYSFNSFRCRSLLESN KLLDSKAI+SS+KSSGKFSC SSCS SALM SDSSAIS IP G AK+QRYGKKNPRKKAKKKEIE KKISS+FV AE
Subjt:  GPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE

Query:  TEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA
        TEVSSKDSA GSFLS+ACGSNDSDC D SVLCSIA+E FLPDFRASKN FERD+++IIQP G  DSIS +I DE+ASEV SSAIKN+S YY+VCGS NQA
Subjt:  TEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA

Query:  LIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSRENCYSHNSVRDEVDVNAKVEKPNR
        LIKVPGC HV GGVNSRERLFA SC DFC K SLDNNSPDSKC SLNS  DNFNLKL EKKGFGVDLL+ERSSPS+EN    N+VRD VDVNA+VE+ N 
Subjt:  LIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSRENCYSHNSVRDEVDVNAKVEKPNR

Query:  GIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTGN
        GI+  TVSET SVLPGKKTKQNKKL GS+RMNRYGG  SSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSP+SKQFKGIC P VGVQMPKVKDK+TGN
Subjt:  GIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTGN

Query:  RKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISDG
        RKQLKEKFPRRLKRKNTSGQEK+Y PTRNSCGSNTSSMVHK PN+ LDIR M FDIR+SS DPRSRFQNDTTDKC TSESFESTQ+CL GL+S+KLIS+G
Subjt:  RKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISDG

Query:  LNSQKVENVSSSLPRACNSLNQSNPVEVQSPVYLPHLFFQATKGSSLAERSKHNSQSRSPFQNWLPSGGEGSRLTTLARPDFSSLKDASTQPAEFGTSEK
        LNSQKVEN SSS PR+C+SLNQSN VEVQSPVYLPHLFFQATKGSSLAERS HN+Q R P QNWLPSG EG  LTTLARPDFSS+KDAS QP   GTSEK
Subjt:  LNSQKVENVSSSLPRACNSLNQSNPVEVQSPVYLPHLFFQATKGSSLAERSKHNSQSRSPFQNWLPSGGEGSRLTTLARPDFSSLKDASTQPAEFGTSEK

Query:  SIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFNSKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIA
        SIQERVNCN+++PVSVV EGIQHSRDG+HGPLE+ECEVQKM+GYDTT LQDH+ EFDVDEHF+ KSS EDAS MEQAVNNACRAQLVSEAIQ+ETG PIA
Subjt:  SIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFNSKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIA

Query:  EFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTH
        EFERFLHLSSPVINQ PKLR+SEI PRN PGDV+PCSNET +ISLGCLWQWYEKHGSYGLEIKANGHENSNGFG DNSAFRAYFVPFLSA+QLFKS KTH
Subjt:  EFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTH

Query:  GGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKI
         GTTT PVGFDSCV+DIKVKEPSTC LPIFSVLFPKPCTDDAS  RVC+Q HSSEQHLASEK+K S QSVN+KLSGESELIFEYFEGEQPQQRRPLFDKI
Subjt:  GGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKI

Query:  HQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPR
        HQLVEGDG  QGKIYGDPTMLNS+TLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRT QSNSPDTNSCLVCPVVGLQSYNAQNECWFEPR
Subjt:  HQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPR

Query:  NIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRRL
        N  PTFTPGLN PRILEERLR+LEETASLMARAVVKKGNLNSEN HPDYEFFLSRRL
Subjt:  NIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRRL

XP_038894656.1 uncharacterized protein LOC120083142 isoform X2 [Benincasa hispida]0.0e+0082.97Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQK
        MQCA + SSDFQK LDK KESLELRLEEN CSRGIKDSKVSSFAWRN F YRCAVISFLTVESDGLWRIVALP QY+DS+DVSCLPQMN+F AERKLVQ+
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQK

Query:  GPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE
        GPAS  TYSFNSFRCRSLLESN KLLDSKAI+SS+KSSGKFSC SSCS SALM SDSSAIS IP G AK+QRYGKKNPRKKAKKKEIE KKISS+FV AE
Subjt:  GPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE

Query:  TEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA
        TEVSSKDSA GSFLS+ACGSNDSDC D SVLCSIA+E FLPDFRASKN FERD+++IIQP G  DSIS +I DE+ASEV SSAIKN+S YY+VCGS NQA
Subjt:  TEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA

Query:  LIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSRENCYSHNSVRDEVDVNAKVEKPNR
        LIKVPGC HV GGVNSRERLFA SC DFC K SLDNNSPDSKC SLNS  DNFNLKL EKKGFGVDLL+ERSSPS+EN    N+VRD VDVNA+VE+ N 
Subjt:  LIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSRENCYSHNSVRDEVDVNAKVEKPNR

Query:  GIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTGN
        GI+  TVSET SVLPGKKTKQNKKL GS+RMNRYGG  SSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSP+SKQFKGIC P VGVQMPKVKDK+TGN
Subjt:  GIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTGN

Query:  RKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISDG
        RKQLKEKFPRRLKRKNTSGQEK+Y PTRNSCGSNTSSMVHK PN+ LDIR M FDIR+SS DPRSRFQNDTTDKC TSESFESTQ+CL GL+S+KLIS+G
Subjt:  RKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISDG

Query:  LNSQKVENVSSSLPRACNSLNQSNPVEVQSPVYLPHLFFQATKGSSLAERSKHNSQSRSPFQNWLPSGGEGSRLTTLARPDFSSLKDASTQPAEFGTSEK
        LNSQKVEN SSS PR+C+SLNQSN VEVQSPVYLPHLFFQATKGSSLAERS HN+Q R P QNWLPSG EG  LTTLARPDFSS+KDAS QP   GTSEK
Subjt:  LNSQKVENVSSSLPRACNSLNQSNPVEVQSPVYLPHLFFQATKGSSLAERSKHNSQSRSPFQNWLPSGGEGSRLTTLARPDFSSLKDASTQPAEFGTSEK

Query:  SIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFNSKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIA
        SIQERVNCN+++PVSVV EGIQHSRDG+HGPLE+ECEVQKM+GYDTT LQDH+ EFDVDEHF+ KSS EDAS MEQAVNNACRAQLVSEAIQ+ETG PIA
Subjt:  SIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFNSKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIA

Query:  EFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTH
        EFERFLHLSSPVINQ PKLR+SEI PRN PGDV+PCSNET +ISLGCLWQWYEKHGSYGLEIKANGHENSNGFG DNSAFRAYFVPFLSA+QLFKS KTH
Subjt:  EFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTH

Query:  GGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKI
         GTTT PVGFDSCV+DIKVKEPSTC LPIFSVLFPKPCTDDAS  RVC+Q HSSEQHLASEK+K S QSVN+KLSGESELIFEYFEGEQPQQRRPLFDK 
Subjt:  GGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKI

Query:  HQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPR
                                          YSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRT QSNSPDTNSCLVCPVVGLQSYNAQNECWFEPR
Subjt:  HQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPR

Query:  NIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRRL
        N  PTFTPGLN PRILEERLR+LEETASLMARAVVKKGNLNSEN HPDYEFFLSRRL
Subjt:  NIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRRL

TrEMBL top hitse value%identityAlignment
A0A0A0LT77 Uncharacterized protein0.0e+0079.5Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIK-DSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQ
        MQC LV SSDFQK LDKGKESLELRLE+NSCSRGI  DSKVSSFAWRN FDYR A+IS LT+ESDGLWRIVALPPQY+DSL++SCLPQMN+F A RKLVQ
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIK-DSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQ

Query:  KGPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA
        KGPASN TYSFNS RCRSLLESN KLLDSKAI+S  +SSGKF C SSCSGSALM SDS AIS IP+ GAK+QRYGKKNPRKKAKKKEIECK ISSDFV A
Subjt:  KGPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA

Query:  ETEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ
        ETEVS +DSAR SFLSEACGSNDSD RD SVLCSIA+ETFLP       DFE+D+  +IQP G  DS+SS+I D H+S+V S AIKNFSGYY+VCGSENQ
Subjt:  ETEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ

Query:  ALIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSRENCYSHNSVRDEVDVNAKVEKPN
        ALI VPGC HV  G+NSRER  A SCNDFC K  LDN S DSK  SLN  CD+ NLKLNEK+GFGVDLLEERSSPS+      NS RDEVD+NA+VEK N
Subjt:  ALIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSRENCYSHNSVRDEVDVNAKVEKPN

Query:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTG
         GI+GCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGG GSSQRRTGKENRHTVWQKVQR++SGGC EQLDQVSP+SKQFKGIC PVVGVQMPKVKDKKTG
Subjt:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTG

Query:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISD
        N+KQLKEK PRRLKRKNTSGQEK+YRPTRNSCGSNTSSMVHKPPNE+LD+R M FDIR+SSGDPRS FQND+TDKC  SES ES Q+ LD L+S+KLI+D
Subjt:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISD

Query:  GLNSQKVENVSSSLPRACNSLNQSNPVEVQSP------------------------------------VYLPHLFFQATKGSSLAERSKHNSQSRSPFQN
        GL+SQKVEN SSSLP++CNS NQSNPVEV+SP                                    VYLPHLFFQATKGSSL ERSKH++QSRSP QN
Subjt:  GLNSQKVENVSSSLPRACNSLNQSNPVEVQSP------------------------------------VYLPHLFFQATKGSSLAERSKHNSQSRSPFQN

Query:  WLPSGGEGSRLTTLARPDFSSLKDASTQPAEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFN
        WLPSG EGSR  TLARPDFSSL+DA+TQPAEFGT EKSI+ERVNCN+++PVS V EGIQH RD   GPLE+EC VQKMYGYDTT LQDH+SEFDVDEHFN
Subjt:  WLPSGGEGSRLTTLARPDFSSLKDASTQPAEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFN

Query:  SKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK
         KSSCED S MEQAVNNACRAQL SEAIQMETG PIAEFERFLHLSSPVI+Q P   SS+I PRN PGDVIPCSNET +ISLGCLWQWYEKHGSYGLEIK
Subjt:  SKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK

Query:  ANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKK
        A G ENSNGFG  NSAFRAYFVPFLSAVQLFKS KTH GT T P+GF+SCVSDIKVKEPSTCHLPIFS+LFPKPCTDD S  RVCNQ HSSEQHLASEKK
Subjt:  ANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKK

Query:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHF
        KSS QS +L+LSGESELIFEYFEGEQPQ RRPLFDKIHQLVEGDG LQGKIYGDPT+LNS+TL+DLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHF
Subjt:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHF

Query:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNI--APTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        VSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR+     TFT  LN PRIL+ERLR+LEETASLMARAVVKKGNLNS N HPDYEFFLSRR
Subjt:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNI--APTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

A0A5D3BH03 Uncharacterized protein0.0e+0079.55Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGI-KDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQ
        MQCALVRSSDFQK LDKGKESL+LRLE+NSCSRGI KD +VSSFAWRN FDYRCAVI FLT+ESDGLWRIVALPPQY+DSL+VSCLPQMN+F A RKLVQ
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGI-KDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQ

Query:  KGPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA
        KG ASN TYSFNS RCRSLLESN KLLDSKAI+S NKSSGK  C SSCS SALM SDS A S IPI GAK+QRYGKKNPRKKAKKKE+E KKISS+FV A
Subjt:  KGPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA

Query:  ETEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ
        ETEVS +DSAR SFLSEACGSNDSD R+ +VLCSIA ETFLP       DFERD++  IQP G  DS+SS+I D H+S+V SSAIKNFSGY++VCGSENQ
Subjt:  ETEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ

Query:  ALIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSRENCYSHNSVRDEVDVNAKVEKPN
        AL   PGC HV  G+NSRE L A SCNDFC   SLDNNS DSK  SLNS CD+ NLKLNEKKGFGVDLLEERSSP RENC S NS RDEVD+N +VEK  
Subjt:  ALIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSRENCYSHNSVRDEVDVNAKVEKPN

Query:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTG
         GIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGG GSSQRRTGKENRHTVWQKVQR+NSGGC EQLDQVSP+SKQFKGIC PV GVQMPKVKDKKTG
Subjt:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTG

Query:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISD
        NRKQLKEK  RRLKRKNTSGQEK+YRPTRNSCGSNTSSMVHKPPNERLDIR M FDIR+SSG+PRSRFQNDTTDKCM SE+ E  Q+  D L S+KLI D
Subjt:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISD

Query:  GLNSQKVENVSSSLPRACNSLNQSNP------------------------------------VEVQSPVYLPHLFFQATKGSSLAERSKHNSQSRSPFQN
        GL+SQKVEN SSSLP++CNS NQSNP                                    VEV+SPVYLPHLFFQATKGSSLAERSKH +QSRSP QN
Subjt:  GLNSQKVENVSSSLPRACNSLNQSNP------------------------------------VEVQSPVYLPHLFFQATKGSSLAERSKHNSQSRSPFQN

Query:  WLPSGGEGSRLTTLARPDFSSLKDASTQPAEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFN
        WLPSG EGSR TTLARPDFSSL+DA+TQPAEFGTSEKSI+ERVNC++++PVS V EGIQH RD  HG LE+ECEVQK+YG+DTT LQ+ + EF+VDEHFN
Subjt:  WLPSGGEGSRLTTLARPDFSSLKDASTQPAEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFN

Query:  SKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK
         KSSCED S MEQAVNNAC+AQL SEAIQMETG PIAEFERFLHLSSPVI+Q PKLRSSEI PRN PGDVIPCSNET +ISL CLWQWYEKHGSYGLEIK
Subjt:  SKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK

Query:  ANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKK
        A  HENSNGFG  NSAFRAYFVPFLSA+QLFKS KTH GTTT P+GFDSCVSDIKVKEPSTCHLPIFS+LFP+P TDD S  RVCN+ HSSEQ LASEK+
Subjt:  ANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKK

Query:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHF
        KSS QS +L+LSGESELIFEYFEGEQPQ RRPLFDKIHQLVEGDG LQGKIYGDPTMLNS+TL+DLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHF
Subjt:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHF

Query:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        VSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR    TFT  LN PR+L+ERLR+LEETASLMARAVVKKGNLNS N HPDYEFFLSRR
Subjt:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

A0A6J1C5T5 uncharacterized protein LOC1110087180.0e+0074.53Show/hide
Query:  MQCALVRS-SDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQ
        MQCAL R  SD QK  DKGKE LE+R +E++CSR IKDS+VSS AWRN FDYRCAV+SFLT+ESDG W+IVA P QY+D L  SCLPQMN+FAAERKLVQ
Subjt:  MQCALVRS-SDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQ

Query:  KGPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA
        KGPASN TYS NSFRCRSLLESN KLLDSKAI+S N+ SGKFSCRSSCS SAL+ SDSSAIS IPIGGAK+ RYGKKNPRKKAKKK IECKKIS DFVCA
Subjt:  KGPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA

Query:  ETEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ
        ETEVSS+DSARGS L EACG+ND +  DGSV CS A+ETFLPD RASKN F+ ++++IIQP G   SISS+  +  AS+V  SA +N SG Y VCGSENQ
Subjt:  ETEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ

Query:  ALIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSREN-CYSHNSVRDEVDVNAKVEKP
         L+KV GC+H  GGV+ RERLF   C DF  KG  DNNS +S+C S NS  D  NLKLNEK+ FGV LLEE++SPSREN C  H SVRDEVDVNA+VE+ 
Subjt:  ALIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSREN-CYSHNSVRDEVDVNAKVEKP

Query:  NRGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKT
          GIQGCT SET  VLPGKKTKQNKKLTGSS++NR+G  G+SQRRTGKEN HTVWQKVQ+NNSGGCC QLDQVSP+ KQFKG C P VGVQ+PKVKD+KT
Subjt:  NRGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKT

Query:  GNRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLIS
        GNRKQLK+K  R+L+RKNTS Q+K+YRP ++  G+NTSSMV K PNERLDI  M FDIR+ +   +S+ QND T KC+TSESFESTQ CLDGLMS +L+S
Subjt:  GNRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLIS

Query:  DGLNSQKVENVSSSLPRACNSLNQSNPVEVQSPVYLPHLFF----QATKGSSLAERSKHNSQSRSPFQNWLPSGGEGSRLTTLARPDFSSLKDASTQPAE
        DGLNSQ+VEN  SS  R+CNSL+QSN +EV SP+YLPHLFF    Q T+GSSLAE SKHN+ SRSP QNW+PSG EGSRLTTLA PD SSLK  +  PAE
Subjt:  DGLNSQKVENVSSSLPRACNSLNQSNPVEVQSPVYLPHLFF----QATKGSSLAERSKHNSQSRSPFQNWLPSGGEGSRLTTLARPDFSSLKDASTQPAE

Query:  FGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFNSKSSCEDASIMEQAVNNACRAQLVSEAIQME
         GTSE+SIQERV C++ DPVSVVTE  + SRDG+HGPLE+ECEVQKM  +D T LQDH  E D+DEHFN KSSCEDAS MEQAVNNACR QL SEA+QME
Subjt:  FGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFNSKSSCEDASIMEQAVNNACRAQLVSEAIQME

Query:  TGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLF
        TG PIAEFE FLHLSSPVI+Q PKL+S +I PRN  GD I CS+E  +ISLGCLWQWYEKHGSYGLEIKA G+EN+N F  DNSAF AYFVPFLSAVQLF
Subjt:  TGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLF

Query:  KSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRR
        KSHKTH GTT +P G DSCV +IK+KEPSTCHLPIFSVLFPKP TDDAS   V +Q HSSEQ LASEK K S QSV+LKLSGESEL+FEYFE E PQQRR
Subjt:  KSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRR

Query:  PLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNE
        PLFDKI QLV GDGRLQGKIYGDPTMLNS+TLNDLHA SWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFV RTSQ NS DT+SCLVCPVVGLQSYNAQNE
Subjt:  PLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNE

Query:  CWFEPRNIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        CWFEPRN    F   ++ P ILEERLR+LEETASLMARA+VKKGNLNSEN HPDYEFFLSRR
Subjt:  CWFEPRNIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

A0A6J1GS60 uncharacterized protein LOC1114570060.0e+0073.17Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQK
        MQCAL +SS+FQK  DKGK+ LE++++E++CSR IKDS+VSSF WRN FDYR AVIS LT+ESDGLWRIVALP Q +DSL VSCLPQMN+F A+RKLV  
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQK

Query:  GPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE
        GPASN TYS NSFRCRSLLESN  LLDSKA +SSNK+S KFS RSSCS SAL+  DSSAIS IPIG AK+QRYGKKN RKKAKK++IECKK SSDFV AE
Subjt:  GPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE

Query:  TEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA
        TE+SS+DSARGS L EACG+N SDCRDG VLCS ARETF  D RASKNDF+RD+++IIQP G TDSISS+I +  ASEVP SA KN SG Y    SENQ 
Subjt:  TEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA

Query:  LIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSREN-CYSHNSVRDEVDVNAKVEKPN
        LIK PGCT   G V+ +ERLF   CNDFC K S DNNSPD       S CD+  LKL E +GFG+DLLE ++SPSREN C  HNS+RDEVDVNA+ EK N
Subjt:  LIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSREN-CYSHNSVRDEVDVNAKVEKPN

Query:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVS-PVSKQFKGICTPVVGVQMPKVKDKKT
         GIQGCT SET  +LPGKKTKQNKKL+G+SR NR+GG GSSQR TGKEN  TVWQKVQ+NNSGGCC QLDQVS PVSKQ KG+C P VGVQ PKVKDKKT
Subjt:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVS-PVSKQFKGICTPVVGVQMPKVKDKKT

Query:  GNRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLIS
        GNRKQLK+KF +RLK KNTS Q+K+YRP+++S GSNT+SM H  PNERLDI  M FDI KSSG  R+ FQND+TDKC TSES ESTQ+CLDG MS KLIS
Subjt:  GNRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLIS

Query:  DGLNSQKVENVSSSLPRACNSLNQSNPVEVQSPVYLPHLFFQATKGSSLAERSKHNSQSRSPFQNWLPSGGEGSRLTT-LARPDFSSLKDASTQPAEFGT
        DGLN+Q+VEN SS+   +C+SLNQSNP++ QSPVY+PHLFFQATKGSSLAERSKH++QSRSP QNW+PS  EGSRLTT LARPDFSSLKDA+ QPAEFG 
Subjt:  DGLNSQKVENVSSSLPRACNSLNQSNPVEVQSPVYLPHLFFQATKGSSLAERSKHNSQSRSPFQNWLPSGGEGSRLTT-LARPDFSSLKDASTQPAEFGT

Query:  SEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFNSKSSCEDASIMEQAVNNACRAQLVSEAIQMETGI
        SEKSIQE V+CN++DPVS   E IQHSRD +H PLE ECE Q+ +G+DT ALQD   E DVDEHFN KS+C DA+ +EQ VN+AC+AQL  +A+      
Subjt:  SEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFNSKSSCEDASIMEQAVNNACRAQLVSEAIQMETGI

Query:  PIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSH
         IAEFERFLHLSSPVI+Q P LRS +I  +NS GD IPCS+ETA+ISL CLWQWYEKHGSYGLE+KANGHE SNGFG DNS F AYFVPFLSAVQLFKSH
Subjt:  PIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSH

Query:  KTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLF
        KTH G TT PVG DS VSDIK  EP T  LPIFSVLFPKPCTDDA+  + C+QLHSSE+ LASEK+  S QSV+  LSGESELIFEYFE EQPQQRRPLF
Subjt:  KTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLF

Query:  DKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWF
        DKI QLV+GDG L+GKIYGDPT+L S+TLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFV RTSQS+S +T+SC+VCPVVGLQS+NAQNECWF
Subjt:  DKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWF

Query:  EPRNIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        +PRN    F P    P +++ERLR+LEETASLMARAVVKKGNLN+ N HPDYEFFLSRR
Subjt:  EPRNIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

A0A6J1K4L4 uncharacterized protein LOC1114900280.0e+0073.14Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQK
        MQCAL +SS+FQK  DKGK+ LE++++E++CSR IKDS+VSSF WRN FDYR AVIS LT+ESDGLWRIVALP Q +DSL VSCLPQMN+F A+RKLV  
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLDVSCLPQMNRFAAERKLVQK

Query:  GPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE
        GPASN TYS NSFRCRSLLESN  LLDSKA +SSNK+S KFS RSSCS SAL+  DSSAIS IPIG  K+QRYGKKN RKKAKK++IECKK SSDFV AE
Subjt:  GPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE

Query:  TEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA
        TEVSS+DSAR S L E  G+N SDCRDGSVLCS ARETF  D RASKNDF+RD+++IIQP G TDSISS+I +  ASE+P SA KN  G Y   GSENQ 
Subjt:  TEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA

Query:  LIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSREN-CYSHNSVRDEVDVNAKVEKPN
        LIK PGCT   G V+ +ERLF   CNDFC K S DNNSPD       S CD+  LKL E +GFG+DLLE ++SPSREN C  HNSVRD VDVNA+ EK N
Subjt:  LIKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSREN-CYSHNSVRDEVDVNAKVEKPN

Query:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTG
         GIQGCT SETC +LPGKKTKQNKKL+G+SR NR+GG GSSQR TGKEN  TVWQKVQ+NNSGGCC QLDQVSP+SKQ KGIC P VGVQ PKVKDKKTG
Subjt:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTG

Query:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISD
        NRKQLK+KF +RLK KN+S Q+K+YRP+++S GSNT+SM H  PNERL I  M FD+ KSS   R+ FQND+TDK MTSES ESTQ+CLDG MS KLISD
Subjt:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISD

Query:  GLNSQKVENVSSSLPRACNSLNQSNPVEVQSPVYLPHLFFQATKGSSLAERSKHNSQSRSPFQNWLPSGGEGSRLTT-LARPDFSSLKDASTQPAEFGTS
        GLN+Q+VEN SS+   +C+S+NQSNP++ QSPVY+PHLFFQATKGSSLAERSKH++QSRSP QNW+PS  EGSRLTT LARPDFSSLKDA+ QPAEFG S
Subjt:  GLNSQKVENVSSSLPRACNSLNQSNPVEVQSPVYLPHLFFQATKGSSLAERSKHNSQSRSPFQNWLPSGGEGSRLTT-LARPDFSSLKDASTQPAEFGTS

Query:  EKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFNSKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIP
        EKSIQE VNCN++DPVS V E IQHSRDG+H PLE ECE Q+ +G+DT ALQDHR E DVDEHFN K++C DA+ +EQ VN+AC+AQL  +A+       
Subjt:  EKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKMYGYDTTALQDHRSEFDVDEHFNSKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIP

Query:  IAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHK
        IAEFERFLHLSSPVI+Q P LRS EI  +NS GDVIPCS+ETA+ISLGCLWQWYEKHGSYGLE+KANGHE SNGFG DNS F AYFVPFLSAVQLFKSHK
Subjt:  IAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHK

Query:  THGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFD
        TH G TT PVG DS VSDIK  EP T  LPIFSVLFPKPCTD+A+  + C+QLHSSE+ LASEK+  S QSV+  LSGESELIFEYFE EQPQQRRPLFD
Subjt:  THGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFD

Query:  KIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFE
        KI QLV+GDG L+GKIYGDPT+L S+TLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFV RTSQS+S +T+SC+VCPVVGLQS+NAQNECWF+
Subjt:  KIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFE

Query:  PRNIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        PR    TF P    P +++ERLR+LEETASL+ARAVVKKGNLNS N HPDYEFFLSRR
Subjt:  PRNIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

SwissProt top hitse value%identityAlignment
P53803 DNA-directed RNA polymerases I, II, and III subunit RPABC44.0e-0961.9Show/hide
Query:  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR
        P+ +P+ YICG+C  EN +K  D I+CRECGYRI+YKKRT+R
Subjt:  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR

Q3ZBC0 DNA-directed RNA polymerases I, II, and III subunit RPABC44.0e-0961.9Show/hide
Query:  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR
        P+ +P+ YICG+C  EN +K  D I+CRECGYRI+YKKRT+R
Subjt:  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR

Q63871 DNA-directed RNA polymerases I, II, and III subunit RPABC44.0e-0961.9Show/hide
Query:  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR
        P+ +P+ YICG+C  EN +K  D I+CRECGYRI+YKKRT+R
Subjt:  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR

Q9C8M4 DNA-directed RNA polymerase subunit 12-like protein3.6e-1073.17Show/hide
Query:  DPQPEP-VSYICGDCGMENTLKQGDVIQCRECGYRILYKKR
        D QPE  V Y+CGDCG EN LK+GDV QCR+CG+RILYKKR
Subjt:  DPQPEP-VSYICGDCGMENTLKQGDVIQCRECGYRILYKKR

Q9FLM8 DNA-directed RNA polymerases II, IV and V subunit 123.0e-1788.64Show/hide
Query:  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR
        MDP PEPV+Y+CGDCG ENTLK GDVIQCRECGYRILYKKRTRR
Subjt:  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR

Arabidopsis top hitse value%identityAlignment
AT1G03610.1 Protein of unknown function (DUF789)6.5e-1526.86Show/hide
Query:  AEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETAD-ISLGCLWQWYEKHGSYGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHK
        +  +RFLH  +P++     L  +EI   N      P   +  +   L  LW  Y++  +YG  +  +         T+  +   Y+VP+LSA+Q+F SH 
Subjt:  AEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETAD-ISLGCLWQWYEKHGSYGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHK

Query:  THGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFD
                     S +   +  E   C           P +D  S   V      SE+ L +         +         L  +YFE   P  R PL D
Subjt:  THGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFD

Query:  KIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSL-GHFVSRTSQSNSPDTNSC------LVCPVVGLQ
        KI++L +   R  G        L SL   DL   SW SVAWYPIY IP G    +L   FLTYH+L   F     + N  D          +     G+ 
Subjt:  KIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSL-GHFVSRTSQSNSPDTNSC------LVCPVVGLQ

Query:  SYNAQNECW
        +Y  Q + W
Subjt:  SYNAQNECW

AT1G15030.1 Protein of unknown function (DUF789)1.4e-1425.21Show/hide
Query:  CEDASIMEQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGH
        C  +   ++   +A     VSEA         +  ERFL   +P +   P    S+   R   G  +   ++     LG +W+ + +  +YG+ +    +
Subjt:  CEDASIMEQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGH

Query:  ENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSV
         N       +  F+ Y+VP LS +Q++                D+  S ++ +         F     +  + + S+S     L  S++ +++   K S+
Subjt:  ENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSV

Query:  QSVNLK---------LSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDG----NLRAAF
        +  + +         LS +  LIFEY E + P  R P  DK+  L      L+           +L   DL   SW+SVAWYPIY+IP G    +L A F
Subjt:  QSVNLK---------LSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDG----NLRAAF

Query:  LTYHSL-----GHFVSRTSQSNSPDTNSC--LVCPVVGLQSYNAQNECW
        LTYHSL     G  V+  S        S   +  PV GL SY  +   W
Subjt:  LTYHSL-----GHFVSRTSQSNSPDTNSC--LVCPVVGLQSYNAQNECW

AT2G01260.1 Protein of unknown function (DUF789)8.4e-1528.7Show/hide
Query:  YFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIF
        Y+VP LSA+Q++        +       DS  SD +                    + D+ + RV  ++      L  + ++ S       L  +  L+F
Subjt:  YFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIF

Query:  EYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSL-----GHFVSRTSQSNSP
        EY E + P  R P  DK+  L      L            +L   DL   SW+SVAWYPIYRIP G    +L A FLTYHSL     G    ++     P
Subjt:  EYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSL-----GHFVSRTSQSNSP

Query:  DTNSCLVCPVVGLQSYNAQNECW
          +  +  PV GL SY  +   W
Subjt:  DTNSCLVCPVVGLQSYNAQNECW

AT4G16100.1 Protein of unknown function (DUF789)6.3e-1828.65Show/hide
Query:  EQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVIN-QTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGHENSNGF
        E+   + C       +    TG   +   RFL  ++P+++ Q   L SS+ +    P              L  LW  +E+  +YG+ +        NG 
Subjt:  EQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVIN-QTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGHENSNGF

Query:  GTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKK---SSVQSV
             +   Y+VP+LS +QL++          DP    +C +  +V E S           P+  + D S    C +L  +    + E+K    SS    
Subjt:  GTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASEKKK---SSVQSV

Query:  NLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSLGHFVSR
            +   EL+FEY EG  P  R PL DKI  L                 L +    DL   SW SVAWYPIYRIP G    NL A FLT+HSL      
Subjt:  NLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSLGHFVSR

Query:  TS----QSNSPDTNSC-LVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNSPRILEERLRSLE
        TS    QS+S    S  L  P  GL SY  +   W    ++      G    R  EE LR L+
Subjt:  TS----QSNSPDTNSC-LVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNSPRILEERLRSLE

AT5G41010.1 DNA directed RNA polymerase, 7 kDa subunit2.1e-1888.64Show/hide
Query:  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR
        MDP PEPV+Y+CGDCG ENTLK GDVIQCRECGYRILYKKRTRR
Subjt:  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTCAACCTGAACCAGTCAGCTACATCTGTGGAGATTGTGGAATGGAGAACACTCTGAAGCAGGGTGATGTTATACAGTGCCGAGAGTGTGGTTATCGTATTCT
CTACAAGAAGCGCACCCGTCGCAAATCTATTCTACTCAACCTATGGCGATGGTTGACCAGCAAAAGTATTTCTTGGTTGTTCAGTATGAGGCCCGCTGAAATGCTTATTC
TGGCGATTGACTTTCTAACTGGATGGTTCACTTTTGCATTTCGATATCAGCATTTGATTCCTATTCAAAATTTTCTTGTATTTTTCCTGAGGGTGAAAGAAGAAGATAAA
AGCTATTGTCCACTGGAACAACAAGCTTTGCTCCTTGTAATTTTCCTTCGATTCTCTCCCGCTAATGGAAATGGGTTACAGAAAACGATGCAGTGTGCTCTTGTAAGAAG
TAGTGATTTTCAGAAAGGTCTAGACAAAGGAAAGGAGTCGTTGGAATTGAGACTCGAGGAAAACAGTTGTTCCAGAGGAATTAAGGATTCTAAAGTTTCTTCGTTTGCCT
GGAGGAACCTTTTTGATTACAGATGTGCCGTCATTAGTTTTCTTACAGTCGAATCTGATGGACTCTGGAGAATTGTTGCACTACCACCGCAATACGTAGATAGCTTGGAT
GTGAGCTGTCTGCCTCAAATGAATCGGTTTGCAGCTGAGAGAAAATTGGTCCAGAAAGGCCCTGCCTCTAATGTTACATATTCATTTAATTCATTCAGATGTAGAAGCTT
GCTGGAGTCGAACAATAAGTTATTGGATAGTAAAGCAATTGAGTCGTCGAATAAATCCTCTGGCAAGTTCTCTTGCAGGAGTTCATGTTCTGGCTCTGCTTTGATGTTAA
GTGACTCTAGTGCAATCTCTGCCATCCCAATTGGTGGAGCTAAATTGCAGAGATATGGGAAGAAAAATCCAAGAAAGAAGGCAAAAAAGAAAGAAATAGAATGTAAGAAG
ATATCTTCTGATTTTGTCTGTGCTGAAACAGAAGTATCATCCAAGGATTCTGCCCGTGGAAGTTTTTTGTCAGAAGCTTGTGGCAGTAATGATTCAGATTGTAGAGATGG
ATCTGTTTTGTGTTCGATTGCACGAGAAACTTTTCTGCCAGATTTTAGGGCCAGTAAAAATGATTTTGAACGAGATACTAAGAAAATTATTCAGCCACCTGGAATCACAG
ATTCGATATCCTCTAAAATTGATGACGAGCATGCATCTGAGGTTCCATCTTCTGCAATAAAGAATTTTAGTGGGTATTATCAAGTTTGTGGATCCGAAAACCAGGCCCTA
ATCAAAGTACCTGGTTGTACCCATGTCAGTGGGGGAGTAAATTCAAGAGAGAGGTTATTTGCCAGCAGCTGCAATGATTTTTGCCCTAAGGGTTCTTTGGATAATAATTC
CCCAGATTCTAAGTGTTTTAGTTTAAACAGTACCTGCGATAATTTTAACTTGAAATTAAATGAAAAGAAAGGTTTTGGAGTTGATCTGTTGGAAGAACGAAGTTCACCTT
CTAGAGAGAACTGTTATTCTCATAACTCAGTAAGAGATGAAGTAGATGTAAATGCCAAAGTGGAGAAACCTAATCGTGGTATTCAGGGATGTACTGTGAGTGAAACTTGT
TCGGTTTTACCTGGAAAGAAAACTAAACAAAATAAAAAATTGACCGGGAGTTCAAGAATGAATAGATATGGTGGTCCAGGGAGTTCACAAAGGCGTACAGGGAAGGAAAA
CAGACATACTGTCTGGCAAAAGGTTCAAAGGAATAATAGTGGTGGATGTTGTGAACAGTTAGACCAAGTAAGTCCTGTCAGCAAACAGTTTAAAGGCATCTGTACTCCTG
TTGTTGGTGTGCAAATGCCAAAGGTCAAGGATAAAAAAACGGGGAACAGAAAACAGCTGAAAGAAAAATTTCCCAGGAGGTTGAAAAGAAAAAATACGTCAGGACAAGAG
AAGGTCTATCGTCCTACTAGGAACAGTTGTGGTAGTAATACTAGTTCAATGGTTCACAAACCACCAAATGAAAGGTTGGATATTCGACCAATGGACTTTGACATAAGAAA
ATCAAGTGGCGATCCAAGATCTCGTTTTCAAAATGATACTACTGATAAATGCATGACTTCTGAATCATTTGAAAGTACACAACTCTGTCTAGATGGATTGATGTCAAGCA
AACTTATCTCCGATGGTTTGAATAGTCAAAAAGTAGAGAATGTCTCTAGCTCATTGCCAAGGGCCTGCAACTCCTTAAATCAGTCAAACCCGGTAGAGGTTCAGTCTCCT
GTTTACCTTCCGCATCTTTTCTTTCAAGCAACAAAAGGAAGTTCTCTTGCTGAACGCAGCAAGCATAACAGCCAATCTAGATCACCCTTTCAAAACTGGTTGCCAAGCGG
GGGAGAAGGTTCCAGATTGACCACCTTGGCCAGACCTGATTTTTCATCTCTGAAAGATGCAAGTACGCAACCTGCTGAGTTTGGCACTTCAGAAAAATCAATTCAAGAAC
GAGTCAATTGCAACATAGTAGATCCTGTTTCTGTTGTCACTGAGGGGATTCAGCATTCTAGAGATGGGAGTCATGGTCCTTTAGAAAATGAATGTGAGGTGCAGAAGATG
TATGGTTACGATACAACTGCACTACAGGATCATAGGTCTGAGTTTGATGTGGATGAGCATTTTAATTCCAAATCCTCATGTGAAGATGCATCTATAATGGAGCAAGCAGT
GAATAATGCATGTAGGGCACAATTGGTATCTGAAGCTATTCAAATGGAAACTGGTATTCCAATCGCAGAGTTCGAAAGATTCCTTCATTTGTCCTCCCCTGTTATCAACC
AGACACCCAAGCTAAGAAGTAGTGAAATTTACCCAAGAAATTCGCCAGGTGATGTGATTCCGTGTAGCAATGAGACTGCTGACATTTCTTTGGGTTGCCTGTGGCAATGG
TATGAAAAGCATGGCAGCTATGGCTTAGAAATAAAAGCCAATGGTCATGAAAATTCAAATGGTTTTGGCACTGATAACTCTGCATTCCGTGCATATTTTGTTCCATTTCT
TTCAGCTGTTCAACTATTCAAGAGCCATAAAACTCATGGGGGAACAACTACAGATCCTGTGGGATTTGATTCATGTGTAAGCGATATAAAAGTGAAGGAGCCGTCCACTT
GTCATCTTCCAATATTTTCAGTCCTTTTTCCTAAGCCCTGTACTGATGATGCAAGTGCTTCGCGGGTTTGTAATCAGCTACATAGTTCAGAGCAACATTTGGCTTCTGAG
AAGAAGAAATCTTCAGTACAATCAGTCAACTTAAAATTATCTGGAGAATCAGAACTTATTTTTGAATATTTTGAAGGGGAACAACCTCAGCAGAGAAGGCCATTATTTGA
TAAGATACATCAGCTGGTCGAGGGTGATGGACGTCTACAAGGAAAAATATACGGGGATCCGACCATGCTCAATTCCTTAACTTTGAATGATCTGCATGCTGGATCATGGT
ACTCAGTGGCATGGTATCCCATTTATAGGATACCAGATGGCAACCTTCGAGCTGCATTTTTGACTTACCACTCACTAGGACATTTTGTTTCTAGAACTTCCCAATCTAAC
TCTCCAGATACAAATTCTTGCTTAGTCTGTCCAGTTGTGGGTCTTCAAAGTTATAATGCACAGAATGAATGCTGGTTCGAGCCTAGAAACATTGCGCCCACATTTACCCC
TGGCTTAAATTCTCCTAGAATTCTCGAGGAGCGCCTGAGGTCGCTGGAAGAGACTGCATCTCTCATGGCCAGAGCTGTGGTTAAGAAAGGAAATCTGAACTCCGAAAACA
TGCATCCAGATTATGAGTTCTTCCTCTCCCGGCGACTCTAG
mRNA sequenceShow/hide mRNA sequence
GAAAAAGAAAACCCTACTACACGTTCCCTCTCCCCTTTTATTTTGATTGATTTTCTCCTCACCCATCATTGAAGCAGCCACAGCCAAAGACCTATACACGACTTTGATCT
CTCTCTCTGCTCTCGTATCTCCCTCGCAGCGGCAGGAAGAATTTTGAAGTTTGACTCATGGATCCTCAACCTGAACCAGTCAGCTACATCTGTGGAGATTGTGGAATGGA
GAACACTCTGAAGCAGGGTGATGTTATACAGTGCCGAGAGTGTGGTTATCGTATTCTCTACAAGAAGCGCACCCGTCGCAAATCTATTCTACTCAACCTATGGCGATGGT
TGACCAGCAAAAGTATTTCTTGGTTGTTCAGTATGAGGCCCGCTGAAATGCTTATTCTGGCGATTGACTTTCTAACTGGATGGTTCACTTTTGCATTTCGATATCAGCAT
TTGATTCCTATTCAAAATTTTCTTGTATTTTTCCTGAGGGTGAAAGAAGAAGATAAAAGCTATTGTCCACTGGAACAACAAGCTTTGCTCCTTGTAATTTTCCTTCGATT
CTCTCCCGCTAATGGAAATGGGTTACAGAAAACGATGCAGTGTGCTCTTGTAAGAAGTAGTGATTTTCAGAAAGGTCTAGACAAAGGAAAGGAGTCGTTGGAATTGAGAC
TCGAGGAAAACAGTTGTTCCAGAGGAATTAAGGATTCTAAAGTTTCTTCGTTTGCCTGGAGGAACCTTTTTGATTACAGATGTGCCGTCATTAGTTTTCTTACAGTCGAA
TCTGATGGACTCTGGAGAATTGTTGCACTACCACCGCAATACGTAGATAGCTTGGATGTGAGCTGTCTGCCTCAAATGAATCGGTTTGCAGCTGAGAGAAAATTGGTCCA
GAAAGGCCCTGCCTCTAATGTTACATATTCATTTAATTCATTCAGATGTAGAAGCTTGCTGGAGTCGAACAATAAGTTATTGGATAGTAAAGCAATTGAGTCGTCGAATA
AATCCTCTGGCAAGTTCTCTTGCAGGAGTTCATGTTCTGGCTCTGCTTTGATGTTAAGTGACTCTAGTGCAATCTCTGCCATCCCAATTGGTGGAGCTAAATTGCAGAGA
TATGGGAAGAAAAATCCAAGAAAGAAGGCAAAAAAGAAAGAAATAGAATGTAAGAAGATATCTTCTGATTTTGTCTGTGCTGAAACAGAAGTATCATCCAAGGATTCTGC
CCGTGGAAGTTTTTTGTCAGAAGCTTGTGGCAGTAATGATTCAGATTGTAGAGATGGATCTGTTTTGTGTTCGATTGCACGAGAAACTTTTCTGCCAGATTTTAGGGCCA
GTAAAAATGATTTTGAACGAGATACTAAGAAAATTATTCAGCCACCTGGAATCACAGATTCGATATCCTCTAAAATTGATGACGAGCATGCATCTGAGGTTCCATCTTCT
GCAATAAAGAATTTTAGTGGGTATTATCAAGTTTGTGGATCCGAAAACCAGGCCCTAATCAAAGTACCTGGTTGTACCCATGTCAGTGGGGGAGTAAATTCAAGAGAGAG
GTTATTTGCCAGCAGCTGCAATGATTTTTGCCCTAAGGGTTCTTTGGATAATAATTCCCCAGATTCTAAGTGTTTTAGTTTAAACAGTACCTGCGATAATTTTAACTTGA
AATTAAATGAAAAGAAAGGTTTTGGAGTTGATCTGTTGGAAGAACGAAGTTCACCTTCTAGAGAGAACTGTTATTCTCATAACTCAGTAAGAGATGAAGTAGATGTAAAT
GCCAAAGTGGAGAAACCTAATCGTGGTATTCAGGGATGTACTGTGAGTGAAACTTGTTCGGTTTTACCTGGAAAGAAAACTAAACAAAATAAAAAATTGACCGGGAGTTC
AAGAATGAATAGATATGGTGGTCCAGGGAGTTCACAAAGGCGTACAGGGAAGGAAAACAGACATACTGTCTGGCAAAAGGTTCAAAGGAATAATAGTGGTGGATGTTGTG
AACAGTTAGACCAAGTAAGTCCTGTCAGCAAACAGTTTAAAGGCATCTGTACTCCTGTTGTTGGTGTGCAAATGCCAAAGGTCAAGGATAAAAAAACGGGGAACAGAAAA
CAGCTGAAAGAAAAATTTCCCAGGAGGTTGAAAAGAAAAAATACGTCAGGACAAGAGAAGGTCTATCGTCCTACTAGGAACAGTTGTGGTAGTAATACTAGTTCAATGGT
TCACAAACCACCAAATGAAAGGTTGGATATTCGACCAATGGACTTTGACATAAGAAAATCAAGTGGCGATCCAAGATCTCGTTTTCAAAATGATACTACTGATAAATGCA
TGACTTCTGAATCATTTGAAAGTACACAACTCTGTCTAGATGGATTGATGTCAAGCAAACTTATCTCCGATGGTTTGAATAGTCAAAAAGTAGAGAATGTCTCTAGCTCA
TTGCCAAGGGCCTGCAACTCCTTAAATCAGTCAAACCCGGTAGAGGTTCAGTCTCCTGTTTACCTTCCGCATCTTTTCTTTCAAGCAACAAAAGGAAGTTCTCTTGCTGA
ACGCAGCAAGCATAACAGCCAATCTAGATCACCCTTTCAAAACTGGTTGCCAAGCGGGGGAGAAGGTTCCAGATTGACCACCTTGGCCAGACCTGATTTTTCATCTCTGA
AAGATGCAAGTACGCAACCTGCTGAGTTTGGCACTTCAGAAAAATCAATTCAAGAACGAGTCAATTGCAACATAGTAGATCCTGTTTCTGTTGTCACTGAGGGGATTCAG
CATTCTAGAGATGGGAGTCATGGTCCTTTAGAAAATGAATGTGAGGTGCAGAAGATGTATGGTTACGATACAACTGCACTACAGGATCATAGGTCTGAGTTTGATGTGGA
TGAGCATTTTAATTCCAAATCCTCATGTGAAGATGCATCTATAATGGAGCAAGCAGTGAATAATGCATGTAGGGCACAATTGGTATCTGAAGCTATTCAAATGGAAACTG
GTATTCCAATCGCAGAGTTCGAAAGATTCCTTCATTTGTCCTCCCCTGTTATCAACCAGACACCCAAGCTAAGAAGTAGTGAAATTTACCCAAGAAATTCGCCAGGTGAT
GTGATTCCGTGTAGCAATGAGACTGCTGACATTTCTTTGGGTTGCCTGTGGCAATGGTATGAAAAGCATGGCAGCTATGGCTTAGAAATAAAAGCCAATGGTCATGAAAA
TTCAAATGGTTTTGGCACTGATAACTCTGCATTCCGTGCATATTTTGTTCCATTTCTTTCAGCTGTTCAACTATTCAAGAGCCATAAAACTCATGGGGGAACAACTACAG
ATCCTGTGGGATTTGATTCATGTGTAAGCGATATAAAAGTGAAGGAGCCGTCCACTTGTCATCTTCCAATATTTTCAGTCCTTTTTCCTAAGCCCTGTACTGATGATGCA
AGTGCTTCGCGGGTTTGTAATCAGCTACATAGTTCAGAGCAACATTTGGCTTCTGAGAAGAAGAAATCTTCAGTACAATCAGTCAACTTAAAATTATCTGGAGAATCAGA
ACTTATTTTTGAATATTTTGAAGGGGAACAACCTCAGCAGAGAAGGCCATTATTTGATAAGATACATCAGCTGGTCGAGGGTGATGGACGTCTACAAGGAAAAATATACG
GGGATCCGACCATGCTCAATTCCTTAACTTTGAATGATCTGCATGCTGGATCATGGTACTCAGTGGCATGGTATCCCATTTATAGGATACCAGATGGCAACCTTCGAGCT
GCATTTTTGACTTACCACTCACTAGGACATTTTGTTTCTAGAACTTCCCAATCTAACTCTCCAGATACAAATTCTTGCTTAGTCTGTCCAGTTGTGGGTCTTCAAAGTTA
TAATGCACAGAATGAATGCTGGTTCGAGCCTAGAAACATTGCGCCCACATTTACCCCTGGCTTAAATTCTCCTAGAATTCTCGAGGAGCGCCTGAGGTCGCTGGAAGAGA
CTGCATCTCTCATGGCCAGAGCTGTGGTTAAGAAAGGAAATCTGAACTCCGAAAACATGCATCCAGATTATGAGTTCTTCCTCTCCCGGCGACTCTAGTTACATAACCAG
GTTTTATGCTTAGAACTTATCAAGGAGATTCTTTCCTTTTGTCAATATTCTTGAGAATTTAGTTTAGGTTAGGACCTAATCTCAAGTAGGGATGCAAGAATAGCTGATCT
TCGTCTGTAATGTATTTACACAATTTTTTTTGTTCCTTTTCTGTCTTTCTATGCTAACTCTTTCAGGCAGTGCAACTTTCATCTGTATATATTGATTTGTACATAAATTA
CTTCTCATGTAAAAATGAAATATCTCCTGGAT
Protein sequenceShow/hide protein sequence
MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRRKSILLNLWRWLTSKSISWLFSMRPAEMLILAIDFLTGWFTFAFRYQHLIPIQNFLVFFLRVKEEDK
SYCPLEQQALLLVIFLRFSPANGNGLQKTMQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNLFDYRCAVISFLTVESDGLWRIVALPPQYVDSLD
VSCLPQMNRFAAERKLVQKGPASNVTYSFNSFRCRSLLESNNKLLDSKAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIGGAKLQRYGKKNPRKKAKKKEIECKK
ISSDFVCAETEVSSKDSARGSFLSEACGSNDSDCRDGSVLCSIARETFLPDFRASKNDFERDTKKIIQPPGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQAL
IKVPGCTHVSGGVNSRERLFASSCNDFCPKGSLDNNSPDSKCFSLNSTCDNFNLKLNEKKGFGVDLLEERSSPSRENCYSHNSVRDEVDVNAKVEKPNRGIQGCTVSETC
SVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPVSKQFKGICTPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQE
KVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRKSSGDPRSRFQNDTTDKCMTSESFESTQLCLDGLMSSKLISDGLNSQKVENVSSSLPRACNSLNQSNPVEVQSP
VYLPHLFFQATKGSSLAERSKHNSQSRSPFQNWLPSGGEGSRLTTLARPDFSSLKDASTQPAEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGSHGPLENECEVQKM
YGYDTTALQDHRSEFDVDEHFNSKSSCEDASIMEQAVNNACRAQLVSEAIQMETGIPIAEFERFLHLSSPVINQTPKLRSSEIYPRNSPGDVIPCSNETADISLGCLWQW
YEKHGSYGLEIKANGHENSNGFGTDNSAFRAYFVPFLSAVQLFKSHKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASASRVCNQLHSSEQHLASE
KKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQSN
SPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNSPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRRL