; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G014490 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G014490
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationCicolChr01:27245749..27260141
RNA-Seq ExpressionCcUC01G014490
SyntenyCcUC01G014490
Gene Ontology termsGO:0006351 - transcription, DNA-templated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR006591 - RNA polymerase archaeal subunit P/eukaryotic subunit RPABC4
IPR008507 - Protein of unknown function DUF789
IPR029040 - RNA polymerase subunit RPABC4/transcription elongation factor Spt4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572995.1 DNA-directed RNA polymerases II, IV and V subunit 12, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0068.67Show/hide
Query:  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRRKSILLNLWRWL-TSKSISWLFSMRPAEMLILAIDVLTGWFTFAFRYQHLIPIQNFF
        MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR  I+ + W  L  S+ + W                                     
Subjt:  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRRKSILLNLWRWL-TSKSISWLFSMRPAEMLILAIDVLTGWFTFAFRYQHLIPIQNFF

Query:  VFFLRVKEEDKSYCPLEKQALLLVIFLRFSPANGNGLQKTMQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFGWRNFFDYRCAVISFLT
                                            +QKTMQCAL +SS+FQK  DKGK+ LE++++E++CSR IKDS+VSSF WRNFFDYR AVIS LT
Subjt:  VFFLRVKEEDKSYCPLEKQALLLVIFLRFSPANGNGLQKTMQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFGWRNFFDYRCAVISFLT

Query:  VESDGLWRIVALP-PYVDSLDVSCLPQMNRFAAERKLVRKGPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAI
        +ESDGLWRIVALP   +DSL VSCLPQMN+F A+RKLV  GPAS+GTYS NSFRCRSLLESN  LLDS+A +SSNK+S KFS RSSCS SAL+  DSSAI
Subjt:  VESDGLWRIVALP-PYVDSLDVSCLPQMNRFAAERKLVRKGPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAI

Query:  SAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAETEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQP
        S IPI  AK+QRYGKKN RKKAKK++IECKK SSDFV AETE+SS+DSARGS LLEACG+N SDCRDGSVLCS ARETF  D RASK+DF+RD+++IIQP
Subjt:  SAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAETEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQP

Query:  LGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQALIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEK
        LG TDSISS+I +  ASEVP SA KN SG Y    SENQ LIK PGCT  +G V+ +ERLF   CNDFC K S DNNSP        S CD+  LKL E 
Subjt:  LGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQALIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEK

Query:  KGFGVDLLEKRSSPSRENYYS-HNSVRDEVDVNAKVEKANRGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRN
        +GFG+DLLE ++SPSREN  S HNS+RDEVDVNA+ EKAN GIQGCT SET  +LPGKKTKQNKKL+G+SR NR+GG GSSQR TGKEN  TVWQKVQ+N
Subjt:  KGFGVDLLEKRSSPSRENYYS-HNSVRDEVDVNAKVEKANRGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRN

Query:  NSGGCCEQLDQVS-PISKQFKGICTPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRR
        NSGGCC QLDQVS P+SKQ KG+C P VGVQ PKVKDKKTGNRKQLK+KF +RLK KNTS Q+K+YRP+++S GSNT+SM H  PNERLDI  M FDI +
Subjt:  NSGGCCEQLDQVS-PISKQFKGICTPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRR

Query:  SSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISDGLNSQRVENVSSSLPRACNSLNQSNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSR
        SS   R+ FQND+TDKC TSESSESTQ CLDG MS KLISDGLN+QRVEN SS+  R+C+SLNQSN ++ QSPVY+PHLFFQATKGSSLAERSKH+NQSR
Subjt:  SSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISDGLNSQRVENVSSSLPRACNSLNQSNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSR

Query:  SPFQNWLPIGGEGSRLTT-LARPDFSSVKDASMQPAEFSTSEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFD
        SP QNW+P   EGSRLTT L RPDFSS+KDA+ QPAEF  SEKSIQE V+CN++DPVS   E IQHSRDG+H PLE ECE Q+ +G+ T ALQD R E D
Subjt:  SPFQNWLPIGGEGSRLTT-LARPDFSSVKDASMQPAEFSTSEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFD

Query:  VDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGS
        VDEHFN KS+C D++R+EQ VN+AC+AQL  +A+       IA  ERFLHLSSPVI+Q P LRS +I  +N  GD IPCS++TA+ISL CLWQWYEKHGS
Subjt:  VDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGS

Query:  YGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQH
        YGLE+KANG+E SNGFG +NS F AYFVPFLSAVQLFKS KTH G TT PVG DS VSDIK  EP T  LPIFSVLFPKPCTDDA+V + C+QLHSSE+ 
Subjt:  YGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQH

Query:  LASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTY
        LASEK+  S QSV+  LSGESELIFEYFE EQPQQRRPLFDKI QLV+GDG L+GKIYGDPT+L S+TLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTY
Subjt:  LASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTY

Query:  HSLRHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        HSL HFV RTSQS+S +T+SC+VCPVVGLQS+NAQNECWF+PRN   TF    NPP +++ERLR+LEETASLMARAVVKKG+LNS N HPDYEFFLSRR
Subjt:  HSLRHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

TYJ99070.1 uncharacterized protein E5676_scaffold248G002740 [Cucumis melo var. makuwa]0.0e+0078.21Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGI-KDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALPP-YVDSLDVSCLPQMNRFAAERKLVR
        MQCALVRSSDFQK LDKGKESL+LRLE+NSCSRGI KD +VSSF WRNFFDYRCAVI FLT+ESDGLWRIVALPP Y+DSL+VSCLPQMN+F A RKLV+
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGI-KDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALPP-YVDSLDVSCLPQMNRFAAERKLVR

Query:  KGPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA
        KG ASNGTYSFNS RCRSLLESN KLLDS+AI+S NKSSGK  C SSCS SALM SDS A S IPI GAK+QRYGKKNPRKKAKKKE+E KKISS+FV A
Subjt:  KGPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA

Query:  ETEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ
        ETEVS +DSAR SFL EACGSNDSD R+ +VLCSIA ETFLP       DFERD++  IQPLG  DS+SS+I D H+S+V SSAIKNFSGY++VCGSENQ
Subjt:  ETEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ

Query:  ALIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYSHNSVRDEVDVNAKVEKAN
        AL   PGC HV+ G+NSRE L A SCNDFC   SLDNNS   K  SL+S CD+ NLKLNEKKGFGVDLLE+RSSP REN  S NS RDEVD+N +VEK  
Subjt:  ALIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYSHNSVRDEVDVNAKVEKAN

Query:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTG
         GIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGG GSSQRRTGKENRHTVWQKVQR+NSGGC EQLDQVSPISKQFKGIC PV GVQMPKVKDKKTG
Subjt:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTG

Query:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISD
        NRKQLKEK  RRLKRKNTSGQEK+YRPTRNSCGSNTSSMVHKPPNERLDIR M FDIRRSSG+PRSRFQNDTTDKC  SE+ E  Q   D L S+KLI D
Subjt:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISD

Query:  GLNSQRVENVSSSLPRACNSLNQ------------------------------------SNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSRSPFQN
        GL+SQ+VEN SSSLP++CNS NQ                                    SN +E++SPVYLPHLFFQATKGSSLAERSKH  QSRSP QN
Subjt:  GLNSQRVENVSSSLPRACNSLNQ------------------------------------SNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSRSPFQN

Query:  WLPIGGEGSRLTTLARPDFSSVKDASMQPAEFSTSEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFN
        WLP G EGSR TTLARPDFSS++DA+ QPAEF TSEKSI+ERVNC++++PVS   EGIQH RD  HG LE+ECEVQK+YG+ TT LQ+ + EF+VDEHFN
Subjt:  WLPIGGEGSRLTTLARPDFSSVKDASMQPAEFSTSEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFN

Query:  SKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK
         KSSCED SRMEQAVNNAC+AQL SEAIQMETG PIA  ERFLHLSSPVI+Q PKLRSSEI PRNLPGDVIPCSNET +ISL CLWQWYEKHGSYGLEIK
Subjt:  SKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK

Query:  ANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKK
        A  +ENSNGFG  NSAFRAYFVPFLSA+QLFKS KTH GTTT P+GFDSCVSDIKVKEPSTCHLPIFS+LFP+P TDD SV RVCN+ HSSEQ LASEK+
Subjt:  ANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKK

Query:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHF
        KSS QS +L+LSGESELIFEYFEGEQPQ RRPLFDKIHQLVEGDG LQGKIYGDPTMLNS+TL+DLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSL HF
Subjt:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHF

Query:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        VSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR    TFT  LNPPR+L+ERLR+LEETASLMARAVVKKGNLNS N HPDYEFFLSRR
Subjt:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

XP_004137638.2 uncharacterized protein LOC101212209 [Cucumis sativus]0.0e+0078.49Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIK-DSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALPP-YVDSLDVSCLPQMNRFAAERKLVR
        MQC LV SSDFQK LDKGKESLELRLE+NSCSRGI  DSKVSSF WRNFFDYR A+IS LT+ESDGLWRIVALPP Y+DSL++SCLPQMN+F A RKLV+
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIK-DSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALPP-YVDSLDVSCLPQMNRFAAERKLVR

Query:  KGPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA
        KGPASNGTYSFNS RCRSLLESN KLLDS+AI+S  +SSGKF C SSCSGSALM SDS AIS IP+ GAK+QRYGKKNPRKKAKKKEIECK ISSDFV A
Subjt:  KGPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA

Query:  ETEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ
        ETEVS +DSAR SFL EACGSNDSD RD SVLCSIA+ETFLP       DFE+D+  +IQPLG  DS+SS+I D H+S+V S AIKNFSGYY+VCGSENQ
Subjt:  ETEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ

Query:  ALIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYSHNSVRDEVDVNAKVEKAN
        ALI VPGC HV+ G+NSRER  A SCNDFC K  LDN S   K  SL+  CD+ NLKLNEK+GFGVDLLE+RSSPS+      NS RDEVD+NA+VEKAN
Subjt:  ALIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYSHNSVRDEVDVNAKVEKAN

Query:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTG
         GI+GCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGG GSSQRRTGKENRHTVWQKVQR++SGGC EQLDQVSPISKQFKGIC PVVGVQMPKVKDKKTG
Subjt:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTG

Query:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISD
        N+KQLKEK PRRLKRKNTSGQEK+YRPTRNSCGSNTSSMVHKPPNE+LD+R M FDIRRSSGDPRS FQND+TDKCT SES ES Q  LD L+S+KLI+D
Subjt:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISD

Query:  GLNSQRVENVSSSLPRACNSLNQSNLIEIQSP------------------------------------VYLPHLFFQATKGSSLAERSKHNNQSRSPFQN
        GL+SQ+VEN SSSLP++CNS NQSN +E++SP                                    VYLPHLFFQATKGSSL ERSKH+ QSRSP QN
Subjt:  GLNSQRVENVSSSLPRACNSLNQSNLIEIQSP------------------------------------VYLPHLFFQATKGSSLAERSKHNNQSRSPFQN

Query:  WLPIGGEGSRLTTLARPDFSSVKDASMQPAEFSTSEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFN
        WLP G EGSR  TLARPDFSS++DA+ QPAEF T EKSI+ERVNCN+++PVS   EGIQH RD   GPLE+EC VQKMYGY TT LQD +SEFDVDEHFN
Subjt:  WLPIGGEGSRLTTLARPDFSSVKDASMQPAEFSTSEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFN

Query:  SKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK
         KSSCED SRMEQAVNNACRAQL SEAIQMETG PIA  ERFLHLSSPVI+Q P   SS+I PRNLPGDVIPCSNET +ISLGCLWQWYEKHGSYGLEIK
Subjt:  SKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK

Query:  ANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKK
        A G ENSNGFG  NSAFRAYFVPFLSAVQLFKS KTH GT T P+GF+SCVSDIKVKEPSTCHLPIFS+LFPKPCTDD SV RVCNQ HSSEQHLASEKK
Subjt:  ANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKK

Query:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHF
        KSS QS +L+LSGESELIFEYFEGEQPQ RRPLFDKIHQLVEGDG LQGKIYGDPT+LNS+TL+DLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSL HF
Subjt:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHF

Query:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNI--APTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        VSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR+     TFT  LNPPRIL+ERLR+LEETASLMARAVVKKGNLNS N HPDYEFFLSRR
Subjt:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNI--APTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

XP_038894653.1 uncharacterized protein LOC120083142 isoform X1 [Benincasa hispida]0.0e+0084.79Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALP-PYVDSLDVSCLPQMNRFAAERKLVRK
        MQCA + SSDFQK LDK KESLELRLEEN CSRGIKDSKVSSF WRNFF YRCAVISFLTVESDGLWRIVALP  Y+DS+DVSCLPQMN+F AERKLV++
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALP-PYVDSLDVSCLPQMNRFAAERKLVRK

Query:  GPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE
        GPAS GTYSFNSFRCRSLLESN KLLDS+AI+SS+KSSGKFSC SSCS SALM SDSSAIS IP   AK+QRYGKKNPRKKAKKKEIE KKISS+FV AE
Subjt:  GPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE

Query:  TEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA
        TEVSSKDSA GSFL +ACGSNDSDC D SVLCSIA+E FLPDFRASK+ FERD+++IIQPLG  DSIS +I DE+ASEV SSAIKN+S YY+VCGS NQA
Subjt:  TEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA

Query:  LIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYSHNSVRDEVDVNAKVEKANR
        LIKVPGC HV+GGVNSRERLFA SC DFC K SLDNNSP  KC SL+S  DNFNLKL EKKGFGVDLL++RSSPS+ENY   N+VRD VDVNA+VE+AN 
Subjt:  LIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYSHNSVRDEVDVNAKVEKANR

Query:  GIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTGN
        GI+  TVSET SVLPGKKTKQNKKL GS+RMNRYGG  SSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGIC P VGVQMPKVKDK+TGN
Subjt:  GIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTGN

Query:  RKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISDG
        RKQLKEKFPRRLKRKNTSGQEK+Y PTRNSCGSNTSSMVHK PN+ LDIR M FDIRRSS DPRSRFQNDTTDKCTTSES ESTQ CL GL+S+KLIS+G
Subjt:  RKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISDG

Query:  LNSQRVENVSSSLPRACNSLNQSNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSRSPFQNWLPIGGEGSRLTTLARPDFSSVKDASMQPAEFSTSEK
        LNSQ+VEN SSS PR+C+SLNQSN +E+QSPVYLPHLFFQATKGSSLAERS HNNQ R P QNWLP G EG  LTTLARPDFSS+KDASMQP    TSEK
Subjt:  LNSQRVENVSSSLPRACNSLNQSNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSRSPFQNWLPIGGEGSRLTTLARPDFSSVKDASMQPAEFSTSEK

Query:  SIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIA
        SIQERVNCN+++PVSV  EGIQHSRDG+HGPLE+ECEVQKM+GY TT LQD + EFDVDEHF+ KSS ED+SRMEQAVNNACRAQLVSEAIQ+ETGSPIA
Subjt:  SIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIA

Query:  MVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTH
          ERFLHLSSPVINQ PKLR+SEI PRNLPGDV+PCSNET +ISLGCLWQWYEKHGSYGLEIKANG+ENSNGFG +NSAFRAYFVPFLSA+QLFKS KTH
Subjt:  MVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTH

Query:  GGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKI
         GTTT PVGFDSCV+DIKVKEPSTC LPIFSVLFPKPCTDDASV RVC+Q HSSEQHLASEK+K S QSVN+KLSGESELIFEYFEGEQPQQRRPLFDKI
Subjt:  GGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKI

Query:  HQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPR
        HQLVEGDG  QGKIYGDPTMLNS+TLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSL HFVSRT QSNSPDTNSCLVCPVVGLQSYNAQNECWFEPR
Subjt:  HQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPR

Query:  NIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRRL
        N  PTFTPGLNPPRILEERLR+LEETASLMARAVVKKGNLNSEN HPDYEFFLSRRL
Subjt:  NIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRRL

XP_038894656.1 uncharacterized protein LOC120083142 isoform X2 [Benincasa hispida]0.0e+0082.02Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALP-PYVDSLDVSCLPQMNRFAAERKLVRK
        MQCA + SSDFQK LDK KESLELRLEEN CSRGIKDSKVSSF WRNFF YRCAVISFLTVESDGLWRIVALP  Y+DS+DVSCLPQMN+F AERKLV++
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALP-PYVDSLDVSCLPQMNRFAAERKLVRK

Query:  GPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE
        GPAS GTYSFNSFRCRSLLESN KLLDS+AI+SS+KSSGKFSC SSCS SALM SDSSAIS IP   AK+QRYGKKNPRKKAKKKEIE KKISS+FV AE
Subjt:  GPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE

Query:  TEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA
        TEVSSKDSA GSFL +ACGSNDSDC D SVLCSIA+E FLPDFRASK+ FERD+++IIQPLG  DSIS +I DE+ASEV SSAIKN+S YY+VCGS NQA
Subjt:  TEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA

Query:  LIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYSHNSVRDEVDVNAKVEKANR
        LIKVPGC HV+GGVNSRERLFA SC DFC K SLDNNSP  KC SL+S  DNFNLKL EKKGFGVDLL++RSSPS+ENY   N+VRD VDVNA+VE+AN 
Subjt:  LIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYSHNSVRDEVDVNAKVEKANR

Query:  GIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTGN
        GI+  TVSET SVLPGKKTKQNKKL GS+RMNRYGG  SSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGIC P VGVQMPKVKDK+TGN
Subjt:  GIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTGN

Query:  RKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISDG
        RKQLKEKFPRRLKRKNTSGQEK+Y PTRNSCGSNTSSMVHK PN+ LDIR M FDIRRSS DPRSRFQNDTTDKCTTSES ESTQ CL GL+S+KLIS+G
Subjt:  RKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISDG

Query:  LNSQRVENVSSSLPRACNSLNQSNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSRSPFQNWLPIGGEGSRLTTLARPDFSSVKDASMQPAEFSTSEK
        LNSQ+VEN SSS PR+C+SLNQSN +E+QSPVYLPHLFFQATKGSSLAERS HNNQ R P QNWLP G EG  LTTLARPDFSS+KDASMQP    TSEK
Subjt:  LNSQRVENVSSSLPRACNSLNQSNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSRSPFQNWLPIGGEGSRLTTLARPDFSSVKDASMQPAEFSTSEK

Query:  SIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIA
        SIQERVNCN+++PVSV  EGIQHSRDG+HGPLE+ECEVQKM+GY TT LQD + EFDVDEHF+ KSS ED+SRMEQAVNNACRAQLVSEAIQ+ETGSPIA
Subjt:  SIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIA

Query:  MVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTH
          ERFLHLSSPVINQ PKLR+SEI PRNLPGDV+PCSNET +ISLGCLWQWYEKHGSYGLEIKANG+ENSNGFG +NSAFRAYFVPFLSA+QLFKS KTH
Subjt:  MVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTH

Query:  GGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKI
         GTTT PVGFDSCV+DIKVKEPSTC LPIFSVLFPKPCTDDASV RVC+Q HSSEQHLASEK+K S QSVN+KLSGESELIFEYFEGEQPQQRRPLFDK 
Subjt:  GGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKI

Query:  HQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPR
                                          YSVAWYPIYRIPDGNLRAAFLTYHSL HFVSRT QSNSPDTNSCLVCPVVGLQSYNAQNECWFEPR
Subjt:  HQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPR

Query:  NIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRRL
        N  PTFTPGLNPPRILEERLR+LEETASLMARAVVKKGNLNSEN HPDYEFFLSRRL
Subjt:  NIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRRL

TrEMBL top hitse value%identityAlignment
A0A0A0LT77 Uncharacterized protein0.0e+0078.49Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIK-DSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALPP-YVDSLDVSCLPQMNRFAAERKLVR
        MQC LV SSDFQK LDKGKESLELRLE+NSCSRGI  DSKVSSF WRNFFDYR A+IS LT+ESDGLWRIVALPP Y+DSL++SCLPQMN+F A RKLV+
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIK-DSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALPP-YVDSLDVSCLPQMNRFAAERKLVR

Query:  KGPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA
        KGPASNGTYSFNS RCRSLLESN KLLDS+AI+S  +SSGKF C SSCSGSALM SDS AIS IP+ GAK+QRYGKKNPRKKAKKKEIECK ISSDFV A
Subjt:  KGPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA

Query:  ETEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ
        ETEVS +DSAR SFL EACGSNDSD RD SVLCSIA+ETFLP       DFE+D+  +IQPLG  DS+SS+I D H+S+V S AIKNFSGYY+VCGSENQ
Subjt:  ETEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ

Query:  ALIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYSHNSVRDEVDVNAKVEKAN
        ALI VPGC HV+ G+NSRER  A SCNDFC K  LDN S   K  SL+  CD+ NLKLNEK+GFGVDLLE+RSSPS+      NS RDEVD+NA+VEKAN
Subjt:  ALIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYSHNSVRDEVDVNAKVEKAN

Query:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTG
         GI+GCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGG GSSQRRTGKENRHTVWQKVQR++SGGC EQLDQVSPISKQFKGIC PVVGVQMPKVKDKKTG
Subjt:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTG

Query:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISD
        N+KQLKEK PRRLKRKNTSGQEK+YRPTRNSCGSNTSSMVHKPPNE+LD+R M FDIRRSSGDPRS FQND+TDKCT SES ES Q  LD L+S+KLI+D
Subjt:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISD

Query:  GLNSQRVENVSSSLPRACNSLNQSNLIEIQSP------------------------------------VYLPHLFFQATKGSSLAERSKHNNQSRSPFQN
        GL+SQ+VEN SSSLP++CNS NQSN +E++SP                                    VYLPHLFFQATKGSSL ERSKH+ QSRSP QN
Subjt:  GLNSQRVENVSSSLPRACNSLNQSNLIEIQSP------------------------------------VYLPHLFFQATKGSSLAERSKHNNQSRSPFQN

Query:  WLPIGGEGSRLTTLARPDFSSVKDASMQPAEFSTSEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFN
        WLP G EGSR  TLARPDFSS++DA+ QPAEF T EKSI+ERVNCN+++PVS   EGIQH RD   GPLE+EC VQKMYGY TT LQD +SEFDVDEHFN
Subjt:  WLPIGGEGSRLTTLARPDFSSVKDASMQPAEFSTSEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFN

Query:  SKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK
         KSSCED SRMEQAVNNACRAQL SEAIQMETG PIA  ERFLHLSSPVI+Q P   SS+I PRNLPGDVIPCSNET +ISLGCLWQWYEKHGSYGLEIK
Subjt:  SKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK

Query:  ANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKK
        A G ENSNGFG  NSAFRAYFVPFLSAVQLFKS KTH GT T P+GF+SCVSDIKVKEPSTCHLPIFS+LFPKPCTDD SV RVCNQ HSSEQHLASEKK
Subjt:  ANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKK

Query:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHF
        KSS QS +L+LSGESELIFEYFEGEQPQ RRPLFDKIHQLVEGDG LQGKIYGDPT+LNS+TL+DLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSL HF
Subjt:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHF

Query:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNI--APTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        VSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR+     TFT  LNPPRIL+ERLR+LEETASLMARAVVKKGNLNS N HPDYEFFLSRR
Subjt:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNI--APTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

A0A5D3BH03 Uncharacterized protein0.0e+0078.21Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGI-KDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALPP-YVDSLDVSCLPQMNRFAAERKLVR
        MQCALVRSSDFQK LDKGKESL+LRLE+NSCSRGI KD +VSSF WRNFFDYRCAVI FLT+ESDGLWRIVALPP Y+DSL+VSCLPQMN+F A RKLV+
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGI-KDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALPP-YVDSLDVSCLPQMNRFAAERKLVR

Query:  KGPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA
        KG ASNGTYSFNS RCRSLLESN KLLDS+AI+S NKSSGK  C SSCS SALM SDS A S IPI GAK+QRYGKKNPRKKAKKKE+E KKISS+FV A
Subjt:  KGPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA

Query:  ETEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ
        ETEVS +DSAR SFL EACGSNDSD R+ +VLCSIA ETFLP       DFERD++  IQPLG  DS+SS+I D H+S+V SSAIKNFSGY++VCGSENQ
Subjt:  ETEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ

Query:  ALIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYSHNSVRDEVDVNAKVEKAN
        AL   PGC HV+ G+NSRE L A SCNDFC   SLDNNS   K  SL+S CD+ NLKLNEKKGFGVDLLE+RSSP REN  S NS RDEVD+N +VEK  
Subjt:  ALIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYSHNSVRDEVDVNAKVEKAN

Query:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTG
         GIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGG GSSQRRTGKENRHTVWQKVQR+NSGGC EQLDQVSPISKQFKGIC PV GVQMPKVKDKKTG
Subjt:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTG

Query:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISD
        NRKQLKEK  RRLKRKNTSGQEK+YRPTRNSCGSNTSSMVHKPPNERLDIR M FDIRRSSG+PRSRFQNDTTDKC  SE+ E  Q   D L S+KLI D
Subjt:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISD

Query:  GLNSQRVENVSSSLPRACNSLNQ------------------------------------SNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSRSPFQN
        GL+SQ+VEN SSSLP++CNS NQ                                    SN +E++SPVYLPHLFFQATKGSSLAERSKH  QSRSP QN
Subjt:  GLNSQRVENVSSSLPRACNSLNQ------------------------------------SNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSRSPFQN

Query:  WLPIGGEGSRLTTLARPDFSSVKDASMQPAEFSTSEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFN
        WLP G EGSR TTLARPDFSS++DA+ QPAEF TSEKSI+ERVNC++++PVS   EGIQH RD  HG LE+ECEVQK+YG+ TT LQ+ + EF+VDEHFN
Subjt:  WLPIGGEGSRLTTLARPDFSSVKDASMQPAEFSTSEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFN

Query:  SKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK
         KSSCED SRMEQAVNNAC+AQL SEAIQMETG PIA  ERFLHLSSPVI+Q PKLRSSEI PRNLPGDVIPCSNET +ISL CLWQWYEKHGSYGLEIK
Subjt:  SKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK

Query:  ANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKK
        A  +ENSNGFG  NSAFRAYFVPFLSA+QLFKS KTH GTTT P+GFDSCVSDIKVKEPSTCHLPIFS+LFP+P TDD SV RVCN+ HSSEQ LASEK+
Subjt:  ANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKK

Query:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHF
        KSS QS +L+LSGESELIFEYFEGEQPQ RRPLFDKIHQLVEGDG LQGKIYGDPTMLNS+TL+DLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSL HF
Subjt:  KSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHF

Query:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        VSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR    TFT  LNPPR+L+ERLR+LEETASLMARAVVKKGNLNS N HPDYEFFLSRR
Subjt:  VSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

A0A6J1C5T5 uncharacterized protein LOC1110087180.0e+0073.67Show/hide
Query:  MQCALVRS-SDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALP-PYVDSLDVSCLPQMNRFAAERKLVR
        MQCAL R  SD QK  DKGKE LE+R +E++CSR IKDS+VSS  WRNFFDYRCAV+SFLT+ESDG W+IVA P  Y+D L  SCLPQMN+FAAERKLV+
Subjt:  MQCALVRS-SDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALP-PYVDSLDVSCLPQMNRFAAERKLVR

Query:  KGPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA
        KGPASNGTYS NSFRCRSLLESN KLLDS+AI+S N+ SGKFSCRSSCS SAL+ SDSSAIS IPI GAK+ RYGKKNPRKKAKKK IECKKIS DFVCA
Subjt:  KGPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCA

Query:  ETEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ
        ETEVSS+DSARGS LLEACG+ND +  DGSV CS A+ETFLPD RASK+ F+ ++++IIQPLG   SISS+  +  AS+V  SA +N SG Y VCGSENQ
Subjt:  ETEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQ

Query:  ALIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYS-HNSVRDEVDVNAKVEKA
         L+KV GC+H +GGV+ RERLF   C DF  KG  DNNS   +C S +S  D  NLKLNEK+ FGV LLE+++SPSRENY S H SVRDEVDVNA+VE+A
Subjt:  ALIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYS-HNSVRDEVDVNAKVEKA

Query:  NRGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKT
          GIQGCT SET  VLPGKKTKQNKKLTGSS++NR+G  G+SQRRTGKEN HTVWQKVQ+NNSGGCC QLDQVSPI KQFKG C P VGVQ+PKVKD+KT
Subjt:  NRGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKT

Query:  GNRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLIS
        GNRKQLK+K  R+L+RKNTS Q+K+YRP ++  G+NTSSMV K PNERLDI  M FDIRR +   +S+ QND T KC TSES ESTQ CLDGLMS +L+S
Subjt:  GNRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLIS

Query:  DGLNSQRVENVSSSLPRACNSLNQSNLIEIQSPVYLPHLFF----QATKGSSLAERSKHNNQSRSPFQNWLPIGGEGSRLTTLARPDFSSVKDASMQPAE
        DGLNSQRVEN  SS  R+CNSL+QSNL+E+ SP+YLPHLFF    Q T+GSSLAE SKHNN SRSP QNW+P G EGSRLTTLA PD SS+K  +  PAE
Subjt:  DGLNSQRVENVSSSLPRACNSLNQSNLIEIQSPVYLPHLFF----QATKGSSLAERSKHNNQSRSPFQNWLPIGGEGSRLTTLARPDFSSVKDASMQPAE

Query:  FSTSEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQME
          TSE+SIQERV C++ DPVSV TE  + SRDG+HGPLE+ECEVQKM  +  T LQD   E D+DEHFN KSSCED+S+MEQAVNNACR QL SEA+QME
Subjt:  FSTSEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQME

Query:  TGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLF
        TG PIA  E FLHLSSPVI+Q PKL+S +I PRNL GD I CS+E  +ISLGCLWQWYEKHGSYGLEIKA G EN+N F  +NSAF AYFVPFLSAVQLF
Subjt:  TGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLF

Query:  KSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRR
        KS KTH GTT +P G DSCV +IK+KEPSTCHLPIFSVLFPKP TDDAS+  V +Q HSSEQ LASEK K S QSV+LKLSGESEL+FEYFE E PQQRR
Subjt:  KSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRR

Query:  PLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNE
        PLFDKI QLV GDGRLQGKIYGDPTMLNS+TLNDLHA SWYSVAWYPIYRIPDGNLRAAFLTYHSL HFV RTSQ NS DT+SCLVCPVVGLQSYNAQNE
Subjt:  PLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNE

Query:  CWFEPRNIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        CWFEPRN    F   ++PP ILEERLR+LEETASLMARA+VKKGNLNSEN HPDYEFFLSRR
Subjt:  CWFEPRNIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

A0A6J1GS60 uncharacterized protein LOC1114570060.0e+0072.39Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALP-PYVDSLDVSCLPQMNRFAAERKLVRK
        MQCAL +SS+FQK  DKGK+ LE++++E++CSR IKDS+VSSF WRNFFDYR AVIS LT+ESDGLWRIVALP   +DSL VSCLPQMN+F A+RKLV  
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALP-PYVDSLDVSCLPQMNRFAAERKLVRK

Query:  GPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE
        GPASNGTYS NSFRCRSLLESN  LLDS+A +SSNK+S KFS RSSCS SAL+  DSSAIS IPI  AK+QRYGKKN RKKAKK++IECKK SSDFV AE
Subjt:  GPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE

Query:  TEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA
        TE+SS+DSARGS LLEACG+N SDCRDG VLCS ARETF  D RASK+DF+RD+++IIQPLG TDSISS+I +  ASEVP SA KN SG Y    SENQ 
Subjt:  TEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA

Query:  LIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYS-HNSVRDEVDVNAKVEKAN
        LIK PGCT  +G V+ +ERLF   CNDFC K S DNNSP        S CD+  LKL E +GFG+DLLE ++SPSREN  S HNS+RDEVDVNA+ EKAN
Subjt:  LIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYS-HNSVRDEVDVNAKVEKAN

Query:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVS-PISKQFKGICTPVVGVQMPKVKDKKT
         GIQGCT SET  +LPGKKTKQNKKL+G+SR NR+GG GSSQR TGKEN  TVWQKVQ+NNSGGCC QLDQVS P+SKQ KG+C P VGVQ PKVKDKKT
Subjt:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVS-PISKQFKGICTPVVGVQMPKVKDKKT

Query:  GNRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLIS
        GNRKQLK+KF +RLK KNTS Q+K+YRP+++S GSNT+SM H  PNERLDI  M FDI +SSG  R+ FQND+TDKCTTSESSESTQ CLDG MS KLIS
Subjt:  GNRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLIS

Query:  DGLNSQRVENVSSSLPRACNSLNQSNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSRSPFQNWLPIGGEGSRLTT-LARPDFSSVKDASMQPAEFST
        DGLN+QRVEN SS+   +C+SLNQSN ++ QSPVY+PHLFFQATKGSSLAERSKH+NQSRSP QNW+P   EGSRLTT LARPDFSS+KDA+ QPAEF  
Subjt:  DGLNSQRVENVSSSLPRACNSLNQSNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSRSPFQNWLPIGGEGSRLTT-LARPDFSSVKDASMQPAEFST

Query:  SEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGS
        SEKSIQE V+CN++DPVS   E IQHSRD +H PLE ECE Q+ +G+ T ALQD   E DVDEHFN KS+C D++++EQ VN+AC+AQL  +A+      
Subjt:  SEKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGS

Query:  PIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSC
         IA  ERFLHLSSPVI+Q P LRS +I  +N  GD IPCS+ETA+ISL CLWQWYEKHGSYGLE+KANG+E SNGFG +NS F AYFVPFLSAVQLFKS 
Subjt:  PIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSC

Query:  KTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLF
        KTH G TT PVG DS VSDIK  EP T  LPIFSVLFPKPCTDDA+V + C+QLHSSE+ LASEK+  S QSV+  LSGESELIFEYFE EQPQQRRPLF
Subjt:  KTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLF

Query:  DKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWF
        DKI QLV+GDG L+GKIYGDPT+L S+TLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSL HFV RTSQS+S +T+SC+VCPVVGLQS+NAQNECWF
Subjt:  DKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWF

Query:  EPRNIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        +PRN    F    NPP +++ERLR+LEETASLMARAVVKKGNLN+ N HPDYEFFLSRR
Subjt:  EPRNIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

A0A6J1K4L4 uncharacterized protein LOC1114900280.0e+0072.28Show/hide
Query:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALP-PYVDSLDVSCLPQMNRFAAERKLVRK
        MQCAL +SS+FQK  DKGK+ LE++++E++CSR IKDS+VSSF WRNFFDYR AVIS LT+ESDGLWRIVALP   +DSL VSCLPQMN+F A+RKLV  
Subjt:  MQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVALP-PYVDSLDVSCLPQMNRFAAERKLVRK

Query:  GPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE
        GPASNGTYS NSFRCRSLLESN  LLDS+A +SSNK+S KFS RSSCS SAL+  DSSAIS IPI   K+QRYGKKN RKKAKK++IECKK SSDFV AE
Subjt:  GPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKNPRKKAKKKEIECKKISSDFVCAE

Query:  TEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA
        TEVSS+DSAR S LLE  G+N SDCRDGSVLCS ARETF  D RASK+DF+RD+++IIQPLG TDSISS+I +  ASE+P SA KN  G Y   GSENQ 
Subjt:  TEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSSAIKNFSGYYQVCGSENQA

Query:  LIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYS-HNSVRDEVDVNAKVEKAN
        LIK PGCT  +G V+ +ERLF   CNDFC K S DNNSP        S CD+  LKL E +GFG+DLLE ++SPSREN  S HNSVRD VDVNA+ EKAN
Subjt:  LIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYS-HNSVRDEVDVNAKVEKAN

Query:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTG
         GIQGCT SETC +LPGKKTKQNKKL+G+SR NR+GG GSSQR TGKEN  TVWQKVQ+NNSGGCC QLDQVSPISKQ KGIC P VGVQ PKVKDKKTG
Subjt:  RGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPKVKDKKTG

Query:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISD
        NRKQLK+KF +RLK KN+S Q+K+YRP+++S GSNT+SM H  PNERL I  M FD+ +SS   R+ FQND+TDK  TSESSESTQ CLDG MS KLISD
Subjt:  NRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLISD

Query:  GLNSQRVENVSSSLPRACNSLNQSNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSRSPFQNWLPIGGEGSRLTT-LARPDFSSVKDASMQPAEFSTS
        GLN+QRVEN SS+   +C+S+NQSN ++ QSPVY+PHLFFQATKGSSLAERSKH+NQSRSP QNW+P   EGSRLTT LARPDFSS+KDA+ QPAEF  S
Subjt:  GLNSQRVENVSSSLPRACNSLNQSNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSRSPFQNWLPIGGEGSRLTT-LARPDFSSVKDASMQPAEFSTS

Query:  EKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSP
        EKSIQE VNCN++DPVS   E IQHSRDG+H PLE ECE Q+ +G+ T ALQD R E DVDEHFN K++C D++R+EQ VN+AC+AQL  +A+       
Subjt:  EKSIQERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSP

Query:  IAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCK
        IA  ERFLHLSSPVI+Q P LRS EI  +N  GDVIPCS+ETA+ISLGCLWQWYEKHGSYGLE+KANG+E SNGFG +NS F AYFVPFLSAVQLFKS K
Subjt:  IAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCK

Query:  THGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFD
        TH G TT PVG DS VSDIK  EP T  LPIFSVLFPKPCTD+A+V + C+QLHSSE+ LASEK+  S QSV+  LSGESELIFEYFE EQPQQRRPLFD
Subjt:  THGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFD

Query:  KIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFE
        KI QLV+GDG L+GKIYGDPT+L S+TLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSL HFV RTSQS+S +T+SC+VCPVVGLQS+NAQNECWF+
Subjt:  KIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFE

Query:  PRNIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR
        PR    TF    NPP +++ERLR+LEETASL+ARAVVKKGNLNS N HPDYEFFLSRR
Subjt:  PRNIAPTFTPGLNPPRILEERLRSLEETASLMARAVVKKGNLNSENMHPDYEFFLSRR

SwissProt top hitse value%identityAlignment
P53803 DNA-directed RNA polymerases I, II, and III subunit RPABC44.0e-0961.9Show/hide
Query:  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR
        P+ +P+ YICG+C  EN +K  D I+CRECGYRI+YKKRT+R
Subjt:  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR

Q3ZBC0 DNA-directed RNA polymerases I, II, and III subunit RPABC44.0e-0961.9Show/hide
Query:  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR
        P+ +P+ YICG+C  EN +K  D I+CRECGYRI+YKKRT+R
Subjt:  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR

Q63871 DNA-directed RNA polymerases I, II, and III subunit RPABC44.0e-0961.9Show/hide
Query:  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR
        P+ +P+ YICG+C  EN +K  D I+CRECGYRI+YKKRT+R
Subjt:  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR

Q9C8M4 DNA-directed RNA polymerase subunit 12-like protein3.6e-1073.17Show/hide
Query:  DPQPEP-VSYICGDCGMENTLKQGDVIQCRECGYRILYKKR
        D QPE  V Y+CGDCG EN LK+GDV QCR+CG+RILYKKR
Subjt:  DPQPEP-VSYICGDCGMENTLKQGDVIQCRECGYRILYKKR

Q9FLM8 DNA-directed RNA polymerases II, IV and V subunit 123.0e-1788.64Show/hide
Query:  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR
        MDP PEPV+Y+CGDCG ENTLK GDVIQCRECGYRILYKKRTRR
Subjt:  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR

Arabidopsis top hitse value%identityAlignment
AT1G03610.1 Protein of unknown function (DUF789)1.9e-1427.33Show/hide
Query:  VERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETAD-ISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTH
        ++RFLH  +P++     L  +EI  R L     P   +  +   L  LW  Y++  +YG  +  +         TN  +   Y+VP+LSA+Q+F S    
Subjt:  VERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETAD-ISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTH

Query:  GGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKI
                   S +   +  E   C           P +D  S   V      SE+ L +         +         L  +YFE   P  R PL DKI
Subjt:  GGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKI

Query:  HQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSLRHFVSRTSQSNSPDTN-----------SCLVCPVVG
        ++L +   R  G        L SL   DL   SW SVAWYPIY IP G    +L   FLTYH+L    S + Q   P+ N             +     G
Subjt:  HQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSLRHFVSRTSQSNSPDTN-----------SCLVCPVVG

Query:  LQSYNAQNECW
        + +Y  Q + W
Subjt:  LQSYNAQNECW

AT1G15030.1 Protein of unknown function (DUF789)3.8e-1525.26Show/hide
Query:  MYGYGTTALQDLRSEFDVDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNET
        M G G    Q  R++ DV         C  S   ++   +A     VSEA         + VERFL   +P +   P    S+   R   G  +   ++ 
Subjt:  MYGYGTTALQDLRSEFDVDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVINQTPKLRSSEIYPRNLPGDVIPCSNET

Query:  ADISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTD
            LG +W+ + +  +YG+ +             N      Y+VP LS +Q++                D+  S ++ +         F     +  + 
Subjt:  ADISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTD

Query:  DASVSRVCNQLHSSEQHLASEKKKSSVQSVNLK---------LSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHA
        + S S     L  S++ +++   K S++  + +         LS +  LIFEY E + P  R P  DK+  L      L+           +L   DL  
Subjt:  DASVSRVCNQLHSSEQHLASEKKKSSVQSVNLK---------LSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHA

Query:  GSWYSVAWYPIYRIPDG----NLRAAFLTYHSLRHFVSRTSQSNSPDTNSCLV-----------CPVVGLQSYNAQNECW
         SW+SVAWYPIY+IP G    +L A FLTYHSL        Q     T S  V            PV GL SY  +   W
Subjt:  GSWYSVAWYPIYRIPDG----NLRAAFLTYHSLRHFVSRTSQSNSPDTNSCLV-----------CPVVGLQSYNAQNECW

AT2G01260.1 Protein of unknown function (DUF789)2.9e-1527.59Show/hide
Query:  LGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASV
        LG +W  + +  +YG  +             N      Y+VP LSA+Q++        +       DS  SD +                    +D   V
Subjt:  LGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASV

Query:  SRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYR
        S   + +   +QH     ++ S       L  +  L+FEY E + P  R P  DK+  L      L            +L   DL   SW+SVAWYPIYR
Subjt:  SRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYR

Query:  IPDG----NLRAAFLTYHSLR-HFVSRTSQSN----SPDTNSCLVCPVVGLQSYNAQNECW
        IP G    +L A FLTYHSL   F    S+ +     P  +  +  PV GL SY  +   W
Subjt:  IPDG----NLRAAFLTYHSLR-HFVSRTSQSN----SPDTNSCLVCPVVGLQSYNAQNECW

AT4G16100.1 Protein of unknown function (DUF789)1.4e-1728.22Show/hide
Query:  EQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVIN-QTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK--ANGYENSN
        E+   + C       +    TG+  + + RFL  ++P+++ Q   L SS+ +    P              L  LW  +E+  +YG+ +    NG +   
Subjt:  EQAVNNACRAQLVSEAIQMETGSPIAMVERFLHLSSPVIN-QTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIK--ANGYENSN

Query:  GFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKK---SSVQ
               +   Y+VP+LS +QL++          DP    +C +  +V E S           P+  + D   S  C +L  +    + E+K    SS  
Subjt:  GFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKK---SSVQ

Query:  SVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSL----
              +   EL+FEY EG  P  R PL DKI  L                 L +    DL   SW SVAWYPIYRIP G    NL A FLT+HSL    
Subjt:  SVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGDPTMLNSLTLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSL----

Query:  RHFVSRTSQSNSPDTNSC-LVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNPPRILEERLRSLE
        R   +   QS+S    S  L  P  GL SY  +   W    ++      G    R  EE LR L+
Subjt:  RHFVSRTSQSNSPDTNSC-LVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNPPRILEERLRSLE

AT5G41010.1 DNA directed RNA polymerase, 7 kDa subunit2.1e-1888.64Show/hide
Query:  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR
        MDP PEPV+Y+CGDCG ENTLK GDVIQCRECGYRILYKKRTRR
Subjt:  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTCAACCTGAACCAGTCAGCTACATCTGTGGAGATTGTGGAATGGAGAACACTCTGAAGCAGGGTGATGTTATACAGTGCCGAGAGTGTGGTTATCGT
ATTCTCTACAAGAAGCGCACCCGTCGCAAATCTATTCTACTCAACCTATGGCGATGGTTGACCAGCAAAAGTATTTCTTGGTTGTTCAGTATGAGGCCCGCTGAA
ATGCTTATTCTGGCGATTGACGTTCTAACTGGATGGTTCACTTTTGCATTTCGATATCAGCATTTGATTCCTATTCAAAATTTTTTTGTATTTTTCCTGAGGGTG
AAAGAAGAAGATAAAAGCTATTGTCCACTGGAAAAACAAGCTTTGCTCCTTGTAATTTTCCTTCGATTCTCTCCCGCTAATGGAAATGGGTTACAGAAAACGATG
CAGTGTGCTCTTGTAAGAAGTAGTGATTTTCAGAAAGGTCTAGACAAAGGAAAGGAGTCATTGGAATTGAGACTCGAGGAAAACAGTTGTTCCAGAGGAATTAAG
GATTCTAAAGTTTCTTCGTTTGGCTGGAGGAACTTTTTTGATTACAGATGTGCCGTCATTAGTTTTCTTACAGTCGAATCTGATGGACTCTGGAGAATTGTTGCA
CTACCACCATACGTAGATAGCTTGGATGTGAGCTGTCTGCCTCAAATGAATCGGTTTGCAGCTGAGAGAAAATTGGTCCGGAAAGGCCCTGCCTCTAATGGTACA
TATTCATTTAATTCATTCAGATGTAGAAGCTTGCTGGAGTCGAACAATAAGTTATTGGATAGTGAAGCAATTGAGTCATCAAATAAATCCTCTGGCAAGTTCTCT
TGCAGGAGTTCATGTTCTGGCTCTGCTTTGATGTTAAGTGACTCTAGTGCAATCTCTGCCATCCCAATTCGTGGAGCTAAATTGCAGAGATATGGGAAGAAAAAT
CCAAGAAAGAAGGCAAAAAAGAAAGAAATAGAATGTAAGAAGATATCTTCTGATTTTGTCTGTGCTGAAACAGAAGTATCATCCAAGGATTCTGCCCGTGGAAGT
TTTTTGTTAGAAGCTTGTGGCAGTAATGATTCAGATTGTAGAGATGGATCTGTTTTGTGTTCGATTGCACGAGAAACTTTTCTGCCAGATTTTAGGGCCAGTAAA
AGTGATTTTGAACGAGATACTAAGAAAATTATTCAGCCACTTGGAATCACAGATTCTATATCCTCTAAAATTGATGACGAGCATGCATCTGAGGTTCCATCTTCT
GCAATAAAGAATTTTAGTGGGTATTATCAAGTTTGTGGATCTGAAAACCAGGCCCTAATCAAAGTACCTGGTTGTACCCATGTCAATGGGGGAGTAAATTCAAGA
GAGAGGTTATTTGCCAGCAGCTGCAATGATTTTTGCCCTAAGGGTTCTTTGGATAATAATTCCCCATATCCTAAGTGTTTTAGTTTAAGCAGTACCTGTGATAAT
TTTAACTTGAAATTAAATGAAAAGAAAGGTTTTGGAGTTGATCTGTTGGAAAAACGAAGTTCACCTTCTAGAGAGAACTATTATTCTCATAACTCAGTAAGAGAT
GAAGTAGATGTGAATGCCAAAGTGGAGAAAGCTAATCGTGGTATTCAGGGATGTACTGTGAGTGAAACTTGTTCGGTTTTACCTGGAAAGAAAACTAAACAAAAC
AAAAAATTGACCGGGAGTTCAAGAATGAATAGATATGGTGGTCCAGGGAGTTCACAAAGGCGTACAGGGAAGGAAAACAGACATACTGTCTGGCAAAAGGTTCAA
AGAAATAATAGTGGTGGATGTTGTGAACAGTTAGACCAAGTAAGTCCTATCAGCAAACAGTTTAAAGGCATCTGTACTCCTGTTGTTGGTGTGCAAATGCCAAAG
GTCAAGGATAAAAAAACCGGGAACAGAAAACAGCTGAAAGAAAAATTTCCCAGGAGGTTGAAAAGAAAAAATACTTCAGGACAAGAGAAGGTCTATCGTCCTACT
AGGAACAGTTGTGGTAGTAATACTAGTTCAATGGTTCACAAACCACCAAATGAAAGGTTGGATATTCGACCAATGGACTTTGACATAAGAAGATCAAGTGGCGAT
CCAAGATCTCGTTTTCAAAATGATACTACTGATAAATGCACGACTTCTGAATCATCTGAAAGTACACAACCCTGTCTAGATGGATTGATGTCAAGCAAACTTATC
TCCGATGGTTTGAATAGTCAAAGAGTAGAGAATGTCTCTAGCTCATTGCCAAGGGCCTGCAACTCCTTAAATCAGTCAAACCTGATAGAGATTCAGTCTCCTGTT
TACCTTCCGCATCTTTTCTTTCAAGCAACAAAAGGAAGTTCTCTTGCTGAACGCAGCAAGCATAACAACCAATCTAGATCACCCTTTCAAAACTGGTTGCCAATC
GGGGGAGAAGGTTCCAGATTGACCACCTTGGCTAGACCTGATTTTTCATCTGTGAAAGATGCAAGTATGCAACCTGCTGAGTTTAGCACTTCAGAAAAATCAATT
CAAGAACGAGTCAATTGCAACATAGTAGATCCTGTTTCTGTTGCCACTGAGGGGATTCAGCATTCTAGAGATGGGAGTCATGGTCCTTTAGAAAATGAATGTGAG
GTGCAGAAGATGTATGGTTATGGTACGACTGCACTACAGGATCTTAGGTCTGAGTTTGATGTGGATGAGCATTTTAATTCCAAATCCTCATGTGAAGATTCGTCT
AGAATGGAGCAAGCAGTGAATAATGCATGTAGGGCACAATTGGTATCTGAAGCTATTCAAATGGAAACTGGTAGTCCAATCGCAATGGTCGAAAGATTCCTTCAT
TTGTCCTCCCCTGTTATCAACCAGACACCCAAGCTAAGAAGTAGTGAAATTTACCCAAGAAATTTGCCAGGTGATGTGATTCCGTGTAGCAATGAGACTGCTGAC
ATTTCTTTGGGTTGCCTGTGGCAATGGTATGAAAAGCATGGCAGCTATGGCTTAGAAATAAAAGCGAATGGTTATGAAAATTCAAATGGTTTTGGCACTAATAAC
TCTGCATTCCGTGCATATTTTGTTCCATTTCTTTCAGCTGTTCAACTATTCAAGAGCTGTAAAACTCATGGGGGAACAACTACAGATCCTGTGGGATTTGATTCA
TGTGTAAGCGATATAAAAGTGAAGGAGCCATCCACTTGTCATCTTCCAATATTTTCAGTCCTTTTTCCTAAGCCCTGTACTGATGATGCAAGTGTTTCACGGGTT
TGTAATCAGCTACATAGTTCAGAGCAACATTTGGCCTCTGAGAAGAAGAAATCTTCAGTACAATCAGTCAACTTAAAATTATCTGGAGAATCAGAACTTATTTTT
GAATATTTTGAAGGGGAACAACCTCAGCAGAGAAGGCCATTATTTGATAAGATACATCAGCTGGTCGAGGGTGATGGACGTCTACAAGGAAAAATATACGGGGAT
CCGACCATGCTCAATTCCTTAACTTTGAATGATCTGCATGCTGGATCATGGTACTCAGTGGCATGGTATCCCATTTATAGGATACCAGATGGCAACCTTCGAGCT
GCATTTTTGACTTACCACTCACTAAGACATTTTGTTTCTAGAACTTCCCAATCTAACTCTCCAGATACAAATTCTTGCTTAGTCTGTCCAGTTGTGGGTCTTCAA
AGTTATAATGCACAGAATGAATGCTGGTTCGAGCCTAGAAACATTGCGCCCACATTTACCCCTGGCTTAAATCCTCCTAGAATTCTCGAGGAGCGCCTGAGGTCG
CTGGAAGAGACTGCATCTCTCATGGCCAGAGCTGTGGTTAAGAAAGGAAATCTGAACTCCGAAAACATGCATCCAGATTATGAGTTCTTCCTCTCCCGGCGACTC
TAG
mRNA sequenceShow/hide mRNA sequence
GAAAAAGAAAACCCTACTACACGTTCCCTCTCCCCTTTTATTTTGATTGATTTTCTCCTCACCCATCATTGAAGCAGCCACAGCCAAAGACCTATACACGACTTT
GATTTCTCTCTCTGCTCTCGTATCTCCCTCGCAGCGGCAGGAAGAATTTTGAAGTTTGACTCATGGATCCTCAACCTGAACCAGTCAGCTACATCTGTGGAGATT
GTGGAATGGAGAACACTCTGAAGCAGGGTGATGTTATACAGTGCCGAGAGTGTGGTTATCGTATTCTCTACAAGAAGCGCACCCGTCGCAAATCTATTCTACTCA
ACCTATGGCGATGGTTGACCAGCAAAAGTATTTCTTGGTTGTTCAGTATGAGGCCCGCTGAAATGCTTATTCTGGCGATTGACGTTCTAACTGGATGGTTCACTT
TTGCATTTCGATATCAGCATTTGATTCCTATTCAAAATTTTTTTGTATTTTTCCTGAGGGTGAAAGAAGAAGATAAAAGCTATTGTCCACTGGAAAAACAAGCTT
TGCTCCTTGTAATTTTCCTTCGATTCTCTCCCGCTAATGGAAATGGGTTACAGAAAACGATGCAGTGTGCTCTTGTAAGAAGTAGTGATTTTCAGAAAGGTCTAG
ACAAAGGAAAGGAGTCATTGGAATTGAGACTCGAGGAAAACAGTTGTTCCAGAGGAATTAAGGATTCTAAAGTTTCTTCGTTTGGCTGGAGGAACTTTTTTGATT
ACAGATGTGCCGTCATTAGTTTTCTTACAGTCGAATCTGATGGACTCTGGAGAATTGTTGCACTACCACCATACGTAGATAGCTTGGATGTGAGCTGTCTGCCTC
AAATGAATCGGTTTGCAGCTGAGAGAAAATTGGTCCGGAAAGGCCCTGCCTCTAATGGTACATATTCATTTAATTCATTCAGATGTAGAAGCTTGCTGGAGTCGA
ACAATAAGTTATTGGATAGTGAAGCAATTGAGTCATCAAATAAATCCTCTGGCAAGTTCTCTTGCAGGAGTTCATGTTCTGGCTCTGCTTTGATGTTAAGTGACT
CTAGTGCAATCTCTGCCATCCCAATTCGTGGAGCTAAATTGCAGAGATATGGGAAGAAAAATCCAAGAAAGAAGGCAAAAAAGAAAGAAATAGAATGTAAGAAGA
TATCTTCTGATTTTGTCTGTGCTGAAACAGAAGTATCATCCAAGGATTCTGCCCGTGGAAGTTTTTTGTTAGAAGCTTGTGGCAGTAATGATTCAGATTGTAGAG
ATGGATCTGTTTTGTGTTCGATTGCACGAGAAACTTTTCTGCCAGATTTTAGGGCCAGTAAAAGTGATTTTGAACGAGATACTAAGAAAATTATTCAGCCACTTG
GAATCACAGATTCTATATCCTCTAAAATTGATGACGAGCATGCATCTGAGGTTCCATCTTCTGCAATAAAGAATTTTAGTGGGTATTATCAAGTTTGTGGATCTG
AAAACCAGGCCCTAATCAAAGTACCTGGTTGTACCCATGTCAATGGGGGAGTAAATTCAAGAGAGAGGTTATTTGCCAGCAGCTGCAATGATTTTTGCCCTAAGG
GTTCTTTGGATAATAATTCCCCATATCCTAAGTGTTTTAGTTTAAGCAGTACCTGTGATAATTTTAACTTGAAATTAAATGAAAAGAAAGGTTTTGGAGTTGATC
TGTTGGAAAAACGAAGTTCACCTTCTAGAGAGAACTATTATTCTCATAACTCAGTAAGAGATGAAGTAGATGTGAATGCCAAAGTGGAGAAAGCTAATCGTGGTA
TTCAGGGATGTACTGTGAGTGAAACTTGTTCGGTTTTACCTGGAAAGAAAACTAAACAAAACAAAAAATTGACCGGGAGTTCAAGAATGAATAGATATGGTGGTC
CAGGGAGTTCACAAAGGCGTACAGGGAAGGAAAACAGACATACTGTCTGGCAAAAGGTTCAAAGAAATAATAGTGGTGGATGTTGTGAACAGTTAGACCAAGTAA
GTCCTATCAGCAAACAGTTTAAAGGCATCTGTACTCCTGTTGTTGGTGTGCAAATGCCAAAGGTCAAGGATAAAAAAACCGGGAACAGAAAACAGCTGAAAGAAA
AATTTCCCAGGAGGTTGAAAAGAAAAAATACTTCAGGACAAGAGAAGGTCTATCGTCCTACTAGGAACAGTTGTGGTAGTAATACTAGTTCAATGGTTCACAAAC
CACCAAATGAAAGGTTGGATATTCGACCAATGGACTTTGACATAAGAAGATCAAGTGGCGATCCAAGATCTCGTTTTCAAAATGATACTACTGATAAATGCACGA
CTTCTGAATCATCTGAAAGTACACAACCCTGTCTAGATGGATTGATGTCAAGCAAACTTATCTCCGATGGTTTGAATAGTCAAAGAGTAGAGAATGTCTCTAGCT
CATTGCCAAGGGCCTGCAACTCCTTAAATCAGTCAAACCTGATAGAGATTCAGTCTCCTGTTTACCTTCCGCATCTTTTCTTTCAAGCAACAAAAGGAAGTTCTC
TTGCTGAACGCAGCAAGCATAACAACCAATCTAGATCACCCTTTCAAAACTGGTTGCCAATCGGGGGAGAAGGTTCCAGATTGACCACCTTGGCTAGACCTGATT
TTTCATCTGTGAAAGATGCAAGTATGCAACCTGCTGAGTTTAGCACTTCAGAAAAATCAATTCAAGAACGAGTCAATTGCAACATAGTAGATCCTGTTTCTGTTG
CCACTGAGGGGATTCAGCATTCTAGAGATGGGAGTCATGGTCCTTTAGAAAATGAATGTGAGGTGCAGAAGATGTATGGTTATGGTACGACTGCACTACAGGATC
TTAGGTCTGAGTTTGATGTGGATGAGCATTTTAATTCCAAATCCTCATGTGAAGATTCGTCTAGAATGGAGCAAGCAGTGAATAATGCATGTAGGGCACAATTGG
TATCTGAAGCTATTCAAATGGAAACTGGTAGTCCAATCGCAATGGTCGAAAGATTCCTTCATTTGTCCTCCCCTGTTATCAACCAGACACCCAAGCTAAGAAGTA
GTGAAATTTACCCAAGAAATTTGCCAGGTGATGTGATTCCGTGTAGCAATGAGACTGCTGACATTTCTTTGGGTTGCCTGTGGCAATGGTATGAAAAGCATGGCA
GCTATGGCTTAGAAATAAAAGCGAATGGTTATGAAAATTCAAATGGTTTTGGCACTAATAACTCTGCATTCCGTGCATATTTTGTTCCATTTCTTTCAGCTGTTC
AACTATTCAAGAGCTGTAAAACTCATGGGGGAACAACTACAGATCCTGTGGGATTTGATTCATGTGTAAGCGATATAAAAGTGAAGGAGCCATCCACTTGTCATC
TTCCAATATTTTCAGTCCTTTTTCCTAAGCCCTGTACTGATGATGCAAGTGTTTCACGGGTTTGTAATCAGCTACATAGTTCAGAGCAACATTTGGCCTCTGAGA
AGAAGAAATCTTCAGTACAATCAGTCAACTTAAAATTATCTGGAGAATCAGAACTTATTTTTGAATATTTTGAAGGGGAACAACCTCAGCAGAGAAGGCCATTAT
TTGATAAGATACATCAGCTGGTCGAGGGTGATGGACGTCTACAAGGAAAAATATACGGGGATCCGACCATGCTCAATTCCTTAACTTTGAATGATCTGCATGCTG
GATCATGGTACTCAGTGGCATGGTATCCCATTTATAGGATACCAGATGGCAACCTTCGAGCTGCATTTTTGACTTACCACTCACTAAGACATTTTGTTTCTAGAA
CTTCCCAATCTAACTCTCCAGATACAAATTCTTGCTTAGTCTGTCCAGTTGTGGGTCTTCAAAGTTATAATGCACAGAATGAATGCTGGTTCGAGCCTAGAAACA
TTGCGCCCACATTTACCCCTGGCTTAAATCCTCCTAGAATTCTCGAGGAGCGCCTGAGGTCGCTGGAAGAGACTGCATCTCTCATGGCCAGAGCTGTGGTTAAGA
AAGGAAATCTGAACTCCGAAAACATGCATCCAGATTATGAGTTCTTCCTCTCCCGGCGACTCTAGTTACATAACCAGGTTTTATGCTTAGAACTTATCAAGGAGA
TTCTTTCCTTTTGTCAATATTCTTGAGAATTTAGTTTAGGTTAGGACCTAATCTCAAGTAGGGATGCAAGAATAACTGATCTTGGTCTGTAATGTATTTACACAA
TGTTTTTTTTTGTTCCTTTTTTATCTTTCTATGCTAACTCTTTCAGGCATTGCAACTATATTGATTTGTACATAAATTACTTCTCATGTAAAAATGAAATATCTC
CTGGAT
Protein sequenceShow/hide protein sequence
MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRRKSILLNLWRWLTSKSISWLFSMRPAEMLILAIDVLTGWFTFAFRYQHLIPIQNFFVFFLRV
KEEDKSYCPLEKQALLLVIFLRFSPANGNGLQKTMQCALVRSSDFQKGLDKGKESLELRLEENSCSRGIKDSKVSSFGWRNFFDYRCAVISFLTVESDGLWRIVA
LPPYVDSLDVSCLPQMNRFAAERKLVRKGPASNGTYSFNSFRCRSLLESNNKLLDSEAIESSNKSSGKFSCRSSCSGSALMLSDSSAISAIPIRGAKLQRYGKKN
PRKKAKKKEIECKKISSDFVCAETEVSSKDSARGSFLLEACGSNDSDCRDGSVLCSIARETFLPDFRASKSDFERDTKKIIQPLGITDSISSKIDDEHASEVPSS
AIKNFSGYYQVCGSENQALIKVPGCTHVNGGVNSRERLFASSCNDFCPKGSLDNNSPYPKCFSLSSTCDNFNLKLNEKKGFGVDLLEKRSSPSRENYYSHNSVRD
EVDVNAKVEKANRGIQGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGPGSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKGICTPVVGVQMPK
VKDKKTGNRKQLKEKFPRRLKRKNTSGQEKVYRPTRNSCGSNTSSMVHKPPNERLDIRPMDFDIRRSSGDPRSRFQNDTTDKCTTSESSESTQPCLDGLMSSKLI
SDGLNSQRVENVSSSLPRACNSLNQSNLIEIQSPVYLPHLFFQATKGSSLAERSKHNNQSRSPFQNWLPIGGEGSRLTTLARPDFSSVKDASMQPAEFSTSEKSI
QERVNCNIVDPVSVATEGIQHSRDGSHGPLENECEVQKMYGYGTTALQDLRSEFDVDEHFNSKSSCEDSSRMEQAVNNACRAQLVSEAIQMETGSPIAMVERFLH
LSSPVINQTPKLRSSEIYPRNLPGDVIPCSNETADISLGCLWQWYEKHGSYGLEIKANGYENSNGFGTNNSAFRAYFVPFLSAVQLFKSCKTHGGTTTDPVGFDS
CVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVSRVCNQLHSSEQHLASEKKKSSVQSVNLKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRLQGKIYGD
PTMLNSLTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLRHFVSRTSQSNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNIAPTFTPGLNPPRILEERLRS
LEETASLMARAVVKKGNLNSENMHPDYEFFLSRRL