; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026164 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026164
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTranscription initiation factor TFIID subunit 8, putative isoform 1
Genome locationtig00153031:2395207..2410448
RNA-Seq ExpressionSgr026164
SyntenySgr026164
Gene Ontology termsGO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR006565 - Bromodomain associated domain
IPR009072 - Histone-fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143440.1 uncharacterized protein LOC101223185 [Cucumis sativus]5.7e-29988.49Show/hide
Query:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS
        MALLGDDGRGYELARKL+TLGVW+ WLGD +YSIFVPFL+STSTW+TFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR   SPSSPS ST NPLSSS
Subjt:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS

Query:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN
        SLAISKL+PNYLQLHGDDVYFTLEN SKDGV QRE  VSSNKASGKIQPK AS+AGPRSRES+IGDSSQRLKNELPETWY+QFIEKYRVKQPYRLSHGNN
Subjt:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN

Query:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP
        +++KRTSEEMSSYLRLLEKHKKRRMVF DD LTNFGNSVSANASSSVFDFSNSVEDD NFFPEIMFTFNCVPESALPPPD+MKDNR+ E+ GVID LPQP
Subjt:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP

Query:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS
        + RN AMMERLGVKPDY+S ERG NV+RAKSGSGG RKSLGQEQSFQMSQKV+ARMLMS+GFEGATEVPLE+FSQFLSCHICKLGSTLRVLADSYRKQCS
Subjt:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS

Query:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAF----QQQQQLVLEKMRRRHQA
        AVDLLRMFLKTMGYSNFGPLA++VKDGSRN+VRQS H  HGVQPQLQAQHQ+LLQVPQQVPRQMHPQMQQMV+SQ  AF    QQQQQ VLEKMRRR QA
Subjt:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAF----QQQQQLVLEKMRRRHQA

Query:  STPRAGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDS
        +TPRA ME NKDRPL+QVKVENTELPMDGNALNALN+RHPQLQFRQQQIAA+SNIHASPGNQFRQ+ S+QMPQIQTPNT VVRAPPVKVEGFQELMGGD+
Subjt:  STPRAGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDS

Query:  SSKHDSEEARLTSPSSK
        SSKHDSEEARLTSPSSK
Subjt:  SSKHDSEEARLTSPSSK

XP_008440528.1 PREDICTED: uncharacterized protein LOC103484926 isoform X1 [Cucumis melo]1.3e-30390.05Show/hide
Query:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS
        MALLGDDGRGYELARKLETLGVWR WLGDF+YSIFVPFLSSTSTW+TFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR   SPSSPS ST N LSSS
Subjt:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS

Query:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN
        SLAISKLNPNYLQLHGDDVYFTLEN SKDGV QRE  VSSNKASGKIQPK ASSAGPRSRES+IGDSSQRLKNELPE WY+QFIEKYRVKQPYRLSHGNN
Subjt:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN

Query:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP
        ++DKRTSEEMSSYLRLLEKHKKRR VF DD LTNFGNSVSANASSSVFDFSNSVEDD NFFPEIMFTFNCVPESALPPPD+MKDNRK E+SGVID LPQP
Subjt:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP

Query:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS
        + RN AMMERLGVKPDY+S ERG NV+RAKSGSGG RKSLGQEQSFQMSQKV+ARMLMS+GFEGATEVPLE+FSQFLSCHICKLGSTLRVLADSYRKQCS
Subjt:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS

Query:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAFQQQQQLVLEKMRRRHQASTPR
        AVDLLRMFLKTMGYSNFGPLA++VKDGSRN+VRQS H  HGVQPQLQAQHQ+LLQVPQQVPRQMHPQMQQMVH+Q    QQQQQ VLEKMRRR QA+TPR
Subjt:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAFQQQQQLVLEKMRRRHQASTPR

Query:  AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDSSSKH
        A ME NKDRPL+QVKVENTELPMDGNALNALN+RHPQLQFRQQQIAA+SNIHASPGNQFRQM S+QMPQIQTPNT VVRAPPVKVEGFQELMGGDSSSKH
Subjt:  AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDSSSKH

Query:  DSEEARLTSPSSK
        DSEEARLTSPSSK
Subjt:  DSEEARLTSPSSK

XP_008440529.1 PREDICTED: uncharacterized protein LOC103484926 isoform X2 [Cucumis melo]4.7e-30189.72Show/hide
Query:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS
        MALLGDDGRGYELARKLETLGVWR WLGDF+YSIFVPFLSSTSTW+TFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR   SPSSPS ST N LSSS
Subjt:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS

Query:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN
        SLAISKLNPNYLQLHGDDVYFTLEN SKDGV QRE  VSSNKASGK  PK ASSAGPRSRES+IGDSSQRLKNELPE WY+QFIEKYRVKQPYRLSHGNN
Subjt:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN

Query:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP
        ++DKRTSEEMSSYLRLLEKHKKRR VF DD LTNFGNSVSANASSSVFDFSNSVEDD NFFPEIMFTFNCVPESALPPPD+MKDNRK E+SGVID LPQP
Subjt:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP

Query:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS
        + RN AMMERLGVKPDY+S ERG NV+RAKSGSGG RKSLGQEQSFQMSQKV+ARMLMS+GFEGATEVPLE+FSQFLSCHICKLGSTLRVLADSYRKQCS
Subjt:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS

Query:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAFQQQQQLVLEKMRRRHQASTPR
        AVDLLRMFLKTMGYSNFGPLA++VKDGSRN+VRQS H  HGVQPQLQAQHQ+LLQVPQQVPRQMHPQMQQMVH+Q    QQQQQ VLEKMRRR QA+TPR
Subjt:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAFQQQQQLVLEKMRRRHQASTPR

Query:  AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDSSSKH
        A ME NKDRPL+QVKVENTELPMDGNALNALN+RHPQLQFRQQQIAA+SNIHASPGNQFRQM S+QMPQIQTPNT VVRAPPVKVEGFQELMGGDSSSKH
Subjt:  AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDSSSKH

Query:  DSEEARLTSPSSK
        DSEEARLTSPSSK
Subjt:  DSEEARLTSPSSK

XP_022133243.1 uncharacterized protein LOC111005886 isoform X1 [Momordica charantia]6.3e-29889.5Show/hide
Query:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS
        MALLGDDGRGYELARKLETLGVWR WLGD NYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR   SPSSPS STANPLSSS
Subjt:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS

Query:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN
        SLAISKLNPNYLQLHGDDVYFTLENLSKDGV QRE CV SNKASGKIQ K A SAGPRSRESEIGDSS RLKNELPETWYNQFIEKYRVKQPYRLS GNN
Subjt:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN

Query:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP
        +SDKRT+EEMSSYLRLLEKHKKRRMVF DDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPD+MKDNRKLELSGVID LPQP
Subjt:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP

Query:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS
        M RNP MMERLGVKPD          YRAKSGSG  RK+LGQEQ FQMSQ VIA+MLM+MGFEGATEVPLE+FSQFLSCHICKLGSTLRVLADSYRKQCS
Subjt:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS

Query:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAF-QQQQQLVLEKMRRRHQASTP
        AVDLLRMFLKTMGYSNFGPLAELVKDGSRN  RQ+QHQVHGVQPQLQAQH  LLQV QQ+PRQMHPQMQQMVH QNLAF QQQQQL+LEKMRRR QA+TP
Subjt:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAF-QQQQQLVLEKMRRRHQASTP

Query:  R-AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQ----LQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGG
        R AGME+NKDRPLVQVKVENTELPMDGNAL+ALNVR PQ    +QFRQQ IAA+SNIHASP NQFRQMSSLQMPQIQ+PN GVVRAPPVKVEGFQELMGG
Subjt:  R-AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQ----LQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGG

Query:  DSSSKHDSEEARLTSPSSK
        DSSSKHDSEEARLTSPSSK
Subjt:  DSSSKHDSEEARLTSPSSK

XP_022133245.1 uncharacterized protein LOC111005886 isoform X2 [Momordica charantia]2.9e-29589.18Show/hide
Query:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS
        MALLGDDGRGYELARKLETLGVWR WLGD NYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR   SPSSPS STANPLSSS
Subjt:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS

Query:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN
        SLAISKLNPNYLQLHGDDVYFTLENLSKDGV QRE CV SNKASGK   K A SAGPRSRESEIGDSS RLKNELPETWYNQFIEKYRVKQPYRLS GNN
Subjt:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN

Query:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP
        +SDKRT+EEMSSYLRLLEKHKKRRMVF DDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPD+MKDNRKLELSGVID LPQP
Subjt:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP

Query:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS
        M RNP MMERLGVKPD          YRAKSGSG  RK+LGQEQ FQMSQ VIA+MLM+MGFEGATEVPLE+FSQFLSCHICKLGSTLRVLADSYRKQCS
Subjt:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS

Query:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAF-QQQQQLVLEKMRRRHQASTP
        AVDLLRMFLKTMGYSNFGPLAELVKDGSRN  RQ+QHQVHGVQPQLQAQH  LLQV QQ+PRQMHPQMQQMVH QNLAF QQQQQL+LEKMRRR QA+TP
Subjt:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAF-QQQQQLVLEKMRRRHQASTP

Query:  R-AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQ----LQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGG
        R AGME+NKDRPLVQVKVENTELPMDGNAL+ALNVR PQ    +QFRQQ IAA+SNIHASP NQFRQMSSLQMPQIQ+PN GVVRAPPVKVEGFQELMGG
Subjt:  R-AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQ----LQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGG

Query:  DSSSKHDSEEARLTSPSSK
        DSSSKHDSEEARLTSPSSK
Subjt:  DSSSKHDSEEARLTSPSSK

TrEMBL top hitse value%identityAlignment
A0A0A0KJX7 BTP domain-containing protein2.8e-29988.49Show/hide
Query:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS
        MALLGDDGRGYELARKL+TLGVW+ WLGD +YSIFVPFL+STSTW+TFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR   SPSSPS ST NPLSSS
Subjt:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS

Query:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN
        SLAISKL+PNYLQLHGDDVYFTLEN SKDGV QRE  VSSNKASGKIQPK AS+AGPRSRES+IGDSSQRLKNELPETWY+QFIEKYRVKQPYRLSHGNN
Subjt:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN

Query:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP
        +++KRTSEEMSSYLRLLEKHKKRRMVF DD LTNFGNSVSANASSSVFDFSNSVEDD NFFPEIMFTFNCVPESALPPPD+MKDNR+ E+ GVID LPQP
Subjt:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP

Query:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS
        + RN AMMERLGVKPDY+S ERG NV+RAKSGSGG RKSLGQEQSFQMSQKV+ARMLMS+GFEGATEVPLE+FSQFLSCHICKLGSTLRVLADSYRKQCS
Subjt:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS

Query:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAF----QQQQQLVLEKMRRRHQA
        AVDLLRMFLKTMGYSNFGPLA++VKDGSRN+VRQS H  HGVQPQLQAQHQ+LLQVPQQVPRQMHPQMQQMV+SQ  AF    QQQQQ VLEKMRRR QA
Subjt:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAF----QQQQQLVLEKMRRRHQA

Query:  STPRAGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDS
        +TPRA ME NKDRPL+QVKVENTELPMDGNALNALN+RHPQLQFRQQQIAA+SNIHASPGNQFRQ+ S+QMPQIQTPNT VVRAPPVKVEGFQELMGGD+
Subjt:  STPRAGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDS

Query:  SSKHDSEEARLTSPSSK
        SSKHDSEEARLTSPSSK
Subjt:  SSKHDSEEARLTSPSSK

A0A1S3B1B3 uncharacterized protein LOC103484926 isoform X16.4e-30490.05Show/hide
Query:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS
        MALLGDDGRGYELARKLETLGVWR WLGDF+YSIFVPFLSSTSTW+TFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR   SPSSPS ST N LSSS
Subjt:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS

Query:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN
        SLAISKLNPNYLQLHGDDVYFTLEN SKDGV QRE  VSSNKASGKIQPK ASSAGPRSRES+IGDSSQRLKNELPE WY+QFIEKYRVKQPYRLSHGNN
Subjt:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN

Query:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP
        ++DKRTSEEMSSYLRLLEKHKKRR VF DD LTNFGNSVSANASSSVFDFSNSVEDD NFFPEIMFTFNCVPESALPPPD+MKDNRK E+SGVID LPQP
Subjt:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP

Query:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS
        + RN AMMERLGVKPDY+S ERG NV+RAKSGSGG RKSLGQEQSFQMSQKV+ARMLMS+GFEGATEVPLE+FSQFLSCHICKLGSTLRVLADSYRKQCS
Subjt:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS

Query:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAFQQQQQLVLEKMRRRHQASTPR
        AVDLLRMFLKTMGYSNFGPLA++VKDGSRN+VRQS H  HGVQPQLQAQHQ+LLQVPQQVPRQMHPQMQQMVH+Q    QQQQQ VLEKMRRR QA+TPR
Subjt:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAFQQQQQLVLEKMRRRHQASTPR

Query:  AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDSSSKH
        A ME NKDRPL+QVKVENTELPMDGNALNALN+RHPQLQFRQQQIAA+SNIHASPGNQFRQM S+QMPQIQTPNT VVRAPPVKVEGFQELMGGDSSSKH
Subjt:  AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDSSSKH

Query:  DSEEARLTSPSSK
        DSEEARLTSPSSK
Subjt:  DSEEARLTSPSSK

A0A1S3B1Y3 uncharacterized protein LOC103484926 isoform X22.3e-30189.72Show/hide
Query:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS
        MALLGDDGRGYELARKLETLGVWR WLGDF+YSIFVPFLSSTSTW+TFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR   SPSSPS ST N LSSS
Subjt:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS

Query:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN
        SLAISKLNPNYLQLHGDDVYFTLEN SKDGV QRE  VSSNKASGK  PK ASSAGPRSRES+IGDSSQRLKNELPE WY+QFIEKYRVKQPYRLSHGNN
Subjt:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN

Query:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP
        ++DKRTSEEMSSYLRLLEKHKKRR VF DD LTNFGNSVSANASSSVFDFSNSVEDD NFFPEIMFTFNCVPESALPPPD+MKDNRK E+SGVID LPQP
Subjt:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP

Query:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS
        + RN AMMERLGVKPDY+S ERG NV+RAKSGSGG RKSLGQEQSFQMSQKV+ARMLMS+GFEGATEVPLE+FSQFLSCHICKLGSTLRVLADSYRKQCS
Subjt:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS

Query:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAFQQQQQLVLEKMRRRHQASTPR
        AVDLLRMFLKTMGYSNFGPLA++VKDGSRN+VRQS H  HGVQPQLQAQHQ+LLQVPQQVPRQMHPQMQQMVH+Q    QQQQQ VLEKMRRR QA+TPR
Subjt:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAFQQQQQLVLEKMRRRHQASTPR

Query:  AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDSSSKH
        A ME NKDRPL+QVKVENTELPMDGNALNALN+RHPQLQFRQQQIAA+SNIHASPGNQFRQM S+QMPQIQTPNT VVRAPPVKVEGFQELMGGDSSSKH
Subjt:  AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDSSSKH

Query:  DSEEARLTSPSSK
        DSEEARLTSPSSK
Subjt:  DSEEARLTSPSSK

A0A5A7SZ36 Transcription initiation factor TFIID subunit 8, putative isoform 16.4e-30490.05Show/hide
Query:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS
        MALLGDDGRGYELARKLETLGVWR WLGDF+YSIFVPFLSSTSTW+TFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR   SPSSPS ST N LSSS
Subjt:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS

Query:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN
        SLAISKLNPNYLQLHGDDVYFTLEN SKDGV QRE  VSSNKASGKIQPK ASSAGPRSRES+IGDSSQRLKNELPE WY+QFIEKYRVKQPYRLSHGNN
Subjt:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN

Query:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP
        ++DKRTSEEMSSYLRLLEKHKKRR VF DD LTNFGNSVSANASSSVFDFSNSVEDD NFFPEIMFTFNCVPESALPPPD+MKDNRK E+SGVID LPQP
Subjt:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP

Query:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS
        + RN AMMERLGVKPDY+S ERG NV+RAKSGSGG RKSLGQEQSFQMSQKV+ARMLMS+GFEGATEVPLE+FSQFLSCHICKLGSTLRVLADSYRKQCS
Subjt:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS

Query:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAFQQQQQLVLEKMRRRHQASTPR
        AVDLLRMFLKTMGYSNFGPLA++VKDGSRN+VRQS H  HGVQPQLQAQHQ+LLQVPQQVPRQMHPQMQQMVH+Q    QQQQQ VLEKMRRR QA+TPR
Subjt:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAFQQQQQLVLEKMRRRHQASTPR

Query:  AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDSSSKH
        A ME NKDRPL+QVKVENTELPMDGNALNALN+RHPQLQFRQQQIAA+SNIHASPGNQFRQM S+QMPQIQTPNT VVRAPPVKVEGFQELMGGDSSSKH
Subjt:  AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDSSSKH

Query:  DSEEARLTSPSSK
        DSEEARLTSPSSK
Subjt:  DSEEARLTSPSSK

A0A6J1BYK1 uncharacterized protein LOC111005886 isoform X13.1e-29889.5Show/hide
Query:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS
        MALLGDDGRGYELARKLETLGVWR WLGD NYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR   SPSSPS STANPLSSS
Subjt:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLR---SPSSPSCSTANPLSSS

Query:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN
        SLAISKLNPNYLQLHGDDVYFTLENLSKDGV QRE CV SNKASGKIQ K A SAGPRSRESEIGDSS RLKNELPETWYNQFIEKYRVKQPYRLS GNN
Subjt:  SLAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNN

Query:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP
        +SDKRT+EEMSSYLRLLEKHKKRRMVF DDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPD+MKDNRKLELSGVID LPQP
Subjt:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP

Query:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS
        M RNP MMERLGVKPD          YRAKSGSG  RK+LGQEQ FQMSQ VIA+MLM+MGFEGATEVPLE+FSQFLSCHICKLGSTLRVLADSYRKQCS
Subjt:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS

Query:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAF-QQQQQLVLEKMRRRHQASTP
        AVDLLRMFLKTMGYSNFGPLAELVKDGSRN  RQ+QHQVHGVQPQLQAQH  LLQV QQ+PRQMHPQMQQMVH QNLAF QQQQQL+LEKMRRR QA+TP
Subjt:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAF-QQQQQLVLEKMRRRHQASTP

Query:  R-AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQ----LQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGG
        R AGME+NKDRPLVQVKVENTELPMDGNAL+ALNVR PQ    +QFRQQ IAA+SNIHASP NQFRQMSSLQMPQIQ+PN GVVRAPPVKVEGFQELMGG
Subjt:  R-AGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQ----LQFRQQQIAALSNIHASPGNQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGG

Query:  DSSSKHDSEEARLTSPSSK
        DSSSKHDSEEARLTSPSSK
Subjt:  DSSSKHDSEEARLTSPSSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G65540.1 unknown protein2.0e-16455.57Show/hide
Query:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLRSP--SSPSCSTANPLSSSS
        MALLGDDGRG++LARKLE  GVWR WLGD  YS F  +LSS STWE FMR D+SKSRAQIQLQLR RALLFDKA+VSLFLRS   ++ S S+A+    SS
Subjt:  MALLGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLRSP--SSPSCSTANPLSSSS

Query:  LAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLK-NELPETWYNQFIEKYRVKQPYRLSHGNN
        +A+SKLNPNYLQLHGDDVY+TLEN S +   QRE  +  N +  K   K + ++G R  ES+  + SQR +  ELP+TWY QFI +Y  K  Y +S G  
Subjt:  LAISKLNPNYLQLHGDDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLK-NELPETWYNQFIEKYRVKQPYRLSHGNN

Query:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP
         SDKRT E MS+YLR+++ HK++R  F +D+     +  S + SS  FD S S EDD  F PE MF  NCVPE+AL P    +DN K E  GV+D LPQ 
Subjt:  ISDKRTSEEMSSYLRLLEKHKKRRMVFNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQP

Query:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS
          R+  M+ERLG+ P+Y   E  G V R++      +     +Q+  +S+KV+ARML++MGFEGATEVP+++FSQ +S H+ KLG  L++L DSY+K+CS
Subjt:  MIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGCRKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCS

Query:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAFQQQQQLVLEKMRRRHQASTPR
        A+ L++MFL T GYSN G LAE+VKDG+RN    +Q Q   +Q QL  Q Q+ L++PQQ+ RQMHPQMQQMV+ QN  FQQQQQ  LE+MRRR   ++PR
Subjt:  AVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQLQAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAFQQQQQLVLEKMRRRHQASTPR

Query:  AGMEVNKDRPLVQVKVEN-TELPMDGNALNALNVRHP---QLQFR-QQQIAALSNIHASPG-NQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGG
          M++ KDRPLVQVK+EN +E+ +DGNA N +N RH    Q Q R QQQIAA+SN+   PG NQFRQ++S+Q+PQ+QTP  G VRA PVKVEGF++LMGG
Subjt:  AGMEVNKDRPLVQVKVEN-TELPMDGNALNALNVRHP---QLQFR-QQQIAALSNIHASPG-NQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGG

Query:  DSSSKHDSEEARLTSPSSK
        DSS KHDS++ +L SP +K
Subjt:  DSSSKHDSEEARLTSPSSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATTCTTGGCACATCATCTGCGTCCTCATTAGCTACATCTACACCTTGCATCCTCCTCCCCGCACGCCTTCCACTTCCCACCAGATCATCTGCTTTTCCCTTGAT
CTCGTGGAGCTCTTCCATGGCAGTCCCCAAGCTAACGTCTGAATTGGACCCATTTGGAGTTGCACATCTCAAAGCCAGATCAACAAGCCGATCAACCAAGTCCAATGGCC
ACTGTCCAGCCTTCTCGTCTAGAATCTGGACCAGAGCCGCCTTACCCATTGTCAATGCCTTCTCAATCAGCCCTGCCCAGTTCCTCCCAGTTACAAGATGTAGCAGCAGA
GCCCCAAAAGCCCGAATGTCCAATTCAATACCACAAGCTTCATCCAATTCGGCGAGCCCAAAACCACTGACTTCAGCAACAAGGAAGCCCAGGCCCGAGCATACCTCTGA
TGCTATACGGATTCTGTCATGCCACATGAGAGGCCGGCATATTTTCCTGGAGCTTTTGTTGGAAGGAAGTAATATGTCACTTAAGCTACCACTATGCATAACTCAATTCT
CCGGTGAAGAATGTCTCTTTGTCTCCTCAGCTCCTCGATTTCTCTCACCATCTCCTCTCTTGTTTTCGCTGTCTTCTCCAACTGGGCCTCTATACGAGATTTTCCTGCTG
TGGAAATTTGCAGCTTGCTTGATAGCTCAGCTTGGAGTTCGAGAAATTAATGGCCCACTCAGCTTTATCTTGCCTTTCAGCACTGCCCCTAAGTTCATTTTTCATCAACT
GGATAGTCTTGCGAGCTTCCTCAATTTTAGTTCTCAGATGCTCAATTCTTTCCTCGGCATTCTGCCAACATCAGTTAGGCAATCAGTATACTTAGGCAACATAACTGAAA
CTGTGAAGCCTGAATTCAGAGTTTTCGGTGCAAAAGGAATTGGAAAGGTTTTGGGTTTCGGAGGTTTAAGAGTGAAATCTTCAACGAAAAGTAAAGAGAAAGATGCTTTG
GATTTGAACCTTTTTCCGAAACTTTGTTGCGGATTTTCAAGGATCCACACTCAGGGCCTTCTGGAAAGCCTGCCTGGCCTCGTCTTTCATTTCCTTCCACTCGTAACATT
GCCCCAACAAACAGAAGGCTTTCGAGTCGTCTTCCCCCAATCGCACCGCCTCCAACAAGTCTTCCACCGCCGAGTCGAGTCGTCGTTTCCGATTCACGGCGATCTTCAAT
TCGGCTCTCTTAATCAACGCCTTTCCGAGTTCTCTATTCGAGAGCAACTTCAAGCACGGCGGAGAAAATGTGTTCCCTCAAAGAAGTGGCGGCTCGACTGGAGGATCGCG
CGATTCTTAGGGCGGAGCTTACCGAAAGCTCGTTTGGAAAGTTCGCGTACGGCCAACAACGTCGCTGCTGTCAGAATAAAGAGGGCTACTTGTATCATGGCTTCCGGCGA
CATTCTCTCTCTCTTCTCACACTTTCACCGCAATCTGATAGGCGCTGTATCTGGGAGTCGCTGCTGTGGGTTTTGCAACTCTGATGATGTTAAACGGCCACCCACAGAGA
AGATAAGACCTAACACGCGGGCTTTTGCTGCGGCCGCGGCGCCGGACACTTTTGAGGACCACGACGGCTTCTACCCTTCCGGGACGTCTGTTTCTCATGGTATGGCTCTC
CTCGGCGACGATGGTCGGGGCTATGAGCTAGCTCGGAAGCTCGAGACGCTTGGCGTGTGGCGGAATTGGCTTGGAGATTTCAATTACTCCATCTTCGTGCCCTTTCTCTC
TTCGACTTCCACATGGGAAACATTCATGCGAACCGACGATTCAAAATCTAGGGCTCAGATCCAGCTACAACTCAGGGCTCGAGCCCTCCTCTTCGACAAAGCAAGCGTCT
CTCTCTTCCTGCGTTCGCCTTCTTCACCTTCTTGTTCTACTGCAAATCCTTTATCCTCCTCTTCTCTCGCTATTTCGAAGCTCAATCCTAATTACTTGCAGCTCCATGGT
GACGATGTATATTTTACACTAGAAAATCTATCAAAAGACGGGGTTCACCAGCGGGAGAGTTGCGTTTCATCCAATAAGGCTTCCGGCAAGATTCAGCCAAAAGGAGCTTC
AAGTGCTGGGCCAAGATCTAGAGAATCTGAAATTGGTGATTCATCCCAAAGATTAAAAAATGAACTTCCTGAAACATGGTACAATCAGTTCATTGAGAAGTATAGAGTCA
AACAGCCATATCGTTTGTCACATGGGAACAATATATCAGACAAACGAACATCGGAGGAAATGTCTTCTTACCTTAGATTACTTGAGAAACATAAGAAAAGGCGTATGGTC
TTTAACGATGATCAGCTGACAAATTTTGGAAATTCTGTCTCAGCAAATGCTTCAAGTTCTGTTTTTGATTTCAGTAATTCAGTTGAAGACGACACTAATTTTTTCCCTGA
AATTATGTTTACTTTTAACTGTGTGCCAGAGAGTGCACTTCCACCCCCTGATGAAATGAAAGACAATAGAAAACTGGAACTTTCTGGAGTCATTGATATCTTGCCTCAGC
CTATGATTAGGAATCCTGCCATGATGGAGAGGCTTGGTGTCAAGCCTGATTACCTTAGCGCAGAACGAGGGGGTAATGTATATCGTGCGAAAAGTGGATCTGGAGGGTGC
AGGAAAAGTCTTGGTCAAGAGCAATCTTTTCAAATGTCTCAGAAAGTAATAGCCCGAATGCTGATGAGTATGGGGTTTGAAGGTGCCACAGAAGTTCCATTGGAGATTTT
CTCCCAGTTCTTGAGTTGTCATATCTGTAAACTTGGCAGTACTTTGAGAGTACTTGCTGATAGTTACAGAAAGCAGTGTTCAGCAGTGGATTTACTCAGGATGTTCCTTA
AAACGATGGGATATAGTAATTTTGGACCCTTGGCGGAACTTGTCAAGGATGGTTCCAGGAATTTTGTGCGGCAGTCCCAGCACCAAGTTCATGGAGTCCAACCACAATTG
CAGGCACAGCATCAGAGCCTTCTCCAAGTGCCTCAACAAGTGCCTAGACAAATGCATCCACAGATGCAACAGATGGTCCATTCCCAAAATTTGGCATTTCAGCAGCAGCA
GCAATTAGTGCTTGAAAAGATGCGAAGACGCCACCAAGCATCAACGCCACGTGCTGGTATGGAAGTGAACAAGGACAGACCATTGGTGCAAGTCAAGGTTGAAAACACAG
AATTGCCAATGGATGGTAATGCCTTAAATGCTCTTAACGTCAGGCATCCCCAGTTGCAGTTCCGCCAGCAGCAAATTGCTGCTCTGTCCAATATTCATGCTTCACCCGGA
AATCAGTTCAGGCAGATGTCTTCTCTGCAAATGCCCCAAATCCAGACACCGAATACCGGTGTCGTTAGGGCTCCACCGGTGAAGGTCGAAGGCTTCCAGGAACTGATGGG
TGGGGATTCTTCATCAAAACATGATTCAGAGGAAGCTAGATTGACTTCCCCTTCAAGTAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGATTCTTGGCACATCATCTGCGTCCTCATTAGCTACATCTACACCTTGCATCCTCCTCCCCGCACGCCTTCCACTTCCCACCAGATCATCTGCTTTTCCCTTGAT
CTCGTGGAGCTCTTCCATGGCAGTCCCCAAGCTAACGTCTGAATTGGACCCATTTGGAGTTGCACATCTCAAAGCCAGATCAACAAGCCGATCAACCAAGTCCAATGGCC
ACTGTCCAGCCTTCTCGTCTAGAATCTGGACCAGAGCCGCCTTACCCATTGTCAATGCCTTCTCAATCAGCCCTGCCCAGTTCCTCCCAGTTACAAGATGTAGCAGCAGA
GCCCCAAAAGCCCGAATGTCCAATTCAATACCACAAGCTTCATCCAATTCGGCGAGCCCAAAACCACTGACTTCAGCAACAAGGAAGCCCAGGCCCGAGCATACCTCTGA
TGCTATACGGATTCTGTCATGCCACATGAGAGGCCGGCATATTTTCCTGGAGCTTTTGTTGGAAGGAAGTAATATGTCACTTAAGCTACCACTATGCATAACTCAATTCT
CCGGTGAAGAATGTCTCTTTGTCTCCTCAGCTCCTCGATTTCTCTCACCATCTCCTCTCTTGTTTTCGCTGTCTTCTCCAACTGGGCCTCTATACGAGATTTTCCTGCTG
TGGAAATTTGCAGCTTGCTTGATAGCTCAGCTTGGAGTTCGAGAAATTAATGGCCCACTCAGCTTTATCTTGCCTTTCAGCACTGCCCCTAAGTTCATTTTTCATCAACT
GGATAGTCTTGCGAGCTTCCTCAATTTTAGTTCTCAGATGCTCAATTCTTTCCTCGGCATTCTGCCAACATCAGTTAGGCAATCAGTATACTTAGGCAACATAACTGAAA
CTGTGAAGCCTGAATTCAGAGTTTTCGGTGCAAAAGGAATTGGAAAGGTTTTGGGTTTCGGAGGTTTAAGAGTGAAATCTTCAACGAAAAGTAAAGAGAAAGATGCTTTG
GATTTGAACCTTTTTCCGAAACTTTGTTGCGGATTTTCAAGGATCCACACTCAGGGCCTTCTGGAAAGCCTGCCTGGCCTCGTCTTTCATTTCCTTCCACTCGTAACATT
GCCCCAACAAACAGAAGGCTTTCGAGTCGTCTTCCCCCAATCGCACCGCCTCCAACAAGTCTTCCACCGCCGAGTCGAGTCGTCGTTTCCGATTCACGGCGATCTTCAAT
TCGGCTCTCTTAATCAACGCCTTTCCGAGTTCTCTATTCGAGAGCAACTTCAAGCACGGCGGAGAAAATGTGTTCCCTCAAAGAAGTGGCGGCTCGACTGGAGGATCGCG
CGATTCTTAGGGCGGAGCTTACCGAAAGCTCGTTTGGAAAGTTCGCGTACGGCCAACAACGTCGCTGCTGTCAGAATAAAGAGGGCTACTTGTATCATGGCTTCCGGCGA
CATTCTCTCTCTCTTCTCACACTTTCACCGCAATCTGATAGGCGCTGTATCTGGGAGTCGCTGCTGTGGGTTTTGCAACTCTGATGATGTTAAACGGCCACCCACAGAGA
AGATAAGACCTAACACGCGGGCTTTTGCTGCGGCCGCGGCGCCGGACACTTTTGAGGACCACGACGGCTTCTACCCTTCCGGGACGTCTGTTTCTCATGGTATGGCTCTC
CTCGGCGACGATGGTCGGGGCTATGAGCTAGCTCGGAAGCTCGAGACGCTTGGCGTGTGGCGGAATTGGCTTGGAGATTTCAATTACTCCATCTTCGTGCCCTTTCTCTC
TTCGACTTCCACATGGGAAACATTCATGCGAACCGACGATTCAAAATCTAGGGCTCAGATCCAGCTACAACTCAGGGCTCGAGCCCTCCTCTTCGACAAAGCAAGCGTCT
CTCTCTTCCTGCGTTCGCCTTCTTCACCTTCTTGTTCTACTGCAAATCCTTTATCCTCCTCTTCTCTCGCTATTTCGAAGCTCAATCCTAATTACTTGCAGCTCCATGGT
GACGATGTATATTTTACACTAGAAAATCTATCAAAAGACGGGGTTCACCAGCGGGAGAGTTGCGTTTCATCCAATAAGGCTTCCGGCAAGATTCAGCCAAAAGGAGCTTC
AAGTGCTGGGCCAAGATCTAGAGAATCTGAAATTGGTGATTCATCCCAAAGATTAAAAAATGAACTTCCTGAAACATGGTACAATCAGTTCATTGAGAAGTATAGAGTCA
AACAGCCATATCGTTTGTCACATGGGAACAATATATCAGACAAACGAACATCGGAGGAAATGTCTTCTTACCTTAGATTACTTGAGAAACATAAGAAAAGGCGTATGGTC
TTTAACGATGATCAGCTGACAAATTTTGGAAATTCTGTCTCAGCAAATGCTTCAAGTTCTGTTTTTGATTTCAGTAATTCAGTTGAAGACGACACTAATTTTTTCCCTGA
AATTATGTTTACTTTTAACTGTGTGCCAGAGAGTGCACTTCCACCCCCTGATGAAATGAAAGACAATAGAAAACTGGAACTTTCTGGAGTCATTGATATCTTGCCTCAGC
CTATGATTAGGAATCCTGCCATGATGGAGAGGCTTGGTGTCAAGCCTGATTACCTTAGCGCAGAACGAGGGGGTAATGTATATCGTGCGAAAAGTGGATCTGGAGGGTGC
AGGAAAAGTCTTGGTCAAGAGCAATCTTTTCAAATGTCTCAGAAAGTAATAGCCCGAATGCTGATGAGTATGGGGTTTGAAGGTGCCACAGAAGTTCCATTGGAGATTTT
CTCCCAGTTCTTGAGTTGTCATATCTGTAAACTTGGCAGTACTTTGAGAGTACTTGCTGATAGTTACAGAAAGCAGTGTTCAGCAGTGGATTTACTCAGGATGTTCCTTA
AAACGATGGGATATAGTAATTTTGGACCCTTGGCGGAACTTGTCAAGGATGGTTCCAGGAATTTTGTGCGGCAGTCCCAGCACCAAGTTCATGGAGTCCAACCACAATTG
CAGGCACAGCATCAGAGCCTTCTCCAAGTGCCTCAACAAGTGCCTAGACAAATGCATCCACAGATGCAACAGATGGTCCATTCCCAAAATTTGGCATTTCAGCAGCAGCA
GCAATTAGTGCTTGAAAAGATGCGAAGACGCCACCAAGCATCAACGCCACGTGCTGGTATGGAAGTGAACAAGGACAGACCATTGGTGCAAGTCAAGGTTGAAAACACAG
AATTGCCAATGGATGGTAATGCCTTAAATGCTCTTAACGTCAGGCATCCCCAGTTGCAGTTCCGCCAGCAGCAAATTGCTGCTCTGTCCAATATTCATGCTTCACCCGGA
AATCAGTTCAGGCAGATGTCTTCTCTGCAAATGCCCCAAATCCAGACACCGAATACCGGTGTCGTTAGGGCTCCACCGGTGAAGGTCGAAGGCTTCCAGGAACTGATGGG
TGGGGATTCTTCATCAAAACATGATTCAGAGGAAGCTAGATTGACTTCCCCTTCAAGTAAATAG
Protein sequenceShow/hide protein sequence
MKILGTSSASSLATSTPCILLPARLPLPTRSSAFPLISWSSSMAVPKLTSELDPFGVAHLKARSTSRSTKSNGHCPAFSSRIWTRAALPIVNAFSISPAQFLPVTRCSSR
APKARMSNSIPQASSNSASPKPLTSATRKPRPEHTSDAIRILSCHMRGRHIFLELLLEGSNMSLKLPLCITQFSGEECLFVSSAPRFLSPSPLLFSLSSPTGPLYEIFLL
WKFAACLIAQLGVREINGPLSFILPFSTAPKFIFHQLDSLASFLNFSSQMLNSFLGILPTSVRQSVYLGNITETVKPEFRVFGAKGIGKVLGFGGLRVKSSTKSKEKDAL
DLNLFPKLCCGFSRIHTQGLLESLPGLVFHFLPLVTLPQQTEGFRVVFPQSHRLQQVFHRRVESSFPIHGDLQFGSLNQRLSEFSIREQLQARRRKCVPSKKWRLDWRIA
RFLGRSLPKARLESSRTANNVAAVRIKRATCIMASGDILSLFSHFHRNLIGAVSGSRCCGFCNSDDVKRPPTEKIRPNTRAFAAAAAPDTFEDHDGFYPSGTSVSHGMAL
LGDDGRGYELARKLETLGVWRNWLGDFNYSIFVPFLSSTSTWETFMRTDDSKSRAQIQLQLRARALLFDKASVSLFLRSPSSPSCSTANPLSSSSLAISKLNPNYLQLHG
DDVYFTLENLSKDGVHQRESCVSSNKASGKIQPKGASSAGPRSRESEIGDSSQRLKNELPETWYNQFIEKYRVKQPYRLSHGNNISDKRTSEEMSSYLRLLEKHKKRRMV
FNDDQLTNFGNSVSANASSSVFDFSNSVEDDTNFFPEIMFTFNCVPESALPPPDEMKDNRKLELSGVIDILPQPMIRNPAMMERLGVKPDYLSAERGGNVYRAKSGSGGC
RKSLGQEQSFQMSQKVIARMLMSMGFEGATEVPLEIFSQFLSCHICKLGSTLRVLADSYRKQCSAVDLLRMFLKTMGYSNFGPLAELVKDGSRNFVRQSQHQVHGVQPQL
QAQHQSLLQVPQQVPRQMHPQMQQMVHSQNLAFQQQQQLVLEKMRRRHQASTPRAGMEVNKDRPLVQVKVENTELPMDGNALNALNVRHPQLQFRQQQIAALSNIHASPG
NQFRQMSSLQMPQIQTPNTGVVRAPPVKVEGFQELMGGDSSSKHDSEEARLTSPSSK