; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018901 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018901
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionMis18-binding protein 1-like isoform X1
Genome locationtig00153228:325275..340876
RNA-Seq ExpressionSgr018901
SyntenySgr018901
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032797 - SMN complex (cellular component)
GO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR015216 - SANT associated
IPR035426 - Gemin2/Brr1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044617.1 mis18-binding protein 1-like isoform X1 [Cucumis melo var. makuwa]0.0e+0071.44Show/hide
Query:  MADEISSNNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYASS
        MADEI+S+  DGFN +  ++E+ QSP +P+DS   I AD   FP IVS +  +CE +IN+ +SAS            ENP   E+SV+KMV+ DSA  SS
Subjt:  MADEISSNNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYASS

Query:  ENGGKN---LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQELL
        ENGG     +V K+QNLD  L KE  KVDAVHDFET D  ED  QEVAVD+V+ KDFARSVLSFD NQ  A +ELVQE QL     A  KEAFART++LL
Subjt:  ENGGKN---LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQELL

Query:  NKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHV
         K  DSE+ LEMKKKLLLEK++AMLVP DE+HL++G NPPSSGGIVD C KTM +DEEKIADQQNDSE M VLRRS+LSLRNSLKIEVIDETALVEP+HV
Subjt:  NKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHV

Query:  SNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAI
        S IGNG+ IGIVCP  SMQ+++ K  EP+KGGKKAK+SRR+ARE K+ E+HW + NVNE+DKV+GRQ+N E NKIMYSRKDMEALRFVNVAEQ+RLW+AI
Subjt:  SNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAI

Query:  CKELMPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKYY
        CKEL+PVVAREYSSLT    +K GS+SDPR+ L KREEASSIIRE CSESLDGEIEDMEGDNEI NFV S+PSCSL              S+D DDDKYY
Subjt:  CKELMPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKYY

Query:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCS-EGF
        +SIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CPEHLLPSK+WE+AFLADFS+LRQALS S E  
Subjt:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCS-EGF

Query:  MQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT
        M+SDFILHEKID +VPNLI QPSVLPA+D D  QPE+ N +TSAKE  C DYPSLSAISKMN +FRVSSLRKRI S ETQTTLSRADC+WLFALSAAVDT
Subjt:  MQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT

Query:  PLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISGSSSSARPSATTMVRTRSKSASGTPDSIPFKSDRQPTATAQTPKTPPTYSTSIIPSSLK
        PLD DT A FRSLLRKCASLRAEK E+DDEVIMLNIL+TISG     RP      RTRS SAS TP S+ FK+DR+  AT  TP TP T STSI+PSSLK
Subjt:  PLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISGSSSSARPSATTMVRTRSKSASGTPDSIPFKSDRQPTATAQTPKTPPTYSTSIIPSSLK

Query:  SVFLYDWWLVKADDGEGLAVGGFASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGFINRSRTHENGFPPKVYNHFLLGFPFNWKDYMGSSSDKTS
        SVFL+DWWLVKA+DGEGLA+GGF SRERAG+R FYSAAISKRHE TILEA DGIII++SGFINR RT ENGFPPKVYNHFLLGFPFNWKDYM SSSD+ S
Subjt:  SVFLYDWWLVKADDGEGLAVGGFASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGFINRSRTHENGFPPKVYNHFLLGFPFNWKDYMGSSSDKTS

Query:  PFQCIKASTSHSDHQGMSHNLEPDLDDLAVTTLRDLFLATYGESS--HYLFMKSSNSSCSPTAIFESGLSETPGGSERVNSYQYLKEQGFWDEGK-NDAV
        PF+C KASTSHS+ QG SH LEPDLD+LA+T LRDL L+TYGESS  H L  K+SNSS SPT  F+SGL +TP G ++VNS+Q LKEQ F +EGK ND +
Subjt:  PFQCIKASTSHSDHQGMSHNLEPDLDDLAVTTLRDLFLATYGESS--HYLFMKSSNSSCSPTAIFESGLSETPGGSERVNSYQYLKEQGFWDEGK-NDAV

Query:  KDPVHASQEAKSKMDMELQIPRSRGVCTRSMTRLKNSRNKSKESRLSDSQKRQR
        KD +HASQEA +K+DM+ QI   +GVCTRSMT+LKN+RN+SKES + DS+K+++
Subjt:  KDPVHASQEAKSKMDMELQIPRSRGVCTRSMTRLKNSRNKSKESRLSDSQKRQR

TYK16972.1 mis18-binding protein 1-like isoform X1 [Cucumis melo var. makuwa]0.0e+0071.35Show/hide
Query:  MADEISSNNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYASS
        MADEI+S+  DGFN +  ++E+ QSP +P+DS   I AD   FP IVS +  +CE +IN+ +SAS            ENP   E+SV+KMV+ DSA  SS
Subjt:  MADEISSNNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYASS

Query:  ENGGKN---LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQELL
        ENGG     +V K+QNLD  L KE  KVDAVHDFET D  ED  QEVAVD+V+ KDFARSVLSFD NQ  A +ELVQE QL     A  KEAFART++LL
Subjt:  ENGGKN---LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQELL

Query:  NKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHV
         K  DSE+ LEMKKKLLLEK++AMLVP DE+HL++G NPPSSGGIVD C KTM +DEEKIADQQNDSE M VLRRS+LSLRNSLKIEVIDETALVEP+HV
Subjt:  NKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHV

Query:  SNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAI
        S IGNG+ IGIVCP  SMQ+++ K  EP+KGGKKAK+SRR+ARE K+ E+HW + NVNE+DKV+GRQ+N E NKIMYSRKDMEALRFVNVAEQ+RLW+AI
Subjt:  SNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAI

Query:  CKELMPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKYY
        CKEL+PVVAREYSSLT    +K GS+SDPR+ L KREEASSIIRE CSESLDGEIEDMEGDNEI NFV S+PSCSL              S+D DDDKYY
Subjt:  CKELMPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKYY

Query:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCS-EGF
        +SIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CPEHLLPSK+WE+AFLADFS+LRQALS S E  
Subjt:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCS-EGF

Query:  MQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT
        M+SDFILHEKID +VPNLI QPSVLPA+D D  QPE+ N +TSAKE  C DYPSLSAISKMN +FRVSSLRKRI S ETQTTLSRADC+WLFALSAAVDT
Subjt:  MQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT

Query:  PLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISGSSSSARPSATTMVRTRSKSASGTPDSIPFKSDRQPTATAQTPKTPPTYSTSIIPSSLK
        PLD DT A FRSLLRKCASLRAEK E+DDEVIMLNIL+TISG     RP      RTRS SAS TP S+ FK+DR+  AT  TP TP T STSI+PSSLK
Subjt:  PLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISGSSSSARPSATTMVRTRSKSASGTPDSIPFKSDRQPTATAQTPKTPPTYSTSIIPSSLK

Query:  SVFLYDWWLVKADDGEGLAVGGFASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGFINRSRTHENGFPPKVYNHFLLGFPFNWKDYMGSSSDKTS
        SVFL+DWWLVKA+DGEGLA+GGF SRERAG+R FYSAAISKRH  TILEA DGIII++SGFINR RT ENGFPPKVYNHFLLGFPFNWKDYM SSSD+ S
Subjt:  SVFLYDWWLVKADDGEGLAVGGFASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGFINRSRTHENGFPPKVYNHFLLGFPFNWKDYMGSSSDKTS

Query:  PFQCIKASTSHSDHQGMSHNLEPDLDDLAVTTLRDLFLATYGESS--HYLFMKSSNSSCSPTAIFESGLSETPGGSERVNSYQYLKEQGFWDEGK-NDAV
        PF+C KASTSHS+ QG SH LEPDLD+LA+T LRDL L+TYGESS  H L  K+SNSS SPT  F+SGL +TP G ++VNS+Q LKEQ F +EGK ND +
Subjt:  PFQCIKASTSHSDHQGMSHNLEPDLDDLAVTTLRDLFLATYGESS--HYLFMKSSNSSCSPTAIFESGLSETPGGSERVNSYQYLKEQGFWDEGK-NDAV

Query:  KDPVHASQEAKSKMDMELQIPRSRGVCTRSMTRLKNSRNKSKESRLSDSQKRQR
        KD +HASQEA +K+DM+ QI   +GVCTRSMT+LKN+RN+SKES + DS+K+++
Subjt:  KDPVHASQEAKSKMDMELQIPRSRGVCTRSMTRLKNSRNKSKESRLSDSQKRQR

XP_022145184.1 uncharacterized protein LOC111014690 isoform X1 [Momordica charantia]0.0e+0081.18Show/hide
Query:  MADEISS-NNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYAS
        MADEISS   GDGFNRR S AE+ Q P+QPIDS S+I  DE KFP+I S QKQ+ E+MIN  +SAS FS NVFHCAA ENPEITETSVEKMVV DSA   
Subjt:  MADEISS-NNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYAS

Query:  SENGGK--NLVDKVQNLDAAL--EKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQE
        SENGG   +LVD+V+NLD+ L  EKE  KVD V  FETFDA+EDGNQEVAVD+VEEKDFARS+ S DRNQ FA KELVQEVQLS A+EADGKEAFAR++E
Subjt:  SENGGK--NLVDKVQNLDAAL--EKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQE

Query:  LLNKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPI
        LL K A S +FLEMKKKLLLE+LEAMLV E     EKG NPP+S GIVDNCS    + + KIADQQN SENM VLRRSNLSLRNSLKIEVIDETALVEP+
Subjt:  LLNKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPI

Query:  HVSNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWR
        HVS IGNGEE+GIVCP+ SMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIH KL NVNELDK N RQKN E NKI+YSRKDMEALRFVNVAEQRRLW+
Subjt:  HVSNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWR

Query:  AICKELMPVVAREYSSLT-TNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDD
        AICKELMPVVAREYSSLT +N QMKIGS+SD ++HLEKR+EASSIIRE CSESLDGEIEDMEGD+EIKNFVSSDPSCSLS SE GNTIL +ECS D DDD
Subjt:  AICKELMPVVAREYSSLT-TNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDD

Query:  KYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSE
        KYY+SIQRPAFLVEGEP+FDSGPPEDGLEYLRRVRWEAS IPNVTVAKVD SN KKEQSVYMPVIPAIA CP+HLLPSKQWEDAFLADFS+LRQALS SE
Subjt:  KYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSE

Query:  GFMQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAV
        GF+QSDFILHEKID + PNLIDQPSVLPAN+IDSQQPEKP ANT AKES C+D+PS+SAISKM+SVFRVSSLRKRI  VET TTLSRADC WLFALSAA+
Subjt:  GFMQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAV

Query:  DTPLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISG
        DTPLDADT A FRSLLRKCASLRAEK ELDDEVIMLNILA ISG
Subjt:  DTPLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISG

XP_022145185.1 uncharacterized protein LOC111014690 isoform X2 [Momordica charantia]1.4e-28680.27Show/hide
Query:  MADEISS-NNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYAS
        MADEISS   GDGFNRR S AE+ Q P+QPIDS S+I  DE KFP+I S QKQ+ E+MIN  +SAS FS NVFHCAA ENPEITETSVEKMVV DSA   
Subjt:  MADEISS-NNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYAS

Query:  SENGGK--NLVDKVQNLDAAL--EKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQE
        SENGG   +LVD+V+NLD+ L  EKE  KVD V  FETFDA+EDGNQEVAVD+VEEKDFARS+ S DRNQ FA KELVQEVQLS A+EADGKEAFAR++E
Subjt:  SENGGK--NLVDKVQNLDAAL--EKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQE

Query:  LLNKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPI
        LL K A S +FLEMKKKLLLE+LEAMLV E     EKG NPP+S GIVDNCS    + + KIADQQN SENM VLRRSNLSLRNSLKIEVIDETALVEP+
Subjt:  LLNKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPI

Query:  HVSNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWR
        HVS IGNGEE+GIVCP+ SMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIH KL NVNELDK N RQKN E NKI+YSRKDMEALRFVNVAEQRRLW+
Subjt:  HVSNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWR

Query:  AICKELMPVVAREYSSLT-TNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDD
        AICKELMPVVAREYSSLT +N QMKIGS+SD ++HLEKR+EASSIIRE CSESLDGEIEDMEGD+EIKNFVSSDPSCSLS SE GNTIL +ECS D DDD
Subjt:  AICKELMPVVAREYSSLT-TNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDD

Query:  KYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSE
        KYY+SIQRPAFLVEGEP+FDSGPPEDGLEYLRRVRWEAS IPNVTVAKVD SN KKEQSVYMPVIPAIA CP+HLLPSKQWEDAFLADFS+LRQALS SE
Subjt:  KYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSE

Query:  GFMQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSV
        GF+QSDFILHEKID + PNLIDQPSVLPAN+IDSQQPEKP ANT AKES C+D+PS+SAISKM+SVFRVSSLRKRI  +
Subjt:  GFMQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSV

XP_038901998.1 uncharacterized protein LOC120088652 [Benincasa hispida]2.5e-28373.58Show/hide
Query:  MADEISSNNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYASS
        MADEISS+ GDGFN + S +E+SQS  +PIDS  KI AD+K FP IVS Q Q+ E +INS +SAS            ENP   ETSV K     SA  SS
Subjt:  MADEISSNNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYASS

Query:  ENGGKN---LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQELL
        ENGG     +V K+QNLD  L KEP KVDAVHDFET DA+EDG Q+VA+D+V EKDFARSVLSFD N   + +ELVQEVQL     A  KEAFART+ELL
Subjt:  ENGGKN---LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQELL

Query:  NKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHV
         K  D E+ LE+KKKLLLE+L+AMLVP D++HLEKG+NPPSS G VD+CSKT+ IDEEKIAD+QNDSE M VLRRS+LSLRNSLKIEVIDETALVEP+HV
Subjt:  NKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHV

Query:  SNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAI
        S IGNGE IGIVCP  SMQ+K+NK  EP++GGKKAKRSRR+AREAK+ E++W LGNVNELDKVNGRQK  E NKI+YSRKDMEALRFVNVAEQRRLW+AI
Subjt:  SNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAI

Query:  CKELMPVVAREYSSLT-TNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKY
        CKEL+P VAREYSSLT +N  MKIGS+SDPR+ L KREEASSIIRE CSESLDGEIEDMEGDNE  NFV  +PSCS S SE            D D+DKY
Subjt:  CKELMPVVAREYSSLT-TNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKY

Query:  YNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSEGF
        Y SIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVT+AKVDRSNFKKEQSVYMPVIP IAKCP+HLLPSK+WE+AFLADFS LR+ALS SE F
Subjt:  YNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSEGF

Query:  MQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT
         QSDFILHEKIDS +P+LI QP VLPA +IDS Q E+ N +TSAKE+ C DYPSLSAISKMNSVFRVSSL+KRI S+ETQTTLS+ DC+WLFALSAAVDT
Subjt:  MQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT

Query:  PLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISG
        PLDADT A FRSLLRKCASLRA+K ELDDEVIMLNIL+TISG
Subjt:  PLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISG

TrEMBL top hitse value%identityAlignment
A0A1S3BZY0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034948755.5e-28172.24Show/hide
Query:  MADEISSNNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYASS
        MADEI+S+  DGFN +  ++E+ QSP +P+DS   I AD   FP IVS +  +CE +IN+ +SAS            ENP   E+SV+KMV+ DSA  SS
Subjt:  MADEISSNNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYASS

Query:  ENGGKN---LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQELL
        ENGG     +V K+QNLD  L KE  KVDAVHDFET D  ED  QEVAVD+V+ KDFARSVLSFD NQ  A +ELVQE QL     A  KEAFART++LL
Subjt:  ENGGKN---LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQELL

Query:  NKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHV
         K  DSE+ LEMKKKLLLEK++AMLVP DE+HL++G NPPSSGGIVD C KTM +DEEKIADQQNDSE M VLRRS+LSLRNSLKIEVIDETALVEP+HV
Subjt:  NKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHV

Query:  SNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAI
        S IGNG+ IGIVCP  SMQ+++ K  EP+KGGKK  +SRR+ARE K+ E+HW + NVNE+DKV+GRQ+N E NKIMYSRKDMEALRFVNVAEQ+RLW+AI
Subjt:  SNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAI

Query:  CKELMPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKYY
        CKEL+PVVAREYSSLT    +K GS+SDPR+ L KREEASSIIRE CSESLDGEIEDMEGDNEI NFV S+PSCSL              S+D DDDKYY
Subjt:  CKELMPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKYY

Query:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCS-EGF
        +SIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CPEHLLPSK+WE+AFLADFS+LRQALS S E  
Subjt:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCS-EGF

Query:  MQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT
        M+SDFILHEKID +VPNLI QPSVLPA+D D  QPE+ N +TSAKE  C DYPSLSAISKMN +FRVSSLRKRI S ETQTTLSRADC+WLFALSAAVDT
Subjt:  MQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT

Query:  PLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISG
        PLD DT A FRSLLRKCASLRAEK E+DDEVIMLNIL+TISG
Subjt:  PLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISG

A0A5A7TRY3 Mis18-binding protein 1-like isoform X10.0e+0071.44Show/hide
Query:  MADEISSNNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYASS
        MADEI+S+  DGFN +  ++E+ QSP +P+DS   I AD   FP IVS +  +CE +IN+ +SAS            ENP   E+SV+KMV+ DSA  SS
Subjt:  MADEISSNNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYASS

Query:  ENGGKN---LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQELL
        ENGG     +V K+QNLD  L KE  KVDAVHDFET D  ED  QEVAVD+V+ KDFARSVLSFD NQ  A +ELVQE QL     A  KEAFART++LL
Subjt:  ENGGKN---LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQELL

Query:  NKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHV
         K  DSE+ LEMKKKLLLEK++AMLVP DE+HL++G NPPSSGGIVD C KTM +DEEKIADQQNDSE M VLRRS+LSLRNSLKIEVIDETALVEP+HV
Subjt:  NKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHV

Query:  SNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAI
        S IGNG+ IGIVCP  SMQ+++ K  EP+KGGKKAK+SRR+ARE K+ E+HW + NVNE+DKV+GRQ+N E NKIMYSRKDMEALRFVNVAEQ+RLW+AI
Subjt:  SNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAI

Query:  CKELMPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKYY
        CKEL+PVVAREYSSLT    +K GS+SDPR+ L KREEASSIIRE CSESLDGEIEDMEGDNEI NFV S+PSCSL              S+D DDDKYY
Subjt:  CKELMPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKYY

Query:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCS-EGF
        +SIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CPEHLLPSK+WE+AFLADFS+LRQALS S E  
Subjt:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCS-EGF

Query:  MQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT
        M+SDFILHEKID +VPNLI QPSVLPA+D D  QPE+ N +TSAKE  C DYPSLSAISKMN +FRVSSLRKRI S ETQTTLSRADC+WLFALSAAVDT
Subjt:  MQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT

Query:  PLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISGSSSSARPSATTMVRTRSKSASGTPDSIPFKSDRQPTATAQTPKTPPTYSTSIIPSSLK
        PLD DT A FRSLLRKCASLRAEK E+DDEVIMLNIL+TISG     RP      RTRS SAS TP S+ FK+DR+  AT  TP TP T STSI+PSSLK
Subjt:  PLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISGSSSSARPSATTMVRTRSKSASGTPDSIPFKSDRQPTATAQTPKTPPTYSTSIIPSSLK

Query:  SVFLYDWWLVKADDGEGLAVGGFASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGFINRSRTHENGFPPKVYNHFLLGFPFNWKDYMGSSSDKTS
        SVFL+DWWLVKA+DGEGLA+GGF SRERAG+R FYSAAISKRHE TILEA DGIII++SGFINR RT ENGFPPKVYNHFLLGFPFNWKDYM SSSD+ S
Subjt:  SVFLYDWWLVKADDGEGLAVGGFASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGFINRSRTHENGFPPKVYNHFLLGFPFNWKDYMGSSSDKTS

Query:  PFQCIKASTSHSDHQGMSHNLEPDLDDLAVTTLRDLFLATYGESS--HYLFMKSSNSSCSPTAIFESGLSETPGGSERVNSYQYLKEQGFWDEGK-NDAV
        PF+C KASTSHS+ QG SH LEPDLD+LA+T LRDL L+TYGESS  H L  K+SNSS SPT  F+SGL +TP G ++VNS+Q LKEQ F +EGK ND +
Subjt:  PFQCIKASTSHSDHQGMSHNLEPDLDDLAVTTLRDLFLATYGESS--HYLFMKSSNSSCSPTAIFESGLSETPGGSERVNSYQYLKEQGFWDEGK-NDAV

Query:  KDPVHASQEAKSKMDMELQIPRSRGVCTRSMTRLKNSRNKSKESRLSDSQKRQR
        KD +HASQEA +K+DM+ QI   +GVCTRSMT+LKN+RN+SKES + DS+K+++
Subjt:  KDPVHASQEAKSKMDMELQIPRSRGVCTRSMTRLKNSRNKSKESRLSDSQKRQR

A0A5D3CZJ0 Mis18-binding protein 1-like isoform X10.0e+0071.35Show/hide
Query:  MADEISSNNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYASS
        MADEI+S+  DGFN +  ++E+ QSP +P+DS   I AD   FP IVS +  +CE +IN+ +SAS            ENP   E+SV+KMV+ DSA  SS
Subjt:  MADEISSNNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYASS

Query:  ENGGKN---LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQELL
        ENGG     +V K+QNLD  L KE  KVDAVHDFET D  ED  QEVAVD+V+ KDFARSVLSFD NQ  A +ELVQE QL     A  KEAFART++LL
Subjt:  ENGGKN---LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQELL

Query:  NKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHV
         K  DSE+ LEMKKKLLLEK++AMLVP DE+HL++G NPPSSGGIVD C KTM +DEEKIADQQNDSE M VLRRS+LSLRNSLKIEVIDETALVEP+HV
Subjt:  NKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHV

Query:  SNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAI
        S IGNG+ IGIVCP  SMQ+++ K  EP+KGGKKAK+SRR+ARE K+ E+HW + NVNE+DKV+GRQ+N E NKIMYSRKDMEALRFVNVAEQ+RLW+AI
Subjt:  SNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAI

Query:  CKELMPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKYY
        CKEL+PVVAREYSSLT    +K GS+SDPR+ L KREEASSIIRE CSESLDGEIEDMEGDNEI NFV S+PSCSL              S+D DDDKYY
Subjt:  CKELMPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKYY

Query:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCS-EGF
        +SIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CPEHLLPSK+WE+AFLADFS+LRQALS S E  
Subjt:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCS-EGF

Query:  MQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT
        M+SDFILHEKID +VPNLI QPSVLPA+D D  QPE+ N +TSAKE  C DYPSLSAISKMN +FRVSSLRKRI S ETQTTLSRADC+WLFALSAAVDT
Subjt:  MQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT

Query:  PLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISGSSSSARPSATTMVRTRSKSASGTPDSIPFKSDRQPTATAQTPKTPPTYSTSIIPSSLK
        PLD DT A FRSLLRKCASLRAEK E+DDEVIMLNIL+TISG     RP      RTRS SAS TP S+ FK+DR+  AT  TP TP T STSI+PSSLK
Subjt:  PLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISGSSSSARPSATTMVRTRSKSASGTPDSIPFKSDRQPTATAQTPKTPPTYSTSIIPSSLK

Query:  SVFLYDWWLVKADDGEGLAVGGFASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGFINRSRTHENGFPPKVYNHFLLGFPFNWKDYMGSSSDKTS
        SVFL+DWWLVKA+DGEGLA+GGF SRERAG+R FYSAAISKRH  TILEA DGIII++SGFINR RT ENGFPPKVYNHFLLGFPFNWKDYM SSSD+ S
Subjt:  SVFLYDWWLVKADDGEGLAVGGFASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGFINRSRTHENGFPPKVYNHFLLGFPFNWKDYMGSSSDKTS

Query:  PFQCIKASTSHSDHQGMSHNLEPDLDDLAVTTLRDLFLATYGESS--HYLFMKSSNSSCSPTAIFESGLSETPGGSERVNSYQYLKEQGFWDEGK-NDAV
        PF+C KASTSHS+ QG SH LEPDLD+LA+T LRDL L+TYGESS  H L  K+SNSS SPT  F+SGL +TP G ++VNS+Q LKEQ F +EGK ND +
Subjt:  PFQCIKASTSHSDHQGMSHNLEPDLDDLAVTTLRDLFLATYGESS--HYLFMKSSNSSCSPTAIFESGLSETPGGSERVNSYQYLKEQGFWDEGK-NDAV

Query:  KDPVHASQEAKSKMDMELQIPRSRGVCTRSMTRLKNSRNKSKESRLSDSQKRQR
        KD +HASQEA +K+DM+ QI   +GVCTRSMT+LKN+RN+SKES + DS+K+++
Subjt:  KDPVHASQEAKSKMDMELQIPRSRGVCTRSMTRLKNSRNKSKESRLSDSQKRQR

A0A6J1CTR6 uncharacterized protein LOC111014690 isoform X26.8e-28780.27Show/hide
Query:  MADEISS-NNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYAS
        MADEISS   GDGFNRR S AE+ Q P+QPIDS S+I  DE KFP+I S QKQ+ E+MIN  +SAS FS NVFHCAA ENPEITETSVEKMVV DSA   
Subjt:  MADEISS-NNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYAS

Query:  SENGGK--NLVDKVQNLDAAL--EKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQE
        SENGG   +LVD+V+NLD+ L  EKE  KVD V  FETFDA+EDGNQEVAVD+VEEKDFARS+ S DRNQ FA KELVQEVQLS A+EADGKEAFAR++E
Subjt:  SENGGK--NLVDKVQNLDAAL--EKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQE

Query:  LLNKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPI
        LL K A S +FLEMKKKLLLE+LEAMLV E     EKG NPP+S GIVDNCS    + + KIADQQN SENM VLRRSNLSLRNSLKIEVIDETALVEP+
Subjt:  LLNKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPI

Query:  HVSNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWR
        HVS IGNGEE+GIVCP+ SMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIH KL NVNELDK N RQKN E NKI+YSRKDMEALRFVNVAEQRRLW+
Subjt:  HVSNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWR

Query:  AICKELMPVVAREYSSLT-TNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDD
        AICKELMPVVAREYSSLT +N QMKIGS+SD ++HLEKR+EASSIIRE CSESLDGEIEDMEGD+EIKNFVSSDPSCSLS SE GNTIL +ECS D DDD
Subjt:  AICKELMPVVAREYSSLT-TNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDD

Query:  KYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSE
        KYY+SIQRPAFLVEGEP+FDSGPPEDGLEYLRRVRWEAS IPNVTVAKVD SN KKEQSVYMPVIPAIA CP+HLLPSKQWEDAFLADFS+LRQALS SE
Subjt:  KYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSE

Query:  GFMQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSV
        GF+QSDFILHEKID + PNLIDQPSVLPAN+IDSQQPEKP ANT AKES C+D+PS+SAISKM+SVFRVSSLRKRI  +
Subjt:  GFMQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSV

A0A6J1CUG9 uncharacterized protein LOC111014690 isoform X10.0e+0081.18Show/hide
Query:  MADEISS-NNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYAS
        MADEISS   GDGFNRR S AE+ Q P+QPIDS S+I  DE KFP+I S QKQ+ E+MIN  +SAS FS NVFHCAA ENPEITETSVEKMVV DSA   
Subjt:  MADEISS-NNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYAS

Query:  SENGGK--NLVDKVQNLDAAL--EKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQE
        SENGG   +LVD+V+NLD+ L  EKE  KVD V  FETFDA+EDGNQEVAVD+VEEKDFARS+ S DRNQ FA KELVQEVQLS A+EADGKEAFAR++E
Subjt:  SENGGK--NLVDKVQNLDAAL--EKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQE

Query:  LLNKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPI
        LL K A S +FLEMKKKLLLE+LEAMLV E     EKG NPP+S GIVDNCS    + + KIADQQN SENM VLRRSNLSLRNSLKIEVIDETALVEP+
Subjt:  LLNKVADSENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPI

Query:  HVSNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWR
        HVS IGNGEE+GIVCP+ SMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIH KL NVNELDK N RQKN E NKI+YSRKDMEALRFVNVAEQRRLW+
Subjt:  HVSNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWR

Query:  AICKELMPVVAREYSSLT-TNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDD
        AICKELMPVVAREYSSLT +N QMKIGS+SD ++HLEKR+EASSIIRE CSESLDGEIEDMEGD+EIKNFVSSDPSCSLS SE GNTIL +ECS D DDD
Subjt:  AICKELMPVVAREYSSLT-TNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDD

Query:  KYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSE
        KYY+SIQRPAFLVEGEP+FDSGPPEDGLEYLRRVRWEAS IPNVTVAKVD SN KKEQSVYMPVIPAIA CP+HLLPSKQWEDAFLADFS+LRQALS SE
Subjt:  KYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSE

Query:  GFMQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAV
        GF+QSDFILHEKID + PNLIDQPSVLPAN+IDSQQPEKP ANT AKES C+D+PS+SAISKM+SVFRVSSLRKRI  VET TTLSRADC WLFALSAA+
Subjt:  GFMQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAV

Query:  DTPLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISG
        DTPLDADT A FRSLLRKCASLRAEK ELDDEVIMLNILA ISG
Subjt:  DTPLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISG

SwissProt top hitse value%identityAlignment
F4KCE9 Kinetochore-associated protein KNL-2 homolog1.1e-1242.55Show/hide
Query:  KSVFLYDWWLVKAD---DGEGLAVGGF-ASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGFINRSRTHENGFPPKVYNHFLLGFPFNWK
        K+V L DWWL+K     +G+   V GF  S E   +RVF S+ I+K  +   L A+DGI IT+ GF+N+ R  +NGF P++   F+ GFP  W+
Subjt:  KSVFLYDWWLVKAD---DGEGLAVGGF-ASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGFINRSRTHENGFPPKVYNHFLLGFPFNWK

O14893 Gem-associated protein 21.1e-1230.29Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSEGFMQSDF
        P  L EG   FD S PP    EYLRRV+ EA+  P+V VA++D    K++QSV +  +      PE   P+ QW+   +A FS +RQ ++      +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSEGFMQSDF

Query:  ILHEKIDS--VVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDY------PSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAA
           +++DS   +P   D+        +  +         +  ES  IDY      P LS +S+MN    V+S+ + + +   +   +     WL+AL A 
Subjt:  ILHEKIDS--VVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDY------PSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAA

Query:  VDTPLDADTYATFRSLLRKCASLRAEKAELDDE-VIMLNIL
        ++ PL  + ++  R L R+C+ +R      DDE V  LN+L
Subjt:  VDTPLDADTYATFRSLLRKCASLRAEKAELDDE-VIMLNIL

O42260 Gem-associated protein 25.1e-1328.76Show/hide
Query:  SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSEGFMQSDFILHEKIDS--VVP
        S PP    EYLRRV+ EA+  P+V +A++D    +K+Q+V +  +      P+   PS +W+   +A FS +RQ+L    G  +S     + +DS   +P
Subjt:  SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSEGFMQSDFILHEKIDS--VVP

Query:  NLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDY------PSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDTPLDADTYATF
        +  D+ S       +    +   A  S  +   IDY      P LS +S+M+     S L   +   E +         WL+AL A ++ PL  + ++  
Subjt:  NLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDY------PSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDTPLDADTYATF

Query:  RSLLRKCASLRA-EKAELDDEVIMLNILATISG
        R L R+C+ +RA  + + DD V  LN+   + G
Subjt:  RSLLRKCASLRA-EKAELDDEVIMLNILATISG

Q8RWD7 Protein EMBRYO DEFECTIVE 16744.6e-2240.22Show/hide
Query:  TPDSIPFKSDRQPTATAQTPKTPPT-YSTSIIP--SSLKSVFLYDWWLVKADDGEGLAVGGFASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGF
        T  S   K+  +P ++  T  T PT +  ++I    +LKSV L DWWL K   G+ L + GF S   +G+R+F S  ISKRHE+T LEA DGI I+++GF
Subjt:  TPDSIPFKSDRQPTATAQTPKTPPT-YSTSIIP--SSLKSVFLYDWWLVKADDGEGLAVGGFASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGF

Query:  INRSRTHENGFPPKVYNHFLLGFPFNWKDYMGSSSDKTSPFQCIKAS----TSHSDHQGMSHNLEPDLDDLAVTTLRDL
        INRSR  ENG   +V N F LGFP++W+DY     +K      I         + D   +   L+  + D  V +LRDL
Subjt:  INRSRTHENGFPPKVYNHFLLGFPFNWKDYMGSSSDKTSPFQCIKAS----TSHSDHQGMSHNLEPDLDDLAVTTLRDL

Q9CQQ4 Gem-associated protein 25.1e-1329.88Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSEGFMQSDF
        P  L EG   FD S PP    EYLRRV+ EA+  P+V VA++D    K++QSV +  +      PE   P+ QW+   +A FS +RQ++       +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSEGFMQSDF

Query:  ILHEKIDS--VVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDY------PSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAA
           +++DS   +P   D+        +  +   +     S +ES  IDY      P LS +S+MN    ++S+ + + +   +   +     W +AL A 
Subjt:  ILHEKIDS--VVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDY------PSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAA

Query:  VDTPLDADTYATFRSLLRKCASLRAEKAELDDE-VIMLNIL
        ++ PL  + ++  R L R+C+ +R      DDE V  LN+L
Subjt:  VDTPLDADTYATFRSLLRKCASLRAEKAELDDE-VIMLNIL

Arabidopsis top hitse value%identityAlignment
AT1G54380.1 spliceosome protein-related2.1e-6236.36Show/hide
Query:  KIEVIDETALVEPIHVSNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEA
        KIE++D TALV+ +H                            P++ G  A+  +   R+ K      K+G+   + + +G + N +  + +Y+RK +E+
Subjt:  KIEVIDETALVEPIHVSNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEA

Query:  LRFVNVAEQRRLWRAICKELMPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNT
        +RF ++  Q+ LW  +   ++P V  EY SL      K   S+  R   E   E          E+L  E    EG  + +++                 
Subjt:  LRFVNVAEQRRLWRAICKELMPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNT

Query:  ILEEECSKDIDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNF-KKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFL
                  DD+  YNSI RPAF V+GEP+F +GPPEDGLEYLRRVRWEA  IPNV VAK+D S + KKEQSVYMP+IP I KCPE+LLP K+WED+ L
Subjt:  ILEEECSKDIDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNF-KKEQSVYMPVIPAIAKCPEHLLPSKQWEDAFL

Query:  ADFSELRQALSCSEGFMQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKES-GCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTL
         DF  LRQ L+ S    + + I  + ++ ++  + ++                 + +T   ES G +    ++ I  M+SV RVS L+KRI  VE ++ L
Subjt:  ADFSELRQALSCSEGFMQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKES-GCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTL

Query:  SRADCVWLFALSAAVDTPLDADTYATFRSLLRKCASLRAEKA-ELDDE--VIMLNILATISG
          +DC W+ AL A+++TPLDADT A  R LLRKCAS+RAE + E+ DE  + M N+L TI+G
Subjt:  SRADCVWLFALSAAVDTPLDADTYATFRSLLRKCASLRAEKA-ELDDE--VIMLNILATISG

AT1G58210.1 kinase interacting family protein3.2e-2340.22Show/hide
Query:  TPDSIPFKSDRQPTATAQTPKTPPT-YSTSIIP--SSLKSVFLYDWWLVKADDGEGLAVGGFASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGF
        T  S   K+  +P ++  T  T PT +  ++I    +LKSV L DWWL K   G+ L + GF S   +G+R+F S  ISKRHE+T LEA DGI I+++GF
Subjt:  TPDSIPFKSDRQPTATAQTPKTPPT-YSTSIIP--SSLKSVFLYDWWLVKADDGEGLAVGGFASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGF

Query:  INRSRTHENGFPPKVYNHFLLGFPFNWKDYMGSSSDKTSPFQCIKAS----TSHSDHQGMSHNLEPDLDDLAVTTLRDL
        INRSR  ENG   +V N F LGFP++W+DY     +K      I         + D   +   L+  + D  V +LRDL
Subjt:  INRSRTHENGFPPKVYNHFLLGFPFNWKDYMGSSSDKTSPFQCIKAS----TSHSDHQGMSHNLEPDLDDLAVTTLRDL

AT2G42510.1 FUNCTIONS IN: molecular_function unknown3.8e-2468.75Show/hide
Query:  ILEEECSKDIDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KEQSVYMPVIP
        +++EE ++D DD   YNSI RPAF V+GEP+FDSGPPEDG+EYLRRVRWEA  IPNV VAKV  S ++ KEQSVYMP IP
Subjt:  ILEEECSKDIDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KEQSVYMPVIP

AT2G42510.2 FUNCTIONS IN: molecular_function unknown5.7e-3626.59Show/hide
Query:  SQSPIQPIDSISKIFADEK--KFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAEN-----------PEITETSVEKMVVFDSAYASSE---NGGKN
        S  P++  DS+ ++ + +K  K     S+ +++ E+ I+S S    FSE       A N           PE   T +E+  V +    SSE     G  
Subjt:  SQSPIQPIDSISKIFADEK--KFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAEN-----------PEITETSVEKMVVFDSAYASSE---NGGKN

Query:  LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVE-EKDFARSVLSFDRNQHFANKELVQEVQLSG-AVEAD--GKEAFARTQELLNKVAD
        L D V  +   +E + F V      E+ ++ + G        VE EK    + +         NKEL Q  ++S  + E+D  G + +    + +  ++D
Subjt:  LVDKVQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVE-EKDFARSVLSFDRNQHFANKELVQEVQLSG-AVEAD--GKEAFARTQELLNKVAD

Query:  SENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHVSNIG-
         +  L  +KK            + +V +   +   + G  +   S    I++     +    +  +V  + N  +  S+ I+++D+TAL + +     G 
Subjt:  SENFLEMKKKLLLEKLEAMLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHVSNIG-

Query:  NGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAICKEL
        +  +  +         + +K V  EK   K   S    R A       K+ +     ++NG+Q      +IMYSR  ME++R+ ++A Q++LW  +   L
Subjt:  NGEEIGIVCPAGSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAICKEL

Query:  MPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKYYNSIQ
        +P +  EY                P   + +R +                                            + +++EE ++D DD   YNSI 
Subjt:  MPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSIIREECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKYYNSIQ

Query:  RPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSEGFMQ--
        RPAF V+GEP+FDSGPPEDG+EYLRRVRWEA  IPNV VAKV  S ++ KEQSVYMP IP      E +          L  F  L    S    F Q  
Subjt:  RPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KEQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSEGFMQ--

Query:  --SDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT
          S F+L +K         D  + L    I +++ +        KESG                                  L  +DC W+ AL A+VDT
Subjt:  --SDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSVFRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDT

Query:  PLDADTYATFRSLLRKCASLRA
        P DADT A  R+L+RKCASLRA
Subjt:  PLDADTYATFRSLLRKCASLRA

AT5G02520.1 CONTAINS InterPro DOMAIN/s: SANT associated (InterPro:IPR015216)8.0e-1442.55Show/hide
Query:  KSVFLYDWWLVKAD---DGEGLAVGGF-ASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGFINRSRTHENGFPPKVYNHFLLGFPFNWK
        K+V L DWWL+K     +G+   V GF  S E   +RVF S+ I+K  +   L A+DGI IT+ GF+N+ R  +NGF P++   F+ GFP  W+
Subjt:  KSVFLYDWWLVKAD---DGEGLAVGGF-ASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGFINRSRTHENGFPPKVYNHFLLGFPFNWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATGAGATCAGTTCTAACAATGGCGATGGGTTTAATCGAAGACTTTCAACGGCTGAGGATTCCCAGTCTCCTATTCAACCGATTGATTCGATCTCTAAGATCTT
TGCGGATGAGAAGAAATTCCCTGCGATCGTCTCGATTCAAAAGCAGGAGTGTGAATTAATGATAAACAGTACGAGTTCCGCTTCTAGGTTCTCCGAGAATGTTTTTCATT
GCGCTGCAGCAGAGAACCCAGAAATTACAGAAACTTCTGTCGAAAAGATGGTCGTGTTCGATTCGGCTTATGCATCTTCTGAAAACGGAGGAAAAAATCTGGTGGACAAG
GTTCAGAATCTTGATGCGGCGCTCGAAAAAGAACCTTTCAAGGTGGACGCTGTCCACGACTTTGAAACGTTCGATGCCATGGAGGATGGTAATCAAGAAGTTGCGGTGGA
CGACGTAGAAGAGAAAGATTTTGCAAGAAGTGTACTGAGTTTTGATAGAAATCAACATTTTGCAAATAAAGAACTTGTTCAAGAAGTCCAGTTGTCTGGTGCTGTTGAAG
CCGACGGCAAAGAAGCCTTTGCGCGAACCCAGGAGCTGTTGAACAAAGTAGCTGATTCTGAGAACTTTCTTGAAATGAAAAAGAAATTACTTTTGGAAAAACTCGAAGCC
ATGTTGGTTCCTGAAGATGAAGTTCATCTAGAAAAGGGAAGTAATCCCCCTAGCTCAGGAGGGATCGTGGATAATTGCAGCAAAACGATGTTTATTGATGAGGAGAAGAT
TGCTGATCAGCAAAATGATTCCGAGAACATGAAAGTTCTTAGACGAAGTAATTTGTCTCTCAGAAATTCATTGAAGATTGAAGTAATAGACGAGACTGCATTAGTTGAAC
CGATTCATGTTTCGAACATTGGAAATGGAGAAGAGATTGGTATTGTTTGTCCAGCTGGGTCAATGCAGATCAAGTTGAACAAATATGTTGAACCCGAAAAAGGGGGGAAA
AAGGCTAAACGATCGAGGAGGAGGGCAAGGGAAGCGAAAATTCCTGAGATACATTGGAAGCTGGGGAATGTGAATGAGCTCGATAAAGTCAATGGACGTCAAAAAAATGA
GGAAAGTAACAAGATAATGTATTCGAGGAAGGATATGGAAGCACTGAGGTTTGTGAATGTTGCAGAACAGAGGAGGTTGTGGAGAGCTATATGCAAGGAACTTATGCCCG
TTGTGGCAAGGGAATACAGTAGCTTAACAACTAATTGCCAGATGAAGATAGGCTCCAGCTCTGATCCTAGGAAGCATTTGGAGAAGAGAGAAGAAGCCTCTTCAATTATA
AGGGAGGAATGTTCAGAAAGCTTGGATGGTGAGATAGAGGACATGGAAGGTGATAATGAAATTAAAAATTTTGTATCCTCGGATCCCTCTTGCAGTCTTAGTTTCAGCGA
GGGTGGCAATACAATTTTAGAAGAAGAGTGCAGCAAAGATATTGATGATGATAAATATTACAACAGTATTCAGAGACCTGCCTTTCTGGTGGAGGGAGAACCCAATTTTG
ATTCAGGGCCTCCAGAAGATGGATTAGAATATCTTAGACGTGTCAGGTGGGAAGCTTCCCATATTCCAAATGTGACGGTGGCAAAGGTTGATAGAAGTAATTTTAAGAAA
GAGCAAAGTGTTTATATGCCAGTGATTCCTGCAATTGCCAAATGCCCCGAGCATTTACTGCCTTCAAAACAGTGGGAGGATGCATTTCTTGCTGATTTTTCTGAGCTGCG
ACAGGCTCTGTCATGCTCTGAGGGATTTATGCAGTCTGATTTCATCCTCCATGAAAAGATCGATTCCGTAGTTCCGAACTTGATTGATCAGCCAAGTGTCTTACCTGCCA
ATGACATCGACTCACAGCAACCTGAGAAACCGAATGCCAACACTTCAGCGAAGGAAAGCGGTTGCATCGATTATCCATCTTTATCAGCGATTTCAAAGATGAACTCTGTG
TTTCGCGTTTCGTCGTTGAGGAAACGTATAAAATCGGTGGAAACACAGACTACATTGTCAAGGGCTGATTGTGTCTGGCTGTTTGCATTAAGTGCAGCAGTTGATACTCC
TCTGGATGCAGATACTTATGCCACTTTCAGGAGTCTGCTTCGGAAATGTGCAAGCTTGCGGGCCGAGAAGGCCGAGCTCGACGACGAGGTGATAATGCTCAATATTCTTG
CCACCATTTCTGGGAGTTCTTCGTCTGCCCGCCCGTCGGCCACCACCATGGTCAGAACTCGAAGCAAGTCCGCGTCTGGGACTCCCGACTCGATTCCGTTCAAATCCGAT
CGGCAGCCCACCGCCACCGCCCAGACGCCGAAAACACCGCCTACGTATTCCACATCCATCATTCCTTCCTCACTAAAATCAGTGTTTCTGTACGATTGGTGGCTGGTAAA
GGCTGATGATGGCGAGGGCTTGGCTGTCGGAGGGTTTGCTTCCAGAGAAAGAGCTGGGTTAAGAGTATTTTATTCTGCAGCAATTTCTAAGCGACATGAAGCCACCATTC
TTGAGGCAACAGATGGCATTATCATTACAGTCAGTGGCTTCATAAATAGATCACGTACACATGAAAATGGCTTCCCACCGAAGGTGTACAATCATTTCCTTCTTGGTTTT
CCCTTCAATTGGAAAGACTATATGGGTTCTAGTTCTGATAAAACATCTCCTTTCCAGTGTATTAAAGCAAGTACTTCTCATTCTGACCATCAAGGTATGAGCCATAATTT
AGAACCTGACTTGGATGATCTTGCAGTAACTACGTTGCGAGATCTCTTCTTGGCCACTTATGGAGAATCTAGTCATTACCTCTTTATGAAGAGCTCAAATTCAAGTTGTA
GTCCAACGGCAATTTTTGAATCCGGACTAAGTGAGACCCCCGGAGGTTCTGAAAGAGTGAATTCCTATCAATATCTAAAGGAGCAAGGTTTCTGGGATGAGGGGAAAAAT
GATGCTGTAAAGGATCCAGTGCACGCTAGTCAAGAAGCTAAATCTAAGATGGACATGGAACTCCAGATTCCAAGAAGCAGAGGCGTTTGCACAAGAAGCATGACAAGGTT
GAAAAATTCTAGGAACAAAAGTAAAGAGAGTCGCTTGTCAGACTCTCAGAAGAGGCAGAGGTATTGGGAGAATAACTCTTCATGTGTAAACTCGGGCAAGGGAACTGAAC
AGAGAGGTAAAAGTAGTTCAAATGTTACACCAACAAGTGATTTTTCGATTAGATCTGATAAGAAAGTTGATGCAATCACAGATAAATTGGAGAGTAGAAGATCAAGTCTT
AGGCTAAGGAACCGAAAGCAGAATCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGATGAGATCAGTTCTAACAATGGCGATGGGTTTAATCGAAGACTTTCAACGGCTGAGGATTCCCAGTCTCCTATTCAACCGATTGATTCGATCTCTAAGATCTT
TGCGGATGAGAAGAAATTCCCTGCGATCGTCTCGATTCAAAAGCAGGAGTGTGAATTAATGATAAACAGTACGAGTTCCGCTTCTAGGTTCTCCGAGAATGTTTTTCATT
GCGCTGCAGCAGAGAACCCAGAAATTACAGAAACTTCTGTCGAAAAGATGGTCGTGTTCGATTCGGCTTATGCATCTTCTGAAAACGGAGGAAAAAATCTGGTGGACAAG
GTTCAGAATCTTGATGCGGCGCTCGAAAAAGAACCTTTCAAGGTGGACGCTGTCCACGACTTTGAAACGTTCGATGCCATGGAGGATGGTAATCAAGAAGTTGCGGTGGA
CGACGTAGAAGAGAAAGATTTTGCAAGAAGTGTACTGAGTTTTGATAGAAATCAACATTTTGCAAATAAAGAACTTGTTCAAGAAGTCCAGTTGTCTGGTGCTGTTGAAG
CCGACGGCAAAGAAGCCTTTGCGCGAACCCAGGAGCTGTTGAACAAAGTAGCTGATTCTGAGAACTTTCTTGAAATGAAAAAGAAATTACTTTTGGAAAAACTCGAAGCC
ATGTTGGTTCCTGAAGATGAAGTTCATCTAGAAAAGGGAAGTAATCCCCCTAGCTCAGGAGGGATCGTGGATAATTGCAGCAAAACGATGTTTATTGATGAGGAGAAGAT
TGCTGATCAGCAAAATGATTCCGAGAACATGAAAGTTCTTAGACGAAGTAATTTGTCTCTCAGAAATTCATTGAAGATTGAAGTAATAGACGAGACTGCATTAGTTGAAC
CGATTCATGTTTCGAACATTGGAAATGGAGAAGAGATTGGTATTGTTTGTCCAGCTGGGTCAATGCAGATCAAGTTGAACAAATATGTTGAACCCGAAAAAGGGGGGAAA
AAGGCTAAACGATCGAGGAGGAGGGCAAGGGAAGCGAAAATTCCTGAGATACATTGGAAGCTGGGGAATGTGAATGAGCTCGATAAAGTCAATGGACGTCAAAAAAATGA
GGAAAGTAACAAGATAATGTATTCGAGGAAGGATATGGAAGCACTGAGGTTTGTGAATGTTGCAGAACAGAGGAGGTTGTGGAGAGCTATATGCAAGGAACTTATGCCCG
TTGTGGCAAGGGAATACAGTAGCTTAACAACTAATTGCCAGATGAAGATAGGCTCCAGCTCTGATCCTAGGAAGCATTTGGAGAAGAGAGAAGAAGCCTCTTCAATTATA
AGGGAGGAATGTTCAGAAAGCTTGGATGGTGAGATAGAGGACATGGAAGGTGATAATGAAATTAAAAATTTTGTATCCTCGGATCCCTCTTGCAGTCTTAGTTTCAGCGA
GGGTGGCAATACAATTTTAGAAGAAGAGTGCAGCAAAGATATTGATGATGATAAATATTACAACAGTATTCAGAGACCTGCCTTTCTGGTGGAGGGAGAACCCAATTTTG
ATTCAGGGCCTCCAGAAGATGGATTAGAATATCTTAGACGTGTCAGGTGGGAAGCTTCCCATATTCCAAATGTGACGGTGGCAAAGGTTGATAGAAGTAATTTTAAGAAA
GAGCAAAGTGTTTATATGCCAGTGATTCCTGCAATTGCCAAATGCCCCGAGCATTTACTGCCTTCAAAACAGTGGGAGGATGCATTTCTTGCTGATTTTTCTGAGCTGCG
ACAGGCTCTGTCATGCTCTGAGGGATTTATGCAGTCTGATTTCATCCTCCATGAAAAGATCGATTCCGTAGTTCCGAACTTGATTGATCAGCCAAGTGTCTTACCTGCCA
ATGACATCGACTCACAGCAACCTGAGAAACCGAATGCCAACACTTCAGCGAAGGAAAGCGGTTGCATCGATTATCCATCTTTATCAGCGATTTCAAAGATGAACTCTGTG
TTTCGCGTTTCGTCGTTGAGGAAACGTATAAAATCGGTGGAAACACAGACTACATTGTCAAGGGCTGATTGTGTCTGGCTGTTTGCATTAAGTGCAGCAGTTGATACTCC
TCTGGATGCAGATACTTATGCCACTTTCAGGAGTCTGCTTCGGAAATGTGCAAGCTTGCGGGCCGAGAAGGCCGAGCTCGACGACGAGGTGATAATGCTCAATATTCTTG
CCACCATTTCTGGGAGTTCTTCGTCTGCCCGCCCGTCGGCCACCACCATGGTCAGAACTCGAAGCAAGTCCGCGTCTGGGACTCCCGACTCGATTCCGTTCAAATCCGAT
CGGCAGCCCACCGCCACCGCCCAGACGCCGAAAACACCGCCTACGTATTCCACATCCATCATTCCTTCCTCACTAAAATCAGTGTTTCTGTACGATTGGTGGCTGGTAAA
GGCTGATGATGGCGAGGGCTTGGCTGTCGGAGGGTTTGCTTCCAGAGAAAGAGCTGGGTTAAGAGTATTTTATTCTGCAGCAATTTCTAAGCGACATGAAGCCACCATTC
TTGAGGCAACAGATGGCATTATCATTACAGTCAGTGGCTTCATAAATAGATCACGTACACATGAAAATGGCTTCCCACCGAAGGTGTACAATCATTTCCTTCTTGGTTTT
CCCTTCAATTGGAAAGACTATATGGGTTCTAGTTCTGATAAAACATCTCCTTTCCAGTGTATTAAAGCAAGTACTTCTCATTCTGACCATCAAGGTATGAGCCATAATTT
AGAACCTGACTTGGATGATCTTGCAGTAACTACGTTGCGAGATCTCTTCTTGGCCACTTATGGAGAATCTAGTCATTACCTCTTTATGAAGAGCTCAAATTCAAGTTGTA
GTCCAACGGCAATTTTTGAATCCGGACTAAGTGAGACCCCCGGAGGTTCTGAAAGAGTGAATTCCTATCAATATCTAAAGGAGCAAGGTTTCTGGGATGAGGGGAAAAAT
GATGCTGTAAAGGATCCAGTGCACGCTAGTCAAGAAGCTAAATCTAAGATGGACATGGAACTCCAGATTCCAAGAAGCAGAGGCGTTTGCACAAGAAGCATGACAAGGTT
GAAAAATTCTAGGAACAAAAGTAAAGAGAGTCGCTTGTCAGACTCTCAGAAGAGGCAGAGGTATTGGGAGAATAACTCTTCATGTGTAAACTCGGGCAAGGGAACTGAAC
AGAGAGGTAAAAGTAGTTCAAATGTTACACCAACAAGTGATTTTTCGATTAGATCTGATAAGAAAGTTGATGCAATCACAGATAAATTGGAGAGTAGAAGATCAAGTCTT
AGGCTAAGGAACCGAAAGCAGAATCAATAG
Protein sequenceShow/hide protein sequence
MADEISSNNGDGFNRRLSTAEDSQSPIQPIDSISKIFADEKKFPAIVSIQKQECELMINSTSSASRFSENVFHCAAAENPEITETSVEKMVVFDSAYASSENGGKNLVDK
VQNLDAALEKEPFKVDAVHDFETFDAMEDGNQEVAVDDVEEKDFARSVLSFDRNQHFANKELVQEVQLSGAVEADGKEAFARTQELLNKVADSENFLEMKKKLLLEKLEA
MLVPEDEVHLEKGSNPPSSGGIVDNCSKTMFIDEEKIADQQNDSENMKVLRRSNLSLRNSLKIEVIDETALVEPIHVSNIGNGEEIGIVCPAGSMQIKLNKYVEPEKGGK
KAKRSRRRAREAKIPEIHWKLGNVNELDKVNGRQKNEESNKIMYSRKDMEALRFVNVAEQRRLWRAICKELMPVVAREYSSLTTNCQMKIGSSSDPRKHLEKREEASSII
REECSESLDGEIEDMEGDNEIKNFVSSDPSCSLSFSEGGNTILEEECSKDIDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKK
EQSVYMPVIPAIAKCPEHLLPSKQWEDAFLADFSELRQALSCSEGFMQSDFILHEKIDSVVPNLIDQPSVLPANDIDSQQPEKPNANTSAKESGCIDYPSLSAISKMNSV
FRVSSLRKRIKSVETQTTLSRADCVWLFALSAAVDTPLDADTYATFRSLLRKCASLRAEKAELDDEVIMLNILATISGSSSSARPSATTMVRTRSKSASGTPDSIPFKSD
RQPTATAQTPKTPPTYSTSIIPSSLKSVFLYDWWLVKADDGEGLAVGGFASRERAGLRVFYSAAISKRHEATILEATDGIIITVSGFINRSRTHENGFPPKVYNHFLLGF
PFNWKDYMGSSSDKTSPFQCIKASTSHSDHQGMSHNLEPDLDDLAVTTLRDLFLATYGESSHYLFMKSSNSSCSPTAIFESGLSETPGGSERVNSYQYLKEQGFWDEGKN
DAVKDPVHASQEAKSKMDMELQIPRSRGVCTRSMTRLKNSRNKSKESRLSDSQKRQRYWENNSSCVNSGKGTEQRGKSSSNVTPTSDFSIRSDKKVDAITDKLESRRSSL
RLRNRKQNQ