; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019362 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019362
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSmr domain-containing protein
Genome locationtig00153346:751572..757057
RNA-Seq ExpressionSgr019362
SyntenySgr019362
Gene Ontology termsNA
InterPro domainsIPR002625 - Smr domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148966.1 uncharacterized protein LOC101223137 [Cucumis sativus]7.6e-10554.59Show/hide
Query:  WARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEITDV
        W  GKSPGW  FNLKQ N+GLQDE+D E FPP+ T  S  PP ENLHGV G RSGRSF+S  LPS DSLT PE +G K  I  DSSI+ GKKVVEE TDV
Subjt:  WARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEITDV

Query:  LAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMV------------------------------------------CIKTILHVKKMH------
        LAF KLK +HSWAD SLI DIM+AVNNN DEASTLLK MV                                             T+  V+ +H      
Subjt:  LAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMV------------------------------------------CIKTILHVKKMH------

Query:  -----KIDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-------------------------------------------
             K+      ERN FHN GN K+ALGCS S+PIEPEWEEDDIYLSHRKDAIAMM                                           
Subjt:  -----KIDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-------------------------------------------

Query:  -------RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKESPSSRHRSTSMEVITGIGNHSRG
               RNSKNGLWKLDLHGLHAAEAV+AL +HLLKIETQNASNRSLSPKKAERKGFQRASSLE LSC++SKLDKESPSSRHR TS+EVITGIG HS+G
Subjt:  -------RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKESPSSRHRSTSMEVITGIGNHSRG

Query:  KLLYQRLWQGFLAK
        +    +    FL +
Subjt:  KLLYQRLWQGFLAK

XP_008451240.1 PREDICTED: uncharacterized protein LOC103492590 [Cucumis melo]2.1e-10254.81Show/hide
Query:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT
        +SW  GKSPGW  FNL+Q NRGLQ E D E FPP+ T     PP ENLHGV G R GRSF+S  LPSADSLT P  +G K  I  DS I+ GKKVVEE T
Subjt:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT

Query:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVC---------IKTI-LH--------------------------------VKKMHKIDR
        DVLAF KLK +HSWAD SLI DIM+AVNNN DEASTLLK MV          I T+ LH                                 + MH+ D 
Subjt:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVC---------IKTI-LH--------------------------------VKKMHKIDR

Query:  R--YSC---------ERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-----------------------------------------
             C         ERNFF N GN K+ALGCS S+PIEPEWEEDDIYLSHRKDAIAMM                                         
Subjt:  R--YSC---------ERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-----------------------------------------

Query:  ---------RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKESPSSRHRSTSMEVITGIGNHS
                 RNSKNGLWKLDLHGLHAAEAV+ALQ+HLLKIETQNASNRSLSPKKAERKGFQRASSLE LSC+D+KLDKESPSSRHR TS+EVITGIG HS
Subjt:  ---------RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKESPSSRHRSTSMEVITGIGNHS

Query:  RGKLLYQRLWQGFLAK
        +G+    +    FL +
Subjt:  RGKLLYQRLWQGFLAK

XP_022143556.1 uncharacterized protein LOC111013424 [Momordica charantia]4.9e-11257.97Show/hide
Query:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT
        MSW RGKS GWAA NLKQQ RGLQDEI+ EPFPPISTTLS  PP  NLH V+  RSGR FSS LLPSADSLT PET G KK +LGDSSI+GGKKV+EEI 
Subjt:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT

Query:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVCIKTILH---------------------------------------VKKMHK------
          LA RKLK LHSWAD  LIED+MEAVNNNI+EAS LLKAMV  +T  +                                       +K +H+      
Subjt:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVCIKTILH---------------------------------------VKKMHK------

Query:  ------IDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-------------------------------------------
              I+  Y CERNFFHNVGNPKL+LGCSNS+PIEPEWEEDDIYLSHRKDAI+M+                                           
Subjt:  ------IDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-------------------------------------------

Query:  -------RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKESPSSRHRSTSMEVITGIGNHSRG
               RN KNGLWKLDLHGLHAAEAV+ALQEHLLKIETQNASNRSLSPKK ERKGF RASSLESL+CIDSKLDKES SSRHR   +EVITGIGNHSRG
Subjt:  -------RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKESPSSRHRSTSMEVITGIGNHSRG

Query:  KLLYQRLWQGFLAK
        +         FL++
Subjt:  KLLYQRLWQGFLAK

XP_022953352.1 uncharacterized protein LOC111455928 isoform X1 [Cucurbita moschata]4.6e-10254.85Show/hide
Query:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT
        +S  RGKSPGW  FNLKQQNRGLQD ID +PFPP+ + LS  PPRENLHGVNG R GRS SS  LPSADSLT PE +   KKILGDSSI+ G+KVVEE T
Subjt:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT

Query:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVC------------------------------------------IKTILHVKKMH----
        DVLAF KLK LH+WAD SLI DIMEAV+NN +EAST L  MV                                             T+  V+ MH    
Subjt:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVC------------------------------------------IKTILHVKKMH----

Query:  -KIDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-----------------------------------------------
         K+      ERNFFHNVGNPK+AL CS S PIEPEWEEDDIYLSHRKDAIAMM                                               
Subjt:  -KIDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-----------------------------------------------

Query:  ---RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKE--SPSSRHRSTSMEVITGIGNHSRGKL
           RNS+NGLWKLDLHGLHAAEAV+ALQ+HLLKIET+NASNRSLSPKKAERKGF R SSLE LSC+  KLDKE  SP  RHR TS+EVITG+G HSRG+ 
Subjt:  ---RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKE--SPSSRHRSTSMEVITGIGNHSRGKL

Query:  LYQRLWQGFLAK
           +    FL++
Subjt:  LYQRLWQGFLAK

XP_022991450.1 uncharacterized protein LOC111488062 isoform X2 [Cucurbita maxima]4.2e-10355.58Show/hide
Query:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT
        MSW RGKSPGWAA NLKQQN GLQDEID +PFPP+ST  S  PP EN+H VNG RSGRS SS  LPSADSLT PE +G  KKILGDS+I+ G+KVVEE T
Subjt:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT

Query:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVC------------------------------------------IKTILHVKKMH----
        DVLAF KLK LH+WAD SLI DIMEAV+NN +EAST L  MV                                             T+  V+ MH    
Subjt:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVC------------------------------------------IKTILHVKKMH----

Query:  -KIDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-----------------------------------------------
         K+      ERNFFHNVGNPK+AL CS S PIEPEWEEDDIYLSHRKDAIAMM                                               
Subjt:  -KIDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-----------------------------------------------

Query:  ---RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKE--SPSSRHRSTSMEVITGIGNHSRGKL
           RNS+NGLWKLDLHGLHAAEAV+ALQ+HLLKIET NASNRSLSPKKAERKGF R SSLE LSC+  KLDKE  SP  RHR TS+EVITGIG HSRG+ 
Subjt:  ---RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKE--SPSSRHRSTSMEVITGIGNHSRGKL

Query:  LYQRLWQGFLAK
           +    FL++
Subjt:  LYQRLWQGFLAK

TrEMBL top hitse value%identityAlignment
A0A0A0KA90 Smr domain-containing protein3.7e-10554.59Show/hide
Query:  WARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEITDV
        W  GKSPGW  FNLKQ N+GLQDE+D E FPP+ T  S  PP ENLHGV G RSGRSF+S  LPS DSLT PE +G K  I  DSSI+ GKKVVEE TDV
Subjt:  WARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEITDV

Query:  LAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMV------------------------------------------CIKTILHVKKMH------
        LAF KLK +HSWAD SLI DIM+AVNNN DEASTLLK MV                                             T+  V+ +H      
Subjt:  LAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMV------------------------------------------CIKTILHVKKMH------

Query:  -----KIDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-------------------------------------------
             K+      ERN FHN GN K+ALGCS S+PIEPEWEEDDIYLSHRKDAIAMM                                           
Subjt:  -----KIDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-------------------------------------------

Query:  -------RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKESPSSRHRSTSMEVITGIGNHSRG
               RNSKNGLWKLDLHGLHAAEAV+AL +HLLKIETQNASNRSLSPKKAERKGFQRASSLE LSC++SKLDKESPSSRHR TS+EVITGIG HS+G
Subjt:  -------RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKESPSSRHRSTSMEVITGIGNHSRG

Query:  KLLYQRLWQGFLAK
        +    +    FL +
Subjt:  KLLYQRLWQGFLAK

A0A1S3BRS7 uncharacterized protein LOC1034925901.0e-10254.81Show/hide
Query:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT
        +SW  GKSPGW  FNL+Q NRGLQ E D E FPP+ T     PP ENLHGV G R GRSF+S  LPSADSLT P  +G K  I  DS I+ GKKVVEE T
Subjt:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT

Query:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVC---------IKTI-LH--------------------------------VKKMHKIDR
        DVLAF KLK +HSWAD SLI DIM+AVNNN DEASTLLK MV          I T+ LH                                 + MH+ D 
Subjt:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVC---------IKTI-LH--------------------------------VKKMHKIDR

Query:  R--YSC---------ERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-----------------------------------------
             C         ERNFF N GN K+ALGCS S+PIEPEWEEDDIYLSHRKDAIAMM                                         
Subjt:  R--YSC---------ERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-----------------------------------------

Query:  ---------RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKESPSSRHRSTSMEVITGIGNHS
                 RNSKNGLWKLDLHGLHAAEAV+ALQ+HLLKIETQNASNRSLSPKKAERKGFQRASSLE LSC+D+KLDKESPSSRHR TS+EVITGIG HS
Subjt:  ---------RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKESPSSRHRSTSMEVITGIGNHS

Query:  RGKLLYQRLWQGFLAK
        +G+    +    FL +
Subjt:  RGKLLYQRLWQGFLAK

A0A6J1CQL8 uncharacterized protein LOC1110134242.4e-11257.97Show/hide
Query:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT
        MSW RGKS GWAA NLKQQ RGLQDEI+ EPFPPISTTLS  PP  NLH V+  RSGR FSS LLPSADSLT PET G KK +LGDSSI+GGKKV+EEI 
Subjt:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT

Query:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVCIKTILH---------------------------------------VKKMHK------
          LA RKLK LHSWAD  LIED+MEAVNNNI+EAS LLKAMV  +T  +                                       +K +H+      
Subjt:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVCIKTILH---------------------------------------VKKMHK------

Query:  ------IDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-------------------------------------------
              I+  Y CERNFFHNVGNPKL+LGCSNS+PIEPEWEEDDIYLSHRKDAI+M+                                           
Subjt:  ------IDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-------------------------------------------

Query:  -------RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKESPSSRHRSTSMEVITGIGNHSRG
               RN KNGLWKLDLHGLHAAEAV+ALQEHLLKIETQNASNRSLSPKK ERKGF RASSLESL+CIDSKLDKES SSRHR   +EVITGIGNHSRG
Subjt:  -------RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKESPSSRHRSTSMEVITGIGNHSRG

Query:  KLLYQRLWQGFLAK
        +         FL++
Subjt:  KLLYQRLWQGFLAK

A0A6J1GN51 uncharacterized protein LOC111455928 isoform X12.2e-10254.85Show/hide
Query:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT
        +S  RGKSPGW  FNLKQQNRGLQD ID +PFPP+ + LS  PPRENLHGVNG R GRS SS  LPSADSLT PE +   KKILGDSSI+ G+KVVEE T
Subjt:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT

Query:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVC------------------------------------------IKTILHVKKMH----
        DVLAF KLK LH+WAD SLI DIMEAV+NN +EAST L  MV                                             T+  V+ MH    
Subjt:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVC------------------------------------------IKTILHVKKMH----

Query:  -KIDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-----------------------------------------------
         K+      ERNFFHNVGNPK+AL CS S PIEPEWEEDDIYLSHRKDAIAMM                                               
Subjt:  -KIDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-----------------------------------------------

Query:  ---RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKE--SPSSRHRSTSMEVITGIGNHSRGKL
           RNS+NGLWKLDLHGLHAAEAV+ALQ+HLLKIET+NASNRSLSPKKAERKGF R SSLE LSC+  KLDKE  SP  RHR TS+EVITG+G HSRG+ 
Subjt:  ---RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKE--SPSSRHRSTSMEVITGIGNHSRGKL

Query:  LYQRLWQGFLAK
           +    FL++
Subjt:  LYQRLWQGFLAK

A0A6J1JSZ2 uncharacterized protein LOC111488062 isoform X22.0e-10355.58Show/hide
Query:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT
        MSW RGKSPGWAA NLKQQN GLQDEID +PFPP+ST  S  PP EN+H VNG RSGRS SS  LPSADSLT PE +G  KKILGDS+I+ G+KVVEE T
Subjt:  MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEIT

Query:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVC------------------------------------------IKTILHVKKMH----
        DVLAF KLK LH+WAD SLI DIMEAV+NN +EAST L  MV                                             T+  V+ MH    
Subjt:  DVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVC------------------------------------------IKTILHVKKMH----

Query:  -KIDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-----------------------------------------------
         K+      ERNFFHNVGNPK+AL CS S PIEPEWEEDDIYLSHRKDAIAMM                                               
Subjt:  -KIDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMM-----------------------------------------------

Query:  ---RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKE--SPSSRHRSTSMEVITGIGNHSRGKL
           RNS+NGLWKLDLHGLHAAEAV+ALQ+HLLKIET NASNRSLSPKKAERKGF R SSLE LSC+  KLDKE  SP  RHR TS+EVITGIG HSRG+ 
Subjt:  ---RNSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKE--SPSSRHRSTSMEVITGIGNHSRGKL

Query:  LYQRLWQGFLAK
           +    FL++
Subjt:  LYQRLWQGFLAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G23520.1 smr (Small MutS Related) domain-containing protein1.7e-4133.91Show/hide
Query:  MSWARGKSPGWAAFNLKQ-QNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGR------RSGRSFSSVLLPSA--DSLTPPETFGVKKKILGDSSIEG
        MSW +GKS GW AF+LKQ Q +GL+ E++ +PFPP+ST+++         GV GR       S +SFSSVLLP +   +LT  +  G +++  G    + 
Subjt:  MSWARGKSPGWAAFNLKQ-QNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGR------RSGRSFSSVLLPSA--DSLTPPETFGVKKKILGDSSIEG

Query:  GKKVVEEITDVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVCI---------------------------KTILHVKKM---------HK
            +   +  LAF KLK ++SWAD +LI D++ +  ++ + A   LK MV                             KT+    KM          K
Subjt:  GKKVVEEITDVLAFRKLKGLHSWADISLIEDIMEAVNNNIDEASTLLKAMVCI---------------------------KTILHVKKM---------HK

Query:  IDRRYSCERNFFHNVG-NPKLALGCS---------NSIPIEPEWEEDDIYLSHRKDAIAMMR--------------------------------------
         D   S   +F  N   N K     S          SIPIEPEWEEDD+YLSHRKDA+ +MR                                      
Subjt:  IDRRYSCERNFFHNVG-NPKLALGCS---------NSIPIEPEWEEDDIYLSHRKDAIAMMR--------------------------------------

Query:  ------------NSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQ-RASSLESLSCIDSK-LDKESPSSRHRSTSMEVITGI
                    N  N +WKLDLHGLHA EAV+ALQE L  IE     NRS+SP +   K    R++S E    +D + +  +  SSR    S++VITGI
Subjt:  ------------NSKNGLWKLDLHGLHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQ-RASSLESLSCIDSK-LDKESPSSRHRSTSMEVITGI

Query:  GNHSRGK
        G HSRG+
Subjt:  GNHSRGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTGGGCAAGGGGTAAATCTCCAGGCTGGGCCGCATTTAACCTTAAGCAACAGAATAGAGGCCTTCAAGATGAAATTGACGCAGAACCATTCCCACCAATATCAAC
CACTCTTTCCTGTCCGCCACCCCGTGAAAACTTGCATGGAGTCAATGGCCGTCGTTCAGGGAGATCTTTCTCATCTGTTCTCCTTCCTTCTGCTGACTCTCTAACTCCGC
CAGAAACTTTTGGCGTAAAAAAGAAAATACTTGGTGATTCTAGCATTGAAGGTGGTAAGAAGGTGGTTGAAGAAATCACTGATGTTTTAGCTTTTAGGAAGCTGAAAGGG
CTCCATTCTTGGGCTGATATTAGCTTGATTGAGGATATAATGGAAGCTGTAAATAATAATATTGATGAGGCATCTACTTTATTGAAAGCAATGGTGTGCATCAAAACTAT
CTTGCATGTGAAGAAGATGCACAAAATTGATAGAAGATATTCTTGTGAAAGAAATTTCTTTCATAATGTTGGAAATCCAAAACTAGCTCTTGGTTGCTCAAACTCCATTC
CTATTGAGCCTGAGTGGGAAGAAGATGATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGAATAGCAAAAATGGGCTCTGGAAGTTGGACTTACATGGT
CTTCATGCAGCCGAAGCTGTTCGAGCCTTGCAAGAACACTTGCTGAAAATTGAAACTCAAAATGCCTCCAATCGATCATTATCGCCAAAGAAAGCTGAAAGGAAAGGGTT
CCAACGTGCTTCATCACTTGAGTCTCTTAGTTGTATAGACTCCAAGTTGGACAAAGAATCACCATCATCTAGGCATAGGTCAACATCAATGGAAGTCATAACAGGTATAG
GAAATCATAGCAGGGGGAAGCTGCTCTACCAACGGCTGTGGCAAGGTTTCTTAGCGAAAACGG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTGGGCAAGGGGTAAATCTCCAGGCTGGGCCGCATTTAACCTTAAGCAACAGAATAGAGGCCTTCAAGATGAAATTGACGCAGAACCATTCCCACCAATATCAAC
CACTCTTTCCTGTCCGCCACCCCGTGAAAACTTGCATGGAGTCAATGGCCGTCGTTCAGGGAGATCTTTCTCATCTGTTCTCCTTCCTTCTGCTGACTCTCTAACTCCGC
CAGAAACTTTTGGCGTAAAAAAGAAAATACTTGGTGATTCTAGCATTGAAGGTGGTAAGAAGGTGGTTGAAGAAATCACTGATGTTTTAGCTTTTAGGAAGCTGAAAGGG
CTCCATTCTTGGGCTGATATTAGCTTGATTGAGGATATAATGGAAGCTGTAAATAATAATATTGATGAGGCATCTACTTTATTGAAAGCAATGGTGTGCATCAAAACTAT
CTTGCATGTGAAGAAGATGCACAAAATTGATAGAAGATATTCTTGTGAAAGAAATTTCTTTCATAATGTTGGAAATCCAAAACTAGCTCTTGGTTGCTCAAACTCCATTC
CTATTGAGCCTGAGTGGGAAGAAGATGATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGAATAGCAAAAATGGGCTCTGGAAGTTGGACTTACATGGT
CTTCATGCAGCCGAAGCTGTTCGAGCCTTGCAAGAACACTTGCTGAAAATTGAAACTCAAAATGCCTCCAATCGATCATTATCGCCAAAGAAAGCTGAAAGGAAAGGGTT
CCAACGTGCTTCATCACTTGAGTCTCTTAGTTGTATAGACTCCAAGTTGGACAAAGAATCACCATCATCTAGGCATAGGTCAACATCAATGGAAGTCATAACAGGTATAG
GAAATCATAGCAGGGGGAAGCTGCTCTACCAACGGCTGTGGCAAGGTTTCTTAGCGAAAACGG
Protein sequenceShow/hide protein sequence
MSWARGKSPGWAAFNLKQQNRGLQDEIDAEPFPPISTTLSCPPPRENLHGVNGRRSGRSFSSVLLPSADSLTPPETFGVKKKILGDSSIEGGKKVVEEITDVLAFRKLKG
LHSWADISLIEDIMEAVNNNIDEASTLLKAMVCIKTILHVKKMHKIDRRYSCERNFFHNVGNPKLALGCSNSIPIEPEWEEDDIYLSHRKDAIAMMRNSKNGLWKLDLHG
LHAAEAVRALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLESLSCIDSKLDKESPSSRHRSTSMEVITGIGNHSRGKLLYQRLWQGFLAKTX