; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008503 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008503
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTetratricopeptide repeat (TPR)-like superfamily protein
Genome locationscaffold4:1860479..1861248
RNA-Seq ExpressionMS008503
SyntenyMS008503
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MBA0869242.1 hypothetical protein [Gossypium schwendimanii]4.0e-3444.26Show/hide
Query:  MLPRTASPPAVESSI-HRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSISETSRFRKRVVRTELRDESVM-----EEEECE
        M+ R+AS P + S + H   P  DL    +    RT F +VSC S    IS   + +S R  + +VS ++        ++T  ++  ++     EEEE E
Subjt:  MLPRTASPPAVESSI-HRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSISETSRFRKRVVRTELRDESVM-----EEEECE

Query:  ---------RFATESVEAKAPDVLVGGGVGSGGDDGGGENWFGD----GYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCE
                 R ++  VE +      GG +  GG  GG +   G     GY DS  G  S + YYQKMI+A P ++L+LSNYARFLKEV+GDLV+AEEYC 
Subjt:  ---------RFATESVEAKAPDVLVGGGVGSGGDDGGGENWFGD----GYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCE

Query:  RAILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD
        RAILA PNDGN LS+Y DLIWQ H DG RAQTYFDQA+ SAP D
Subjt:  RAILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD

XP_021277339.1 uncharacterized protein LOC110411484 [Herrania umbratica]1.0e-3445.42Show/hide
Query:  MLPRTASPPAVESSI-HRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSISETSRFRKRVV---------RTELRDESVMEE
        ML R+AS P + S + H   P  +     +    R+   +VSC S     S+S+ S+ + S  ++ ++SET   R  VV            L   SV EE
Subjt:  MLPRTASPPAVESSI-HRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSISETSRFRKRVV---------RTELRDESVMEE

Query:  EECE---------RFATESVE-------AKAPDVLVGGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLV
        EE E         R A+  VE               GGG G GG DGG   W   GY DS  GNGS + YYQKMIEA P ++L+LSNYARFLKEV+GD V
Subjt:  EECE---------RFATESVE-------AKAPDVLVGGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLV

Query:  KAEEYCERAILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD
        KAEEYC RAILA PNDGN LS+Y DLIWQ H D  RA+TYFD+A+ ++PHD
Subjt:  KAEEYCERAILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD

XP_028102716.1 aspartate, glycine, lysine and serine-rich protein-like [Camellia sinensis]4.0e-3468.7Show/hide
Query:  GGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKPNDGNALSLYGDLIWQIHGDGDR
        GGG G GG  GGG+     GY DS  GN S EEYY KMI+AYP DAL+LSNYA+FLKEV+GD VKAEEYC RAILA P+DGN LSLY DLIWQ H D  R
Subjt:  GGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKPNDGNALSLYGDLIWQIHGDGDR

Query:  AQTYFDQALHSAPHD
        A+TYFDQA+ +AP D
Subjt:  AQTYFDQALHSAPHD

XP_038896674.1 uncharacterized protein LOC120084935 isoform X1 [Benincasa hispida]8.2e-6462.4Show/hide
Query:  MIPPKMLPRTASPPAVESSIHRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSI------------SETSRFRKRVVRTELR
        MI P MLP T S    +  IHRS PG DL PI+RF  IR   MSVSCGS + R SM    NS +SFS+ + +            SE S+F K V++ E+ 
Subjt:  MIPPKMLPRTASPPAVESSIHRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSI------------SETSRFRKRVVRTELR

Query:  DESVMEEEECERFATESVEAKAPDVLVGGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERA
         E   E+E C  F  ESV    P+V VGGGVGSGG  GGGENWFGD  GDSGRG+GSM+EYYQKMIEAYP DALILSNYARFLKEVK D VKAEEYCERA
Subjt:  DESVMEEEECERFATESVEAKAPDVLVGGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERA

Query:  ILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD
        ILAKPNDGN LSLYGDLIWQ H D DRA+TYF QA++S+P+D
Subjt:  ILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD

XP_038896675.1 uncharacterized protein LOC120084935 isoform X2 [Benincasa hispida]8.2e-6462.4Show/hide
Query:  MIPPKMLPRTASPPAVESSIHRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSI------------SETSRFRKRVVRTELR
        MI P MLP T S    +  IHRS PG DL PI+RF  IR   MSVSCGS + R SM    NS +SFS+ + +            SE S+F K V++ E+ 
Subjt:  MIPPKMLPRTASPPAVESSIHRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSI------------SETSRFRKRVVRTELR

Query:  DESVMEEEECERFATESVEAKAPDVLVGGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERA
         E   E+E C  F  ESV    P+V VGGGVGSGG  GGGENWFGD  GDSGRG+GSM+EYYQKMIEAYP DALILSNYARFLKEVK D VKAEEYCERA
Subjt:  DESVMEEEECERFATESVEAKAPDVLVGGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERA

Query:  ILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD
        ILAKPNDGN LSLYGDLIWQ H D DRA+TYF QA++S+P+D
Subjt:  ILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD

TrEMBL top hitse value%identityAlignment
A0A1R3IEX9 Tetratricopeptide-like helical1.9e-3442.42Show/hide
Query:  MLPRTASPPAVESSIHRSWPGRDLAPIARFL----TIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSISETSRFRKRVVRTELRDES--------VM
        ML R+AS P + S I  S   ++ +P   FL      R+   +++C S     SMSM S  E S  ++ ++SET      V + +  +++        V 
Subjt:  MLPRTASPPAVESSIHRSWPGRDLAPIARFL----TIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSISETSRFRKRVVRTELRDES--------VM

Query:  EEEECERFATESVEAKAPDVLV---------------------------GGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSN
        EE+E E      VE +  D  +                           GGG G GG DGG   W   GY DS  GN S E YYQKMIEA P + L+LSN
Subjt:  EEEECERFATESVEAKAPDVLV---------------------------GGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSN

Query:  YARFLKEVKGDLVKAEEYCERAILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD
        YARFLKEV+GD VKAEEYC RAILA PNDGN LS+YGDLIW+   D  RA+TYFDQA+ +AP+D
Subjt:  YARFLKEVKGDLVKAEEYCERAILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD

A0A4S4F256 Uncharacterized protein1.9e-3468.7Show/hide
Query:  GGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKPNDGNALSLYGDLIWQIHGDGDR
        GGG G GG  GGG+     GY DS  GN S EEYY KMI+AYP DAL+LSNYA+FLKEV+GD VKAEEYC RAILA P+DGN LSLY DLIWQ H D  R
Subjt:  GGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKPNDGNALSLYGDLIWQIHGDGDR

Query:  AQTYFDQALHSAPHD
        A+TYFDQA+ +AP D
Subjt:  AQTYFDQALHSAPHD

A0A6J0ZSR2 uncharacterized protein LOC1104114845.0e-3545.42Show/hide
Query:  MLPRTASPPAVESSI-HRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSISETSRFRKRVV---------RTELRDESVMEE
        ML R+AS P + S + H   P  +     +    R+   +VSC S     S+S+ S+ + S  ++ ++SET   R  VV            L   SV EE
Subjt:  MLPRTASPPAVESSI-HRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSISETSRFRKRVV---------RTELRDESVMEE

Query:  EECE---------RFATESVE-------AKAPDVLVGGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLV
        EE E         R A+  VE               GGG G GG DGG   W   GY DS  GNGS + YYQKMIEA P ++L+LSNYARFLKEV+GD V
Subjt:  EECE---------RFATESVE-------AKAPDVLVGGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLV

Query:  KAEEYCERAILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD
        KAEEYC RAILA PNDGN LS+Y DLIWQ H D  RA+TYFD+A+ ++PHD
Subjt:  KAEEYCERAILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD

A0A7J7GYP1 Uncharacterized protein2.5e-3467.21Show/hide
Query:  LVGGGVGSGGDDGGGENWFGD-----GYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKPNDGNALSLYGDLIWQ
        L GGG G GG  GGG +  GD     GY DS  GN S EEYY KMI+AYP DAL+LSNYA+FLKEV+GD VKAEEYC RAILA P+DGN LSLY DLIWQ
Subjt:  LVGGGVGSGGDDGGGENWFGD-----GYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKPNDGNALSLYGDLIWQ

Query:  IHGDGDRAQTYFDQALHSAPHD
         H D  RA+TYFDQA+ +AP D
Subjt:  IHGDGDRAQTYFDQALHSAPHD

A0A7J9MDS8 Uncharacterized protein1.9e-3444.26Show/hide
Query:  MLPRTASPPAVESSI-HRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSISETSRFRKRVVRTELRDESVM-----EEEECE
        M+ R+AS P + S + H   P  DL    +    RT F +VSC S    IS   + +S R  + +VS ++        ++T  ++  ++     EEEE E
Subjt:  MLPRTASPPAVESSI-HRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSISETSRFRKRVVRTELRDESVM-----EEEECE

Query:  ---------RFATESVEAKAPDVLVGGGVGSGGDDGGGENWFGD----GYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCE
                 R ++  VE +      GG +  GG  GG +   G     GY DS  G  S + YYQKMI+A P ++L+LSNYARFLKEV+GDLV+AEEYC 
Subjt:  ---------RFATESVEAKAPDVLVGGGVGSGGDDGGGENWFGD----GYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCE

Query:  RAILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD
        RAILA PNDGN LS+Y DLIWQ H DG RAQTYFDQA+ SAP D
Subjt:  RAILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04530.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.0e-1643.75Show/hide
Query:  GYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD
        GY D   G     +YY+ M+E YP   L+L NYA+FL E KGDL  AEEY  +  + +P+DG AL+ YG L+ ++H D  +A +YF++A+ ++P D
Subjt:  GYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD

AT1G80130.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-2450.41Show/hide
Query:  DVLVGGGVGSGGDDGGGENWFGDGYG----DSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKPNDGNALSLYGDLIW
        + LV GG G  G  GG     G G G    D GR   + + YY++MI++ P ++L+  NYA+FLKEVKGD+ KAEEYCERAIL   NDGN LSLY DLI 
Subjt:  DVLVGGGVGSGGDDGGGENWFGDGYG----DSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKPNDGNALSLYGDLIW

Query:  QIHGDGDRAQTYFDQALHSAPHD
          H D  RA +Y+ QA+  +P D
Subjt:  QIHGDGDRAQTYFDQALHSAPHD

AT4G17940.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-2033.62Show/hide
Query:  MLPRTASPPAVESSIHRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVS-ISETSRFRKRV------VRTELRDESVMEEEEC
        +L RT S P +++ +      R + PI+R  ++ +   S        +IS+ + +N      +S S +  + R  KRV       R    DE+  EE   
Subjt:  MLPRTASPPAVESSIHRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVS-ISETSRFRKRV------VRTELRDESVMEEEEC

Query:  ERFATESVEAKAP---DVLVGGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKPND
               +    P       GGGVG G    GG   +G+G G        + +YY++M+ + P ++L+L NY +FL EV+ D   AEEY  RAIL  P D
Subjt:  ERFATESVEAKAP---DVLVGGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKPND

Query:  GNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD
        G ALS+YG LIW+   D  RAQ YFDQA++++P+D
Subjt:  GNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD

AT4G32340.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-3059.17Show/hide
Query:  VLVGGGVGSGGDDGGGENWFGDGY-GDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKP-NDGNALSLYGDLIWQIH
        VL G  +  GG +GG     GDG  G  G G GS++ YY++MI+ YP D L+LSNYARFLKEVKGD  KAEEYCERA+L++   DG  LS+YGDLIW+ H
Subjt:  VLVGGGVGSGGDDGGGENWFGDGY-GDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKP-NDGNALSLYGDLIWQIH

Query:  GDGDRAQTYFDQALHSAPHD
        GDG RAQ+Y+DQA+ S+P D
Subjt:  GDGDRAQTYFDQALHSAPHD

AT5G20190.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.4e-2638.43Show/hide
Query:  MLPRTASPPAVESSIHRSWPGRDLAPIARFLTI----RTEFMSVSCGSPS---RRISMSMASNSERSFSVSVSISETSRF--RKRVVRTELRDESVMEE-
        ML R+AS P + S +H S P RD +PI    ++    R   +++S  S S     +S+  + +S R    + S S+       K  V   L   ++ME+ 
Subjt:  MLPRTASPPAVESSIHRSWPGRDLAPIARFLTI----RTEFMSVSCGSPS---RRISMSMASNSERSFSVSVSISETSRF--RKRVVRTELRDESVMEE-

Query:  EECERFA---TESVE----AKAPDVLVGGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERA
        +E   F    T S +    A   D  V GG G G   GGG+   G        G+ + + +Y+KMIEA P + + LSNYA+FLKEV+ D +KAEEYC RA
Subjt:  EECERFA---TESVE----AKAPDVLVGGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERA

Query:  ILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD
        IL  PNDGN L++Y +L+W+IH D  RA+ YF+QA+ +AP D
Subjt:  ILAKPNDGNALSLYGDLIWQIHGDGDRAQTYFDQALHSAPHD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCCGCCTAAAATGCTTCCGCGAACTGCTTCGCCGCCGGCCGTGGAGTCATCCATCCACCGATCATGGCCGGGACGTGATCTGGCTCCGATCGCGCGGTTTCTGAC
GATCAGAACCGAGTTTATGTCCGTCTCATGTGGCTCACCCTCCCGTAGAATCTCCATGAGTATGGCTTCTAATTCTGAAAGATCGTTCTCGGTTTCGGTTTCGATTTCGG
AAACGTCGCGATTCAGGAAGCGTGTGGTGAGAACAGAGCTGAGAGACGAATCGGTGATGGAGGAAGAGGAATGCGAACGTTTCGCGACGGAATCCGTGGAGGCGAAGGCG
CCGGATGTTCTTGTTGGTGGGGGAGTTGGGAGCGGCGGCGACGACGGCGGCGGCGAAAACTGGTTCGGCGATGGATATGGAGATTCAGGTAGAGGAAATGGAAGCATGGA
GGAGTACTATCAGAAGATGATCGAAGCTTATCCTTGCGACGCTCTAATTCTAAGCAATTACGCAAGATTTCTGAAGGAGGTGAAAGGGGATTTAGTGAAAGCAGAAGAGT
ACTGTGAGAGAGCGATTTTGGCGAAACCAAATGACGGCAATGCTCTCTCTCTCTATGGAGATTTGATTTGGCAGATTCATGGAGATGGAGATCGTGCTCAGACCTACTTC
GATCAAGCTCTCCATTCAGCCCCTCATGACAGG
mRNA sequenceShow/hide mRNA sequence
ATGATTCCGCCTAAAATGCTTCCGCGAACTGCTTCGCCGCCGGCCGTGGAGTCATCCATCCACCGATCATGGCCGGGACGTGATCTGGCTCCGATCGCGCGGTTTCTGAC
GATCAGAACCGAGTTTATGTCCGTCTCATGTGGCTCACCCTCCCGTAGAATCTCCATGAGTATGGCTTCTAATTCTGAAAGATCGTTCTCGGTTTCGGTTTCGATTTCGG
AAACGTCGCGATTCAGGAAGCGTGTGGTGAGAACAGAGCTGAGAGACGAATCGGTGATGGAGGAAGAGGAATGCGAACGTTTCGCGACGGAATCCGTGGAGGCGAAGGCG
CCGGATGTTCTTGTTGGTGGGGGAGTTGGGAGCGGCGGCGACGACGGCGGCGGCGAAAACTGGTTCGGCGATGGATATGGAGATTCAGGTAGAGGAAATGGAAGCATGGA
GGAGTACTATCAGAAGATGATCGAAGCTTATCCTTGCGACGCTCTAATTCTAAGCAATTACGCAAGATTTCTGAAGGAGGTGAAAGGGGATTTAGTGAAAGCAGAAGAGT
ACTGTGAGAGAGCGATTTTGGCGAAACCAAATGACGGCAATGCTCTCTCTCTCTATGGAGATTTGATTTGGCAGATTCATGGAGATGGAGATCGTGCTCAGACCTACTTC
GATCAAGCTCTCCATTCAGCCCCTCATGACAGG
Protein sequenceShow/hide protein sequence
MIPPKMLPRTASPPAVESSIHRSWPGRDLAPIARFLTIRTEFMSVSCGSPSRRISMSMASNSERSFSVSVSISETSRFRKRVVRTELRDESVMEEEECERFATESVEAKA
PDVLVGGGVGSGGDDGGGENWFGDGYGDSGRGNGSMEEYYQKMIEAYPCDALILSNYARFLKEVKGDLVKAEEYCERAILAKPNDGNALSLYGDLIWQIHGDGDRAQTYF
DQALHSAPHDR