; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001930 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001930
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionNHL domain protein
Genome locationscaffold30:1104812..1105306
RNA-Seq ExpressionMS001930
SyntenyMS001930
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587481.1 hypothetical protein SDJN03_16046, partial [Cucurbita argyrosperma subsp. sororia]1.3e-7180.86Show/hide
Query:  MTRTDAEEPPCEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SGA
        M    AEEPPCEADK+N+LLPTTSCGC R+SCFGSRR  A GPSWWERIRAS+VH EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS  + A
Subjt:  MTRTDAEEPPCEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SGA

Query:  TRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD
         RPGKFQYDPLSYAMNFDEGS +IGELDDDIDDF+GFRNFS+RYASIP  +K+ G  V GKD
Subjt:  TRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD

KAG7021468.1 putative alpha,alpha-trehalose-phosphate synthase [UDP-forming] 10 [Cucurbita argyrosperma subsp. argyrosperma]3.2e-7082.28Show/hide
Query:  AEEPP-CEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SGATRPG
        +EEPP CEADK+N+LLPTTSCGC R+SCFGSRR AA GPSWWERIRAS+VH EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS  + A RPG
Subjt:  AEEPP-CEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SGATRPG

Query:  KFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD
        KFQYDPLSYAMNFDEGS +IGELDDDIDDF+GFRNFS+RYASIP  +K+ G  V GKD
Subjt:  KFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD

XP_022928159.1 uncharacterized protein LOC111435061 [Cucurbita moschata]1.1e-7080.98Show/hide
Query:  MTRTDAEEPP-CEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SG
        M    AEEPP CEADK+N+LLPTTSCGC R+SCFGSRR AA GPSWWERIRAS+VH EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS  + 
Subjt:  MTRTDAEEPP-CEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SG

Query:  ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD
        A RPGKFQYDPLSYAMNFDEGS +IGELDDDIDDF+GFRNFS+RYASIP  +K+ G  V GKD
Subjt:  ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD

XP_023003975.1 uncharacterized protein LOC111497423 [Cucurbita maxima]1.1e-7080.98Show/hide
Query:  MTRTDAEEPP-CEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SG
        M    AEEPP CEADK+N+LLPTTSC C R+SCFGSRR AA GPSWWERIRAS+VH EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS  + 
Subjt:  MTRTDAEEPP-CEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SG

Query:  ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD
        A RPGKFQYDPLSYAMNFDEGS +IGELDDDIDDF+GFRNFS+RYASIP  +K+GG  V GKD
Subjt:  ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD

XP_038878924.1 uncharacterized protein LOC120071012 [Benincasa hispida]1.4e-7080.12Show/hide
Query:  MTRTDA-EEPPCEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS---S
        M  TD  EE PCEADK+N+LLPT  CGCFR+SCFGSRR A  GPSWWER+RASQVH EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS    
Subjt:  MTRTDA-EEPPCEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS---S

Query:  GATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKDVS
        G  R GKFQYDPLSYAMNFDEGSR+IGELDDDIDDFNG+RNFS+RYASIPAP+K+GG  +  KDVS
Subjt:  GATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKDVS

TrEMBL top hitse value%identityAlignment
A0A0A0LUB2 Uncharacterized protein5.0e-6977.84Show/hide
Query:  MTRTDAEE--PPCEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSSG-
        M  TD EE  P C+ADK+N+LLPT  CGCFR+SCFGSRR A  GPSWWERIRASQVH EGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS G 
Subjt:  MTRTDAEE--PPCEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSSG-

Query:  -----------ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGG
                     R GKFQYDPLSYAMNFDEGSR+IGELDDDIDDFNG+RNFS+RYASIPAP+K+GG
Subjt:  -----------ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGG

A0A5D3BK38 Putative Serine/arginine repetitive matrix protein 22.5e-6876.02Show/hide
Query:  TRTDAEEPP-CEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSSG---
        T  D EE P CEADK+N+LLPT  CGCFR+SCFGSRR A  GPSWWERIRASQVH EGRWW RGV+V+LKLREWSEIVAGPRWKTFIRRFNRNRS G   
Subjt:  TRTDAEEPP-CEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSSG---

Query:  ------ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKDVS
                R GKFQYDPLSYAMNFDEGSR+IGELDDD+DDFNG+RNFS+RYASIPAP+K+GG     KDVS
Subjt:  ------ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKDVS

A0A6J1E188 uncharacterized protein LOC1114298652.9e-6978.18Show/hide
Query:  MTRTDAEE-PPCEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SG
        M   DAEE PPCE D +N+LLPTT CGCFR+SC   RR  A GPSWWERIRASQVH EGRWW RGVRV+LK+REWSEIVAGPRWKTFIRRFNRNR+  +G
Subjt:  MTRTDAEE-PPCEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SG

Query:  ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKDVS
        A RPGKFQYDPLSYAMNFDEGSR+IGELDDD DDFNG+RNFS+RYASIPAP+K+GG    GKD+S
Subjt:  ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKDVS

A0A6J1EK25 uncharacterized protein LOC1114350615.3e-7180.98Show/hide
Query:  MTRTDAEEPP-CEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SG
        M    AEEPP CEADK+N+LLPTTSCGC R+SCFGSRR AA GPSWWERIRAS+VH EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS  + 
Subjt:  MTRTDAEEPP-CEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SG

Query:  ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD
        A RPGKFQYDPLSYAMNFDEGS +IGELDDDIDDF+GFRNFS+RYASIP  +K+ G  V GKD
Subjt:  ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD

A0A6J1KTA4 uncharacterized protein LOC1114974235.3e-7180.98Show/hide
Query:  MTRTDAEEPP-CEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SG
        M    AEEPP CEADK+N+LLPTTSC C R+SCFGSRR AA GPSWWERIRAS+VH EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS  + 
Subjt:  MTRTDAEEPP-CEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS--SG

Query:  ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD
        A RPGKFQYDPLSYAMNFDEGS +IGELDDDIDDF+GFRNFS+RYASIP  +K+GG  V GKD
Subjt:  ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)1.1e-2336.11Show/hide
Query:  PPCEADKTN----ALLPTTSCGCFRVSCFGSRRPAAAGPS-WWERI-RASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSSG-----
        P  E D T+    AL     C CF + C  S +P+  G S WW+RI    ++  + RWW RG R   ++REWSE+VAGPRWKT+IRRF R+   G     
Subjt:  PPCEADKTN----ALLPTTSCGCFRVSCFGSRRPAAAGPS-WWERI-RASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSSG-----

Query:  ----------------ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKDV
                        ++  GKF+YD LSY++NFD+G+ + G  DD+      +R++S R+A+   P+ +  +  F  D+
Subjt:  ----------------ATRPGKFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKDV

AT3G48020.1 unknown protein1.7e-2443.54Show/hide
Query:  EADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRN--RSSGATRPGKFQYDPL
        E  + N LL  +SC    V             SWW+RI  +  H E RWW   VR  LK+REWSEIVAGPRWKTFIRRFNR+  R        KF+YDP+
Subjt:  EADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRN--RSSGATRPGKFQYDPL

Query:  SYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAV
        SY ++F++  ++    DDD     G R+FS RYAS+P       A +
Subjt:  SYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAV

AT5G14890.1 NHL domain-containing protein2.4e-2338.55Show/hide
Query:  PPCEADKTN----ALLPTTSCGCFRVSCFGSRRPAAA-GPSWWERIR-ASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNR------SS
        P  E D T+    A+     C CF + C GS +P+   G  WW+RIR   ++  + RWW  G    +K+REWSEIVAGP+WKTFIRRF RN         
Subjt:  PPCEADKTN----ALLPTTSCGCFRVSCFGSRRPAAA-GPSWWERIR-ASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNR------SS

Query:  GATRPG--KFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD
        G  RP    F+YD  SY++NFD+G ++ G  +D+      +R++S R+A+   P+ +  +  F  D
Subjt:  GATRPG--KFQYDPLSYAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKD

AT5G25240.1 unknown protein1.2e-0642.42Show/hide
Query:  QVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSSGATRPGKFQYDPLSYAMNFDEG
        Q    G W   G   L  L+E SE +AGP+WK FIR F+  R     R   F YD  +Y++NFD+G
Subjt:  QVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSSGATRPGKFQYDPLSYAMNFDEG

AT5G62865.1 unknown protein2.8e-2451.15Show/hide
Query:  CGCFRVSCFGSRRPAAAGPSWWERIRA------SQVHG-EGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSSGA--TRPGKFQYDPLSYAMNF
        C CF  S   SR   A G S W RIR       S  HG E RWW   +R  LK+REWSEIVAGPRWKTFIRRFNR+   G       KFQYDPLSY++NF
Subjt:  CGCFRVSCFGSRRPAAAGPSWWERIRA------SQVHG-EGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSSGA--TRPGKFQYDPLSYAMNF

Query:  DEGSREIGELDDDIDDFNGFRNFSSRYASIP
        D+   E     D+     G R+FS+R+AS+P
Subjt:  DEGSREIGELDDDIDDFNGFRNFSSRYASIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCGTACCGATGCAGAGGAGCCGCCTTGTGAAGCAGATAAAACCAATGCTCTCTTGCCGACCACCAGCTGCGGCTGCTTCCGTGTTTCGTGCTTCGGATCTCGCCG
TCCGGCCGCCGCCGGGCCGTCCTGGTGGGAGCGGATTCGGGCGTCGCAGGTTCACGGCGAGGGGCGGTGGTGGACGCGAGGCGTCAGAGTTCTCTTGAAGCTCCGTGAGT
GGTCGGAGATTGTCGCTGGGCCGAGATGGAAGACTTTCATCCGCCGATTTAATCGCAATCGGAGTTCTGGTGCTACTAGGCCTGGGAAATTCCAGTACGATCCGTTGAGT
TACGCCATGAATTTCGACGAGGGTTCGAGGGAGATTGGGGAATTGGACGACGACATCGATGACTTCAATGGCTTCCGGAATTTCTCCTCTCGGTACGCTTCGATTCCGGC
GCCGATGAAGTCCGGAGGAGCGGCGGTTTTTGGCAAGGATGTTTCGATTTCAACC
mRNA sequenceShow/hide mRNA sequence
ATGACCCGTACCGATGCAGAGGAGCCGCCTTGTGAAGCAGATAAAACCAATGCTCTCTTGCCGACCACCAGCTGCGGCTGCTTCCGTGTTTCGTGCTTCGGATCTCGCCG
TCCGGCCGCCGCCGGGCCGTCCTGGTGGGAGCGGATTCGGGCGTCGCAGGTTCACGGCGAGGGGCGGTGGTGGACGCGAGGCGTCAGAGTTCTCTTGAAGCTCCGTGAGT
GGTCGGAGATTGTCGCTGGGCCGAGATGGAAGACTTTCATCCGCCGATTTAATCGCAATCGGAGTTCTGGTGCTACTAGGCCTGGGAAATTCCAGTACGATCCGTTGAGT
TACGCCATGAATTTCGACGAGGGTTCGAGGGAGATTGGGGAATTGGACGACGACATCGATGACTTCAATGGCTTCCGGAATTTCTCCTCTCGGTACGCTTCGATTCCGGC
GCCGATGAAGTCCGGAGGAGCGGCGGTTTTTGGCAAGGATGTTTCGATTTCAACC
Protein sequenceShow/hide protein sequence
MTRTDAEEPPCEADKTNALLPTTSCGCFRVSCFGSRRPAAAGPSWWERIRASQVHGEGRWWTRGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSSGATRPGKFQYDPLS
YAMNFDEGSREIGELDDDIDDFNGFRNFSSRYASIPAPMKSGGAAVFGKDVSIST