; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019406 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019406
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionVQ domain-containing protein
Genome locationscaffold28:155994..156272
RNA-Seq ExpressionMS019406
SyntenyMS019406
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR008889 - VQ
IPR039607 - VQ motif-containing protein 8/17/18/20/21/25


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043778.1 protein MKS1-like [Cucumis melo var. makuwa]2.0e-2470.59Show/hide
Query:  MSSAAARNQLQLRGPRPPPLTVNGASVKILKKST-----ANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS----------FSGEEFV
        M+  AARNQ QLRGPRPPPLTVN +S  I KKST      NRRSP+I+YLRSPKIIHVRPEEFKSFVQRLTGNR SSVAVVAS           SGEEF 
Subjt:  MSSAAARNQLQLRGPRPPPLTVNGASVKILKKST-----ANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS----------FSGEEFV

Query:  SA
        SA
Subjt:  SA

KAG6596645.1 VQ motif-containing protein 8, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]9.1e-2267.74Show/hide
Query:  SAAARNQ--LQLRGPRPPPLTVNGASVKILKKSTA-------NRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFSGEEFVS
        ++AARNQ  L+LRGPRPPPLTVN +S KI KKST        NRR PVIVYLRSPKIIHVRPEEFKSFVQRLTGN +S   V ASFS    ++
Subjt:  SAAARNQ--LQLRGPRPPPLTVNGASVKILKKSTA-------NRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFSGEEFVS

KAG6690545.1 hypothetical protein I3842_10G021000 [Carya illinoinensis]2.0e-1360.56Show/hide
Query:  QLRGPRPPPLTVNGASVKILKKSTANR-RSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFS
        QL+GPRP P+TV+ +S+KI K   + + RSPVI+YL+SPK+IHVRPEEF   VQRLTGN+  SVA  +S+S
Subjt:  QLRGPRPPPLTVNGASVKILKKSTANR-RSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFS

KGN59247.1 hypothetical protein Csa_002000 [Cucumis sativus]6.9e-2266.35Show/hide
Query:  MSSAAARNQLQLRGPRPPPLTVN-GASVKILKKSTAN----------RRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS-------FSGE
        ++SAAARN  QLRGPRPPPLTVN  +S  I KKST N          RRSP+I+YLRSPK+IHVRPEEFKSFVQRLTGNR SSVAVVAS        + E
Subjt:  MSSAAARNQLQLRGPRPPPLTVN-GASVKILKKSTAN----------RRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS-------FSGE

Query:  EFVS
        EFVS
Subjt:  EFVS

XP_008443599.1 PREDICTED: protein MKS1-like [Cucumis melo]2.0e-2470.59Show/hide
Query:  MSSAAARNQLQLRGPRPPPLTVNGASVKILKKST-----ANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS----------FSGEEFV
        M+  AARNQ QLRGPRPPPLTVN +S  I KKST      NRRSP+I+YLRSPKIIHVRPEEFKSFVQRLTGNR SSVAVVAS           SGEEF 
Subjt:  MSSAAARNQLQLRGPRPPPLTVNGASVKILKKST-----ANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS----------FSGEEFV

Query:  SA
        SA
Subjt:  SA

TrEMBL top hitse value%identityAlignment
A0A0A0LEW5 VQ domain-containing protein3.4e-2266.35Show/hide
Query:  MSSAAARNQLQLRGPRPPPLTVN-GASVKILKKSTAN----------RRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS-------FSGE
        ++SAAARN  QLRGPRPPPLTVN  +S  I KKST N          RRSP+I+YLRSPK+IHVRPEEFKSFVQRLTGNR SSVAVVAS        + E
Subjt:  MSSAAARNQLQLRGPRPPPLTVN-GASVKILKKSTAN----------RRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS-------FSGE

Query:  EFVS
        EFVS
Subjt:  EFVS

A0A1S3B975 protein MKS1-like9.4e-2570.59Show/hide
Query:  MSSAAARNQLQLRGPRPPPLTVNGASVKILKKST-----ANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS----------FSGEEFV
        M+  AARNQ QLRGPRPPPLTVN +S  I KKST      NRRSP+I+YLRSPKIIHVRPEEFKSFVQRLTGNR SSVAVVAS           SGEEF 
Subjt:  MSSAAARNQLQLRGPRPPPLTVNGASVKILKKST-----ANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS----------FSGEEFV

Query:  SA
        SA
Subjt:  SA

A0A2N9GB07 VQ domain-containing protein1.3e-1356.1Show/hide
Query:  MSSAAARNQLQLRGPRPPPLTVNGASVKILKKSTAN--RRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFS
        M S   +  LQL+GPRP PLTVN +S KI K+   N  R SPVI+YL+SPK+IHVRPEEF   VQ+LTGN+  S     S S
Subjt:  MSSAAARNQLQLRGPRPPPLTVNGASVKILKKSTAN--RRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFS

A0A5D3DP92 Protein MKS1-like9.4e-2570.59Show/hide
Query:  MSSAAARNQLQLRGPRPPPLTVNGASVKILKKST-----ANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS----------FSGEEFV
        M+  AARNQ QLRGPRPPPLTVN +S  I KKST      NRRSP+I+YLRSPKIIHVRPEEFKSFVQRLTGNR SSVAVVAS           SGEEF 
Subjt:  MSSAAARNQLQLRGPRPPPLTVNGASVKILKKST-----ANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS----------FSGEEFV

Query:  SA
        SA
Subjt:  SA

I1NAX0 VQ domain-containing protein1.8e-1255.7Show/hide
Query:  MSSAAARNQLQLRGPRPPPLTVNGASVKILK-KSTANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS
        M+S AA    QL+GP+P  L +N  S KI K K   +  SPVIV+L+SPK+IHVRPEEF S VQ+LTGN  S+ AV A+
Subjt:  MSSAAARNQLQLRGPRPPPLTVNGASVKILK-KSTANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVAS

SwissProt top hitse value%identityAlignment
F4HWF9 Nuclear speckle RNA-binding protein B1.4e-0434.09Show/hide
Query:  GPRPPPLTVNGASVKILKK---------------------STANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFS
        GP+P PL V G S KI+KK                      +     PV +Y  +P+IIH  P  F + VQRLTG  ++S    +S S
Subjt:  GPRPPPLTVNGASVKILKK---------------------STANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFS

Q8LGD5 Protein MKS12.5e-0638.78Show/hide
Query:  RNQLQLRGPRPPPLTVNGASVKILK-----KSTANR--------RSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFSGEEFVSAGLPS
        + QLQ+ GPRP PL+V+  S KI K         NR        R PV++Y  SPK++H    EF + VQRLTG  +S V + +   G+   +A L S
Subjt:  RNQLQLRGPRPPPLTVNGASVKILK-----KSTANR--------RSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFSGEEFVSAGLPS

Q9CA36 VQ motif-containing protein 8, chloroplastic1.9e-0645.31Show/hide
Query:  QLRGPRPPPLTVNGASVKILKKSTA-------NRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTG
        ++ G RP  L + G S  I K S+         R SPVI+Y  SPK+IH R E+F + VQRLTG
Subjt:  QLRGPRPPPLTVNGASVKILKKSTA-------NRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTG

Q9LS54 VQ motif-containing protein 202.1e-0545Show/hide
Query:  PRPPPLTVNGASVKILK-------KSTANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTG
        P PP L VN  S  I K        S A  R PVI+Y  +P+IIH  P++F + VQ+LTG
Subjt:  PRPPPLTVNGASVKILK-------KSTANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTG

Arabidopsis top hitse value%identityAlignment
AT1G21320.1 nucleotide binding;nucleic acid binding9.7e-0634.09Show/hide
Query:  GPRPPPLTVNGASVKILKK---------------------STANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFS
        GP+P PL V G S KI+KK                      +     PV +Y  +P+IIH  P  F + VQRLTG  ++S    +S S
Subjt:  GPRPPPLTVNGASVKILKK---------------------STANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFS

AT1G21326.1 VQ motif-containing protein5.1e-0738.64Show/hide
Query:  GPRPPPLTVNGASVKILKK---------------------STANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFS
        GPRP PL V G S KI+KK                      +     PVI+Y  SP+IIH  P  F + VQRLTG +TS+    +S+S
Subjt:  GPRPPPLTVNGASVKILKK---------------------STANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFS

AT1G68450.1 VQ motif-containing protein1.4e-0745.31Show/hide
Query:  QLRGPRPPPLTVNGASVKILKKSTA-------NRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTG
        ++ G RP  L + G S  I K S+         R SPVI+Y  SPK+IH R E+F + VQRLTG
Subjt:  QLRGPRPPPLTVNGASVKILKKSTA-------NRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTG

AT3G18360.1 VQ motif-containing protein1.5e-0645Show/hide
Query:  PRPPPLTVNGASVKILK-------KSTANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTG
        P PP L VN  S  I K        S A  R PVI+Y  +P+IIH  P++F + VQ+LTG
Subjt:  PRPPPLTVNGASVKILK-------KSTANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTG

AT3G18690.1 MAP kinase substrate 11.8e-0738.78Show/hide
Query:  RNQLQLRGPRPPPLTVNGASVKILK-----KSTANR--------RSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFSGEEFVSAGLPS
        + QLQ+ GPRP PL+V+  S KI K         NR        R PV++Y  SPK++H    EF + VQRLTG  +S V + +   G+   +A L S
Subjt:  RNQLQLRGPRPPPLTVNGASVKILK-----KSTANR--------RSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFSGEEFVSAGLPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAAACAATGAGCTCCGCCGCCGCGAGGAACCAGTTGCAATTGCGCGGTCCAAGGCCGCCGCCGTTGACGGTGAACGGAGCCTCCGTCAAGATCCTCAAGAAATCGACGGC
GAATCGCCGGTCTCCGGTCATCGTGTACCTCCGATCGCCGAAGATTATCCACGTCCGGCCTGAGGAGTTCAAGAGCTTCGTTCAACGCCTCACCGGAAACCGAACCTCCT
CCGTCGCCGTCGTCGCATCGTTTTCCGGTGAGGAATTCGTATCCGCCGGCTTGCCGTCG
mRNA sequenceShow/hide mRNA sequence
GAAACAATGAGCTCCGCCGCCGCGAGGAACCAGTTGCAATTGCGCGGTCCAAGGCCGCCGCCGTTGACGGTGAACGGAGCCTCCGTCAAGATCCTCAAGAAATCGACGGC
GAATCGCCGGTCTCCGGTCATCGTGTACCTCCGATCGCCGAAGATTATCCACGTCCGGCCTGAGGAGTTCAAGAGCTTCGTTCAACGCCTCACCGGAAACCGAACCTCCT
CCGTCGCCGTCGTCGCATCGTTTTCCGGTGAGGAATTCGTATCCGCCGGCTTGCCGTCG
Protein sequenceShow/hide protein sequence
ETMSSAAARNQLQLRGPRPPPLTVNGASVKILKKSTANRRSPVIVYLRSPKIIHVRPEEFKSFVQRLTGNRTSSVAVVASFSGEEFVSAGLPS