; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004547 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004547
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold995:172666..172989
RNA-Seq ExpressionMS004547
SyntenyMS004547
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591893.1 Acetyltransferase, partial [Cucurbita argyrosperma subsp. sororia]2.8e-3884.16Show/hide
Query:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT
        MD+ATKFLRT+T+A+NN TLINVCLV+SFGALSARSIKQQREIEALEAEKDSLLNSNK+LKK+MWDWKQQL+++AST+SALVPLARIKAIYGE P+ SP+
Subjt:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT

Query:  G
        G
Subjt:  G

XP_022140674.1 uncharacterized protein LOC111011272 [Momordica charantia]4.0e-4599.01Show/hide
Query:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT
        MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEAS+ESALVPLARIKAIYGEVPISSPT
Subjt:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT

Query:  G
        G
Subjt:  G

XP_022937146.1 uncharacterized protein LOC111443531 [Cucurbita moschata]2.8e-3884.16Show/hide
Query:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT
        MD+ATKFLRT+T+A+NN TLINVCLV+SFGALSARSIKQQREIEALEAEKDSLLNSNK+LKK+MWDWKQQL+++AST+SALVPLARIKAIYGE P+ SP+
Subjt:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT

Query:  G
        G
Subjt:  G

XP_022976887.1 uncharacterized protein LOC111477117 [Cucurbita maxima]1.2e-3885.15Show/hide
Query:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT
        MD+A+KFLRTLTNA+NN TLINVCLV+SFGALSARSIKQQREIEALEAEKDSLLNSNKALKK+MWDWKQQL++EAST+SAL+PLARIKAIY E P+ SP+
Subjt:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT

Query:  G
        G
Subjt:  G

XP_038897052.1 uncharacterized protein LOC120085226 [Benincasa hispida]1.2e-3888.12Show/hide
Query:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT
        MD+A+KFLRT+TNA+NN TLINVCLV+SFGALSARSIKQQREIEALEAEK SLLNSNKALKK+MWDWKQQLFAEAST+SALVPLARIKAIYGE PI SP+
Subjt:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT

Query:  G
        G
Subjt:  G

TrEMBL top hitse value%identityAlignment
A0A0A0L380 Uncharacterized protein1.9e-3781.73Show/hide
Query:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT
        MD A+KFLR+L NA+N  T+INVCLVVSF AL+ARSIKQ+R+IEALE EK+SLLNSNKALKK+MWDWKQQLFAEASTESALVPLARIKAIYGE PI SP+
Subjt:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT

Query:  GNLR
        GN+R
Subjt:  GNLR

A0A1S3B9R7 uncharacterized protein LOC1034875835.2e-3581.19Show/hide
Query:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT
        MD+A+KFLR L+N +N  TLINVCLV SF ALSARSIKQ+R+IEALEAEK+SLL+SNKALKK+MWDWKQQLFAEAST+SALVPLARIKAIYGE PI SP+
Subjt:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT

Query:  G
        G
Subjt:  G

A0A6J1CGB8 uncharacterized protein LOC1110112721.9e-4599.01Show/hide
Query:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT
        MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEAS+ESALVPLARIKAIYGEVPISSPT
Subjt:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT

Query:  G
        G
Subjt:  G

A0A6J1F9J6 uncharacterized protein LOC1114435311.3e-3884.16Show/hide
Query:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT
        MD+ATKFLRT+T+A+NN TLINVCLV+SFGALSARSIKQQREIEALEAEKDSLLNSNK+LKK+MWDWKQQL+++AST+SALVPLARIKAIYGE P+ SP+
Subjt:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT

Query:  G
        G
Subjt:  G

A0A6J1IGY3 uncharacterized protein LOC1114771176.0e-3985.15Show/hide
Query:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT
        MD+A+KFLRTLTNA+NN TLINVCLV+SFGALSARSIKQQREIEALEAEKDSLLNSNKALKK+MWDWKQQL++EAST+SAL+PLARIKAIY E P+ SP+
Subjt:  MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPT

Query:  G
        G
Subjt:  G

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48200.1 unknown protein1.9e-2156.38Show/hide
Query:  MATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEAST--ESALVPLARIKAIYGEV
        MA K    L+ A NN  +IN CL VSF  L  RS KQQ+ +EAL  +K+SL  SNKA+K +MW+WKQQLFAEA++   +A+VPL+ +KAIYGEV
Subjt:  MATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEAST--ESALVPLARIKAIYGEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGGCGACCAAATTTCTACGAACTTTAACGAATGCCTCCAATAACAAAACCTTAATTAACGTTTGCCTCGTCGTTTCGTTTGGGGCACTGAGCGCCAGGTCGAT
CAAGCAGCAGAGGGAAATTGAAGCTCTGGAGGCCGAGAAAGATTCCCTCCTCAATTCCAATAAAGCACTGAAGAAATCCATGTGGGATTGGAAGCAACAGTTGTTCGCTG
AAGCTTCCACCGAGTCCGCTTTGGTCCCTCTCGCCAGGATCAAAGCTATCTATGGCGAAGTTCCGATATCATCCCCTACTGGTAATTTACGCTTTCCTTTTCTC
mRNA sequenceShow/hide mRNA sequence
ATGGACATGGCGACCAAATTTCTACGAACTTTAACGAATGCCTCCAATAACAAAACCTTAATTAACGTTTGCCTCGTCGTTTCGTTTGGGGCACTGAGCGCCAGGTCGAT
CAAGCAGCAGAGGGAAATTGAAGCTCTGGAGGCCGAGAAAGATTCCCTCCTCAATTCCAATAAAGCACTGAAGAAATCCATGTGGGATTGGAAGCAACAGTTGTTCGCTG
AAGCTTCCACCGAGTCCGCTTTGGTCCCTCTCGCCAGGATCAAAGCTATCTATGGCGAAGTTCCGATATCATCCCCTACTGGTAATTTACGCTTTCCTTTTCTC
Protein sequenceShow/hide protein sequence
MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKALKKSMWDWKQQLFAEASTESALVPLARIKAIYGEVPISSPTGNLRFPFL