; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g19310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g19310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:13003583..13006284
RNA-Seq ExpressionMoc03g19310
SyntenyMoc03g19310
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7054848.1 unnamed protein product [Microthlaspi erraticum]1.8e-1238.3Show/hide
Query:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH
        SL   +  D W++DSGAS+HVC     F+ + P + +T++L N T V I   G IH+T  L+L DVL V +F +NL+SVS L + L  S  F  T C I 
Subjt:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH

Query:  NKSSLKMIGIADLCGSTSVGQGQHDHNIGSDIASFDASALL
          S   MIG   L  +  V          S + SF  S L+
Subjt:  NKSSLKMIGIADLCGSTSVGQGQHDHNIGSDIASFDASALL

CAA7061254.1 unnamed protein product [Microthlaspi erraticum]1.8e-1238.3Show/hide
Query:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH
        SL   +  D W++DSGAS+HVC     F+ + P + +T++L N T V I   G IH+T  L+L DVL V +F +NL+SVS L + L  S  F  T C I 
Subjt:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH

Query:  NKSSLKMIGIADLCGSTSVGQGQHDHNIGSDIASFDASALL
          S   MIG   L  +  V          S + SF  S L+
Subjt:  NKSSLKMIGIADLCGSTSVGQGQHDHNIGSDIASFDASALL

KAG7544005.1 Zinc finger CCHC-type [Arabidopsis thaliana x Arabidopsis arenosa]2.3e-1243.12Show/hide
Query:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH
        +L S + H  W++DSGA++HVC    FF   +  SG+T+SL N T V I   G + ++S LVL DVL V SF +NL+SVS+L K    S  F   +C I 
Subjt:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH

Query:  NKSSLKMIG
              MIG
Subjt:  NKSSLKMIG

KAG7551573.1 Zinc finger CCHC-type [Arabidopsis thaliana x Arabidopsis arenosa]1.8e-1241.28Show/hide
Query:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH
        SL S + +  W+++SGA++HVC     F+   P +G+T+SL N T V I  IG +H+++ L+L DVL+VS+F +NLLS+S+L K  + S  F    C I 
Subjt:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH

Query:  NKSSLKMIG
        + +   MIG
Subjt:  NKSSLKMIG

KAG7578814.1 GAG-pre-integrase domain [Arabidopsis thaliana x Arabidopsis arenosa]1.0e-1242.2Show/hide
Query:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH
        SL S + +  W++DSGA++HVC     F+   P +G+T+SL N T V I   G +H+++ L+L DVL+VS+F +NLLSVS+L K  + S  F    C I 
Subjt:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH

Query:  NKSSLKMIG
        + +   MIG
Subjt:  NKSSLKMIG

TrEMBL top hitse value%identityAlignment
A0A2K3PAA5 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-1240Show/hide
Query:  ASLTSSSSHVAGTGNVSLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNK
        AS +S +    GT +++ LS      W++DSGA+ H CY  + F+       I + L N + V  + IGDIH+T+ LVL +VL++  F YNL+SVS +  
Subjt:  ASLTSSSSHVAGTGNVSLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNK

Query:  GLSFSVLFADTYCVIHNKSSLKMIG
         L+ +  FA   C IHN S  KMIG
Subjt:  GLSFSVLFADTYCVIHNKSSLKMIG

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 87.2e-1234.23Show/hide
Query:  VNTDSVKSTTSGHTSSESLTSLASLTSSSSHVAGTGNVSLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELV
        +N     +TT+  T++ ++T  + + + +SH          +   HD W++DSGAS H+C+ +  F +    + + + L N   ++++ IGDI +   L 
Subjt:  VNTDSVKSTTSGHTSSESLTSLASLTSSSSHVAGTGNVSLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELV

Query:  LKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIHNKSSLKMIGIA
        LKDVLFVS F YNL+SVS L    + S+ F  T C+I + S   MIG A
Subjt:  LKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIHNKSSLKMIGIA

A0A6D2L0S7 Integrase catalytic domain-containing protein8.5e-1338.3Show/hide
Query:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH
        SL   +  D W++DSGAS+HVC     F+ + P + +T++L N T V I   G IH+T  L+L DVL V +F +NL+SVS L + L  S  F  T C I 
Subjt:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH

Query:  NKSSLKMIGIADLCGSTSVGQGQHDHNIGSDIASFDASALL
          S   MIG   L  +  V          S + SF  S L+
Subjt:  NKSSLKMIGIADLCGSTSVGQGQHDHNIGSDIASFDASALL

A0A6D2LAD7 gag_pre-integrs domain-containing protein8.5e-1338.3Show/hide
Query:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH
        SL   +  D W++DSGAS+HVC     F+ + P + +T++L N T V I   G IH+T  L+L DVL V +F +NL+SVS L + L  S  F  T C I 
Subjt:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH

Query:  NKSSLKMIGIADLCGSTSVGQGQHDHNIGSDIASFDASALL
          S   MIG   L  +  V          S + SF  S L+
Subjt:  NKSSLKMIGIADLCGSTSVGQGQHDHNIGSDIASFDASALL

Q9SN55 Putative retrotransposon polyprotein1.2e-1143.12Show/hide
Query:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH
        SL + +  D W++DSGAS+HVC     F  ++  SG+T++L N T VAI   G I +TS L+L +VL V  F +NL+SV  L K LS+S  F    C I 
Subjt:  SLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQTHVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIH

Query:  NKSSLKMIG
          +   MIG
Subjt:  NKSSLKMIG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACAAATTGGAAGGGCGGCAACAGAACTGTTGCAGAGGAGTAGAGATTCTGGCAACAAGCAAGGGTCTTCTAACCCTAAAGTCAATACTGACTCTGTTAAGTCTAC
TACTTCAGGCCATACTTCATCGGAGTCCTTGACTTCTCTTGCTTCCCTGACCTCTAGTTCATCCCATGTTGCAGGTACTGGTAATGTATCTTTGTTGTCCACTGTCTTAC
ATGATCATTGGGTGGTTGATTCGGGGGCGTCCACTCATGTTTGTTATATGCAAGATTTTTTTACTTCTGTTATGCCAGCTTCTGGAATTACTTTGTCATTGTCGAATCAG
ACTCATGTTGCAATAGAATTTATTGGTGATATACATGTTACCTCAGAACTTGTTCTCAAGGATGTTCTGTTTGTGTCGTCTTTTTGTTATAATTTGCTCTCTGTCAGTGC
TTTAAATAAGGGATTGTCTTTCTCGGTACTTTTTGCTGACACTTATTGTGTTATTCACAACAAGTCTTCTTTGAAAATGATTGGCATAGCTGATCTTTGTGGTTCTACTT
CTGTTGGACAAGGTCAACATGATCATAATATTGGTTCTGATATTGCTTCTTTTGATGCTTCTGCTTTGCTCCTTATTGAACATATTGTTGCACATTCTCCTCCCAGTGAG
AATGTTGTTGTGCCTACTATTGTCCAGTCCCACGACGCAGGAGCGGGATTCTCTAAAAAACCAAGGTCGCGGGTGGCCAAGAGGGCCCACTTGGAGACTTGGGATCTCAG
CAAGAAATGCAAAGGTTGCAACGATGAAAGCCGGTCGACTGATAAGCGAAAGAGAATCTACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGACAAATTGGAAGGGCGGCAACAGAACTGTTGCAGAGGAGTAGAGATTCTGGCAACAAGCAAGGGTCTTCTAACCCTAAAGTCAATACTGACTCTGTTAAGTCTAC
TACTTCAGGCCATACTTCATCGGAGTCCTTGACTTCTCTTGCTTCCCTGACCTCTAGTTCATCCCATGTTGCAGGTACTGGTAATGTATCTTTGTTGTCCACTGTCTTAC
ATGATCATTGGGTGGTTGATTCGGGGGCGTCCACTCATGTTTGTTATATGCAAGATTTTTTTACTTCTGTTATGCCAGCTTCTGGAATTACTTTGTCATTGTCGAATCAG
ACTCATGTTGCAATAGAATTTATTGGTGATATACATGTTACCTCAGAACTTGTTCTCAAGGATGTTCTGTTTGTGTCGTCTTTTTGTTATAATTTGCTCTCTGTCAGTGC
TTTAAATAAGGGATTGTCTTTCTCGGTACTTTTTGCTGACACTTATTGTGTTATTCACAACAAGTCTTCTTTGAAAATGATTGGCATAGCTGATCTTTGTGGTTCTACTT
CTGTTGGACAAGGTCAACATGATCATAATATTGGTTCTGATATTGCTTCTTTTGATGCTTCTGCTTTGCTCCTTATTGAACATATTGTTGCACATTCTCCTCCCAGTGAG
AATGTTGTTGTGCCTACTATTGTCCAGTCCCACGACGCAGGAGCGGGATTCTCTAAAAAACCAAGGTCGCGGGTGGCCAAGAGGGCCCACTTGGAGACTTGGGATCTCAG
CAAGAAATGCAAAGGTTGCAACGATGAAAGCCGGTCGACTGATAAGCGAAAGAGAATCTACTAG
Protein sequenceShow/hide protein sequence
MRQIGRAATELLQRSRDSGNKQGSSNPKVNTDSVKSTTSGHTSSESLTSLASLTSSSSHVAGTGNVSLLSTVLHDHWVVDSGASTHVCYMQDFFTSVMPASGITLSLSNQ
THVAIEFIGDIHVTSELVLKDVLFVSSFCYNLLSVSALNKGLSFSVLFADTYCVIHNKSSLKMIGIADLCGSTSVGQGQHDHNIGSDIASFDASALLLIEHIVAHSPPSE
NVVVPTIVQSHDAGAGFSKKPRSRVAKRAHLETWDLSKKCKGCNDESRSTDKRKRIY