; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015983 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015983
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationscaffold16:248182..248718
RNA-Seq ExpressionMS015983
SyntenyMS015983
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.2e-3046.15Show/hide
Query:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES
        L  IF+SR LAQ ++ K+KLH ++KG+  LK+YF  + +C+DALA+  KPVS +DHI+Y+L GLGS+++SMISVI+A++   +VQEVM+LLLTQE++NES
Subjt:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES

Query:  KKSLLNSDGTLPSAHL-THAVALGKEN---DAPKNSAPSLSHNGTNNRFRGRGNRGGKQWNNRGQIQCQLCGKFGHTTLKCY
        K   L S+  LPS ++ T     G E+       N   + S+N    R  GR NRG +   NR + QCQ+C K G++  +C+
Subjt:  KKSLLNSDGTLPSAHL-THAVALGKEN---DAPKNSAPSLSHNGTNNRFRGRGNRGGKQWNNRGQIQCQLCGKFGHTTLKCY

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]1.2e-3549.46Show/hide
Query:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES
        L  ++TSRNLA+V+++KSKL  ++KGN  LKDYF  VK  +D+LAAAGK V++EDHI+++L GL SEFES +SVI+A++  QT+QEV +LLL+ E RNE 
Subjt:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES

Query:  KKSLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNG-----TNNRFRGRGNRG-GKQWNNRGQIQCQLCGKFGHTTLKCY
         ++ +N+DGTLPS +LT            KNS  + S +G      NNR +  GN    + WN+  + QCQ+ GKFGHT L+CY
Subjt:  KKSLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNG-----TNNRFRGRGNRG-GKQWNNRGQIQCQLCGKFGHTTLKCY

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]1.2e-3549.46Show/hide
Query:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES
        L  ++TSRNLA+V+++KSKL  ++KGN  LKDYF  VK  +D+LAAAGK V++EDHI+++L GL SEFES +SVI+A++  QT+QEV +LLL+ E RNE 
Subjt:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES

Query:  KKSLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNG-----TNNRFRGRGNRG-GKQWNNRGQIQCQLCGKFGHTTLKCY
         ++ +N+DGTLPS +LT            KNS  + S +G      NNR +  GN    + WN+  + QCQ+ GKFGHT L+CY
Subjt:  KKSLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNG-----TNNRFRGRGNRG-GKQWNNRGQIQCQLCGKFGHTTLKCY

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]9.5e-3650.28Show/hide
Query:  IFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNESKKS
        +F SR LA+V+++K KL   +KGN SLKDYF  +K  +D+LA AGK +S EDHI+++L GLG EF+++ISVITA++ PQT+QEV +LLL QE RNE  ++
Subjt:  IFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNESKKS

Query:  LLNSDGTLPSAHLTHAVALGKEN------DAPKNSAPSLSHNGTNNRFRGRGNRGGKQWNNRGQIQCQLCGKFGHTTLKCY
        L+NSDG+LPS +LT   +  K N        P  S  S    GTNNR   R N     W    + QCQ+CG+FGHT L+CY
Subjt:  LLNSDGTLPSAHLTHAVALGKEN------DAPKNSAPSLSHNGTNNRFRGRGNRGGKQWNNRGQIQCQLCGKFGHTTLKCY

XP_022158089.1 uncharacterized protein LOC111024658 [Momordica charantia]4.1e-3143.98Show/hide
Query:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES
        L Q+F ++NL +V+++K++L  L+KG  SLK+Y   +K  +D+L AAGK ++ EDHI+++L GLGSE+ES +SVIT K GP T+Q+V ALLL+ + R E 
Subjt:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES

Query:  KKSLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNGT------------NNRFRGRGNR-GGKQWNNRGQIQCQLCGKFGHTTLKCY
        + S    D TLPSA    ++ L  +     N   S+S+N              NNR RGR +R GG++WN+R +IQCQ+C +FGHT  + Y
Subjt:  KKSLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNGT------------NNRFRGRGNR-GGKQWNNRGQIQCQLCGKFGHTTLKCY

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-945.8e-3146.15Show/hide
Query:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES
        L  IF+SR LAQ ++ K+KLH ++KG+  LK+YF  + +C+DALA+  KPVS +DHI+Y+L GLGS+++SMISVI+A++   +VQEVM+LLLTQE++NES
Subjt:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES

Query:  KKSLLNSDGTLPSAHL-THAVALGKEN---DAPKNSAPSLSHNGTNNRFRGRGNRGGKQWNNRGQIQCQLCGKFGHTTLKCY
        K   L S+  LPS ++ T     G E+       N   + S+N    R  GR NRG +   NR + QCQ+C K G++  +C+
Subjt:  KKSLLNSDGTLPSAHL-THAVALGKEN---DAPKNSAPSLSHNGTNNRFRGRGNRGGKQWNNRGQIQCQLCGKFGHTTLKCY

A0A6J1C6N9 dr1-associated corepressor homolog isoform X16.0e-3649.46Show/hide
Query:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES
        L  ++TSRNLA+V+++KSKL  ++KGN  LKDYF  VK  +D+LAAAGK V++EDHI+++L GL SEFES +SVI+A++  QT+QEV +LLL+ E RNE 
Subjt:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES

Query:  KKSLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNG-----TNNRFRGRGNRG-GKQWNNRGQIQCQLCGKFGHTTLKCY
         ++ +N+DGTLPS +LT            KNS  + S +G      NNR +  GN    + WN+  + QCQ+ GKFGHT L+CY
Subjt:  KKSLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNG-----TNNRFRGRGNRG-GKQWNNRGQIQCQLCGKFGHTTLKCY

A0A6J1C8R2 dr1-associated corepressor homolog isoform X26.0e-3649.46Show/hide
Query:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES
        L  ++TSRNLA+V+++KSKL  ++KGN  LKDYF  VK  +D+LAAAGK V++EDHI+++L GL SEFES +SVI+A++  QT+QEV +LLL+ E RNE 
Subjt:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES

Query:  KKSLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNG-----TNNRFRGRGNRG-GKQWNNRGQIQCQLCGKFGHTTLKCY
         ++ +N+DGTLPS +LT            KNS  + S +G      NNR +  GN    + WN+  + QCQ+ GKFGHT L+CY
Subjt:  KKSLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNG-----TNNRFRGRGNRG-GKQWNNRGQIQCQLCGKFGHTTLKCY

A0A6J1DLT9 uncharacterized protein LOC1110217574.6e-3650.28Show/hide
Query:  IFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNESKKS
        +F SR LA+V+++K KL   +KGN SLKDYF  +K  +D+LA AGK +S EDHI+++L GLG EF+++ISVITA++ PQT+QEV +LLL QE RNE  ++
Subjt:  IFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNESKKS

Query:  LLNSDGTLPSAHLTHAVALGKEN------DAPKNSAPSLSHNGTNNRFRGRGNRGGKQWNNRGQIQCQLCGKFGHTTLKCY
        L+NSDG+LPS +LT   +  K N        P  S  S    GTNNR   R N     W    + QCQ+CG+FGHT L+CY
Subjt:  LLNSDGTLPSAHLTHAVALGKEN------DAPKNSAPSLSHNGTNNRFRGRGNRGGKQWNNRGQIQCQLCGKFGHTTLKCY

A0A6J1DYD5 uncharacterized protein LOC1110246582.0e-3143.98Show/hide
Query:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES
        L Q+F ++NL +V+++K++L  L+KG  SLK+Y   +K  +D+L AAGK ++ EDHI+++L GLGSE+ES +SVIT K GP T+Q+V ALLL+ + R E 
Subjt:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNES

Query:  KKSLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNGT------------NNRFRGRGNR-GGKQWNNRGQIQCQLCGKFGHTTLKCY
        + S    D TLPSA    ++ L  +     N   S+S+N              NNR RGR +R GG++WN+R +IQCQ+C +FGHT  + Y
Subjt:  KKSLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNGT------------NNRFRGRGNR-GGKQWNNRGQIQCQLCGKFGHTTLKCY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.9e-0526.32Show/hide
Query:  FTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENR--NESKK
        F +   A+ +++ S+L T   G+  + DY+  +KK  D+L     PV+  + ++Y+L GL  +F+++I+VI  +    +  +   +L  +E+R     K 
Subjt:  FTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENR--NESKK

Query:  SLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNGT---NNRFRGRGNR
        +  + D +  S  L  + A    N   ++    + + G    NN FRGRG R
Subjt:  SLLNSDGTLPSAHLTHAVALGKENDAPKNSAPSLSHNGT---NNRFRGRGNR

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.4e-0627.11Show/hide
Query:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENR--N
        L  +F     A+ ++ +++L T    + S+ +Y   +K   D L     P+S    +++LL GL  +++ +++VI  KS   +  E  ++LL +E+R  N
Subjt:  LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENR--N

Query:  ESKKSLLNSDGTLPSAHLTHAVALGKENDAPKNSA--PSLSHNGTNNRFRGRG---NRGGKQWNNR
        +SK SL         +H  H          P+     P   HN  +N  RGR    NRGG   + R
Subjt:  ESKKSLLNSDGTLPSAHLTHAVALGKENDAPKNSA--PSLSHNGTNNRFRGRG---NRGGKQWNNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTTCCACAAATTTTCACATCTAGAAACCTAGCGCAAGTAATTAAGATTAAATCAAAATTACATACTCTACAAAAGGGAAATTCTTCATTGAAAGACTATTTTTCACCAGT
TAAAAAGTGCATTGATGCTCTAGCAGCAGCCGGTAAACCCGTGTCTATAGAGGATCATATTGTATATCTATTATTTGGCCTAGGATCTGAATTTGAGTCGATGATATCTG
TTATTACTGCAAAATCTGGTCCTCAAACAGTGCAGGAAGTAATGGCTCTATTATTGACACAAGAAAATAGGAATGAAAGCAAGAAGAGTCTACTTAATTCTGACGGTACT
CTTCCTTCTGCTCACCTTACTCATGCTGTTGCTTTGGGAAAAGAGAATGATGCTCCCAAAAATTCTGCTCCATCCTTGAGTCACAATGGAACAAACAATCGGTTTAGAGG
TCGGGGTAATAGAGGAGGCAAACAGTGGAATAATAGAGGTCAGATTCAATGTCAACTGTGTGGCAAATTTGGTCATACTACTCTAAAGTGCTATTCT
mRNA sequenceShow/hide mRNA sequence
CTTCCACAAATTTTCACATCTAGAAACCTAGCGCAAGTAATTAAGATTAAATCAAAATTACATACTCTACAAAAGGGAAATTCTTCATTGAAAGACTATTTTTCACCAGT
TAAAAAGTGCATTGATGCTCTAGCAGCAGCCGGTAAACCCGTGTCTATAGAGGATCATATTGTATATCTATTATTTGGCCTAGGATCTGAATTTGAGTCGATGATATCTG
TTATTACTGCAAAATCTGGTCCTCAAACAGTGCAGGAAGTAATGGCTCTATTATTGACACAAGAAAATAGGAATGAAAGCAAGAAGAGTCTACTTAATTCTGACGGTACT
CTTCCTTCTGCTCACCTTACTCATGCTGTTGCTTTGGGAAAAGAGAATGATGCTCCCAAAAATTCTGCTCCATCCTTGAGTCACAATGGAACAAACAATCGGTTTAGAGG
TCGGGGTAATAGAGGAGGCAAACAGTGGAATAATAGAGGTCAGATTCAATGTCAACTGTGTGGCAAATTTGGTCATACTACTCTAAAGTGCTATTCT
Protein sequenceShow/hide protein sequence
LPQIFTSRNLAQVIKIKSKLHTLQKGNSSLKDYFSPVKKCIDALAAAGKPVSIEDHIVYLLFGLGSEFESMISVITAKSGPQTVQEVMALLLTQENRNESKKSLLNSDGT
LPSAHLTHAVALGKENDAPKNSAPSLSHNGTNNRFRGRGNRGGKQWNNRGQIQCQLCGKFGHTTLKCYS