; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003695 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003695
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE2
Genome locationscaffold185:430515..431291
RNA-Seq ExpressionMS003695
SyntenyMS003695
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_021717245.1 uncharacterized protein LOC110685089 [Chenopodium quinoa]3.1e-8059.23Show/hide
Query:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVVN--K
        N+++WS  I +ALSSR+KFGFL+GTIT   P  TKDDW+TI+ ML+ WL NTI PEVRSMLS Y++ K LWDDL ERF +VNG RI QLK  +      K
Subjt:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVVN--K

Query:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN
        ++ V VY+SKL+V WDELDKHE LI C+CGK  C VGK H  RR  DRL QFL GL    Y  LR+T L +DP PSLNRA+Q +AQ+ERVRGI    S+ 
Subjt:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN

Query:  DEKPDVVGFSVRTDNRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTPDWYLKRYGS
        +E P+ V F+VR +N  K RL RAE+AAL CTHCHK GH ++TCF  HG P WY+++YGS
Subjt:  DEKPDVVGFSVRTDNRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTPDWYLKRYGS

XP_021746757.1 uncharacterized protein LOC110712595 [Chenopodium quinoa]4.9e-7858.85Show/hide
Query:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEIL--VVVNK
        NF++WS TI +ALSSR++FGFL+GTIT   P  TKDDWVTI+ ML+ WL NTIDPEVR+MLSNY + K LWDDL ERF +VNG RI Q+K  +     +K
Subjt:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEIL--VVVNK

Query:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGL--GLYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN
        ++ V VYFSKL+V WDELDKHE LI C+ GK TC VGK H  RR  DRL QFL GL    Y  LR+T L +DP P+LNRAFQ +AQ++RVRGI       
Subjt:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGL--GLYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN

Query:  DEKPDVVGFSVRTDNRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTPDWYLKRYGS
        +E P+V GF+VR +   K RL RAE+AAL CT+C+K  H S+TCF  HG P WY+++YGS
Subjt:  DEKPDVVGFSVRTDNRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTPDWYLKRYGS

XP_021756883.1 uncharacterized protein LOC110721955 [Chenopodium quinoa]2.2e-7859.07Show/hide
Query:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVVN--K
        N+++WS  I +ALSSR+KFGFL+GTIT   P  TKDDWVTI+ ML+  L NTIDPEVRS LSNY++ K LWDDL E F +VNG RI Q+K  +      K
Subjt:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVVN--K

Query:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGL--GLYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN
        ++ V VYFSKL+V WDELDKHE LI+C+CGK TC + K H  RR  DRL QFL GL    Y  LR+T L +DP PSLNRAFQ +AQ+ERVRGI    ++ 
Subjt:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGL--GLYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN

Query:  DEKPDVVGFSVRTDNRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTPDWYLKRYG
        +E P+VVGF+ R +N+ K RL RAEKAAL CT+C + GH +STCF  HG P+WY +RYG
Subjt:  DEKPDVVGFSVRTDNRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTPDWYLKRYG

XP_021837444.1 uncharacterized protein LOC110777138 [Spinacia oleracea]5.7e-6648.43Show/hide
Query:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVVN--K
        N+D+W+ ++++AL SR+KFGF++GTI +    +  +DW TI+ ML+ W+ +TIDPEV+S +S  +D K+LWD+L  RFSV+NG RI QLK  +V     K
Subjt:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVVN--K

Query:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN
         + ++ YF K+ V WDEL  HE +I+C+CGK  C +GK+HE+RR+D+RLHQFL GL    Y QLR+  L ++P PSL+RA+QQV Q+ER+RGITR     
Subjt:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN

Query:  DEKPDVVGFSVRTDNRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTPDWY
        +  P+VVGF++R + R + R ++ +K+ L C+HCH++GH    CF L G P+W+
Subjt:  DEKPDVVGFSVRTDNRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTPDWY

XP_030505864.1 uncharacterized protein LOC115720531 [Cannabis sativa]1.4e-6951.57Show/hide
Query:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVV--VNK
        N+D+W+ ++++AL SR+KFGF+NGTI +  P +T++DW TI+ ML+ WL +TID EV+S +S Y+D K+LWD+L  RFSV NG +I QLK  +V    +K
Subjt:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVV--VNK

Query:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN
         + V  YF KL V WD+L  HE +++C CGK TC +GK HEKRR ++RLHQFL GL    Y QLR+T L ++P P L+R +QQV Q+E+VRGITR   N 
Subjt:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN

Query:  DEKPDVVGFSVRTDNRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTPDWY
        +  P+VVGFSVR + R K + ++ +K+ L C+HCH+SGH  + CF L G PDW+
Subjt:  DEKPDVVGFSVRTDNRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTPDWY

TrEMBL top hitse value%identityAlignment
A0A3Q7IBW4 Uncharacterized protein1.6e-6148.91Show/hide
Query:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEIL--VVVNK
        NFD WS  ++VALSSR+KFGFL+GTI   V   TK+DW+ ++ ML+ WL NTIDPEV+SMLSNY++ K LWDDL+ERF VVNG RI+QLK  +      K
Subjt:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEIL--VVVNK

Query:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGL--GLYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN
         + V +Y+ KL V WD+L   + LI+C CGK +C VGK HEKRR DD L QFL GL    Y Q+R+  L +DP PSLN+A+QQV+Q+E VRG+ R     
Subjt:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGL--GLYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN

Query:  DEKPDVVGFSVRTDNRQ-----------KSRLDRAEK---------AALTCTHCHKSGHCSSTCFILHGTPDWY
        D+    VGF+VR    Q           K      +K         A   CTHC K GH  S C+ L+G P+ Y
Subjt:  DEKPDVVGFSVRTDNRQ-----------KSRLDRAEK---------AALTCTHCHKSGHCSSTCFILHGTPDWY

A0A438D744 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-5448.02Show/hide
Query:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVV--NK
        N+D W+  IQ+AL +R+KF FL GTIT   P +T+ DW T+  ML+ W+TNTIDPEV+S LS + D K LW+ L +R+++VNG RI QLK  +     +K
Subjt:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVV--NK

Query:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN
        ++ VT Y+ +LNV W+EL KHE LISC C   +C    LH+ RR   +LH FL GL   LY QLRT  L +DP PSL+RA+Q V QDERVR         
Subjt:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN

Query:  DEKPDVVGFSVRTD-NRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTP
        D+  +V+GF+VRT   R + + +R     L C+HC K+GH +STC+ L   P
Subjt:  DEKPDVVGFSVRTD-NRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTP

A0A438FPG9 Uncharacterized protein4.1e-5448.02Show/hide
Query:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVVN--K
        N+D W+  IQ+AL +R+KF FL GTIT   P +T+ DW T+  ML+ W+TNTIDPEV+S LS + D K LW+ L +R+++VNG RI QLK  +      K
Subjt:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVVN--K

Query:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN
        ++ VT Y+ KLNV W+EL KHE LISC C   +C    LH+ RR   +LH FL GL   LY QLRT  L +DP PSL+RA+Q V QDERVR         
Subjt:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN

Query:  DEKPDVVGFSVRTD-NRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTP
        D+  +V+GF+VRT   R + + +R       C+HC K+GH +STC+ L   P
Subjt:  DEKPDVVGFSVRTD-NRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTP

A0A438GLW9 Retrovirus-related Pol polyprotein from transposon RE23.1e-5447.81Show/hide
Query:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVVN--K
        N+D W+  IQ+AL +R+KF FL GTIT   P +T+ DW T+  ML+ W+TNTIDPEV+S LS + D K LW+ L +R+++VNG RI QLK  +      K
Subjt:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVVN--K

Query:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN
        ++ VT Y+ KLNV W+E  KHE LISC C   +C    LH+ RR   +LH FL GL   LY QLRT  L +DP PSL+RA+Q V QDERVR         
Subjt:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN

Query:  DEKPDVVGFSVRTDNRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTP
        D+  +V+GF+VRT     +R  R +     C+HC K+GH +STC+ L   P
Subjt:  DEKPDVVGFSVRTDNRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTP

A0A438HDC5 Retrovirus-related Pol polyprotein from transposon RE21.8e-5448.02Show/hide
Query:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVV--NK
        N+D W+  IQ+AL +R+KF FL GTIT   P +T+ DW T+  ML+ W+TNTIDPEV+S LS + D K LW+ L +R+++VNG RI QLK  +     +K
Subjt:  NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVV--NK

Query:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN
        ++ VT Y+ +LNV W+EL KHE LISC C   +C    LH+ RR   +LH FL GL   LY QLRT  L +DP PSL+RA+Q V QDERVR         
Subjt:  NILVTVYFSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLG--LYGQLRTT-LCRDPFPSLNRAFQQVAQDERVRGITRTGSNN

Query:  DEKPDVVGFSVRTD-NRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTP
        D+  +V+GF+VRT   R + + +R     L C+HC K+GH +STC+ L   P
Subjt:  DEKPDVVGFSVRTD-NRQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AATTTTGACCAATGGTCTCCTACCATCCAAGTTGCTCTTTCGTCTCGGCAAAAATTTGGCTTTCTTAATGGAACTATCACTAAGGTTGTCCCTCATTGGACAAAA
GATGATTGGGTCACGATTTATTATATGCTTCTTTTATGGCTTACGAATACAATCGACCCTGAGGTACGTTCTATGCTGTCAAATTATAATGATGTCAAGGTGTTG
TGGGATGATTTGAATGAACGATTCTCAGTTGTGAATGGTCATCGTATTTATCAACTAAAGGAGATATTAGTCGTTGTGAACAAAAATATTCTTGTTACTGTCTAC
TTTAGTAAATTGAATGTATCGTGGGATGAGTTAGATAAACACGAGGTATTGATTTCTTGTGAGTGTGGAAAGCGCACTTGTCAAGTTGGAAAACTGCATGAAAAA
CGTCGTTCCGATGATAGACTTCATCAATTTTTATTCGGTTTGGGTTTATATGGTCAGTTGAGGACTACTCTTTGTCGAGACCCCTTTCCATCCTTGAATCGGGCA
TTTCAACAAGTTGCTCAAGATGAGCGGGTTAGAGGGATTACTCGAACTGGATCGAACAATGATGAGAAACCTGATGTAGTTGGCTTTTCTGTGCGCACAGATAAT
AGACAAAAGTCGCGACTGGATCGAGCAGAAAAGGCAGCCTTAACATGTACTCATTGTCACAAATCAGGACATTGCAGTAGTACATGTTTTATTTTACATGGCACT
CCGGATTGGTACTTGAAGAGATACGGGTCCTCT
mRNA sequenceShow/hide mRNA sequence
AATTTTGACCAATGGTCTCCTACCATCCAAGTTGCTCTTTCGTCTCGGCAAAAATTTGGCTTTCTTAATGGAACTATCACTAAGGTTGTCCCTCATTGGACAAAA
GATGATTGGGTCACGATTTATTATATGCTTCTTTTATGGCTTACGAATACAATCGACCCTGAGGTACGTTCTATGCTGTCAAATTATAATGATGTCAAGGTGTTG
TGGGATGATTTGAATGAACGATTCTCAGTTGTGAATGGTCATCGTATTTATCAACTAAAGGAGATATTAGTCGTTGTGAACAAAAATATTCTTGTTACTGTCTAC
TTTAGTAAATTGAATGTATCGTGGGATGAGTTAGATAAACACGAGGTATTGATTTCTTGTGAGTGTGGAAAGCGCACTTGTCAAGTTGGAAAACTGCATGAAAAA
CGTCGTTCCGATGATAGACTTCATCAATTTTTATTCGGTTTGGGTTTATATGGTCAGTTGAGGACTACTCTTTGTCGAGACCCCTTTCCATCCTTGAATCGGGCA
TTTCAACAAGTTGCTCAAGATGAGCGGGTTAGAGGGATTACTCGAACTGGATCGAACAATGATGAGAAACCTGATGTAGTTGGCTTTTCTGTGCGCACAGATAAT
AGACAAAAGTCGCGACTGGATCGAGCAGAAAAGGCAGCCTTAACATGTACTCATTGTCACAAATCAGGACATTGCAGTAGTACATGTTTTATTTTACATGGCACT
CCGGATTGGTACTTGAAGAGATACGGGTCCTCT
Protein sequenceShow/hide protein sequence
NFDQWSPTIQVALSSRQKFGFLNGTITKVVPHWTKDDWVTIYYMLLLWLTNTIDPEVRSMLSNYNDVKVLWDDLNERFSVVNGHRIYQLKEILVVVNKNILVTVY
FSKLNVSWDELDKHEVLISCECGKRTCQVGKLHEKRRSDDRLHQFLFGLGLYGQLRTTLCRDPFPSLNRAFQQVAQDERVRGITRTGSNNDEKPDVVGFSVRTDN
RQKSRLDRAEKAALTCTHCHKSGHCSSTCFILHGTPDWYLKRYGSS