; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g12860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g12860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:9900657..9903838
RNA-Seq ExpressionMoc04g12860
SyntenyMoc04g12860
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON63538.1 hypothetical protein PanWU01x14_130120, partial [Parasponia andersonii]3.6e-2644.67Show/hide
Query:  PSIEISSALASRLDLRGGNQ---FVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPD
        P++ + S+ A    L G      FVASP  V DP+WY+D+GA++H+ +D  N+   S+Y GK+KV +GNG  L ISH+GS  I  NN +  L N++ V  
Subjt:  PSIEISSALASRLDLRGGNQ---FVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPD

Query:  IAKNLINVSKPDRDNVVYLEFHVGCCVVKDISTGKVVLKGALKDELYQLE
        I K+L+ VSK   DN V+ EF+   C++KD+S  KV+L+G LK+ LYQL+
Subjt:  IAKNLINVSKPDRDNVVYLEFHVGCCVVKDISTGKVVLKGALKDELYQLE

TYJ99887.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]6.2e-2641.34Show/hide
Query:  NQFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLEF
        N F  + +   D NWY +N A NH+ ADY N+ N  EYGGK +V +GNG++L I+ +G+S + +   +L L NV+ VP IAKNL++VSK  RD  V++EF
Subjt:  NQFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLEF

Query:  HVGCCVVKDISTGKVVLKGALKDELYQLE--------------------PTSITERTPKHRRLPYSASNPYQNWSRKCV
        H   C+VKD  T + +LKG LKD+LY LE                    PT I+   P    LP S S+P  + +R  V
Subjt:  HVGCCVVKDISTGKVVLKGALKDELYQLE--------------------PTSITERTPKHRRLPYSASNPYQNWSRKCV

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.9e-2748.06Show/hide
Query:  NQFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLEF
        NQF A+   V + NWY+D+GA+NH+  +Y+N+ N SEY G +K+++GNG+ L IS++G++++      L+LKNV+CVPDI KNL++VSK  +DN VY+EF
Subjt:  NQFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLEF

Query:  HVGCCVVKDISTGKVVLKGALKDELYQLE
        H   C +KD  TG+ +L   +KD LY L+
Subjt:  HVGCCVVKDISTGKVVLKGALKDELYQLE

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]4.3e-2755.45Show/hide
Query:  ASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLEFHVGC
        A+P  V DPNWY+D+GA+NHV  + +N+ N +EY G +KV +GNGN L IS+VG++ +   +  L LKN++CVPDIAKNLI+VSK  +DN +Y+EFH  C
Subjt:  ASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLEFHVGC

Query:  CVVKDISTGK
        C +KD STGK
Subjt:  CVVKDISTGK

XP_031282138.1 uncharacterized protein LOC116140680 [Pistacia vera]2.8e-2645.03Show/hide
Query:  QFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPS--NNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLE
        Q VA+P+ V D +WY+D GASNH+  D +N+++ S Y GK KV +GNG+ + I+H G   +PS  ++ +L LKN++CVP IAKNL+++S+  +DN V +E
Subjt:  QFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPS--NNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLE

Query:  FHVGCCVVKDISTGKVVLKGALKDELYQLEPTSITERTPKHRRLPYSASNP
        FH   C+VKD S+  V+L+G++K  LYQLE +S           PY++S P
Subjt:  FHVGCCVVKDISTGKVVLKGALKDELYQLEPTSITERTPKHRRLPYSASNP

TrEMBL top hitse value%identityAlignment
A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-949.3e-2848.06Show/hide
Query:  NQFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLEF
        NQF A+   V + NWY+D+GA+NH+  +Y+N+ N SEY G +K+++GNG+ L IS++G++++      L+LKNV+CVPDI KNL++VSK  +DN VY+EF
Subjt:  NQFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLEF

Query:  HVGCCVVKDISTGKVVLKGALKDELYQLE
        H   C +KD  TG+ +L   +KD LY L+
Subjt:  HVGCCVVKDISTGKVVLKGALKDELYQLE

A0A803NU85 Uncharacterized protein3.2e-2847.65Show/hide
Query:  VASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLEFHVG
        VA+P+++ D +WY D+GASNH+ +D   + N SEYGGK+++ +G+G++L I HVG+ F+ S N  L L N++ VP I+KNLI+VSK   DN V +EF   
Subjt:  VASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLEFHVG

Query:  CCVVKDISTGKVVLKGALKDELYQLEP--TSITERTPKHRRLPYSASNP
         CVVK+  TG+VVL+G LKD LYQL P  +S + ++  H  L + +S P
Subjt:  CCVVKDISTGKVVLKGALKDELYQLEP--TSITERTPKHRRLPYSASNP

A0A803P4G6 Uncharacterized protein1.2e-2748.87Show/hide
Query:  RGGNQFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNG-LLHLKNVICVPDIAKNLINVSKPDRDNVV
        +G N F+A+PK++    W+VD+GASNH+ +  N++   SEYGGK+ + +G+G++L ISH+G+ F+ +N+G LL LK ++ VP IAKNLI+V K   DN V
Subjt:  RGGNQFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNG-LLHLKNVICVPDIAKNLINVSKPDRDNVV

Query:  YLEFHVGCCVVKDISTGKVVLKGALKDELYQLE
         +EF+   C+VKD +T KV+L+G LKD LYQ++
Subjt:  YLEFHVGCCVVKDISTGKVVLKGALKDELYQLE

A0A803PPY9 Uncharacterized protein1.0e-2947.52Show/hide
Query:  NQFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNG-LLHLKNVICVPDIAKNLINVSKPDRDNVVYLE
        N F+A P++V    W+ D+GASNH+ AD + +  + EYGGK+ V +GNGNEL ISH+GS ++ +N G  L LK ++ VP IAKNLI++SK    N + +E
Subjt:  NQFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNG-LLHLKNVICVPDIAKNLINVSKPDRDNVVYLE

Query:  FHVGCCVVKDISTGKVVLKGALKDELYQLEPTSITERTPKH
        F+  CC++KD +TGK +L+GALK+ LYQ+   S     P H
Subjt:  FHVGCCVVKDISTGKVVLKGALKDELYQLEPTSITERTPKH

A0A803QD60 Uncharacterized protein1.6e-2735.82Show/hide
Query:  FVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNG-LLHLKNVICVPDIAKNLINVSKPDRDNVVYLEFH
        FVA+P+L+    W+ D+GASNH+ +D  ++    EYGGK+KV +GNG +L ISH+ +  + +++G  L LK ++ VP+IAKNL++VSK   DN V +EF+
Subjt:  FVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNG-LLHLKNVICVPDIAKNLINVSKPDRDNVVYLEFH

Query:  VGCCVVKDISTGKVVLKGALKDELYQLEPTSITERTPKHRRLPYSASNPYQNWSRKCVGRRSLDSRQEVRRGFVGEDDGWMKKEEEVHGVNIEYILGICS
          CCVVKD  T KV+L+G L+D LYQL+       T    ++  +    + ++S   V  +S  ++ ++    V + D W ++        +  +L  C 
Subjt:  VGCCVVKDISTGKVVLKGALKDELYQLEPTSITERTPKHRRLPYSASNPYQNWSRKCVGRRSLDSRQEVRRGFVGEDDGWMKKEEEVHGVNIEYILGICS

Query:  I
        +
Subjt:  I

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.8e-1834.19Show/hide
Query:  PAPISPVQASPSIEISSALASRLDLRGGNQFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLK
        P+P +P Q   ++ + S  +S                    NW +D+GA++H+ +D+NN+  H  Y G D V++ +G+ + ISH GS+ + + +  L+L 
Subjt:  PAPISPVQASPSIEISSALASRLDLRGGNQFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLK

Query:  NVICVPDIAKNLINVSKPDRDNVVYLEFHVGCCVVKDISTGKVVLKGALKDELYQ
        N++ VP+I KNLI+V +    N V +EF      VKD++TG  +L+G  KDELY+
Subjt:  NVICVPDIAKNLINVSKPDRDNVVYLEFHVGCCVVKDISTGKVVLKGALKDELYQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.5e-1941.23Show/hide
Query:  NWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLEFHVGCCVVKDISTG
        NW +D+GA++H+ +D+NN+  H  Y G D V++ +G+ + I+H GS+ +P+++  L L  V+ VP+I KNLI+V +    N V +EF      VKD++TG
Subjt:  NWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPSNNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLEFHVGCCVVKDISTG

Query:  KVVLKGALKDELYQ
          +L+G  KDELY+
Subjt:  KVVLKGALKDELYQ

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCACAAATATGGCGCCAGCACCAGCAACACAAGGCCCCTATTTGTCTGCTCCAGCACCAATTTCACCTGTACAAGCCTCTCCCTCCATTGAAATCTCGTCTGCATT
AGCTAGTCGACTCGACCTCAGGGGAGGTAATCAATTTGTTGCTAGCCCTAAATTGGTTGCTGATCCCAACTGGTATGTTGACAATGGAGCTTCGAATCACGTTATTGCTG
ACTACAACAATGTGATTAATCATTCTGAGTATGGTGGTAAGGACAAAGTTATTCTTGGTAATGGTAATGAGCTATTTATTTCTCATGTTGGATCTTCATTTATACCCTCA
AATAATGGATTATTGCATCTTAAAAATGTTATATGTGTGCCTGATATAGCAAAGAATCTTATTAATGTTTCCAAGCCTGACCGAGATAATGTTGTCTATCTTGAATTTCA
TGTTGGTTGTTGTGTTGTAAAGGACATTTCTACGGGCAAGGTGGTTCTTAAGGGAGCTCTTAAAGACGAACTATATCAGCTCGAACCTACATCTATCACTGAGAGAACGC
CAAAACACCGTCGCCTGCCGTACTCCGCCTCAAACCCCTACCAAAACTGGAGCAGAAAATGTGTCGGCCGCCGTTCGCTGGATAGTCGTCAAGAGGTGCGTCGGGGATTC
GTTGGGGAAGATGATGGTTGGATGAAGAAGGAGGAAGAAGTACATGGAGTTAATATAGAGTATATTTTGGGTATATGTAGCATATACTTTAAGTCTTCTTCAACTTCAAC
AGCACCTGTTCTGCCTCAACGCAATCCTTTGTATGATGATTGGATTGCAAAAGATCAAGCTCTCATGACTTTAATTAATGCTACTTTATCATCAGAGGCTCTTGTTGATG
AGGATCTTCTCATTTATGCCTTAAATGGACTTCGAGTGGAGTACAATACCTTTTGGACTTCAATGTGCACCAGATCTCAGTTGGTGTCATTTGAAGAACTTCATGTTCTT
ATGAAATCTGAAGAGATGACTATTGAAAGGCAATCAAAGCGTGATGATCTCCTATCACAACCCACTGCTATGTTCGTGTCTTCGCAACAATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCACAAATATGGCGCCAGCACCAGCAACACAAGGCCCCTATTTGTCTGCTCCAGCACCAATTTCACCTGTACAAGCCTCTCCCTCCATTGAAATCTCGTCTGCATT
AGCTAGTCGACTCGACCTCAGGGGAGGTAATCAATTTGTTGCTAGCCCTAAATTGGTTGCTGATCCCAACTGGTATGTTGACAATGGAGCTTCGAATCACGTTATTGCTG
ACTACAACAATGTGATTAATCATTCTGAGTATGGTGGTAAGGACAAAGTTATTCTTGGTAATGGTAATGAGCTATTTATTTCTCATGTTGGATCTTCATTTATACCCTCA
AATAATGGATTATTGCATCTTAAAAATGTTATATGTGTGCCTGATATAGCAAAGAATCTTATTAATGTTTCCAAGCCTGACCGAGATAATGTTGTCTATCTTGAATTTCA
TGTTGGTTGTTGTGTTGTAAAGGACATTTCTACGGGCAAGGTGGTTCTTAAGGGAGCTCTTAAAGACGAACTATATCAGCTCGAACCTACATCTATCACTGAGAGAACGC
CAAAACACCGTCGCCTGCCGTACTCCGCCTCAAACCCCTACCAAAACTGGAGCAGAAAATGTGTCGGCCGCCGTTCGCTGGATAGTCGTCAAGAGGTGCGTCGGGGATTC
GTTGGGGAAGATGATGGTTGGATGAAGAAGGAGGAAGAAGTACATGGAGTTAATATAGAGTATATTTTGGGTATATGTAGCATATACTTTAAGTCTTCTTCAACTTCAAC
AGCACCTGTTCTGCCTCAACGCAATCCTTTGTATGATGATTGGATTGCAAAAGATCAAGCTCTCATGACTTTAATTAATGCTACTTTATCATCAGAGGCTCTTGTTGATG
AGGATCTTCTCATTTATGCCTTAAATGGACTTCGAGTGGAGTACAATACCTTTTGGACTTCAATGTGCACCAGATCTCAGTTGGTGTCATTTGAAGAACTTCATGTTCTT
ATGAAATCTGAAGAGATGACTATTGAAAGGCAATCAAAGCGTGATGATCTCCTATCACAACCCACTGCTATGTTCGTGTCTTCGCAACAATCTTAG
Protein sequenceShow/hide protein sequence
MITNMAPAPATQGPYLSAPAPISPVQASPSIEISSALASRLDLRGGNQFVASPKLVADPNWYVDNGASNHVIADYNNVINHSEYGGKDKVILGNGNELFISHVGSSFIPS
NNGLLHLKNVICVPDIAKNLINVSKPDRDNVVYLEFHVGCCVVKDISTGKVVLKGALKDELYQLEPTSITERTPKHRRLPYSASNPYQNWSRKCVGRRSLDSRQEVRRGF
VGEDDGWMKKEEEVHGVNIEYILGICSIYFKSSSTSTAPVLPQRNPLYDDWIAKDQALMTLINATLSSEALVDEDLLIYALNGLRVEYNTFWTSMCTRSQLVSFEELHVL
MKSEEMTIERQSKRDDLLSQPTAMFVSSQQS