; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g20330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g20330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:13684020..13687549
RNA-Seq ExpressionMoc03g20330
SyntenyMoc03g20330
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN82576.1 hypothetical protein VITISV_031328 [Vitis vinifera]8.5e-1031.64Show/hide
Query:  MKTTVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVNYTGKQQHRYSRGSGSSTGSEVLLLKGSDPAAEARERVCF
        M+  VSNS GK  LK++ I D+ L+EE RR+         G  +   S L  E +GKG      K+Q +  +       + ++  +  D    A E    
Subjt:  MKTTVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVNYTGKQQHRYSRGSGSSTGSEVLLLKGSDPAAEARERVCF

Query:  DLWIVDTVVSVHVASNRCWFSSLAAGEWTELVLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKRESKSSCR
        D W++D+  S H   +R    +  AG + +L L+ VR+IP    NL+S G+LDD+G +  FV G WK+ + ++   R
Subjt:  DLWIVDTVVSVHVASNRCWFSSLAAGEWTELVLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKRESKSSCR

KAB5561215.1 hypothetical protein DKX38_006172 [Salix brachista]1.2e-0828.28Show/hide
Query:  MKTTVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVN--YTGKQQHRYSRGSGSSTGSEVLLLKGSDPAAEARERV
        M+TT+SNS GK+ L +  I D+ L EE RR+         G  +   STL  E +G  + N     K+    S  + +  G E LLL  S P        
Subjt:  MKTTVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVN--YTGKQQHRYSRGSGSSTGSEVLLLKGSDPAAEARERV

Query:  CFDLWIVDTVVSVHVASNRCWFSSLAAGEWTELV--------------------------LQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKR
          D W++D+  S H   ++    +  AG++  +                           LQ VR++P    NL+S G+LD+ G+S  F  G WK+ +
Subjt:  CFDLWIVDTVVSVHVASNRCWFSSLAAGEWTELV--------------------------LQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKR

KAF7130883.1 hypothetical protein RHSIM_Rhsim10G0164600 [Rhododendron simsii]1.9e-0928.5Show/hide
Query:  TVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVNYTGKQQHR-------YSRGSGSSTGSEVLLLKGSDPAAEARE
        T+SNS     +    + D   +EEARRK         G   + E+ LV +NKG+ +   +G  +++        SRG+     +E+   K +   A   E
Subjt:  TVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVNYTGKQQHR-------YSRGSGSSTGSEVLLLKGSDPAAEARE

Query:  --RVCFD----------LWIVDTVVSVHVASNRCWFSSLAAGEW--------------------------TELVLQGVRYIPSFIFNLLSAGKLDDDGYS
           VC D           W++D+  S HV S R +F+S   G++                           +L+L+ VR++P    NL+SAGKLDD+GY 
Subjt:  --RVCFD----------LWIVDTVVSVHVASNRCWFSSLAAGEW--------------------------TELVLQGVRYIPSFIFNLLSAGKLDDDGYS

Query:  SEFVRGCWKLKRES
        ++F  G WKL + S
Subjt:  SEFVRGCWKLKRES

PKI48889.1 hypothetical protein CRG98_030737 [Punica granatum]4.7e-0834.69Show/hide
Query:  EVLLLKGSDPAAEARERVCFD-LWIVDTVVSVHVASNRCWFSSLAAGEW-----TELVLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKRES
        ++L++  S+ A   +  +  D  W+ D   S H+  +R +FSS   G++      +L+L+ VR++P    NL+S G+LDD+GY +EF  G WKL + S
Subjt:  EVLLLKGSDPAAEARERVCFD-LWIVDTVVSVHVASNRCWFSSLAAGEW-----TELVLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKRES

RVW76343.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.4e-0727.09Show/hide
Query:  MKTTVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVNYTGKQQHRYSRGSGSSTGSE----VLLLKGSDPAAEARE
        M+  VSNS GK  LK++ I D+ L+EE RR+         G  +   S L  E +GKG      K   + +    ++  +E     LL     P      
Subjt:  MKTTVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVNYTGKQQHRYSRGSGSSTGSE----VLLLKGSDPAAEARE

Query:  RVCFDLWIVDTVVSVHVASNRCWFSSLAAGEWTEL--------------------------VLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKR
            D W++D+  S H   +R    +  AG++ ++                          +L+ VR+IP    NL+S G+LDD+GY+  FV G WK+ +
Subjt:  RVCFDLWIVDTVVSVHVASNRCWFSSLAAGEWTEL--------------------------VLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKR

Query:  ESK
         ++
Subjt:  ESK

TrEMBL top hitse value%identityAlignment
A0A2I0IXZ9 Integrase catalytic domain-containing protein2.3e-0834.69Show/hide
Query:  EVLLLKGSDPAAEARERVCFD-LWIVDTVVSVHVASNRCWFSSLAAGEW-----TELVLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKRES
        ++L++  S+ A   +  +  D  W+ D   S H+  +R +FSS   G++      +L+L+ VR++P    NL+S G+LDD+GY +EF  G WKL + S
Subjt:  EVLLLKGSDPAAEARERVCFD-LWIVDTVVSVHVASNRCWFSSLAAGEW-----TELVLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKRES

A0A2N9HSF0 gag_pre-integrs domain-containing protein6.6e-0828.14Show/hide
Query:  MKTTVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVNYTGKQQHRYSRGSGSSTGSEV---LLLKGSDPAAEARER
        M+  VSNS GK  LK++ I D+ L EE RR+         G  +   S L  E +G+GK          Y+RG   S   EV   LLL    P       
Subjt:  MKTTVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVNYTGKQQHRYSRGSGSSTGSEV---LLLKGSDPAAEARER

Query:  VCFDLWIVDTVVSVHVASNRCWFSSLAAGEWTEL--------------------------VLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKR
           + W++D+  S H  ++R    +  A ++ ++                          +LQ VR++P    NL+S G+LD +G++  FV G WK+ +
Subjt:  VCFDLWIVDTVVSVHVASNRCWFSSLAAGEWTEL--------------------------VLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKR

A0A371G5R2 Uncharacterized protein (Fragment)8.6e-0830.57Show/hide
Query:  LSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVNYTGKQQHRYSRGSGSSTGSEVLLLKGSDPAAEARERVCFDLWIVDTVVSVHVASNRCWFSSL
        L+EE RRK       T G+ ++ E     ENKGK      G  + +      ++TG  +++L+  +      +     +WI+D+  ++HV   + +F+S 
Subjt:  LSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVNYTGKQQHRYSRGSGSSTGSEVLLLKGSDPAAEARERVCFDLWIVDTVVSVHVASNRCWFSSL

Query:  AAGEW----------TELVLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKR
         AG++          T+L L+GV++ P   FNL+S   LDDDGY + F  G WKL +
Subjt:  AAGEW----------TELVLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKR

A0A5N5N166 Uncharacterized protein5.9e-0928.28Show/hide
Query:  MKTTVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVN--YTGKQQHRYSRGSGSSTGSEVLLLKGSDPAAEARERV
        M+TT+SNS GK+ L +  I D+ L EE RR+         G  +   STL  E +G  + N     K+    S  + +  G E LLL  S P        
Subjt:  MKTTVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVN--YTGKQQHRYSRGSGSSTGSEVLLLKGSDPAAEARERV

Query:  CFDLWIVDTVVSVHVASNRCWFSSLAAGEWTELV--------------------------LQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKR
          D W++D+  S H   ++    +  AG++  +                           LQ VR++P    NL+S G+LD+ G+S  F  G WK+ +
Subjt:  CFDLWIVDTVVSVHVASNRCWFSSLAAGEWTELV--------------------------LQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKR

A5ATR7 Integrase catalytic domain-containing protein4.1e-1031.64Show/hide
Query:  MKTTVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVNYTGKQQHRYSRGSGSSTGSEVLLLKGSDPAAEARERVCF
        M+  VSNS GK  LK++ I D+ L+EE RR+         G  +   S L  E +GKG      K+Q +  +       + ++  +  D    A E    
Subjt:  MKTTVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVNYTGKQQHRYSRGSGSSTGSEVLLLKGSDPAAEARERVCF

Query:  DLWIVDTVVSVHVASNRCWFSSLAAGEWTELVLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKRESKSSCR
        D W++D+  S H   +R    +  AG + +L L+ VR+IP    NL+S G+LDD+G +  FV G WK+ + ++   R
Subjt:  DLWIVDTVVSVHVASNRCWFSSLAAGEWTELVLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKRESKSSCR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGACCACAGTATCGAATTCGCTAGGGAAAAATAGCTTGAAATTTTCAGCTATTTGTGATGTCGCCTTATCTGAGGAAGCCAGGAGAAAATTAGGAAAAATGTTTGC
ATCTACTCCAGGGGCAGAAAATGAGGTGGAATCAACTTTGGTAGCTGAAAACAAAGGGAAGGGCAAGGTGAACTACACGGGGAAGCAGCAGCATAGATATAGCAGGGGTA
GTGGGAGTTCTACAGGAAGCGAAGTGTTATTACTGAAGGGATCGGATCCCGCAGCGGAAGCACGCGAACGCGTGTGCTTTGATCTTTGGATAGTGGACACTGTAGTATCA
GTGCATGTTGCTTCAAACAGATGTTGGTTCTCATCTCTTGCTGCAGGCGAATGGACCGAGCTGGTGCTACAAGGCGTCAGATATATTCCTAGCTTCATATTTAATTTGTT
ATCCGCAGGGAAGTTAGACGACGATGGCTACAGCAGCGAGTTCGTTAGGGGTTGCTGGAAGCTCAAGAGGGAATCCAAGAGTAGCTGCAGATGGTTCAGGGCGAGACTGG
AAAGAGTTAGCAGCATTGACAACCAATACAGATCAGATGAGTCTGTCATCAATTCAAGTAAACAACTGAGAAGTAGAGGAAAGGGCAACAGACTTGGGTGGGAGTGTCAA
GTCATCAGGGGAATCTTCCTTCAGAGGTCGTTGGGTTCGATCGAGAAAGGAAGCGATGAGGACCACTTAGATCGAGTGGAAGCACGTGGCTGTGTCTCTAACGTCTGGAA
GACAAGATCAGTGGAAGGTAGAAAATTTGTCGTTTTGTCTCTAAGTGGGAGATTTTTGGGATTGGTGAAGTCAAAACGACGTCGGAACAAGCTAGGGATGAGTCGGGATC
GAGTCGAAACACTTCGAGATCGAAGCAGGATGAAAACAGGGGAAAGCGGGACGCTGCAGGCCATAGTGGCCGAGAAGACAGTGCCGCAGTGTCGTTGCACTGCTGCGACG
CTAAGGGATAGCACAGCGGCGCTGTCCTGGTGGGCGCGCGGATGCATTTTTGCGTCTGTAGAGGCATGGCGCCGGGGACAACGCCATGGTGCTGTTTCGATGTTTATAAA
TAGCATTGTTCGGGTTTTAGGGTTTGCCAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGACCACAGTATCGAATTCGCTAGGGAAAAATAGCTTGAAATTTTCAGCTATTTGTGATGTCGCCTTATCTGAGGAAGCCAGGAGAAAATTAGGAAAAATGTTTGC
ATCTACTCCAGGGGCAGAAAATGAGGTGGAATCAACTTTGGTAGCTGAAAACAAAGGGAAGGGCAAGGTGAACTACACGGGGAAGCAGCAGCATAGATATAGCAGGGGTA
GTGGGAGTTCTACAGGAAGCGAAGTGTTATTACTGAAGGGATCGGATCCCGCAGCGGAAGCACGCGAACGCGTGTGCTTTGATCTTTGGATAGTGGACACTGTAGTATCA
GTGCATGTTGCTTCAAACAGATGTTGGTTCTCATCTCTTGCTGCAGGCGAATGGACCGAGCTGGTGCTACAAGGCGTCAGATATATTCCTAGCTTCATATTTAATTTGTT
ATCCGCAGGGAAGTTAGACGACGATGGCTACAGCAGCGAGTTCGTTAGGGGTTGCTGGAAGCTCAAGAGGGAATCCAAGAGTAGCTGCAGATGGTTCAGGGCGAGACTGG
AAAGAGTTAGCAGCATTGACAACCAATACAGATCAGATGAGTCTGTCATCAATTCAAGTAAACAACTGAGAAGTAGAGGAAAGGGCAACAGACTTGGGTGGGAGTGTCAA
GTCATCAGGGGAATCTTCCTTCAGAGGTCGTTGGGTTCGATCGAGAAAGGAAGCGATGAGGACCACTTAGATCGAGTGGAAGCACGTGGCTGTGTCTCTAACGTCTGGAA
GACAAGATCAGTGGAAGGTAGAAAATTTGTCGTTTTGTCTCTAAGTGGGAGATTTTTGGGATTGGTGAAGTCAAAACGACGTCGGAACAAGCTAGGGATGAGTCGGGATC
GAGTCGAAACACTTCGAGATCGAAGCAGGATGAAAACAGGGGAAAGCGGGACGCTGCAGGCCATAGTGGCCGAGAAGACAGTGCCGCAGTGTCGTTGCACTGCTGCGACG
CTAAGGGATAGCACAGCGGCGCTGTCCTGGTGGGCGCGCGGATGCATTTTTGCGTCTGTAGAGGCATGGCGCCGGGGACAACGCCATGGTGCTGTTTCGATGTTTATAAA
TAGCATTGTTCGGGTTTTAGGGTTTGCCAGCTAA
Protein sequenceShow/hide protein sequence
MKTTVSNSLGKNSLKFSAICDVALSEEARRKLGKMFASTPGAENEVESTLVAENKGKGKVNYTGKQQHRYSRGSGSSTGSEVLLLKGSDPAAEARERVCFDLWIVDTVVS
VHVASNRCWFSSLAAGEWTELVLQGVRYIPSFIFNLLSAGKLDDDGYSSEFVRGCWKLKRESKSSCRWFRARLERVSSIDNQYRSDESVINSSKQLRSRGKGNRLGWECQ
VIRGIFLQRSLGSIEKGSDEDHLDRVEARGCVSNVWKTRSVEGRKFVVLSLSGRFLGLVKSKRRRNKLGMSRDRVETLRDRSRMKTGESGTLQAIVAEKTVPQCRCTAAT
LRDSTAALSWWARGCIFASVEAWRRGQRHGAVSMFINSIVRVLGFAS