; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g36710 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g36710
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:27524744..27527168
RNA-Seq ExpressionMoc04g36710
SyntenyMoc04g36710
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-1927.08Show/hide
Query:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPS--SGDTMATPILNTEYSHWICQDSLITAWLLGSMSNSLLSEM
        +SS  +++F  GNKI+ VKL+++ FLLWK QILT L  Y LE+++  ++  PS+ L S  S    AT   N  Y  W  QD LI++WLLGSMS  +L++M
Subjt:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPS--SGDTMATPILNTEYSHWICQDSLITAWLLGSMSNSLLSEM

Query:  LACKTARE-----------------------------------------------------------------------------------------RVY
        L CK+A+E                                                                                          V 
Subjt:  LACKTARE-----------------------------------------------------------------------------------------RVY

Query:  SLLLAQENRIERHSTINPDGSLPSVNLTIHN---------RVKQSSTVNNDP-NRRGKNSGQKFNNRRLWNNN---------------------------
        SLLL QE++ E  S +  + +LPSVN+             R  Q++  NN   N+RG     + N  R  N N                           
Subjt:  SLLLAQENRIERHSTINPDGSLPSVNLTIHN---------RVKQSSTVNNDP-NRRGKNSGQKFNNRRLWNNN---------------------------

Query:  -GHS-----------SSQPPFNVFTLQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNGAGI
         G+S           ++ P  +      +LN +S W+P+ GA+NH+T+  SNL+IGS+Y G N++   NG+G+
Subjt:  -GHS-----------SSQPPFNVFTLQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNGAGI

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-1927.08Show/hide
Query:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPS--SGDTMATPILNTEYSHWICQDSLITAWLLGSMSNSLLSEM
        +SS  +++F  GNKI+ VKL+++ FLLWK QILT L  Y LE+++  ++  PS+ L S  S    AT   N  Y  W  QD LI++WLLGSMS  +L++M
Subjt:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPS--SGDTMATPILNTEYSHWICQDSLITAWLLGSMSNSLLSEM

Query:  LACKTARE-----------------------------------------------------------------------------------------RVY
        L CK+A+E                                                                                          V 
Subjt:  LACKTARE-----------------------------------------------------------------------------------------RVY

Query:  SLLLAQENRIERHSTINPDGSLPSVNLTIHN---------RVKQSSTVNNDP-NRRGKNSGQKFNNRRLWNNN---------------------------
        SLLL QE++ E  S +  + +LPSVN+             R  Q++  NN   N+RG     + N  R  N N                           
Subjt:  SLLLAQENRIERHSTINPDGSLPSVNLTIHN---------RVKQSSTVNNDP-NRRGKNSGQKFNNRRLWNNN---------------------------

Query:  -GHS-----------SSQPPFNVFTLQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNGAGI
         G+S           ++ P  +      +LN +S W+P+ GA+NH+T+  SNL+IGS+Y G N++   NG+G+
Subjt:  -GHS-----------SSQPPFNVFTLQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNGAGI

XP_022152240.1 uncharacterized protein LOC111020007 [Momordica charantia]1.3e-2445.78Show/hide
Query:  LLLAQENRIERHSTINPDGSLPSVNLTIHNRVKQ-----SSTVNNDPNRRGKNSGQKFNNRRLWNNN------------GH-----------------SS
        ++L   +RI+ HS+IN DGSLPSVNLT  +   Q     SS  +ND N+ G+N G KF+NRR WNNN            GH                 +S
Subjt:  LLLAQENRIERHSTINPDGSLPSVNLTIHNRVKQ-----SSTVNNDPNRRGKNSGQKFNNRRLWNNN------------GH-----------------SS

Query:  SQ----------PPFNVFTLQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNGAG
        SQ            FN  TLQH+LNKE+QWFP+ G SNHV +D +NL I ++YLGDNKVL+GNGAG
Subjt:  SQ----------PPFNVFTLQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNGAG

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]3.1e-2627.41Show/hide
Query:  SSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPSSGDTMATPIL--NTEYSHWICQDSLITAWLLGSMSNS
        +SD      +SK  +PG+K++ V+L+++N LLWK QI T L+G GLE Y++ +   P+Q + ++ D  ++  L  N  Y  WI QD LI+AWLLGSM+  
Subjt:  SSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPSSGDTMATPIL--NTEYSHWICQDSLITAWLLGSMSNS

Query:  LLSEMLACKTARE---------------------------------------------------------------------------------------
        +LS+ML CK+ARE                                                                                       
Subjt:  LLSEMLACKTARE---------------------------------------------------------------------------------------

Query:  --RVYSLLLAQENRIERHSTINPDGSLPSVNLTIH-----NRVKQSSTVN---NDPNRRGKNSGQKFNNRRLWNNN-----------GHSS---------
           V SLLL QE R ER + IN DGSLPSVNLT++     N + QS   N   ++ ++RG+ +  + +NRR W  N           GH++         
Subjt:  --RVYSLLLAQENRIERHSTINPDGSLPSVNLTIH-----NRVKQSSTVN---NDPNRRGKNSGQKFNNRRLWNNN-----------GHSS---------

Query:  -----------------------SQPPFNVFT-----------------------LQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNG
                               + P  N F+                       +  + N++S W+ + G +NHVTN+F N ++GS+Y GD K+ VGNG
Subjt:  -----------------------SQPPFNVFT-----------------------LQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNG

Query:  AGIRS
         G ++
Subjt:  AGIRS

XP_022159146.1 uncharacterized protein LOC111025572 [Momordica charantia]1.0e-2147.79Show/hide
Query:  ETSSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPSSGDTMATPILNTEYSHWICQDSLITAWLLGSMSNS
        ET S +++   +S + +PGNKI+T+KL++ENFLLW+LQI T L+G+GL  +++ +A +PS+ + S+ ++ + P  N E+ +W  QD LIT+WLLGSMS  
Subjt:  ETSSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPSSGDTMATPILNTEYSHWICQDSLITAWLLGSMSNS

Query:  LLSEMLACKTARE
        +LS+ML C+TA+E
Subjt:  LLSEMLACKTARE

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-1927.08Show/hide
Query:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPS--SGDTMATPILNTEYSHWICQDSLITAWLLGSMSNSLLSEM
        +SS  +++F  GNKI+ VKL+++ FLLWK QILT L  Y LE+++  ++  PS+ L S  S    AT   N  Y  W  QD LI++WLLGSMS  +L++M
Subjt:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPS--SGDTMATPILNTEYSHWICQDSLITAWLLGSMSNSLLSEM

Query:  LACKTARE-----------------------------------------------------------------------------------------RVY
        L CK+A+E                                                                                          V 
Subjt:  LACKTARE-----------------------------------------------------------------------------------------RVY

Query:  SLLLAQENRIERHSTINPDGSLPSVNLTIHN---------RVKQSSTVNNDP-NRRGKNSGQKFNNRRLWNNN---------------------------
        SLLL QE++ E  S +  + +LPSVN+             R  Q++  NN   N+RG     + N  R  N N                           
Subjt:  SLLLAQENRIERHSTINPDGSLPSVNLTIHN---------RVKQSSTVNNDP-NRRGKNSGQKFNNRRLWNNN---------------------------

Query:  -GHS-----------SSQPPFNVFTLQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNGAGI
         G+S           ++ P  +      +LN +S W+P+ GA+NH+T+  SNL+IGS+Y G N++   NG+G+
Subjt:  -GHS-----------SSQPPFNVFTLQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNGAGI

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-1927.08Show/hide
Query:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPS--SGDTMATPILNTEYSHWICQDSLITAWLLGSMSNSLLSEM
        +SS  +++F  GNKI+ VKL+++ FLLWK QILT L  Y LE+++  ++  PS+ L S  S    AT   N  Y  W  QD LI++WLLGSMS  +L++M
Subjt:  SSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPS--SGDTMATPILNTEYSHWICQDSLITAWLLGSMSNSLLSEM

Query:  LACKTARE-----------------------------------------------------------------------------------------RVY
        L CK+A+E                                                                                          V 
Subjt:  LACKTARE-----------------------------------------------------------------------------------------RVY

Query:  SLLLAQENRIERHSTINPDGSLPSVNLTIHN---------RVKQSSTVNNDP-NRRGKNSGQKFNNRRLWNNN---------------------------
        SLLL QE++ E  S +  + +LPSVN+             R  Q++  NN   N+RG     + N  R  N N                           
Subjt:  SLLLAQENRIERHSTINPDGSLPSVNLTIHN---------RVKQSSTVNNDP-NRRGKNSGQKFNNRRLWNNN---------------------------

Query:  -GHS-----------SSQPPFNVFTLQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNGAGI
         G+S           ++ P  +      +LN +S W+P+ GA+NH+T+  SNL+IGS+Y G N++   NG+G+
Subjt:  -GHS-----------SSQPPFNVFTLQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNGAGI

A0A6J1DDD5 uncharacterized protein LOC1110200076.4e-2545.78Show/hide
Query:  LLLAQENRIERHSTINPDGSLPSVNLTIHNRVKQ-----SSTVNNDPNRRGKNSGQKFNNRRLWNNN------------GH-----------------SS
        ++L   +RI+ HS+IN DGSLPSVNLT  +   Q     SS  +ND N+ G+N G KF+NRR WNNN            GH                 +S
Subjt:  LLLAQENRIERHSTINPDGSLPSVNLTIHNRVKQ-----SSTVNNDPNRRGKNSGQKFNNRRLWNNN------------GH-----------------SS

Query:  SQ----------PPFNVFTLQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNGAG
        SQ            FN  TLQH+LNKE+QWFP+ G SNHV +D +NL I ++YLGDNKVL+GNGAG
Subjt:  SQ----------PPFNVFTLQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNGAG

A0A6J1DLT9 uncharacterized protein LOC1110217571.5e-2627.41Show/hide
Query:  SSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPSSGDTMATPIL--NTEYSHWICQDSLITAWLLGSMSNS
        +SD      +SK  +PG+K++ V+L+++N LLWK QI T L+G GLE Y++ +   P+Q + ++ D  ++  L  N  Y  WI QD LI+AWLLGSM+  
Subjt:  SSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPSSGDTMATPIL--NTEYSHWICQDSLITAWLLGSMSNS

Query:  LLSEMLACKTARE---------------------------------------------------------------------------------------
        +LS+ML CK+ARE                                                                                       
Subjt:  LLSEMLACKTARE---------------------------------------------------------------------------------------

Query:  --RVYSLLLAQENRIERHSTINPDGSLPSVNLTIH-----NRVKQSSTVN---NDPNRRGKNSGQKFNNRRLWNNN-----------GHSS---------
           V SLLL QE R ER + IN DGSLPSVNLT++     N + QS   N   ++ ++RG+ +  + +NRR W  N           GH++         
Subjt:  --RVYSLLLAQENRIERHSTINPDGSLPSVNLTIH-----NRVKQSSTVN---NDPNRRGKNSGQKFNNRRLWNNN-----------GHSS---------

Query:  -----------------------SQPPFNVFT-----------------------LQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNG
                               + P  N F+                       +  + N++S W+ + G +NHVTN+F N ++GS+Y GD K+ VGNG
Subjt:  -----------------------SQPPFNVFT-----------------------LQHELNKESQWFPNFGASNHVTNDFSNLTIGSKYLGDNKVLVGNG

Query:  AGIRS
         G ++
Subjt:  AGIRS

A0A6J1E314 uncharacterized protein LOC1110255725.0e-2247.79Show/hide
Query:  ETSSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPSSGDTMATPILNTEYSHWICQDSLITAWLLGSMSNS
        ET S +++   +S + +PGNKI+T+KL++ENFLLW+LQI T L+G+GL  +++ +A +PS+ + S+ ++ + P  N E+ +W  QD LIT+WLLGSMS  
Subjt:  ETSSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPSSGDTMATPILNTEYSHWICQDSLITAWLLGSMSNS

Query:  LLSEMLACKTARE
        +LS+ML C+TA+E
Subjt:  LLSEMLACKTARE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCTCGTGAAGAAACTTCTTCAGATTTGATTTCTTCATCGACATCATCCAAACTCTTTCATCCAGGGAATAAAATCACCACAGTAAAACTTGATGAGGAGAATTT
TCTTCTCTGGAAACTTCAAATTCTTACCACTCTCAGAGGCTATGGCTTGGAGGATTATGTCAATCTGGATGCAACTGTTCCATCACAACTCCTCCCTTCTTCGGGTGATA
CAATGGCAACTCCGATTCTAAATACTGAGTACTCTCACTGGATTTGTCAAGATAGTTTGATTACGGCGTGGCTCTTGGGTTCTATGTCCAATTCTCTCCTATCAGAAATG
TTAGCCTGCAAGACTGCTAGGGAGAGAGTTTATTCACTTTTGCTGGCTCAAGAAAATAGAATTGAACGTCACTCTACCATCAATCCCGATGGTTCTCTGCCTTCAGTAAA
CCTTACCATTCACAATCGAGTCAAACAGTCATCTACAGTGAACAATGATCCAAATCGAAGAGGGAAAAATTCGGGACAGAAGTTCAATAATCGACGATTATGGAACAACA
ATGGGCATAGTTCTTCTCAACCTCCATTCAATGTCTTTACTCTACAACATGAATTGAACAAGGAAAGTCAGTGGTTCCCAAATTTTGGTGCGTCGAACCATGTTACGAAT
GATTTTAGCAATTTAACAATTGGATCTAAGTATCTTGGAGATAACAAAGTTTTGGTCGGCAATGGTGCAGGGATTAGAAGTCTTAATTTGGCGTGGATGGTTGCCGGAGA
GGTGGTGGAAGTCGTCGTCGGTGTCAGCAACGGTGAAGGAAACGGCGTCGTTTCCGGAATAGAGGATGAAGGGAAGGGGAGAGTCGTCCGGCCAAAGGAGGTTGCCGGCG
ACAGGGAGAAAGTGGTTGAGATTGAGGGAGAGGGAATGTTTGAGGCTTGGAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTGCTCGTGAAGAAACTTCTTCAGATTTGATTTCTTCATCGACATCATCCAAACTCTTTCATCCAGGGAATAAAATCACCACAGTAAAACTTGATGAGGAGAATTT
TCTTCTCTGGAAACTTCAAATTCTTACCACTCTCAGAGGCTATGGCTTGGAGGATTATGTCAATCTGGATGCAACTGTTCCATCACAACTCCTCCCTTCTTCGGGTGATA
CAATGGCAACTCCGATTCTAAATACTGAGTACTCTCACTGGATTTGTCAAGATAGTTTGATTACGGCGTGGCTCTTGGGTTCTATGTCCAATTCTCTCCTATCAGAAATG
TTAGCCTGCAAGACTGCTAGGGAGAGAGTTTATTCACTTTTGCTGGCTCAAGAAAATAGAATTGAACGTCACTCTACCATCAATCCCGATGGTTCTCTGCCTTCAGTAAA
CCTTACCATTCACAATCGAGTCAAACAGTCATCTACAGTGAACAATGATCCAAATCGAAGAGGGAAAAATTCGGGACAGAAGTTCAATAATCGACGATTATGGAACAACA
ATGGGCATAGTTCTTCTCAACCTCCATTCAATGTCTTTACTCTACAACATGAATTGAACAAGGAAAGTCAGTGGTTCCCAAATTTTGGTGCGTCGAACCATGTTACGAAT
GATTTTAGCAATTTAACAATTGGATCTAAGTATCTTGGAGATAACAAAGTTTTGGTCGGCAATGGTGCAGGGATTAGAAGTCTTAATTTGGCGTGGATGGTTGCCGGAGA
GGTGGTGGAAGTCGTCGTCGGTGTCAGCAACGGTGAAGGAAACGGCGTCGTTTCCGGAATAGAGGATGAAGGGAAGGGGAGAGTCGTCCGGCCAAAGGAGGTTGCCGGCG
ACAGGGAGAAAGTGGTTGAGATTGAGGGAGAGGGAATGTTTGAGGCTTGGAATTAG
Protein sequenceShow/hide protein sequence
MTAREETSSDLISSSTSSKLFHPGNKITTVKLDEENFLLWKLQILTTLRGYGLEDYVNLDATVPSQLLPSSGDTMATPILNTEYSHWICQDSLITAWLLGSMSNSLLSEM
LACKTARERVYSLLLAQENRIERHSTINPDGSLPSVNLTIHNRVKQSSTVNNDPNRRGKNSGQKFNNRRLWNNNGHSSSQPPFNVFTLQHELNKESQWFPNFGASNHVTN
DFSNLTIGSKYLGDNKVLVGNGAGIRSLNLAWMVAGEVVEVVVGVSNGEGNGVVSGIEDEGKGRVVRPKEVAGDREKVVEIEGEGMFEAWN