; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g17800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g17800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:13480588..13489629
RNA-Seq ExpressionMoc08g17800
SyntenyMoc08g17800
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEW43769.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Tanacetum cinerariifolium]2.6e-1124Show/hide
Query:  KFNGANFGYWKMQIKDFLTCKKLQKAL-KEQPTGMKDEDWVDTNEQAIAIIIFVFVNECG----KEKVMTDDRQKSIARGIGNTKCFHGHKKCHIKKNY-
        KF+G+++G+WKMQI+D L  K L   +  EQP  M DE+W   + QA+ ++              E   ++ +++  ++   + +C++  +K H +    
Subjt:  KFNGANFGYWKMQIKDFLTCKKLQKAL-KEQPTGMKDEDWVDTNEQAIAIIIFVFVNECG----KEKVMTDDRQKSIARGIGNTKCFHGHKKCHIKKNY-

Query:  --RILKENLKRYMAEANAVVDNVLVCVENNTKTGNQSSEWVIDNAVSVHISSNKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLLYQSE
          +  K+ +    +E +   D ++ CVEN     ++   W++D+  S H  S   +  + + GN   V + + ++     +R  +L +  + +   +   
Subjt:  --RILKENLKRYMAEANAVVDNVLVCVENNTKTGNQSSEWVIDNAVSVHISSNKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLLYQSE

Query:  FGGNQWKLIKESKLVAVGHRRYTVY
        FGG QWK+IK + ++A G ++ ++Y
Subjt:  FGGNQWKLIKESKLVAVGHRRYTVY

KAF7115099.1 hypothetical protein RHSIM_RhsimUnG0064500 [Rhododendron simsii]1.1e-1429.9Show/hide
Query:  GKEKVMTDDRQKSIARGIGNTK----CFHGHKKCHIKKNYRILKENLKRYMA-------EANAVVDN---VLVCVENNTKTGNQSSEWVIDNAVSVHISS
        G ++  + DR KS +RG    +    CFH HK  H++K  RIL++ LK+          +  AV  +   ++VC +       Q   WVID+  S H++S
Subjt:  GKEKVMTDDRQKSIARGIGNTK----CFHGHKKCHIKKNYRILKENLKRYMA-------EANAVVDN---VLVCVENNTKTGNQSSEWVIDNAVSVHISS

Query:  NKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRL
         +  F S+  G+  HV+MGN  +SK  G+ +  L+TN+  +LLL                       Y+++FG  +WKL K S +VA G +  T+Y  + 
Subjt:  NKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRL

Query:  SVAR
         +++
Subjt:  SVAR

KAF7121453.1 hypothetical protein RHSIM_Rhsim13G0116100 [Rhododendron simsii]2.0e-1429.41Show/hide
Query:  GKEKVMTDDRQKSIARGIGNTK----CFHGHKKCHIKKNYRILKENLKR-YMAEANAVVDN---------VLVCVENNTKTGNQSSEWVIDNAVSVHISS
        G ++  + DR +S +RG    +    CFH HK  H++K  RIL++ LK+  + E++   D          ++VC +       Q   WVID+  S H++S
Subjt:  GKEKVMTDDRQKSIARGIGNTK----CFHGHKKCHIKKNYRILKENLKR-YMAEANAVVDN---------VLVCVENNTKTGNQSSEWVIDNAVSVHISS

Query:  NKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRL
         +  F S+  G+  HV+MGN  +SK  G+ +  L+TN+  +LLL                       Y+++FG  +WKL K S +VA G +  T+Y  + 
Subjt:  NKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRL

Query:  SVAR
         +++
Subjt:  SVAR

KAF7129225.1 hypothetical protein RHSIM_Rhsim10G0050800 [Rhododendron simsii]6.7e-1528.94Show/hide
Query:  KEQPTGMKDEDWVDTNEQAIAIIIFVFVNECGKEKVMTDDRQKSIARGIGNTK----CFHGHKKCHIKKNYRILKENLKRYMA-------EANAVVDN--
        KEQ T +  E  V  N     I         G ++  + DR +S +RG    +    CFH HK  H++K  RIL++ LK+          +  AV  +  
Subjt:  KEQPTGMKDEDWVDTNEQAIAIIIFVFVNECGKEKVMTDDRQKSIARGIGNTK----CFHGHKKCHIKKNYRILKENLKRYMA-------EANAVVDN--

Query:  -VLVCVENNTKTGNQSSEWVIDNAVSVHISSNKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQ
         ++VC +       Q   WVID+  S H++S +  F S+  G+  HV+MGN  +SK  G+ +  L+TN+  +LLL                       Y+
Subjt:  -VLVCVENNTKTGNQSSEWVIDNAVSVHISSNKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQ

Query:  SEFGGNQWKLIKESKLVAVGHRRYTVYTSRLSVAR
        ++FG  +WKL K S +VA G +  T+Y  +  +++
Subjt:  SEFGGNQWKLIKESKLVAVGHRRYTVYTSRLSVAR

KAF7129546.1 hypothetical protein RHSIM_Rhsim10G0154200 [Rhododendron simsii]8.8e-1528.94Show/hide
Query:  KEQPTGMKDEDWVDTNEQAIAIIIFVFVNECGKEKVMTDDRQKSIARGIGNTK----CFHGHKKCHIKKNYRILKENLKRYMA-------EANAVVDN--
        KEQ T +  E  V  N     I         G ++    DR KS + G    +    CFH HK  H++K  RIL++ LK+          +  AV  +  
Subjt:  KEQPTGMKDEDWVDTNEQAIAIIIFVFVNECGKEKVMTDDRQKSIARGIGNTK----CFHGHKKCHIKKNYRILKENLKRYMA-------EANAVVDN--

Query:  -VLVCVENNTKTGNQSSEWVIDNAVSVHISSNKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQ
         ++VC +       Q   WVID+  S H++S +  F S+  G+  HV+MGN  +SK  G+ +  L+TN+  +LLL                       Y+
Subjt:  -VLVCVENNTKTGNQSSEWVIDNAVSVHISSNKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQ

Query:  SEFGGNQWKLIKESKLVAVGHRRYTVYTSRLSVAR
        ++FG  +WKL K S +VA G +  T+Y  ++ +++
Subjt:  SEFGGNQWKLIKESKLVAVGHRRYTVYTSRLSVAR

TrEMBL top hitse value%identityAlignment
A0A2N9GCZ3 Uncharacterized protein1.2e-1224.91Show/hide
Query:  GIMKFNGANFGYWKMQIKDFLTCKKLQ-KALKEQPTGMKDEDWVDTNEQAIAIIIFVFVNECGKEKVMTDDRQKSIARGIGNTKCFHGHKKCHIKKNYRI
        GI KF+G NFGYWK+QI+D+L  KKL    L ++P  M+D +W   + Q    +  +       EK + +++   + + + N K   G         +  
Subjt:  GIMKFNGANFGYWKMQIKDFLTCKKLQ-KALKEQPTGMKDEDWVDTNEQAIAIIIFVFVNECGKEKVMTDDRQKSIARGIGNTKCFHGHKKCHIKKNYRI

Query:  LKENLKRYMAE---------------------------ANAVVDNVLVCV-----ENNTKTGNQSSEWVIDNAVSVHISSNKRLFTSFRRGNHDHVKMGN
        +   L     E                             AVV++  V V     + N    N   EWV+D+A + H+   K LFT+++  +   V MGN
Subjt:  LKENLKRYMAE---------------------------ANAVVDNVLVCV-----ENNTKTGNQSSEWVIDNAVSVHISSNKRLFTSFRRGNHDHVKMGN

Query:  VKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRLSVARR
           SK  GI +  +KTN    ++L                       Y +  G  +WKL K   +VA G     +Y +R+   ++
Subjt:  VKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRLSVARR

A0A2N9H4J8 Uncharacterized protein1.2e-1225.79Show/hide
Query:  IMKFNGANFGYWKMQIKDFLTCKKLQ-KALKEQPTGMKDEDWVDTNEQAIAIIIFVFVNECGKEKVMTDDRQKSIARGIGNTKCFHGHKKCHIKKNYRIL
        I KF+G +FGYWKMQI+D+L  KKL    L ++P  M+D +W   + Q     + +     G  +  +D+ +  + + + N K   G         +  +
Subjt:  IMKFNGANFGYWKMQIKDFLTCKKLQ-KALKEQPTGMKDEDWVDTNEQAIAIIIFVFVNECGKEKVMTDDRQKSIARGIGNTKCFHGHKKCHIKKNYRIL

Query:  KENLKRYMAEANAVVDNVLVCVENNTKTGNQSSEWVIDNAVSVHISSNKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL---------
           L     E N  +  VL+ + +   +       V+D+  + H+   K LFT+++ G+   VKMGN   SK  GI +  +KTN    ++L         
Subjt:  KENLKRYMAEANAVVDNVLVCVENNTKTGNQSSEWVIDNAVSVHISSNKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL---------

Query:  --------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRLSVARR
                      Y +  G  +WKL K   +VA G     +Y +R+   ++
Subjt:  --------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRLSVARR

A0A438BTH6 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-1127.89Show/hide
Query:  RQKSIARGIGNTKCFHGHKKCHIKKNYRILK-------ENLKRYMAEANAVVDNVLVCVENNTKTG--NQSSEWVIDNAVSVHISSNKRLFTSFRRGNHD
        R  SI++   + +C++ HKK H+K+  R LK       +N ++   +   V D  L+ + ++       Q ++WVID+  S H++S    FTS+ +G+  
Subjt:  RQKSIARGIGNTKCFHGHKKCHIKKNYRILK-------ENLKRYMAEANAVVDNVLVCVENNTKTG--NQSSEWVIDNAVSVHISSNKRLFTSFRRGNHD

Query:  HVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRLSVAR
        +V+MGN  +SK  G+ +  L+TN+  +LLL                       Y + F   +WKL K S +VA G +  ++YT +  + +
Subjt:  HVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRLSVAR

A0A438HI91 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-1127.89Show/hide
Query:  RQKSIARGIGNTKCFHGHKKCHIKKNYRILK-------ENLKRYMAEANAVVDNVLVCVENNTKTG--NQSSEWVIDNAVSVHISSNKRLFTSFRRGNHD
        R  SI++   + +C++ HKK H+K+  R LK       +N ++   +   V D  L+ + ++       Q ++WVID+  S H++S    FTS+ +G+  
Subjt:  RQKSIARGIGNTKCFHGHKKCHIKKNYRILK-------ENLKRYMAEANAVVDNVLVCVENNTKTG--NQSSEWVIDNAVSVHISSNKRLFTSFRRGNHD

Query:  HVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRLSVAR
        +V+MGN  +SK  G+ +  L+TN+  +LLL                       Y + F   +WKL K S +VA G +  ++YT +  + +
Subjt:  HVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRLSVAR

A0A438IBT7 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-1128.64Show/hide
Query:  ECGKEK-VMTDDRQKSIARGIGNTK----CFHGHKKCHIKKNYRILK-------ENLKRYMAEANAVVDNVLVCVENNTKTG--NQSSEWVIDNAVSVHI
        E GK K   + +R KS  R    +K    C++ HKK H+K+  R LK       +N ++   +   V D  L+ + ++       Q ++WVID+  S H+
Subjt:  ECGKEK-VMTDDRQKSIARGIGNTK----CFHGHKKCHIKKNYRILK-------ENLKRYMAEANAVVDNVLVCVENNTKTG--NQSSEWVIDNAVSVHI

Query:  SSNKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTS
        +S    FTS+ +G+  +V+MGN  +SK  G+ +  L+TN+  +LLL                       Y + F   +WKL K S +VA G +  ++YT 
Subjt:  SSNKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTS

Query:  RLSVAR
        +  + +
Subjt:  RLSVAR

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-0726.21Show/hide
Query:  GKEKVMTDDRQKSIARGIGNTKCFHGHKKCHIKKNYRILKENLKRYMAEANAVVDNVLVCVENNTKT-------------GNQSSEWVIDNAVSVHISSN
        GK K    +R KS  R      C++ ++  H K++    ++       + N   DN    V+NN                    SEWV+D A S H +  
Subjt:  GKEKVMTDDRQKSIARGIGNTKCFHGHKKCHIKKNYRILKENLKRYMAEANAVVDNVLVCVENNTKT-------------GNQSSEWVIDNAVSVHISSN

Query:  KRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRLS
        + LF  +  G+   VKMGN   SK  GI +  +KTN    L+L                       Y+S F   +W+L K S ++A G  R T+Y +   
Subjt:  KRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLL-----------------------YQSEFGGNQWKLIKESKLVAVGHRRYTVYTSRLS

Query:  VARRSL
        + +  L
Subjt:  VARRSL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCATGGGATCGGTAGAGAAGTCGGGTTTTGAAGGAATCATGAAGTTCAATGGGGCCAATTTTGGATATTGGAAGATGCAGATTAAGGATTTTCTGACTTGCAAGAA
GTTACAAAAAGCTTTAAAAGAACAACCAACAGGGATGAAAGATGAAGACTGGGTAGATACAAATGAACAGGCAATTGCCATTATCATATTTGTGTTTGTCAATGAATGTG
GTAAAGAGAAAGTTATGACAGACGACCGACAGAAAAGCATAGCAAGAGGAATTGGAAATACTAAATGTTTTCACGGCCACAAAAAATGCCACATCAAGAAAAACTATAGA
ATATTAAAGGAGAATCTTAAAAGGTATATGGCGGAGGCAAATGCAGTTGTGGACAATGTCCTCGTTTGTGTTGAGAACAACACAAAAACAGGAAACCAATCATCGGAGTG
GGTAATAGACAACGCAGTTTCAGTACACATATCTTCAAATAAGAGATTGTTTACATCTTTCAGAAGAGGCAATCACGACCACGTGAAGATGGGGAATGTAAAGCTTTCCA
AGACTAAAGGGATTAGAAACACACGATTGAAGACCAATAGTGAAACTGAGTTATTACTTTATCAGAGTGAGTTTGGTGGAAACCAATGGAAGCTCATCAAAGAATCTAAG
TTGGTGGCAGTTGGCCATAGAAGATATACAGTTTATACATCGAGATTGAGTGTTGCCAGAAGATCATTGAAACAATGGATGCAAGCTGCATATGGTGTCCAAAGAGAGAA
TAAGAACACATTCTGTCAGGGATTATTCTCAATTGTTAGACGGATGAGCGAATTGATGAAGTCGTGTCAGCGAACAGCTGCATCTGAGAAGATGAACTCAGTAGGTGTTG
AGGTAAAGAGTGGAGTTTCAATTCTAGCAACATGCTTGAATAGAGTTGTCAAGTCATCAATTGGAAGTTCTTTCTTCAAGAATCAGTGTTTGAGAACGGAGAAAAAAGCT
TTAGATATCGCCACTTATCGCACCTTCTGGGTTTGTAGTGTCGTGGCGTCCTATGGTAGCGTCGCGGCACTAGGGACAAGACCTAGGCCCCTTGCTAAAGTGCCTAGGCG
CTACTGCTTTGGATCCCCACCAGCTTTCGCCTTGGTTCGTCCCCAAACCACATCATACTGGGAGAGGTTTCGCTCTGATACCATTTGTAGCGCCCTAGGTAGGAGAACAT
TGTCTTTTACAATGAGTATAGTAGATATTACTTCCAAGTGGCGGAGAATGAAACTCACCTCGGATGACGACGATTCAGCAGATTTATCCAAGGGTGAAGACAAGTGGGCC
ATGATTTATGACCTTCCTCTAGGGTGTATGCACAAGAACAACCTTTTGGAGCTGGGAAGAAAATTGGGAGAGGTGGAAGACATTGAACAAAATGCTAATGGAGACTTCTT
TGGCGTTTTGCAAGTTGCTGAGAAGAGGCCTAAAAATCAAAGTCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCATGGGATCGGTAGAGAAGTCGGGTTTTGAAGGAATCATGAAGTTCAATGGGGCCAATTTTGGATATTGGAAGATGCAGATTAAGGATTTTCTGACTTGCAAGAA
GTTACAAAAAGCTTTAAAAGAACAACCAACAGGGATGAAAGATGAAGACTGGGTAGATACAAATGAACAGGCAATTGCCATTATCATATTTGTGTTTGTCAATGAATGTG
GTAAAGAGAAAGTTATGACAGACGACCGACAGAAAAGCATAGCAAGAGGAATTGGAAATACTAAATGTTTTCACGGCCACAAAAAATGCCACATCAAGAAAAACTATAGA
ATATTAAAGGAGAATCTTAAAAGGTATATGGCGGAGGCAAATGCAGTTGTGGACAATGTCCTCGTTTGTGTTGAGAACAACACAAAAACAGGAAACCAATCATCGGAGTG
GGTAATAGACAACGCAGTTTCAGTACACATATCTTCAAATAAGAGATTGTTTACATCTTTCAGAAGAGGCAATCACGACCACGTGAAGATGGGGAATGTAAAGCTTTCCA
AGACTAAAGGGATTAGAAACACACGATTGAAGACCAATAGTGAAACTGAGTTATTACTTTATCAGAGTGAGTTTGGTGGAAACCAATGGAAGCTCATCAAAGAATCTAAG
TTGGTGGCAGTTGGCCATAGAAGATATACAGTTTATACATCGAGATTGAGTGTTGCCAGAAGATCATTGAAACAATGGATGCAAGCTGCATATGGTGTCCAAAGAGAGAA
TAAGAACACATTCTGTCAGGGATTATTCTCAATTGTTAGACGGATGAGCGAATTGATGAAGTCGTGTCAGCGAACAGCTGCATCTGAGAAGATGAACTCAGTAGGTGTTG
AGGTAAAGAGTGGAGTTTCAATTCTAGCAACATGCTTGAATAGAGTTGTCAAGTCATCAATTGGAAGTTCTTTCTTCAAGAATCAGTGTTTGAGAACGGAGAAAAAAGCT
TTAGATATCGCCACTTATCGCACCTTCTGGGTTTGTAGTGTCGTGGCGTCCTATGGTAGCGTCGCGGCACTAGGGACAAGACCTAGGCCCCTTGCTAAAGTGCCTAGGCG
CTACTGCTTTGGATCCCCACCAGCTTTCGCCTTGGTTCGTCCCCAAACCACATCATACTGGGAGAGGTTTCGCTCTGATACCATTTGTAGCGCCCTAGGTAGGAGAACAT
TGTCTTTTACAATGAGTATAGTAGATATTACTTCCAAGTGGCGGAGAATGAAACTCACCTCGGATGACGACGATTCAGCAGATTTATCCAAGGGTGAAGACAAGTGGGCC
ATGATTTATGACCTTCCTCTAGGGTGTATGCACAAGAACAACCTTTTGGAGCTGGGAAGAAAATTGGGAGAGGTGGAAGACATTGAACAAAATGCTAATGGAGACTTCTT
TGGCGTTTTGCAAGTTGCTGAGAAGAGGCCTAAAAATCAAAGTCAATGA
Protein sequenceShow/hide protein sequence
MVMGSVEKSGFEGIMKFNGANFGYWKMQIKDFLTCKKLQKALKEQPTGMKDEDWVDTNEQAIAIIIFVFVNECGKEKVMTDDRQKSIARGIGNTKCFHGHKKCHIKKNYR
ILKENLKRYMAEANAVVDNVLVCVENNTKTGNQSSEWVIDNAVSVHISSNKRLFTSFRRGNHDHVKMGNVKLSKTKGIRNTRLKTNSETELLLYQSEFGGNQWKLIKESK
LVAVGHRRYTVYTSRLSVARRSLKQWMQAAYGVQRENKNTFCQGLFSIVRRMSELMKSCQRTAASEKMNSVGVEVKSGVSILATCLNRVVKSSIGSSFFKNQCLRTEKKA
LDIATYRTFWVCSVVASYGSVAALGTRPRPLAKVPRRYCFGSPPAFALVRPQTTSYWERFRSDTICSALGRRTLSFTMSIVDITSKWRRMKLTSDDDDSADLSKGEDKWA
MIYDLPLGCMHKNNLLELGRKLGEVEDIEQNANGDFFGVLQVAEKRPKNQSQ