; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g09220 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g09220
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:7078657..7080914
RNA-Seq ExpressionMoc07g09220
SyntenyMoc07g09220
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006589923.1 uncharacterized protein LOC102667168 [Glycine max]6.3e-2542.68Show/hide
Query:  TFKVVNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNK--------VEIEKWDRKWSSTSPKTIRQRRVPLWQNSP--L
        TF ++++ N++ QVN+IP LN  NFK  KE I+IVLGCM+LDLALR +RP ST E  N+         EIE++  K        +  + + +       +
Subjt:  TFKVVNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNK--------VEIEKWDRKWSSTSPKTIRQRRVPLWQNSP--L

Query:  QNTLVK--ETATKLKALKLEVSEDFLVHLVLNSLPAEYSHVRGCIWSRPPSDAEAFIYLVDDNR
        +  +++    A+KLK+LKLE+ ED LVHLVL SLPA +   +GC+WSR PSD E FI++ D  +
Subjt:  QNTLVK--ETATKLKALKLEVSEDFLVHLVLNSLPAEYSHVRGCIWSRPPSDAEAFIYLVDDNR

XP_022152232.1 uncharacterized protein LOC111020001 [Momordica charantia]3.7e-2590.91Show/hide
Query:  FKVVNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDR
        FKV NSDNMSTQVNN PRLN ANFK WKEDIQIVLGCM+LDLALRVDRPTS EENPNKVEIEKWDR
Subjt:  FKVVNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDR

XP_022155096.1 uncharacterized protein LOC111022228 [Momordica charantia]2.6e-3154.22Show/hide
Query:  MSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDRKWSSTSPKTIRQRRVP-LWQNSPLQNTLVK------------
        MSTQVNNIPRLN ANFK WKEDIQIVLGCM+LDLALRVDRPTSTEENPNKVEIEKWDR  S+     I +R +P  ++ S ++ T  K            
Subjt:  MSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDRKWSSTSPKTIRQRRVP-LWQNSPLQNTLVK------------

Query:  --------------------------------ETATKLKALKLEVSEDFLVHLVLNSLPAEYSHVR
                                          ATKLKALKL+VSE+FLVHLVLNSL AEYSH R
Subjt:  --------------------------------ETATKLKALKLEVSEDFLVHLVLNSLPAEYSHVR

XP_022156979.1 uncharacterized protein LOC111023808 [Momordica charantia]2.8e-2579.01Show/hide
Query:  KVVNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDRKWSSTSPKTIRQRRVP
        KV NSDNMSTQVNNIPRLN ANFK WKEDIQIVLGCM+LDLALRVDRPTSTEENPNKVEI+KWDR  S+     I +R +P
Subjt:  KVVNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDRKWSSTSPKTIRQRRVP

XP_022158724.1 uncharacterized protein LOC111025186 [Momordica charantia]3.4e-3156.29Show/hide
Query:  MSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDRKWSSTSPKTIRQRRV-PLWQNSPLQNT-----LVKETATKLK
        MSTQVNNIPRLNEANFK WKEDIQIVL CM+LDLALRVDRPTS EENPNKVEIEKWDR  S+     I +R +   ++ S ++ T     L ++     K
Subjt:  MSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDRKWSSTSPKTIRQRRV-PLWQNSPLQNT-----LVKETATKLK

Query:  ALKLEVSEDFLVHLVLNSLPAEYSHVRGCIWSRPPSDAEAFIYLVDDNRQK
          K EV      H+ ++        ++ CIWSRPPSDAEAFIY+ DDNR K
Subjt:  ALKLEVSEDFLVHLVLNSLPAEYSHVRGCIWSRPPSDAEAFIYLVDDNRQK

TrEMBL top hitse value%identityAlignment
A0A6J1DFM1 uncharacterized protein LOC1110200011.8e-2590.91Show/hide
Query:  FKVVNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDR
        FKV NSDNMSTQVNN PRLN ANFK WKEDIQIVLGCM+LDLALRVDRPTS EENPNKVEIEKWDR
Subjt:  FKVVNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDR

A0A6J1DQP2 uncharacterized protein LOC1110222281.3e-3154.22Show/hide
Query:  MSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDRKWSSTSPKTIRQRRVP-LWQNSPLQNTLVK------------
        MSTQVNNIPRLN ANFK WKEDIQIVLGCM+LDLALRVDRPTSTEENPNKVEIEKWDR  S+     I +R +P  ++ S ++ T  K            
Subjt:  MSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDRKWSSTSPKTIRQRRVP-LWQNSPLQNTLVK------------

Query:  --------------------------------ETATKLKALKLEVSEDFLVHLVLNSLPAEYSHVR
                                          ATKLKALKL+VSE+FLVHLVLNSL AEYSH R
Subjt:  --------------------------------ETATKLKALKLEVSEDFLVHLVLNSLPAEYSHVR

A0A6J1DV67 uncharacterized protein LOC1110238081.4e-2579.01Show/hide
Query:  KVVNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDRKWSSTSPKTIRQRRVP
        KV NSDNMSTQVNNIPRLN ANFK WKEDIQIVLGCM+LDLALRVDRPTSTEENPNKVEI+KWDR  S+     I +R +P
Subjt:  KVVNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDRKWSSTSPKTIRQRRVP

A0A6J1E084 uncharacterized protein LOC1110251861.7e-3156.29Show/hide
Query:  MSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDRKWSSTSPKTIRQRRV-PLWQNSPLQNT-----LVKETATKLK
        MSTQVNNIPRLNEANFK WKEDIQIVL CM+LDLALRVDRPTS EENPNKVEIEKWDR  S+     I +R +   ++ S ++ T     L ++     K
Subjt:  MSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDRKWSSTSPKTIRQRRV-PLWQNSPLQNT-----LVKETATKLK

Query:  ALKLEVSEDFLVHLVLNSLPAEYSHVRGCIWSRPPSDAEAFIYLVDDNRQK
          K EV      H+ ++        ++ CIWSRPPSDAEAFIY+ DDNR K
Subjt:  ALKLEVSEDFLVHLVLNSLPAEYSHVRGCIWSRPPSDAEAFIYLVDDNRQK

A5BW89 Integrase catalytic domain-containing protein1.3e-2037.93Show/hide
Query:  VNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRP-----TSTEENPNKV--------------------EIEKWDRKWSSTSPKTIRQ
        +++ ++S  +NN+P LNE NFK WKE++ I+LGCM++DLALR+ +P      ST+E+   +                    EI+K   K       T+  
Subjt:  VNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRP-----TSTEENPNKV--------------------EIEKWDRKWSSTSPKTIRQ

Query:  RRVPL-WQNSPLQNTLVKE---TATKLKALKLEVSEDFLVHLVLNSLPAEYSHVRGCIWSRPPSDAEAFIYLVD
          + + ++        + E    A+KLKALKLE+S+D LVHLVL SLPA+++  +GC+  R PSDAE  IY+VD
Subjt:  RRVPL-WQNSPLQNTLVKE---TATKLKALKLEVSEDFLVHLVLNSLPAEYSHVRGCIWSRPPSDAEAFIYLVD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53670.1 unknown protein8.2e-0737.88Show/hide
Query:  FKVVNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDR
        F V+   +  + V++IP L+ +NF  WKE + +VL  M+LDL+L  +RP+S +      E++ WDR
Subjt:  FKVVNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAGCATTCCAAGTGAGATGCCCAAGATAAAAGGAACGAGGGAAAGAATCGACGAGGAAACCACGCAGATATTGCTACAGGAATTTGTAATTTCTACTCATGGATC
TAGGTATCTCACTTTTAAGGTTGTTAATTCTGATAATATGTCCACTCAAGTCAACAACATTCCTAGACTGAATGAGGCTAATTTTAAGGGCTGGAAAGAAGACATCCAGA
TAGTACTTGGGTGTATGAATTTAGACCTTGCATTAAGGGTAGACCGTCCTACTTCAACTGAGGAAAATCCTAATAAGGTTGAAATTGAGAAGTGGGATAGGAAATGGAGC
AGTACTTCACCAAAAACGATAAGGCAGAGGCGAGTACCCTTATGGCAAAACTCACCTCTTCAAAATACGTTGGTAAAGGAAACTGCAACAAAACTTAAGGCACTAAAGTT
GGAAGTTTCTGAAGATTTTTTAGTGCATTTGGTTTTGAACTCTCTTCCAGCAGAGTATAGCCACGTCAGGGGTTGCATTTGGAGTCGACCGCCAAGTGATGCTGAGGCTT
TCATCTACCTGGTTGACGATAATAGGCAAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATAGCATTCCAAGTGAGATGCCCAAGATAAAAGGAACGAGGGAAAGAATCGACGAGGAAACCACGCAGATATTGCTACAGGAATTTGTAATTTCTACTCATGGATC
TAGGTATCTCACTTTTAAGGTTGTTAATTCTGATAATATGTCCACTCAAGTCAACAACATTCCTAGACTGAATGAGGCTAATTTTAAGGGCTGGAAAGAAGACATCCAGA
TAGTACTTGGGTGTATGAATTTAGACCTTGCATTAAGGGTAGACCGTCCTACTTCAACTGAGGAAAATCCTAATAAGGTTGAAATTGAGAAGTGGGATAGGAAATGGAGC
AGTACTTCACCAAAAACGATAAGGCAGAGGCGAGTACCCTTATGGCAAAACTCACCTCTTCAAAATACGTTGGTAAAGGAAACTGCAACAAAACTTAAGGCACTAAAGTT
GGAAGTTTCTGAAGATTTTTTAGTGCATTTGGTTTTGAACTCTCTTCCAGCAGAGTATAGCCACGTCAGGGGTTGCATTTGGAGTCGACCGCCAAGTGATGCTGAGGCTT
TCATCTACCTGGTTGACGATAATAGGCAAAAGTAG
Protein sequenceShow/hide protein sequence
MHSIPSEMPKIKGTRERIDEETTQILLQEFVISTHGSRYLTFKVVNSDNMSTQVNNIPRLNEANFKGWKEDIQIVLGCMNLDLALRVDRPTSTEENPNKVEIEKWDRKWS
STSPKTIRQRRVPLWQNSPLQNTLVKETATKLKALKLEVSEDFLVHLVLNSLPAEYSHVRGCIWSRPPSDAEAFIYLVDDNRQK