; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g16660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g16660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:11027723..11036048
RNA-Seq ExpressionMoc03g16660
SyntenyMoc03g16660
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF8394168.1 hypothetical protein HHK36_020374 [Tetracentron sinense]2.4e-5359.69Show/hide
Query:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------
        +V+ IP L+G+NFK  KE ++IVLGC DLDLALR D+PT+T ENPN+V+IEKWDRSNRMCLMIMK SIPE F+GSI+E  +                   
Subjt:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------

Query:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREK
                       KGN+REYIM+MS++A+KLK+LKLE+S+D LVHLVL SLPA +  F VSYNTQKDKWS+NELISHCVQEEER QR+K
Subjt:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREK

KAF8413461.1 hypothetical protein HHK36_001448 [Tetracentron sinense]2.2e-5460.94Show/hide
Query:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------
        +V+NIP L+G+NFK  KE ++IVLGC DLDLALR D+PT+T ENPN+V+IEKWDRSNRMCLMIMK SIPE FRGSI E  +                   
Subjt:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------

Query:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREKT
                       KGN+REYIM+MS++A+KLK+LKLE+S+D LVHLVL SLPA +  F VSYNTQKDKWS+NELISHCVQEEER QR+KT
Subjt:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREKT

RZC02438.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]2.1e-5251.1Show/hide
Query:  VFGVLDDSPPLEHYNLSSHGLVLRAPLSIVIG-PKEVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMK
        +F   DD+  ++ +N S   ++ R  ++       +VN+IP LNG+NFK  KE ++IV GC DLDLALR++RP ST E  N+V+IEKWDRSNRMCLMIMK
Subjt:  VFGVLDDSPPLEHYNLSSHGLVLRAPLSIVIG-PKEVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMK

Query:  HSIPETFRGSIVEGTNV---------------------------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYN
         S+PE FRGSI +G +                                  KGN+REYIM+MSN+ +KL +LKLE+ +D LVHLVL S PA +  F VSYN
Subjt:  HSIPETFRGSIVEGTNV---------------------------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYN

Query:  TQKDKWSMNELISHCVQEEERMQREKT
        TQKDKWS+NELISHCVQEEER+QR++T
Subjt:  TQKDKWSMNELISHCVQEEERMQREKT

RZC20139.1 hypothetical protein D0Y65_006823 [Glycine soja]7.1e-5359.38Show/hide
Query:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------
        +VN+IP LNG+NFK  KE ++IVLGC DLDLALR +RP ST E  N+V+IEKWDRSNRMCLMIMK SIPE FRGSI EG +                   
Subjt:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------

Query:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREKT
                       KGN+REYIM+M N+A+KLK+LKLE+ +D  VHLVL SLPA +  F VSYNTQKDKWS+NELISHCVQEEER+QR++T
Subjt:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREKT

XP_022155096.1 uncharacterized protein LOC111022228 [Momordica charantia]3.4e-6373.08Show/hide
Query:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------
        +VNNIPRLN +NFKD KEDIQIVLGC DLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMK SIPETFRGSIVEGTN                   
Subjt:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------

Query:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQ
                       KGN+REY MQMS+VATKLKALKL+VS++FLVHLVLNSL AEYSHF VSYNTQKDKWS+NELISHCVQ
Subjt:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQ

TrEMBL top hitse value%identityAlignment
A0A151RB35 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-5058.38Show/hide
Query:  LNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV-------------------------
        LNG+NFK  KE ++I+LGC DLDLALR ++PT   ENP++ ++EKW+RSNRMCLMIMK S+PE FRGSI E  N                          
Subjt:  LNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV-------------------------

Query:  --------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREKT
                KGN+REYIM+MSN+A+KLKALKLE+S D LVHLVL SLP  +  F VSYNTQKDKW++NELISHCVQEEER QREKT
Subjt:  --------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREKT

A0A151RDF9 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)8.5e-5257.29Show/hide
Query:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------
        ++N IP LNG+NFK  KE ++I+LGC DLDLALR ++PT   ENP++ ++EKW+RSNRMCLMIMK S+PE FR SI E  N                   
Subjt:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------

Query:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREKT
                       KGN+REYIM+MSN+A+KLKALKLE+S D LVHLVL SLP  +  F VSYNTQKDKW++NELISHCVQEEER QREKT
Subjt:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREKT

A0A445JVF2 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-5251.1Show/hide
Query:  VFGVLDDSPPLEHYNLSSHGLVLRAPLSIVIG-PKEVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMK
        +F   DD+  ++ +N S   ++ R  ++       +VN+IP LNG+NFK  KE ++IV GC DLDLALR++RP ST E  N+V+IEKWDRSNRMCLMIMK
Subjt:  VFGVLDDSPPLEHYNLSSHGLVLRAPLSIVIG-PKEVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMK

Query:  HSIPETFRGSIVEGTNV---------------------------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYN
         S+PE FRGSI +G +                                  KGN+REYIM+MSN+ +KL +LKLE+ +D LVHLVL S PA +  F VSYN
Subjt:  HSIPETFRGSIVEGTNV---------------------------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYN

Query:  TQKDKWSMNELISHCVQEEERMQREKT
        TQKDKWS+NELISHCVQEEER+QR++T
Subjt:  TQKDKWSMNELISHCVQEEERMQREKT

A0A445LAJ7 Uncharacterized protein3.4e-5359.38Show/hide
Query:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------
        +VN+IP LNG+NFK  KE ++IVLGC DLDLALR +RP ST E  N+V+IEKWDRSNRMCLMIMK SIPE FRGSI EG +                   
Subjt:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------

Query:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREKT
                       KGN+REYIM+M N+A+KLK+LKLE+ +D  VHLVL SLPA +  F VSYNTQKDKWS+NELISHCVQEEER+QR++T
Subjt:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREKT

A0A6J1DQP2 uncharacterized protein LOC1110222281.6e-6373.08Show/hide
Query:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------
        +VNNIPRLN +NFKD KEDIQIVLGC DLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMK SIPETFRGSIVEGTN                   
Subjt:  EVNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNV------------------

Query:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQ
                       KGN+REY MQMS+VATKLKALKL+VS++FLVHLVLNSL AEYSHF VSYNTQKDKWS+NELISHCVQ
Subjt:  ---------------KGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53670.1 unknown protein3.0e-1732.52Show/hide
Query:  VNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSI-----------------------VE
        V++IP L+GSNF + KE + +VL   DLDL+L  +RP+S +      E++ WDRSNR+ +MIMK  IP+ FRG +                        E
Subjt:  VNNIPRLNGSNFKDRKEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSI-----------------------VE

Query:  GTNVKG-----------NMREYIMQMSNVATKLKALKLE--VSKD-FLVHLVLNSLPAEYSHFWVSYNTQKDK-------------WSMNELISHCVQEE
         + V+            N+RE IM+M  +  K K L +    S D  L H  +  LP +Y      Y+  + K             WS  ELIS C  EE
Subjt:  GTNVKG-----------NMREYIMQMSNVATKLKALKLE--VSKD-FLVHLVLNSLPAEYSHFWVSYNTQKDK-------------WSMNELISHCVQEE

Query:  ERMQRE
        E ++ E
Subjt:  ERMQRE

AT5G53690.1 unknown protein6.9e-0653.49Show/hide
Query:  VLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREK
        VL+SLP++Y     +Y+  K +WS ++LISHCVQEEER+  EK
Subjt:  VLNSLPAEYSHFWVSYNTQKDKWSMNELISHCVQEEERMQREK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAGAAGAAAACATAATGGGTAAAGAAATTTTACAACCCATTTTGAGAAAAGAGAAGAAAACCGAAGATGGAGATGACAGCTCGCAGCAGCAGCCGAATATCCT
CCTCTGTCGTCAGCTTTCCTCTGAAAATGGAAAACAACATGTGGCTTTCAATCGTGCACTAATTCTTCTAGTGTTTGGAGTGCTCGATGATTCTCCACCCCTCGAACATT
ACAACTTAAGCTCTCATGGGCTTGTTCTGCGAGCTCCACTGTCGATTGTGATTGGCCCAAAGGAAGTCAACAACATTCCTAGACTGAATGGGTCTAATTTTAAGGACCGG
AAAGAAGACATCCAGATAGTACTTGGGTGTAGGGATTTAGACCTTGCATTAAGGGTAGACCGTCCTACTTCAACTGAGGAAAATCCTAATAAAGTTGAAATTGAGAAATG
GGATAGGTCTAATCGCATGTGTCTAATGATCATGAAGCACTCAATTCCAGAAACATTTAGAGGCTCTATTGTTGAGGGAACGAATGTCAAAGGAAACATGAGGGAATACA
TAATGCAAATGTCAAATGTTGCAACAAAACTTAAGGCACTGAAGTTGGAAGTTTCTAAAGACTTTTTAGTGCATTTGGTTTTGAACTCTCTTCCAGCAGAGTATAGCCAC
TTTTGGGTGAGTTACAACACTCAGAAGGATAAATGGTCCATGAACGAGCTAATCTCTCACTGTGTTCAAGAGGAAGAGAGGATGCAGCGAGAAAAGACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAAGAAGAAAACATAATGGGTAAAGAAATTTTACAACCCATTTTGAGAAAAGAGAAGAAAACCGAAGATGGAGATGACAGCTCGCAGCAGCAGCCGAATATCCT
CCTCTGTCGTCAGCTTTCCTCTGAAAATGGAAAACAACATGTGGCTTTCAATCGTGCACTAATTCTTCTAGTGTTTGGAGTGCTCGATGATTCTCCACCCCTCGAACATT
ACAACTTAAGCTCTCATGGGCTTGTTCTGCGAGCTCCACTGTCGATTGTGATTGGCCCAAAGGAAGTCAACAACATTCCTAGACTGAATGGGTCTAATTTTAAGGACCGG
AAAGAAGACATCCAGATAGTACTTGGGTGTAGGGATTTAGACCTTGCATTAAGGGTAGACCGTCCTACTTCAACTGAGGAAAATCCTAATAAAGTTGAAATTGAGAAATG
GGATAGGTCTAATCGCATGTGTCTAATGATCATGAAGCACTCAATTCCAGAAACATTTAGAGGCTCTATTGTTGAGGGAACGAATGTCAAAGGAAACATGAGGGAATACA
TAATGCAAATGTCAAATGTTGCAACAAAACTTAAGGCACTGAAGTTGGAAGTTTCTAAAGACTTTTTAGTGCATTTGGTTTTGAACTCTCTTCCAGCAGAGTATAGCCAC
TTTTGGGTGAGTTACAACACTCAGAAGGATAAATGGTCCATGAACGAGCTAATCTCTCACTGTGTTCAAGAGGAAGAGAGGATGCAGCGAGAAAAGACATAA
Protein sequenceShow/hide protein sequence
MEEEENIMGKEILQPILRKEKKTEDGDDSSQQQPNILLCRQLSSENGKQHVAFNRALILLVFGVLDDSPPLEHYNLSSHGLVLRAPLSIVIGPKEVNNIPRLNGSNFKDR
KEDIQIVLGCRDLDLALRVDRPTSTEENPNKVEIEKWDRSNRMCLMIMKHSIPETFRGSIVEGTNVKGNMREYIMQMSNVATKLKALKLEVSKDFLVHLVLNSLPAEYSH
FWVSYNTQKDKWSMNELISHCVQEEERMQREKT