; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g21140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g21140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon opus
Genome locationchr4:15388976..15390671
RNA-Seq ExpressionMoc04g21140
SyntenyMoc04g21140
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144016.1 uncharacterized protein LOC111013805 [Momordica charantia]2.6e-1948.84Show/hide
Query:  MINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVPKRDG------------------------VVNEPTHVEQGQSQRAKDSEPAEIVSPT
        M NND TVQSQ ASLRNLELQVGQLA DLKSR  GA PSD EVPKRDG                        +  E T V+ G++Q  +DS+PAE++  T
Subjt:  MINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVPKRDG------------------------VVNEPTHVEQGQSQRAKDSEPAEIVSPT

Query:  PSTHTVEQPREVQNSSSEEVNPVNIKAAD
        P   T  QP++ QN+S + VNPV ++A +
Subjt:  PSTHTVEQPREVQNSSSEEVNPVNIKAAD

XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]1.3e-1831.67Show/hide
Query:  MNAPNPPPRPPIPLNVRIGEIVYRVPVAADPEVAVPPLNVVLLIDDIDREIR--------------TMKEATKKCCGLSYSRSHSE-----------MKP
        MN PNP    PIP NVRI EIV  VPVA + EV VP LNVVLL   IDREIR              T +E T     L     +++           ++ 
Subjt:  MNAPNPPPRPPIPLNVRIGEIVYRVPVAADPEVAVPPLNVVLLIDDIDREIR--------------TMKEATKKCCGLSYSRSHSE-----------MKP

Query:  EHGSQLANLGSVS-------------------------SDF------------------------CKH--------------GL--TTRLVINALANRAL
        E  + L +L S S                         SD                         C H              GL   TRLV     N AL
Subjt:  EHGSQLANLGSVS-------------------------SDF------------------------CKH--------------GL--TTRLVINALANRAL

Query:  LAKPYAEAFNILERISSNNHSWSDPRAIQAKLIKQYMINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVPKRDGVV-----------NEP
        LAKPYAEAFNILERISSN HS SD RAIQ +  K+  +N   +  +  + + N+   V +  T  +    GA    A      G             N P
Subjt:  LAKPYAEAFNILERISSNNHSWSDPRAIQAKLIKQYMINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVPKRDGVV-----------NEP

Query:  THVEQ----GQSQRAKDSEPAEIVSPTPSTH--------------------------------TVEQPREVQNSSSEEVNPVNIKAADVESMQIRVLEKR
         + E     G +  ++++  +   +P    H                                 VEQ RE QNSS+EEVNPVN  A+   S QIRV +KR
Subjt:  THVEQ----GQSQRAKDSEPAEIVSPTPSTH--------------------------------TVEQPREVQNSSSEEVNPVNIKAADVESMQIRVLEKR

Query:  KQAEDDNAPEEYRPAPPSPK
        KQ E ++A  EY+ APP PK
Subjt:  KQAEDDNAPEEYRPAPPSPK

XP_022158598.1 uncharacterized protein LOC111025053 [Momordica charantia]2.0e-3236.86Show/hide
Query:  PEHGSQLANLGSVSSDFCKHGL--TTRLVINALANRALLAKPYAEAFNILERISSNNHSWSDPRAIQAK-------------------------------
        P HG      GS+  +    GL   TRLVI+A  N ALL KPYA+A NILERISS+NHSWSD RAI+ K                               
Subjt:  PEHGSQLANLGSVSSDFCKHGL--TTRLVINALANRALLAKPYAEAFNILERISSNNHSWSDPRAIQAK-------------------------------

Query:  ----------------------------------------------------------------LIKQYMINNDVTVQSQAASLRNLELQVGQLATDLKS
                                                                        ++KQYM NND TVQSQAASLRNLELQVGQLA DLKS
Subjt:  ----------------------------------------------------------------LIKQYMINNDVTVQSQAASLRNLELQVGQLATDLKS

Query:  RPYGALPSDAEVPKRDG------------------------VVNEPTHVEQGQSQRAKDSEPAEIVSPTPSTHTVEQPREVQNSSSEEVNPVNIKAADVE
        RP GALPSD EVPKRD                         +  EP  + QG+ Q  +DSEPAE+V P P     EQP+E QN+S + VNPV  +A +  
Subjt:  RPYGALPSDAEVPKRDG------------------------VVNEPTHVEQGQSQRAKDSEPAEIVSPTPSTHTVEQPREVQNSSSEEVNPVNIKAADVE

Query:  SMQIRVLEKRKQ
        S Q  + EK  +
Subjt:  SMQIRVLEKRKQ

XP_022158740.1 uncharacterized protein LOC111025203 [Momordica charantia]2.8e-2148.61Show/hide
Query:  LIKQYMINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVPKRDGVVNEPTHVEQGQSQRAKDSEPAEIVSPTPSTHTVEQPREVQNSSSEE
        L+KQYM  N+VTVQS AASLRNLELQVGQLATDLKSRPYGALPSD +V                                       EQP++ Q+ +S+E
Subjt:  LIKQYMINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVPKRDGVVNEPTHVEQGQSQRAKDSEPAEIVSPTPSTHTVEQPREVQNSSSEE

Query:  VNPVNIKAADVESMQIRVLEKRKQAEDDNAPEEYRPAPPSPKWL
        VNPVN KA++  +   +V EKRK+ E ++AP E+RP PP PK L
Subjt:  VNPVNIKAADVESMQIRVLEKRKQAEDDNAPEEYRPAPPSPKWL

XP_022159060.1 uncharacterized protein LOC111025500 [Momordica charantia]1.5e-1938.73Show/hide
Query:  TRLVINALANRALLAKPYAEAFNILERISSNNHSWSDPRAIQAKLIK-----------------------------------------------------
        TRLVI+A AN ALLAKPYAEAFNILERISSNN SWSDPRAI  K  K                                                     
Subjt:  TRLVINALANRALLAKPYAEAFNILERISSNNHSWSDPRAIQAKLIK-----------------------------------------------------

Query:  ------------------------------------------------------QYMINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVP
                                                              +YM NND TVQSQA SLRNLE+QVGQLATDLKS+P G LPSD +VP
Subjt:  ------------------------------------------------------QYMINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVP

Query:  KRDG
        KRDG
Subjt:  KRDG

TrEMBL top hitse value%identityAlignment
A0A6J1CS22 uncharacterized protein LOC1110138051.3e-1948.84Show/hide
Query:  MINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVPKRDG------------------------VVNEPTHVEQGQSQRAKDSEPAEIVSPT
        M NND TVQSQ ASLRNLELQVGQLA DLKSR  GA PSD EVPKRDG                        +  E T V+ G++Q  +DS+PAE++  T
Subjt:  MINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVPKRDG------------------------VVNEPTHVEQGQSQRAKDSEPAEIVSPT

Query:  PSTHTVEQPREVQNSSSEEVNPVNIKAAD
        P   T  QP++ QN+S + VNPV ++A +
Subjt:  PSTHTVEQPREVQNSSSEEVNPVNIKAAD

A0A6J1DAK9 uncharacterized protein LOC1110189106.2e-1931.67Show/hide
Query:  MNAPNPPPRPPIPLNVRIGEIVYRVPVAADPEVAVPPLNVVLLIDDIDREIR--------------TMKEATKKCCGLSYSRSHSE-----------MKP
        MN PNP    PIP NVRI EIV  VPVA + EV VP LNVVLL   IDREIR              T +E T     L     +++           ++ 
Subjt:  MNAPNPPPRPPIPLNVRIGEIVYRVPVAADPEVAVPPLNVVLLIDDIDREIR--------------TMKEATKKCCGLSYSRSHSE-----------MKP

Query:  EHGSQLANLGSVS-------------------------SDF------------------------CKH--------------GL--TTRLVINALANRAL
        E  + L +L S S                         SD                         C H              GL   TRLV     N AL
Subjt:  EHGSQLANLGSVS-------------------------SDF------------------------CKH--------------GL--TTRLVINALANRAL

Query:  LAKPYAEAFNILERISSNNHSWSDPRAIQAKLIKQYMINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVPKRDGVV-----------NEP
        LAKPYAEAFNILERISSN HS SD RAIQ +  K+  +N   +  +  + + N+   V +  T  +    GA    A      G             N P
Subjt:  LAKPYAEAFNILERISSNNHSWSDPRAIQAKLIKQYMINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVPKRDGVV-----------NEP

Query:  THVEQ----GQSQRAKDSEPAEIVSPTPSTH--------------------------------TVEQPREVQNSSSEEVNPVNIKAADVESMQIRVLEKR
         + E     G +  ++++  +   +P    H                                 VEQ RE QNSS+EEVNPVN  A+   S QIRV +KR
Subjt:  THVEQ----GQSQRAKDSEPAEIVSPTPSTH--------------------------------TVEQPREVQNSSSEEVNPVNIKAADVESMQIRVLEKR

Query:  KQAEDDNAPEEYRPAPPSPK
        KQ E ++A  EY+ APP PK
Subjt:  KQAEDDNAPEEYRPAPPSPK

A0A6J1DWK1 uncharacterized protein LOC1110250539.9e-3336.86Show/hide
Query:  PEHGSQLANLGSVSSDFCKHGL--TTRLVINALANRALLAKPYAEAFNILERISSNNHSWSDPRAIQAK-------------------------------
        P HG      GS+  +    GL   TRLVI+A  N ALL KPYA+A NILERISS+NHSWSD RAI+ K                               
Subjt:  PEHGSQLANLGSVSSDFCKHGL--TTRLVINALANRALLAKPYAEAFNILERISSNNHSWSDPRAIQAK-------------------------------

Query:  ----------------------------------------------------------------LIKQYMINNDVTVQSQAASLRNLELQVGQLATDLKS
                                                                        ++KQYM NND TVQSQAASLRNLELQVGQLA DLKS
Subjt:  ----------------------------------------------------------------LIKQYMINNDVTVQSQAASLRNLELQVGQLATDLKS

Query:  RPYGALPSDAEVPKRDG------------------------VVNEPTHVEQGQSQRAKDSEPAEIVSPTPSTHTVEQPREVQNSSSEEVNPVNIKAADVE
        RP GALPSD EVPKRD                         +  EP  + QG+ Q  +DSEPAE+V P P     EQP+E QN+S + VNPV  +A +  
Subjt:  RPYGALPSDAEVPKRDG------------------------VVNEPTHVEQGQSQRAKDSEPAEIVSPTPSTHTVEQPREVQNSSSEEVNPVNIKAADVE

Query:  SMQIRVLEKRKQ
        S Q  + EK  +
Subjt:  SMQIRVLEKRKQ

A0A6J1DWN2 uncharacterized protein LOC1110252031.3e-2148.61Show/hide
Query:  LIKQYMINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVPKRDGVVNEPTHVEQGQSQRAKDSEPAEIVSPTPSTHTVEQPREVQNSSSEE
        L+KQYM  N+VTVQS AASLRNLELQVGQLATDLKSRPYGALPSD +V                                       EQP++ Q+ +S+E
Subjt:  LIKQYMINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVPKRDGVVNEPTHVEQGQSQRAKDSEPAEIVSPTPSTHTVEQPREVQNSSSEE

Query:  VNPVNIKAADVESMQIRVLEKRKQAEDDNAPEEYRPAPPSPKWL
        VNPVN KA++  +   +V EKRK+ E ++AP E+RP PP PK L
Subjt:  VNPVNIKAADVESMQIRVLEKRKQAEDDNAPEEYRPAPPSPKWL

A0A6J1DXK5 uncharacterized protein LOC1110255007.4e-2038.73Show/hide
Query:  TRLVINALANRALLAKPYAEAFNILERISSNNHSWSDPRAIQAKLIK-----------------------------------------------------
        TRLVI+A AN ALLAKPYAEAFNILERISSNN SWSDPRAI  K  K                                                     
Subjt:  TRLVINALANRALLAKPYAEAFNILERISSNNHSWSDPRAIQAKLIK-----------------------------------------------------

Query:  ------------------------------------------------------QYMINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVP
                                                              +YM NND TVQSQA SLRNLE+QVGQLATDLKS+P G LPSD +VP
Subjt:  ------------------------------------------------------QYMINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVP

Query:  KRDG
        KRDG
Subjt:  KRDG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCACCTAACCCACCTCCGCGCCCGCCTATTCCATTGAATGTGAGGATTGGGGAAATAGTATACAGGGTTCCCGTTGCTGCTGACCCTGAGGTAGCAGTGCCCCC
TCTCAATGTCGTATTACTAATAGATGACATCGACAGAGAAATCAGGACAATGAAGGAGGCAACAAAGAAGTGCTGTGGCTTAAGCTATTCCCGTAGTCACTCAGAGATGA
AGCCAGAACATGGGAGTCAGTTAGCGAATCTTGGGAGCGTTTCTAGCGACTTTTGCAAACATGGCCTCACGACGCGCTTAGTCATCAATGCGTTAGCAAATAGGGCTTTG
CTAGCGAAACCCTATGCTGAAGCATTCAACATCTTGGAAAGGATATCGTCCAACAACCACTCATGGTCTGACCCTAGAGCTATTCAAGCAAAGCTAATAAAGCAGTACAT
GATAAATAATGACGTCACTGTGCAAAGTCAGGCCGCATCACTAAGAAACCTAGAGTTGCAAGTAGGCCAGTTAGCAACCGATTTGAAAAGCAGACCTTACGGAGCATTAC
CTAGCGATGCTGAAGTGCCAAAGAGAGATGGGGTAGTAAATGAGCCTACTCACGTAGAACAAGGACAATCCCAGAGAGCAAAAGATAGTGAGCCAGCAGAAATAGTTTCA
CCTACCCCATCAACGCATACTGTTGAGCAACCAAGAGAAGTTCAAAATTCTTCCAGTGAAGAGGTTAACCCAGTGAATATTAAGGCAGCTGATGTAGAATCAATGCAGAT
TAGAGTGCTCGAGAAAAGAAAGCAGGCAGAGGATGATAATGCTCCAGAAGAATACAGACCAGCACCACCATCTCCTAAGTGGTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGCACCTAACCCACCTCCGCGCCCGCCTATTCCATTGAATGTGAGGATTGGGGAAATAGTATACAGGGTTCCCGTTGCTGCTGACCCTGAGGTAGCAGTGCCCCC
TCTCAATGTCGTATTACTAATAGATGACATCGACAGAGAAATCAGGACAATGAAGGAGGCAACAAAGAAGTGCTGTGGCTTAAGCTATTCCCGTAGTCACTCAGAGATGA
AGCCAGAACATGGGAGTCAGTTAGCGAATCTTGGGAGCGTTTCTAGCGACTTTTGCAAACATGGCCTCACGACGCGCTTAGTCATCAATGCGTTAGCAAATAGGGCTTTG
CTAGCGAAACCCTATGCTGAAGCATTCAACATCTTGGAAAGGATATCGTCCAACAACCACTCATGGTCTGACCCTAGAGCTATTCAAGCAAAGCTAATAAAGCAGTACAT
GATAAATAATGACGTCACTGTGCAAAGTCAGGCCGCATCACTAAGAAACCTAGAGTTGCAAGTAGGCCAGTTAGCAACCGATTTGAAAAGCAGACCTTACGGAGCATTAC
CTAGCGATGCTGAAGTGCCAAAGAGAGATGGGGTAGTAAATGAGCCTACTCACGTAGAACAAGGACAATCCCAGAGAGCAAAAGATAGTGAGCCAGCAGAAATAGTTTCA
CCTACCCCATCAACGCATACTGTTGAGCAACCAAGAGAAGTTCAAAATTCTTCCAGTGAAGAGGTTAACCCAGTGAATATTAAGGCAGCTGATGTAGAATCAATGCAGAT
TAGAGTGCTCGAGAAAAGAAAGCAGGCAGAGGATGATAATGCTCCAGAAGAATACAGACCAGCACCACCATCTCCTAAGTGGTTGTAG
Protein sequenceShow/hide protein sequence
MNAPNPPPRPPIPLNVRIGEIVYRVPVAADPEVAVPPLNVVLLIDDIDREIRTMKEATKKCCGLSYSRSHSEMKPEHGSQLANLGSVSSDFCKHGLTTRLVINALANRAL
LAKPYAEAFNILERISSNNHSWSDPRAIQAKLIKQYMINNDVTVQSQAASLRNLELQVGQLATDLKSRPYGALPSDAEVPKRDGVVNEPTHVEQGQSQRAKDSEPAEIVS
PTPSTHTVEQPREVQNSSSEEVNPVNIKAADVESMQIRVLEKRKQAEDDNAPEEYRPAPPSPKWL