; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g20070 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g20070
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr3:13526617..13527705
RNA-Seq ExpressionMoc03g20070
SyntenyMoc03g20070
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153526.1 uncharacterized protein LOC111021009 [Momordica charantia]3.0e-7050.18Show/hide
Query:  NPFILPNITDDALRLTLFPFSLKDQARTWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQ
        N F LP I+DDALRL LFPFSL  QA  W N+F P SI +W  +V+KFL+KYF PT++AD++EEI++F Q + E V+EAWERFKEL++ CPN  +PAC+Q
Subjt:  NPFILPNITDDALRLTLFPFSLKDQARTWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQ

Query:  IEYFFRGLNHPTKMMLNNAANGAFTKNTFNEIVDILKDLASHNELWCSQRSKPTPK--------------------------------------------
        IE+F+RG + PTKMMLN AANG FT  T+NEIV IL  L  HN+LWCS+RS+  PK                                            
Subjt:  IEYFFRGLNHPTKMMLNNAANGAFTKNTFNEIVDILKDLASHNELWCSQRSKPTPK--------------------------------------------

Query:  -----------------NDNHVYENCPHNPASVYYIGQGNNRNFNPYSNTYNSGWRHHPNFSWEGQGSSSRANQGQN
                          D+H  ENCP  PAS+ Y+GQGN RNF+PYSNTYN GWRHHPNFSW GQGSS+ A Q QN
Subjt:  -----------------NDNHVYENCPHNPASVYYIGQGNNRNFNPYSNTYNSGWRHHPNFSWEGQGSSSRANQGQN

XP_022155016.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022160 [Momordica charantia]3.4e-8273.24Show/hide
Query:  AFQNLDSVILNPIPEAANFELNP---------------------------------FILPNITDDALRLTLFPFSLKDQARTWFNSFSPGSITTWDLLVE
        AFQNLDS ILNPIP+AANFEL P                                 F  PNITDDALRLTLFPFSLKD+ARTW N F PGSITTW+ LVE
Subjt:  AFQNLDSVILNPIPEAANFELNP---------------------------------FILPNITDDALRLTLFPFSLKDQARTWFNSFSPGSITTWDLLVE

Query:  KFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLNNAANGAFTKNTFNEIVDILKDLASHNELW
        KFL+KYF PTRHADI EEIVTF QYDREPVHEAWERFKELL+KCPNHGLPACIQIE+FFRGL+HPTKMMLNNAANGAFTK TFNEIVDIL+DLASHNELW
Subjt:  KFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLNNAANGAFTKNTFNEIVDILKDLASHNELW

Query:  CSQRSKPTPKNDN
        CSQRSKP PK  +
Subjt:  CSQRSKPTPKNDN

XP_022157400.1 uncharacterized protein LOC111024107 [Momordica charantia]6.5e-7343.68Show/hide
Query:  NHILMADNRDVVMREYAATAFQNLDSVILNPIPEAANFELNPFI---------------------------------LPNITDDALRLTLFPFSLKDQAR
        N I +AD +D  MR+YAAT  ++L+S ++NP+P  A FE  P +                                 LP I+DDALRLTLFPFSL  QA 
Subjt:  NHILMADNRDVVMREYAATAFQNLDSVILNPIPEAANFELNPFI---------------------------------LPNITDDALRLTLFPFSLKDQAR

Query:  TWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLNNAANGAFTKN
         W N+F  G+ITTW  +V+KFL KYF PTR+AD++EEI++F Q + E V+ AWE FK+L++ CPN G+PAC+QIE+FFRG + PTKMMLN AANG FT  
Subjt:  TWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLNNAANGAFTKN

Query:  TFNEIVDILKDLASHNELWCSQRSK-------------------------------------------------PTPK-----------NDNHVYENCPH
        +FNEIV+IL  L+ HN+ W S+RS+                                                 P+P             D H  ENCP 
Subjt:  TFNEIVDILKDLASHNELWCSQRSK-------------------------------------------------PTPK-----------NDNHVYENCPH

Query:  NPASVYYIGQGNNRNFNPYSNTYNSGWRHHPNFSWEGQGSSSRANQGQ
        NP+S+YY+GQ N + FNPYSNTY+ GW+ HPNFSW GQGSSS   Q Q
Subjt:  NPASVYYIGQGNNRNFNPYSNTYNSGWRHHPNFSWEGQGSSSRANQGQ

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]7.9e-8754.79Show/hide
Query:  MAGQQQNNEFNHILMADNRDVVMREYAATAFQNLDSVILNPIPEAANFELNP---------------------------------FILPNITDDALRLTL
        M    +N+EFN+I MADNRDV MREYAATAFQN DS I+NPIP   NFEL P                                 F LP ITDDA  LTL
Subjt:  MAGQQQNNEFNHILMADNRDVVMREYAATAFQNLDSVILNPIPEAANFELNP---------------------------------FILPNITDDALRLTL

Query:  FPFSLKDQARTWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLN
        FPFSLKDQAR   N+F  GSITTW  LVEKFL+K+F PTRHADI+EEI++F QYDREPVHEAWERFKEL++KC NHGLPAC QIE+FFRGL+HPTKMMLN
Subjt:  FPFSLKDQARTWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLN

Query:  NAANGAFTKNTFNEIVDILKDLASHNELWCSQRSKPTPKNDNHVYENCPHNPASVYYIGQGNNRN------------------------FNPYSNTYNSG
        NAANGAFTK TFNEIVDIL DLASHNELWCSQRS+  PK           +PA V  +    +                            P  + Y + 
Subjt:  NAANGAFTKNTFNEIVDILKDLASHNELWCSQRSKPTPKNDNHVYENCPHNPASVYYIGQGNNRN------------------------FNPYSNTYNSG

Query:  -----------WRHHPNFSWEGQGSSSRANQGQN
                   WRHHPNFSW GQG SS  NQGQ+
Subjt:  -----------WRHHPNFSWEGQGSSSRANQGQN

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]1.1e-6942.82Show/hide
Query:  NHILMADNRDVVMREYAATAFQNLDSVILNPIPEAANFELNP---------------------------------FILPNITDDALRLTLFPFSLKDQAR
        N I +AD RD  MR+YAA   ++L+S ++N  P  A FE  P                                 F LP I+DDALRLTLFPFS+  QA 
Subjt:  NHILMADNRDVVMREYAATAFQNLDSVILNPIPEAANFELNP---------------------------------FILPNITDDALRLTLFPFSLKDQAR

Query:  TWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLNNAANGAFTKN
         W N+F   +ITTW  +V+KFL KYF PTR+AD++EEI++F Q + E V+ AWERFK+L+  CPN G+PAC+QIE+FFRG +  TKMMLN AANG FT  
Subjt:  TWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLNNAANGAFTKN

Query:  TFNEIVDILKDLASHNELWCSQRSK-------------------------------------------------PTPK-----------NDNHVYENCPH
        +FNEIV+IL  L+ HN  WCS++S+                                                 P+P             D H  ENCP 
Subjt:  TFNEIVDILKDLASHNELWCSQRSK-------------------------------------------------PTPK-----------NDNHVYENCPH

Query:  NPASVYYIGQGNNRNFNPYSNTYNSGWRHHPNFSWEGQGSSSRANQGQ
        NP+S+YY+GQ N + FNPYSNTYN GW+ HPNFSW GQGSS+     Q
Subjt:  NPASVYYIGQGNNRNFNPYSNTYNSGWRHHPNFSWEGQGSSSRANQGQ

TrEMBL top hitse value%identityAlignment
A0A6J1DKX0 uncharacterized protein LOC1110210091.5e-7050.18Show/hide
Query:  NPFILPNITDDALRLTLFPFSLKDQARTWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQ
        N F LP I+DDALRL LFPFSL  QA  W N+F P SI +W  +V+KFL+KYF PT++AD++EEI++F Q + E V+EAWERFKEL++ CPN  +PAC+Q
Subjt:  NPFILPNITDDALRLTLFPFSLKDQARTWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQ

Query:  IEYFFRGLNHPTKMMLNNAANGAFTKNTFNEIVDILKDLASHNELWCSQRSKPTPK--------------------------------------------
        IE+F+RG + PTKMMLN AANG FT  T+NEIV IL  L  HN+LWCS+RS+  PK                                            
Subjt:  IEYFFRGLNHPTKMMLNNAANGAFTKNTFNEIVDILKDLASHNELWCSQRSKPTPK--------------------------------------------

Query:  -----------------NDNHVYENCPHNPASVYYIGQGNNRNFNPYSNTYNSGWRHHPNFSWEGQGSSSRANQGQN
                          D+H  ENCP  PAS+ Y+GQGN RNF+PYSNTYN GWRHHPNFSW GQGSS+ A Q QN
Subjt:  -----------------NDNHVYENCPHNPASVYYIGQGNNRNFNPYSNTYNSGWRHHPNFSWEGQGSSSRANQGQN

A0A6J1DQF5 LOW QUALITY PROTEIN: uncharacterized protein LOC1110221601.7e-8273.24Show/hide
Query:  AFQNLDSVILNPIPEAANFELNP---------------------------------FILPNITDDALRLTLFPFSLKDQARTWFNSFSPGSITTWDLLVE
        AFQNLDS ILNPIP+AANFEL P                                 F  PNITDDALRLTLFPFSLKD+ARTW N F PGSITTW+ LVE
Subjt:  AFQNLDSVILNPIPEAANFELNP---------------------------------FILPNITDDALRLTLFPFSLKDQARTWFNSFSPGSITTWDLLVE

Query:  KFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLNNAANGAFTKNTFNEIVDILKDLASHNELW
        KFL+KYF PTRHADI EEIVTF QYDREPVHEAWERFKELL+KCPNHGLPACIQIE+FFRGL+HPTKMMLNNAANGAFTK TFNEIVDIL+DLASHNELW
Subjt:  KFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLNNAANGAFTKNTFNEIVDILKDLASHNELW

Query:  CSQRSKPTPKNDN
        CSQRSKP PK  +
Subjt:  CSQRSKPTPKNDN

A0A6J1DSZ5 uncharacterized protein LOC1110241073.1e-7343.68Show/hide
Query:  NHILMADNRDVVMREYAATAFQNLDSVILNPIPEAANFELNPFI---------------------------------LPNITDDALRLTLFPFSLKDQAR
        N I +AD +D  MR+YAAT  ++L+S ++NP+P  A FE  P +                                 LP I+DDALRLTLFPFSL  QA 
Subjt:  NHILMADNRDVVMREYAATAFQNLDSVILNPIPEAANFELNPFI---------------------------------LPNITDDALRLTLFPFSLKDQAR

Query:  TWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLNNAANGAFTKN
         W N+F  G+ITTW  +V+KFL KYF PTR+AD++EEI++F Q + E V+ AWE FK+L++ CPN G+PAC+QIE+FFRG + PTKMMLN AANG FT  
Subjt:  TWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLNNAANGAFTKN

Query:  TFNEIVDILKDLASHNELWCSQRSK-------------------------------------------------PTPK-----------NDNHVYENCPH
        +FNEIV+IL  L+ HN+ W S+RS+                                                 P+P             D H  ENCP 
Subjt:  TFNEIVDILKDLASHNELWCSQRSK-------------------------------------------------PTPK-----------NDNHVYENCPH

Query:  NPASVYYIGQGNNRNFNPYSNTYNSGWRHHPNFSWEGQGSSSRANQGQ
        NP+S+YY+GQ N + FNPYSNTY+ GW+ HPNFSW GQGSSS   Q Q
Subjt:  NPASVYYIGQGNNRNFNPYSNTYNSGWRHHPNFSWEGQGSSSRANQGQ

A0A6J1DW02 uncharacterized protein LOC1110248973.8e-8754.79Show/hide
Query:  MAGQQQNNEFNHILMADNRDVVMREYAATAFQNLDSVILNPIPEAANFELNP---------------------------------FILPNITDDALRLTL
        M    +N+EFN+I MADNRDV MREYAATAFQN DS I+NPIP   NFEL P                                 F LP ITDDA  LTL
Subjt:  MAGQQQNNEFNHILMADNRDVVMREYAATAFQNLDSVILNPIPEAANFELNP---------------------------------FILPNITDDALRLTL

Query:  FPFSLKDQARTWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLN
        FPFSLKDQAR   N+F  GSITTW  LVEKFL+K+F PTRHADI+EEI++F QYDREPVHEAWERFKEL++KC NHGLPAC QIE+FFRGL+HPTKMMLN
Subjt:  FPFSLKDQARTWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLN

Query:  NAANGAFTKNTFNEIVDILKDLASHNELWCSQRSKPTPKNDNHVYENCPHNPASVYYIGQGNNRN------------------------FNPYSNTYNSG
        NAANGAFTK TFNEIVDIL DLASHNELWCSQRS+  PK           +PA V  +    +                            P  + Y + 
Subjt:  NAANGAFTKNTFNEIVDILKDLASHNELWCSQRSKPTPKNDNHVYENCPHNPASVYYIGQGNNRN------------------------FNPYSNTYNSG

Query:  -----------WRHHPNFSWEGQGSSSRANQGQN
                   WRHHPNFSW GQG SS  NQGQ+
Subjt:  -----------WRHHPNFSWEGQGSSSRANQGQN

A0A6J1DY39 uncharacterized protein LOC1110256535.6e-7042.82Show/hide
Query:  NHILMADNRDVVMREYAATAFQNLDSVILNPIPEAANFELNP---------------------------------FILPNITDDALRLTLFPFSLKDQAR
        N I +AD RD  MR+YAA   ++L+S ++N  P  A FE  P                                 F LP I+DDALRLTLFPFS+  QA 
Subjt:  NHILMADNRDVVMREYAATAFQNLDSVILNPIPEAANFELNP---------------------------------FILPNITDDALRLTLFPFSLKDQAR

Query:  TWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLNNAANGAFTKN
         W N+F   +ITTW  +V+KFL KYF PTR+AD++EEI++F Q + E V+ AWERFK+L+  CPN G+PAC+QIE+FFRG +  TKMMLN AANG FT  
Subjt:  TWFNSFSPGSITTWDLLVEKFLSKYFSPTRHADIKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLNNAANGAFTKN

Query:  TFNEIVDILKDLASHNELWCSQRSK-------------------------------------------------PTPK-----------NDNHVYENCPH
        +FNEIV+IL  L+ HN  WCS++S+                                                 P+P             D H  ENCP 
Subjt:  TFNEIVDILKDLASHNELWCSQRSK-------------------------------------------------PTPK-----------NDNHVYENCPH

Query:  NPASVYYIGQGNNRNFNPYSNTYNSGWRHHPNFSWEGQGSSSRANQGQ
        NP+S+YY+GQ N + FNPYSNTYN GW+ HPNFSW GQGSS+     Q
Subjt:  NPASVYYIGQGNNRNFNPYSNTYNSGWRHHPNFSWEGQGSSSRANQGQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGGTCAACAACAAAACAATGAGTTCAATCATATCCTAATGGCAGATAATAGAGACGTGGTCATGCGAGAATATGCTGCCACAGCATTCCAGAATTTGGATTCTGT
CATCCTAAATCCCATTCCAGAAGCCGCCAACTTCGAATTGAATCCGTTTATACTTCCAAATATAACTGATGATGCATTAAGGTTAACTCTTTTCCCATTTTCTCTTAAGG
ATCAGGCAAGAACATGGTTCAACTCGTTCTCGCCAGGATCAATTACAACATGGGATTTGTTAGTAGAGAAGTTCCTTTCAAAGTATTTCTCTCCCACTCGCCATGCTGAC
ATCAAGGAAGAGATTGTCACTTTTATACAATATGACCGTGAACCAGTGCATGAGGCATGGGAGAGATTCAAGGAGTTACTGCAGAAGTGTCCGAATCATGGATTACCAGC
ATGTATCCAGATTGAGTATTTCTTCAGAGGTTTGAACCACCCCACTAAGATGATGCTCAACAATGCCGCAAATGGAGCTTTCACGAAAAATACCTTCAACGAAATAGTTG
ATATTTTGAAGGACCTAGCTTCCCACAACGAGCTATGGTGTTCTCAAAGGTCGAAACCGACACCTAAGAATGATAACCATGTTTATGAGAATTGTCCCCATAACCCAGCT
TCTGTTTATTACATAGGTCAGGGGAACAACCGCAACTTCAACCCCTATTCGAACACATACAATTCAGGGTGGAGACACCATCCCAACTTTTCATGGGAAGGTCAAGGAAG
TTCCAGTAGAGCAAACCAAGGGCAGAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAGGTCAACAACAAAACAATGAGTTCAATCATATCCTAATGGCAGATAATAGAGACGTGGTCATGCGAGAATATGCTGCCACAGCATTCCAGAATTTGGATTCTGT
CATCCTAAATCCCATTCCAGAAGCCGCCAACTTCGAATTGAATCCGTTTATACTTCCAAATATAACTGATGATGCATTAAGGTTAACTCTTTTCCCATTTTCTCTTAAGG
ATCAGGCAAGAACATGGTTCAACTCGTTCTCGCCAGGATCAATTACAACATGGGATTTGTTAGTAGAGAAGTTCCTTTCAAAGTATTTCTCTCCCACTCGCCATGCTGAC
ATCAAGGAAGAGATTGTCACTTTTATACAATATGACCGTGAACCAGTGCATGAGGCATGGGAGAGATTCAAGGAGTTACTGCAGAAGTGTCCGAATCATGGATTACCAGC
ATGTATCCAGATTGAGTATTTCTTCAGAGGTTTGAACCACCCCACTAAGATGATGCTCAACAATGCCGCAAATGGAGCTTTCACGAAAAATACCTTCAACGAAATAGTTG
ATATTTTGAAGGACCTAGCTTCCCACAACGAGCTATGGTGTTCTCAAAGGTCGAAACCGACACCTAAGAATGATAACCATGTTTATGAGAATTGTCCCCATAACCCAGCT
TCTGTTTATTACATAGGTCAGGGGAACAACCGCAACTTCAACCCCTATTCGAACACATACAATTCAGGGTGGAGACACCATCCCAACTTTTCATGGGAAGGTCAAGGAAG
TTCCAGTAGAGCAAACCAAGGGCAGAACTAG
Protein sequenceShow/hide protein sequence
MAGQQQNNEFNHILMADNRDVVMREYAATAFQNLDSVILNPIPEAANFELNPFILPNITDDALRLTLFPFSLKDQARTWFNSFSPGSITTWDLLVEKFLSKYFSPTRHAD
IKEEIVTFIQYDREPVHEAWERFKELLQKCPNHGLPACIQIEYFFRGLNHPTKMMLNNAANGAFTKNTFNEIVDILKDLASHNELWCSQRSKPTPKNDNHVYENCPHNPA
SVYYIGQGNNRNFNPYSNTYNSGWRHHPNFSWEGQGSSSRANQGQN