; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g03880 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g03880
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr3:2872726..2873673
RNA-Seq ExpressionMoc03g03880
SyntenyMoc03g03880
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]2.4e-5957.31Show/hide
Query:  MNRNPQDPPRPQNPPVNGDMTGEGVANRARKIPNLIFLADNRDVAMRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQMDGARTWLNALEPNSINTWAE
        MNRN QDPP PQNPPVNGDM GEG ANRA +IPN I LADNRDVAMRNY+T AFHN NSGINN L QAAQ E+K VM  M                    
Subjt:  MNRNPQDPPRPQNPPVNGDMTGEGVANRARKIPNLIFLADNRDVAMRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQMDGARTWLNALEPNSINTWAE

Query:  LREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLP-TCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDI
           + + ++  LT                    ++ +   K  +    +  LP       +   GLDRSSRMMLNTAANGSLLEKSVNEI+DILNKM DI
Subjt:  LREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLP-TCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDI

Query:  NDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEKETKTVIS
        NDQGE GRSL KKQVSA +FELDTVA MQA+MA MNQMLKQ TMEKETKTV S
Subjt:  NDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEKETKTVIS

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]2.3e-8669.03Show/hide
Query:  MNRNPQDPPRPQNPPVNGDMTGEGVANRARKIPNLIFLADNRDVAMRNYITHAFHNQNSGI--NNPLFQAAQFEVKLVMSQMDGARTWLNALEPNSINTW
        MN NPQDPP P NPPV+GD  GEG ANRA ++PN I L DNRDVA+RNY+THAFHN NS +  + P+ +A   +     S +       NA + + ++  
Subjt:  MNRNPQDPPRPQNPPVNGDMTGEGVANRARKIPNLIFLADNRDVAMRNYITHAFHNQNSGI--NNPLFQAAQFEVKLVMSQMDGARTWLNALEPNSINTW

Query:  AELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTD
        A LR         L  N DLREDIVSFRQKENEAVQE WERFKELLRRC SHGLPTCVQIEQFYRGLDR SRMMLNTAAN SL EKS++EIIDILNKMTD
Subjt:  AELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTD

Query:  INDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEKETKTVISAIPERPPVLQISDI
         NDQGEIGRSLPKKQVSARVFELDTVASMQA+MAT+NQMLKQLTMEKETKT  SA+ E    LQISDI
Subjt:  INDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEKETKTVISAIPERPPVLQISDI

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]5.5e-4841.25Show/hide
Query:  PQDPPRPQNPPVNGDMTGEGVANRARKIP-NLIFLADNRDVAMRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQM-----------------------
        P+DP  P  P VNG+M      + AR    N I +ADNRDVAMR Y   AF N +SGI NP+     FE+K +M QM                       
Subjt:  PQDPPRPQNPPVNGDMTGEGVANRARKIP-NLIFLADNRDVAMRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQM-----------------------

Query:  --------------------------DGARTWLNALEPNSINTWAELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPT
                                  D AR  LNA    SI TW  L EKFL K+   TR+ D+RE+I+SFRQ + E V EAWERFKEL+R+C +HGLP 
Subjt:  --------------------------DGARTWLNALEPNSINTWAELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPT

Query:  CVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQ--GEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTM--EKETKTV
        C QIE F+RGLD  ++MMLN AANG+  +K+ NEI+DILN +   N+    +  R+ PKKQ  A V  LD   SMQ EM TMNQ LK++ +  +    T 
Subjt:  CVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQ--GEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTM--EKETKTV

Query:  ISAIPE----RPPVLQISDI
        I  +        PV Q++D+
Subjt:  ISAIPE----RPPVLQISDI

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]3.5e-11973.02Show/hide
Query:  MNRNPQDPPRPQNPPVNGDMTGEGVANRARKIPNLIFLADNRDVAMRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQM--------------------
        MNRN QDPP PQNPPVNGDM GE  ANR  +IPNLI LADNRDVAMRNY+THAFHN NSGINNPL QAAQFE+K VM Q+                    
Subjt:  MNRNPQDPPRPQNPPVNGDMTGEGVANRARKIPNLIFLADNRDVAMRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQM--------------------

Query:  -----------------------------DGARTWLNALEPNSINTWAELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHG
                                     DGARTW+NALEPNSINTWAEL +KFLAKYHTLT+N DLREDIVSFRQKENEAVQEAWERFKELLRRCPSHG
Subjt:  -----------------------------DGARTWLNALEPNSINTWAELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHG

Query:  LPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEKETKTVI
        LP+CVQIEQFYRGLDRSS+MMLNT ANGSLLEKSVNEI+D+LNKMTDINDQGE+GRSLPKKQVS  +FELDTVASMQA+MA MNQMLKQLTMEKETKTV 
Subjt:  LPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEKETKTVI

Query:  SAIPERPPVLQISDI
        SAIPE  P+LQISDI
Subjt:  SAIPERPPVLQISDI

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]1.5e-6965.77Show/hide
Query:  MRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQMDGARTWLNALEPNSINTWAELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLR
        ++++I  A   Q  G++    +   F   L     DGA TW+N LE N I TWAEL +KFLAKYHTLTRN DL+EDIVSFRQ+E+EAVQEAWERFKELL+
Subjt:  MRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQMDGARTWLNALEPNSINTWAELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLR

Query:  RCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEK
        RC SHGLPTCVQI+QFYRGLD   RMM +TAAN SLLEKSVNEIIDILNKM DINDQ E+GRSLPKKQ SA +FELDTV S+QA+++ M+QMLKQLTM+K
Subjt:  RCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEK

Query:  ETKTVISAIPERP-PVLQISDI
          K   S I   P  +LQISDI
Subjt:  ETKTVISAIPERP-PVLQISDI

TrEMBL top hitse value%identityAlignment
A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220071.2e-5957.31Show/hide
Query:  MNRNPQDPPRPQNPPVNGDMTGEGVANRARKIPNLIFLADNRDVAMRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQMDGARTWLNALEPNSINTWAE
        MNRN QDPP PQNPPVNGDM GEG ANRA +IPN I LADNRDVAMRNY+T AFHN NSGINN L QAAQ E+K VM  M                    
Subjt:  MNRNPQDPPRPQNPPVNGDMTGEGVANRARKIPNLIFLADNRDVAMRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQMDGARTWLNALEPNSINTWAE

Query:  LREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLP-TCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDI
           + + ++  LT                    ++ +   K  +    +  LP       +   GLDRSSRMMLNTAANGSLLEKSVNEI+DILNKM DI
Subjt:  LREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLP-TCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDI

Query:  NDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEKETKTVIS
        NDQGE GRSL KKQVSA +FELDTVA MQA+MA MNQMLKQ TMEKETKTV S
Subjt:  NDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEKETKTVIS

A0A6J1DW02 uncharacterized protein LOC1110248972.7e-4841.25Show/hide
Query:  PQDPPRPQNPPVNGDMTGEGVANRARKIP-NLIFLADNRDVAMRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQM-----------------------
        P+DP  P  P VNG+M      + AR    N I +ADNRDVAMR Y   AF N +SGI NP+     FE+K +M QM                       
Subjt:  PQDPPRPQNPPVNGDMTGEGVANRARKIP-NLIFLADNRDVAMRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQM-----------------------

Query:  --------------------------DGARTWLNALEPNSINTWAELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPT
                                  D AR  LNA    SI TW  L EKFL K+   TR+ D+RE+I+SFRQ + E V EAWERFKEL+R+C +HGLP 
Subjt:  --------------------------DGARTWLNALEPNSINTWAELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPT

Query:  CVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQ--GEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTM--EKETKTV
        C QIE F+RGLD  ++MMLN AANG+  +K+ NEI+DILN +   N+    +  R+ PKKQ  A V  LD   SMQ EM TMNQ LK++ +  +    T 
Subjt:  CVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQ--GEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTM--EKETKTV

Query:  ISAIPE----RPPVLQISDI
        I  +        PV Q++D+
Subjt:  ISAIPE----RPPVLQISDI

A0A6J1DYY9 uncharacterized protein LOC1110255575.6e-7066.22Show/hide
Query:  MRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQMDGARTWLNALEPNSINTWAELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLR
        ++++I  A   Q  G++    +   F   L     DGA TWLN LE N I TWAEL +KFLAKYHTLTRN DL+EDIVSFRQ+E+EAVQEAWERFKELL+
Subjt:  MRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQMDGARTWLNALEPNSINTWAELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLR

Query:  RCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEK
        RC SHGLPTCVQI+QFYRGLD   RMM +TAAN SLLEKSVNEIIDILNKM DINDQ E+GRSLPKKQ SA +FELDTV S+QA+++ M+QMLKQLTM+K
Subjt:  RCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEK

Query:  ETKTVISAIPERP-PVLQISDI
          K   S I   P  +LQISDI
Subjt:  ETKTVISAIPERP-PVLQISDI

A0A6J1DZ19 uncharacterized protein LOC1110248241.1e-8669.03Show/hide
Query:  MNRNPQDPPRPQNPPVNGDMTGEGVANRARKIPNLIFLADNRDVAMRNYITHAFHNQNSGI--NNPLFQAAQFEVKLVMSQMDGARTWLNALEPNSINTW
        MN NPQDPP P NPPV+GD  GEG ANRA ++PN I L DNRDVA+RNY+THAFHN NS +  + P+ +A   +     S +       NA + + ++  
Subjt:  MNRNPQDPPRPQNPPVNGDMTGEGVANRARKIPNLIFLADNRDVAMRNYITHAFHNQNSGI--NNPLFQAAQFEVKLVMSQMDGARTWLNALEPNSINTW

Query:  AELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTD
        A LR         L  N DLREDIVSFRQKENEAVQE WERFKELLRRC SHGLPTCVQIEQFYRGLDR SRMMLNTAAN SL EKS++EIIDILNKMTD
Subjt:  AELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTD

Query:  INDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEKETKTVISAIPERPPVLQISDI
         NDQGEIGRSLPKKQVSARVFELDTVASMQA+MAT+NQMLKQLTMEKETKT  SA+ E    LQISDI
Subjt:  INDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEKETKTVISAIPERPPVLQISDI

A0A6J1E251 uncharacterized protein LOC1110253021.7e-11973.02Show/hide
Query:  MNRNPQDPPRPQNPPVNGDMTGEGVANRARKIPNLIFLADNRDVAMRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQM--------------------
        MNRN QDPP PQNPPVNGDM GE  ANR  +IPNLI LADNRDVAMRNY+THAFHN NSGINNPL QAAQFE+K VM Q+                    
Subjt:  MNRNPQDPPRPQNPPVNGDMTGEGVANRARKIPNLIFLADNRDVAMRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQM--------------------

Query:  -----------------------------DGARTWLNALEPNSINTWAELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHG
                                     DGARTW+NALEPNSINTWAEL +KFLAKYHTLT+N DLREDIVSFRQKENEAVQEAWERFKELLRRCPSHG
Subjt:  -----------------------------DGARTWLNALEPNSINTWAELREKFLAKYHTLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHG

Query:  LPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEKETKTVI
        LP+CVQIEQFYRGLDRSS+MMLNT ANGSLLEKSVNEI+D+LNKMTDINDQGE+GRSLPKKQVS  +FELDTVASMQA+MA MNQMLKQLTMEKETKTV 
Subjt:  LPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGRSLPKKQVSARVFELDTVASMQAEMATMNQMLKQLTMEKETKTVI

Query:  SAIPERPPVLQISDI
        SAIPE  P+LQISDI
Subjt:  SAIPERPPVLQISDI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGAAATCCACAAGATCCTCCACGGCCACAAAATCCACCTGTGAATGGAGATATGACAGGTGAAGGAGTAGCAAATCGAGCAAGAAAAATTCCCAATCTGATCTT
TCTAGCAGACAACCGAGATGTAGCCATGCGGAATTATATCACTCATGCGTTCCACAACCAAAATTCAGGGATAAATAATCCTTTATTCCAAGCCGCACAGTTCGAGGTTA
AGCTAGTCATGTCCCAGATGGATGGTGCAAGGACTTGGCTAAACGCACTAGAACCAAATTCTATCAACACATGGGCGGAACTGAGGGAGAAATTTTTGGCAAAGTACCAC
ACTTTGACTAGGAACGTAGACCTTCGAGAGGACATTGTGTCTTTTAGACAGAAGGAGAACGAAGCAGTTCAAGAAGCTTGGGAGCGTTTTAAGGAATTACTGAGAAGATG
CCCGAGCCATGGATTGCCCACATGTGTGCAGATTGAACAATTCTATAGAGGATTGGATCGTTCATCAAGGATGATGTTGAACACTGCAGCCAATGGCTCGTTGTTAGAGA
AGTCGGTAAATGAGATCATTGATATCTTGAATAAGATGACAGACATTAATGACCAAGGCGAAATAGGAAGGTCATTACCAAAGAAGCAAGTATCAGCCAGAGTCTTTGAG
TTAGACACAGTAGCTTCAATGCAAGCCGAAATGGCGACTATGAACCAGATGTTAAAACAGTTGACAATGGAGAAGGAAACTAAAACCGTCATTTCGGCGATACCTGAACG
CCCTCCTGTTTTACAAATTTCAGATATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATAGAAATCCACAAGATCCTCCACGGCCACAAAATCCACCTGTGAATGGAGATATGACAGGTGAAGGAGTAGCAAATCGAGCAAGAAAAATTCCCAATCTGATCTT
TCTAGCAGACAACCGAGATGTAGCCATGCGGAATTATATCACTCATGCGTTCCACAACCAAAATTCAGGGATAAATAATCCTTTATTCCAAGCCGCACAGTTCGAGGTTA
AGCTAGTCATGTCCCAGATGGATGGTGCAAGGACTTGGCTAAACGCACTAGAACCAAATTCTATCAACACATGGGCGGAACTGAGGGAGAAATTTTTGGCAAAGTACCAC
ACTTTGACTAGGAACGTAGACCTTCGAGAGGACATTGTGTCTTTTAGACAGAAGGAGAACGAAGCAGTTCAAGAAGCTTGGGAGCGTTTTAAGGAATTACTGAGAAGATG
CCCGAGCCATGGATTGCCCACATGTGTGCAGATTGAACAATTCTATAGAGGATTGGATCGTTCATCAAGGATGATGTTGAACACTGCAGCCAATGGCTCGTTGTTAGAGA
AGTCGGTAAATGAGATCATTGATATCTTGAATAAGATGACAGACATTAATGACCAAGGCGAAATAGGAAGGTCATTACCAAAGAAGCAAGTATCAGCCAGAGTCTTTGAG
TTAGACACAGTAGCTTCAATGCAAGCCGAAATGGCGACTATGAACCAGATGTTAAAACAGTTGACAATGGAGAAGGAAACTAAAACCGTCATTTCGGCGATACCTGAACG
CCCTCCTGTTTTACAAATTTCAGATATCTAG
Protein sequenceShow/hide protein sequence
MNRNPQDPPRPQNPPVNGDMTGEGVANRARKIPNLIFLADNRDVAMRNYITHAFHNQNSGINNPLFQAAQFEVKLVMSQMDGARTWLNALEPNSINTWAELREKFLAKYH
TLTRNVDLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPTCVQIEQFYRGLDRSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGRSLPKKQVSARVFE
LDTVASMQAEMATMNQMLKQLTMEKETKTVISAIPERPPVLQISDI