; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022453 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022453
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon protein, putative, unclassified, expressed
Genome locationchr7:29444150..29445575
RNA-Seq ExpressionLag0022453
SyntenyLag0022453
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]9.6e-2535.44Show/hide
Query:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN--------DRWCVPK---------------------
        +LAK  WRI+  P+S+L+R+LKGRYFKD  F+EA +  NPS  WRS + G+DL  KG RWR+GN        D W VP                      
Subjt:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN--------DRWCVPK---------------------

Query:  ------------VQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLKKSWKS--------------LWRIV
                    V+D F P + K IL+IP+G  +  D +IW+ +K G +SV+S Y +AL     + A +S +S   + W +              LWR+ 
Subjt:  ------------VQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLKKSWKS--------------LWRIV

Query:  LKHLPT
        L  LPT
Subjt:  LKHLPT

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]1.1e-2342.04Show/hide
Query:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN------------------------------------
        MLAK SWRI++ P SLLA+ L+G+YFK   FL+A LGA PS  WRS + G+DLF KGYRW+VGN                                    
Subjt:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN------------------------------------

Query:  ---DRWCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLAL
            RW   KV++ F   +   IL  PL      DEIIW  DK G FSV+SAYHL +
Subjt:  ---DRWCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLAL

XP_024039324.1 uncharacterized protein LOC112097962 [Citrus clementina]2.4e-2335.94Show/hide
Query:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN--DRWCVPKVQDHFHPQDVKDILNIPLGDPSTRDEI
        M+AK  WR+I+ P+SL++++L+ RYF+   FL A  G+NPS  WRS + G+ +  KG RW +GN  +   V K+  HF  +D + I+ IPL      DE+
Subjt:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN--DRWCVPKVQDHFHPQDVKDILNIPLGDPSTRDEI

Query:  IWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLKKSWKSLWRIVLKHLPTRSASAKWEHWTTLDYWD---WMRKNLKDEDLETTNASKE
        +W  DK G +SV+S Y +AL      +  +S ++S  K WK +W +    LP +     W     L       W RK LKD   +  N   E
Subjt:  IWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLKKSWKSLWRIVLKHLPTRSASAKWEHWTTLDYWD---WMRKNLKDEDLETTNASKE

XP_030502714.1 uncharacterized protein LOC115717884 [Cannabis sativa]1.4e-2334.48Show/hide
Query:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGNDR----------------------------------
        ML K +W+I+K+P+SLL R+LK RYF +   LEA  G NPS  WRS + G+DL   G  W++G+                                    
Subjt:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGNDR----------------------------------

Query:  ----WCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLK---KSWKS---------LWRIVLKHLP
            W   K+++HF+  D++DIL +P+   S +D+IIWS +  G F+VKSAYHLAL+   ++ +S+S  +S K   K W S         +WR+V   +P
Subjt:  ----WCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLK---KSWKS---------LWRIVLKHLP

Query:  TRS
          S
Subjt:  TRS

XP_030507775.1 uncharacterized protein LOC115722655 [Cannabis sativa]1.1e-2335.78Show/hide
Query:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGNDR----------------------------------
        MLAK +W+I K+P SLL R+LK RYF +   LEA  G NPS  WRS + G+DL   G  W++G+                                    
Subjt:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGNDR----------------------------------

Query:  ----WCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLK---KSWKS---------LWRIVLKHLP
            W   K++ +F   DV+DILN+P+   S +DE+IWS    G F+VKSAYHLAL+   ++ +++SDT+S K   K W S         +WR+V   +P
Subjt:  ----WCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLK---KSWKS---------LWRIVLKHLP

Query:  TRSA
          S+
Subjt:  TRSA

TrEMBL top hitse value%identityAlignment
A0A6J1DAR4 uncharacterized protein LOC1110189544.6e-2535.44Show/hide
Query:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN--------DRWCVPK---------------------
        +LAK  WRI+  P+S+L+R+LKGRYFKD  F+EA +  NPS  WRS + G+DL  KG RWR+GN        D W VP                      
Subjt:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN--------DRWCVPK---------------------

Query:  ------------VQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLKKSWKS--------------LWRIV
                    V+D F P + K IL+IP+G  +  D +IW+ +K G +SV+S Y +AL     + A +S +S   + W +              LWR+ 
Subjt:  ------------VQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLKKSWKS--------------LWRIV

Query:  LKHLPT
        L  LPT
Subjt:  LKHLPT

A0A6J1DRA0 uncharacterized protein LOC1110224235.1e-2442.04Show/hide
Query:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN------------------------------------
        MLAK SWRI++ P SLLA+ L+G+YFK   FL+A LGA PS  WRS + G+DLF KGYRW+VGN                                    
Subjt:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN------------------------------------

Query:  ---DRWCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLAL
            RW   KV++ F   +   IL  PL      DEIIW  DK G FSV+SAYHL +
Subjt:  ---DRWCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLAL

A0A803Q6R6 Uncharacterized protein5.1e-2428.33Show/hide
Query:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGNDR----------------------------------
        +LAK +WRI   P SLL R+LK RYF +  FL+A++G +PSLTW+S   GK+L +KG R++VGN                                    
Subjt:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGNDR----------------------------------

Query:  ----WCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLKKSWKSLWRIVLKHLPTRSASAKWEHWT
            W VP +  +F P D   IL+IPL   +  D +IW     G ++V+S +H A +   ++ +ST D                     +S  AKW+   
Subjt:  ----WCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLKKSWKSLWRIVLKHLPTRSASAKWEHWT

Query:  TLDYWDWMRKNLKDEDLETTNASKERGGLGWIVRNSGGSPICGGMKMINENWPMKLLEAKAIWTVLRGIESLPEKPSSIIVNSDCWELFKLLNHADINLS
            ++  + N+       TN   ++ G+G ++R+  G+ I    K++  N+    +EAKA++  L  I +L  + S   + +D   +   +NHA  +LS
Subjt:  TLDYWDWMRKNLKDEDLETTNASKERGGLGWIVRNSGGSPICGGMKMINENWPMKLLEAKAIWTVLRGIESLPEKPSSIIVNSDCWELFKLLNHADINLS

A0A803QBU0 Uncharacterized protein6.7e-2435.71Show/hide
Query:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGNDR----------------------------------
        +LAK +WR+++ P+SLLA +LK +YF +  FL+AS+G +PSLTW+    G++L +KG RW++G  R                                  
Subjt:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGNDR----------------------------------

Query:  ----WCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLKKSWKSLWRI
            W V  +Q +F P DV  IL +PL    +RD++IW P   G F+V+SAYHLA     E D S++ TS++   WK  W +
Subjt:  ----WCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLKKSWKSLWRI

A0A803QI38 Uncharacterized protein2.1e-2539.56Show/hide
Query:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN----------------------------DR------
        MLAK +WR++  P+SLLA ILK +YFK   FLEA LG  PS TW S + G+DL ++G  W+VGN                            D+      
Subjt:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN----------------------------DR------

Query:  ----WCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLKKSWKSLWRI
            W   K+Q +F    V  IL IP+G P   D +IW+ D  G FSVKSAYH  +A    M  S+SD+S  K  WK+LW +
Subjt:  ----WCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSVKSAYHLALAKAGEMDASTSDTSSLKKSWKSLWRI

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003101.0e-0840Show/hide
Query:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN--------DRW
        +LAK S+RII +P +LL+R+L+ RYF     +E S+G  PS  WRS + G++L  +G    +G+        DRW
Subjt:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN--------DRW

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein2.3e-0839.06Show/hide
Query:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN
        +L K  WR++  P SL+A++ K RYF     L A LG+ PS  W+S    +++  +G R  VGN
Subjt:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.1e-1040Show/hide
Query:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN--------DRW
        +LAK S+RII +P +LL+R+L+ RYF     +E S+G  PS  WRS + G++L  +G    +G+        DRW
Subjt:  MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGN--------DRW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGCTAAGTTGAGTTGGAGGATAATCAAAGAGCCGTCGAGCCTTCTTGCAAGAATCCTCAAAGGGAGGTATTTTAAGGATAGGCCTTTCCTTGAAGCCTCGCTGGG
GGCTAATCCTTCCCTAACGTGGAGAAGTACCATGTTGGGCAAGGATCTCTTCCTTAAAGGCTACAGGTGGAGGGTGGGTAACGATAGGTGGTGTGTGCCCAAGGTGCAAG
ATCATTTCCACCCCCAAGATGTCAAAGATATTCTGAACATTCCTTTAGGCGACCCAAGCACAAGAGATGAAATCATCTGGAGTCCGGATAAGAAGGGGAGATTCTCGGTT
AAGAGTGCCTACCACTTAGCTTTGGCTAAAGCAGGAGAGATGGATGCTTCGACTTCAGACACTAGTAGTTTAAAGAAAAGCTGGAAGAGTCTTTGGAGAATCGTATTGAA
ACATCTACCCACTCGATCTGCGAGTGCAAAGTGGGAACACTGGACGACTCTAGATTACTGGGATTGGATGCGGAAAAATTTAAAAGATGAAGATTTAGAAACAACGAACG
CTTCGAAAGAGAGAGGAGGATTGGGATGGATCGTTCGCAACTCTGGAGGTTCTCCCATCTGTGGAGGCATGAAAATGATCAATGAAAATTGGCCGATGAAGCTTTTGGAA
GCAAAAGCAATCTGGACAGTCCTAAGAGGTATCGAATCCTTACCGGAGAAGCCGTCGTCGATTATTGTGAATTCAGACTGTTGGGAGCTGTTCAAGTTGCTGAATCATGC
TGATATCAATCTCTCGGAAGTTAAGCCCTTCGTTGACGCCATCCTAAATCTGGCCGATTCTATGGGGGGATTTCTTTCGAGCATTGCCCTAGGGAGCAAAACTGCGTTGC
TCACTCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTAGCTAAGTTGAGTTGGAGGATAATCAAAGAGCCGTCGAGCCTTCTTGCAAGAATCCTCAAAGGGAGGTATTTTAAGGATAGGCCTTTCCTTGAAGCCTCGCTGGG
GGCTAATCCTTCCCTAACGTGGAGAAGTACCATGTTGGGCAAGGATCTCTTCCTTAAAGGCTACAGGTGGAGGGTGGGTAACGATAGGTGGTGTGTGCCCAAGGTGCAAG
ATCATTTCCACCCCCAAGATGTCAAAGATATTCTGAACATTCCTTTAGGCGACCCAAGCACAAGAGATGAAATCATCTGGAGTCCGGATAAGAAGGGGAGATTCTCGGTT
AAGAGTGCCTACCACTTAGCTTTGGCTAAAGCAGGAGAGATGGATGCTTCGACTTCAGACACTAGTAGTTTAAAGAAAAGCTGGAAGAGTCTTTGGAGAATCGTATTGAA
ACATCTACCCACTCGATCTGCGAGTGCAAAGTGGGAACACTGGACGACTCTAGATTACTGGGATTGGATGCGGAAAAATTTAAAAGATGAAGATTTAGAAACAACGAACG
CTTCGAAAGAGAGAGGAGGATTGGGATGGATCGTTCGCAACTCTGGAGGTTCTCCCATCTGTGGAGGCATGAAAATGATCAATGAAAATTGGCCGATGAAGCTTTTGGAA
GCAAAAGCAATCTGGACAGTCCTAAGAGGTATCGAATCCTTACCGGAGAAGCCGTCGTCGATTATTGTGAATTCAGACTGTTGGGAGCTGTTCAAGTTGCTGAATCATGC
TGATATCAATCTCTCGGAAGTTAAGCCCTTCGTTGACGCCATCCTAAATCTGGCCGATTCTATGGGGGGATTTCTTTCGAGCATTGCCCTAGGGAGCAAAACTGCGTTGC
TCACTCAATAG
Protein sequenceShow/hide protein sequence
MLAKLSWRIIKEPSSLLARILKGRYFKDRPFLEASLGANPSLTWRSTMLGKDLFLKGYRWRVGNDRWCVPKVQDHFHPQDVKDILNIPLGDPSTRDEIIWSPDKKGRFSV
KSAYHLALAKAGEMDASTSDTSSLKKSWKSLWRIVLKHLPTRSASAKWEHWTTLDYWDWMRKNLKDEDLETTNASKERGGLGWIVRNSGGSPICGGMKMINENWPMKLLE
AKAIWTVLRGIESLPEKPSSIIVNSDCWELFKLLNHADINLSEVKPFVDAILNLADSMGGFLSSIALGSKTALLTQ