; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008219 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008219
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:15138153..15139073
RNA-Seq ExpressionLag0008219
SyntenyLag0008219
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4366980.1 hypothetical protein F8388_022768 [Cannabis sativa]2.9e-1523.28Show/hide
Query:  FWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKDSWSRMMINLSK
        +WK  WK  + PR K+  W++ N+ +P+K N+  +G+++NP+C +CG  +E+  H +W C   ++ W +  P       L     +  D  + +   LS+
Subjt:  FWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKDSWSRMMINLSK

Query:  EDLDKGITIMWKIWNFHNHNGDHNQIRAIEYLVHDIQTSITECEASYLKTQASKRPGNQGGLRW---------------------------IVRDSNGSL
         + +  + IMW IW   N   +   +     L+  + T+     ++ ++ +  K       + W                           I RD NG++
Subjt:  EDLDKGITIMWKIWNFHNHNGDHNQIRAIEYLVHDIQTSITECEASYLKTQASKRPGNQGGLRW---------------------------IVRDSNGSL

Query:  VYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSD
        +  GM  I     I + EA EI   LK         + P+E++SD   VV  +       SD
Subjt:  VYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSD

KAF4384439.1 hypothetical protein G4B88_028513 [Cannabis sativa]1.8e-1724.82Show/hide
Query:  SQSDNSKQA-LFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKD
        ++S N  Q   +WK  W  N+ PR K+  WK+  + +P+K N++ +G+ +N +C  CG+  E+ +H +W C+  ++ W   L  S N+    R N +  D
Subjt:  SQSDNSKQA-LFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKD

Query:  SWSRMMINLSKEDLDKGITIMWKIWNFHNHNGDH----NQIRAIEYLVHDIQTSI-------------------TECEASY-LKTQASKRPGNQG-GLRW
                L KE+ +  + ++W IW   N    +    N  R +E++ +     I                   T    +Y +   A+ +P  +G GL +
Subjt:  SWSRMMINLSKEDLDKGITIMWKIWNFHNHNGDH----NQIRAIEYLVHDIQTSI-------------------TECEASY-LKTQASKRPGNQG-GLRW

Query:  IVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSDLKP
        + +D  G+++  GM+       +K+ EAK + A L       +    P E+ +D+ +++  IV     L D+KP
Subjt:  IVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSDLKP

KAF4391976.1 hypothetical protein F8388_004305 [Cannabis sativa]6.2e-1824.82Show/hide
Query:  SQSDNSKQA-LFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKD
        ++S N  Q   +WK  W  N+ PR K+  WK+  + +P+K N++ +G+ +NP+C  CG+  E+ +H +W C+  ++ W   L  S N+    R N +  D
Subjt:  SQSDNSKQA-LFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKD

Query:  SWSRMMINLSKEDLDKGITIMWKIWNFHNHNGDHNQI----RAIEYLVHDI------QTSITECEASYLKTQ--------------ASKRPGNQG-GLRW
                L+KE+ +  + ++W IW   N    +  I    R ++++ +        Q ++T+   +  K Q              A+ +P  +G G+ +
Subjt:  SWSRMMINLSKEDLDKGITIMWKIWNFHNHNGDHNQI----RAIEYLVHDI------QTSITECEASYLKTQ--------------ASKRPGNQG-GLRW

Query:  IVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSDLKP
        + +D  G+++  GM+       + I EA+ + A L     + +    P E+ +D+  +V+ IV     L D+KP
Subjt:  IVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSDLKP

KAF4399153.1 hypothetical protein G4B88_023747 [Cannabis sativa]2.9e-1526.8Show/hide
Query:  KINAH--HSSSQSDNSKQALFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSL
        +IN H   SS+  D  K   +WK LW  ++ PR K+  W++ ++ +P+K N+  +G+D+N  C +CG + E+ TH +W C   +  W + +P     +  
Subjt:  KINAH--HSSSQSDNSKQALFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSL

Query:  CR--DNWTTKDSWSRMMINLSKEDLDKGITIMWKIWNFHNHNGDH----NQIRAIEYL---VHDIQTSITECEASYLKTQASK---RP------------
        C    N +  D    +  +L K + ++ I IMW IW   N   +     N I+ ++++     D + S  +     +K Q  K   RP            
Subjt:  CR--DNWTTKDSWSRMMINLSKEDLDKGITIMWKIWNFHNHNGDH----NQIRAIEYL---VHDIQTSITECEASYLKTQASK---RP------------

Query:  ---GNQG-GLRWIVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVV-------NVIVEVAEDLSDLKPNFYTSCI
           G  G G  +I RD  G+L+  GM        +++ EA  IL  LKH   T N   + +E++SD  ++V       N +  V+  L  ++    +S  
Subjt:  ---GNQG-GLRWIVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVV-------NVIVEVAEDLSDLKPNFYTSCI

Query:  TNIYQI
        TNI  +
Subjt:  TNIYQI

XP_021847414.1 uncharacterized protein LOC110787151 [Spinacia oleracea]1.5e-1624.36Show/hide
Query:  NSKQALFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKDSWSRM
        +S  +  WK +WK NVLPR KV AW+  ++ +P++  + K+    + +CG+C  + ES  H + ECK+ R  W       +  +  C    +  D W R 
Subjt:  NSKQALFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKDSWSRM

Query:  MINLSKEDLDKGITIMWKIWNFHNHNGDHNQIRAIEYLVHD----IQTSITECEASYLKTQASKRPGNQGGLRW--------------------------
           L  E +++ IT+ W +W   N          +E L  D    ++ + T C     +     R  + GG  W                          
Subjt:  MINLSKEDLDKGITIMWKIWNFHNHNGDHNQIRAIEYLVHD----IQTSITECEASYLKTQASKRPGNQGGLRW--------------------------

Query:  ----IVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSD
            ++RD  G +V   +++   K    + EAK +L G++       QR +   VESD L V+  +   A   S+
Subjt:  ----IVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSD

TrEMBL top hitse value%identityAlignment
A0A6J1DL64 uncharacterized protein LOC111022134 isoform X12.4e-1528.64Show/hide
Query:  KKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKDSWSRMMINLSKEDLDKGITIMWKIWNFHNHN---GDHNQIRAIE-----YLV------
        KK E+T H +WECK+ ++ W  C P   N F + R NWTTK+ W  +M    +E+  + + I  +IW   N +   G H++ R I+     Y++      
Subjt:  KKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKDSWSRMMINLSKEDLDKGITIMWKIWNFHNHN---GDHNQIRAIE-----YLV------

Query:  ----------HDIQTSITECEASY---------LKTQASKRPG-NQGGLRWIVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPL
                  H I+       A +         L T A+ R   N  G+ WI+RD  G ++  G + I  +  I  LE   I  GL+ I     +   P+
Subjt:  ----------HDIQTSITECEASY---------LKTQASKRPG-NQGGLRWIVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPL

Query:  EVESDALEVVNVI
         +ESD+LE ++++
Subjt:  EVESDALEVVNVI

A0A7J6FB29 Uncharacterized protein1.4e-1523.28Show/hide
Query:  FWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKDSWSRMMINLSK
        +WK  WK  + PR K+  W++ N+ +P+K N+  +G+++NP+C +CG  +E+  H +W C   ++ W +  P       L     +  D  + +   LS+
Subjt:  FWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKDSWSRMMINLSK

Query:  EDLDKGITIMWKIWNFHNHNGDHNQIRAIEYLVHDIQTSITECEASYLKTQASKRPGNQGGLRW---------------------------IVRDSNGSL
         + +  + IMW IW   N   +   +     L+  + T+     ++ ++ +  K       + W                           I RD NG++
Subjt:  EDLDKGITIMWKIWNFHNHNGDHNQIRAIEYLVHDIQTSITECEASYLKTQASKRPGNQGGLRW---------------------------IVRDSNGSL

Query:  VYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSD
        +  GM  I     I + EA EI   LK         + P+E++SD   VV  +       SD
Subjt:  VYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSD

A0A7J6H9M1 Uncharacterized protein3.0e-1824.82Show/hide
Query:  SQSDNSKQA-LFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKD
        ++S N  Q   +WK  W  N+ PR K+  WK+  + +P+K N++ +G+ +NP+C  CG+  E+ +H +W C+  ++ W   L  S N+    R N +  D
Subjt:  SQSDNSKQA-LFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKD

Query:  SWSRMMINLSKEDLDKGITIMWKIWNFHNHNGDHNQI----RAIEYLVHDI------QTSITECEASYLKTQ--------------ASKRPGNQG-GLRW
                L+KE+ +  + ++W IW   N    +  I    R ++++ +        Q ++T+   +  K Q              A+ +P  +G G+ +
Subjt:  SWSRMMINLSKEDLDKGITIMWKIWNFHNHNGDHNQI----RAIEYLVHDI------QTSITECEASYLKTQ--------------ASKRPGNQG-GLRW

Query:  IVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSDLKP
        + +D  G+++  GM+       + I EA+ + A L     + +    P E+ +D+  +V+ IV     L D+KP
Subjt:  IVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSDLKP

A0A7J6HAE9 Uncharacterized protein8.8e-1824.82Show/hide
Query:  SQSDNSKQA-LFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKD
        ++S N  Q   +WK  W  N+ PR K+  WK+  + +P+K N++ +G+ +N +C  CG+  E+ +H +W C+  ++ W   L  S N+    R N +  D
Subjt:  SQSDNSKQA-LFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKD

Query:  SWSRMMINLSKEDLDKGITIMWKIWNFHNHNGDH----NQIRAIEYLVHDIQTSI-------------------TECEASY-LKTQASKRPGNQG-GLRW
                L KE+ +  + ++W IW   N    +    N  R +E++ +     I                   T    +Y +   A+ +P  +G GL +
Subjt:  SWSRMMINLSKEDLDKGITIMWKIWNFHNHNGDH----NQIRAIEYLVHDIQTSI-------------------TECEASY-LKTQASKRPGNQG-GLRW

Query:  IVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSDLKP
        + +D  G+++  GM+       +K+ EAK + A L       +    P E+ +D+ +++  IV     L D+KP
Subjt:  IVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSDLKP

A0A7J6HVA4 Uncharacterized protein1.4e-1526.8Show/hide
Query:  KINAH--HSSSQSDNSKQALFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSL
        +IN H   SS+  D  K   +WK LW  ++ PR K+  W++ ++ +P+K N+  +G+D+N  C +CG + E+ TH +W C   +  W + +P     +  
Subjt:  KINAH--HSSSQSDNSKQALFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSL

Query:  CR--DNWTTKDSWSRMMINLSKEDLDKGITIMWKIWNFHNHNGDH----NQIRAIEYL---VHDIQTSITECEASYLKTQASK---RP------------
        C    N +  D    +  +L K + ++ I IMW IW   N   +     N I+ ++++     D + S  +     +K Q  K   RP            
Subjt:  CR--DNWTTKDSWSRMMINLSKEDLDKGITIMWKIWNFHNHNGDH----NQIRAIEYL---VHDIQTSITECEASYLKTQASK---RP------------

Query:  ---GNQG-GLRWIVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVV-------NVIVEVAEDLSDLKPNFYTSCI
           G  G G  +I RD  G+L+  GM        +++ EA  IL  LKH   T N   + +E++SD  ++V       N +  V+  L  ++    +S  
Subjt:  ---GNQG-GLRWIVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKHIVGTCNQRSIPLEVESDALEVV-------NVIVEVAEDLSDLKPNFYTSCI

Query:  TNIYQI
        TNI  +
Subjt:  TNIYQI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.5e-0528.57Show/hide
Query:  LWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWEC
        +W   + P+ K+  WK +N+ +P  A +L + + + P C  C +  E+ TH ++ C
Subjt:  LWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATCAATGCTCATCACTCGTCCTCCCAATCAGACAACAGCAAGCAAGCCTTATTTTGGAAGAGTTTGTGGAAATCTAATGTGCTCCCTAGATCCAAAGTGTGTGC
TTGGAAGATCGTCAACGACATCATCCCCTCCAAAGCCAACATTCTAAAAAAGGGAGTAGATTTAAACCCTCTTTGTGGCATTTGTGGAAAGAAGCTAGAATCAACCACTC
ATCCCATTTGGGAGTGTAAAATCCCGAGGGAAGGGTGGACTCGATGCCTACCTAGCTCACTGAACCTATTCTCTCTTTGCAGGGACAATTGGACAACAAAGGACTCTTGG
AGCAGGATGATGATTAACCTTAGCAAAGAAGACCTTGATAAAGGAATCACAATCATGTGGAAGATATGGAATTTTCACAACCATAATGGAGATCACAACCAAATTAGAGC
AATAGAGTACTTAGTACATGATATTCAGACTAGTATTACCGAATGTGAAGCTTCTTACCTCAAGACTCAAGCCTCGAAAAGGCCAGGGAACCAGGGAGGGCTACGTTGGA
TCGTGCGTGACTCGAATGGATCCTTGGTCTATTTCGGAATGCAAAAAATCACGAGAAAATCGCCCATCAAAATCCTAGAGGCAAAAGAAATCCTAGCGGGTTTGAAACAC
ATTGTGGGTACCTGTAATCAACGCTCAATCCCCTTGGAAGTAGAATCTGACGCATTGGAAGTCGTCAATGTGATCGTCGAAGTCGCTGAAGACTTATCGGATCTGAAACC
CAATTTTTATACATCATGCATAACAAATATATACCAAATTTTAGATTATCCACCAATAAGAATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATCAATGCTCATCACTCGTCCTCCCAATCAGACAACAGCAAGCAAGCCTTATTTTGGAAGAGTTTGTGGAAATCTAATGTGCTCCCTAGATCCAAAGTGTGTGC
TTGGAAGATCGTCAACGACATCATCCCCTCCAAAGCCAACATTCTAAAAAAGGGAGTAGATTTAAACCCTCTTTGTGGCATTTGTGGAAAGAAGCTAGAATCAACCACTC
ATCCCATTTGGGAGTGTAAAATCCCGAGGGAAGGGTGGACTCGATGCCTACCTAGCTCACTGAACCTATTCTCTCTTTGCAGGGACAATTGGACAACAAAGGACTCTTGG
AGCAGGATGATGATTAACCTTAGCAAAGAAGACCTTGATAAAGGAATCACAATCATGTGGAAGATATGGAATTTTCACAACCATAATGGAGATCACAACCAAATTAGAGC
AATAGAGTACTTAGTACATGATATTCAGACTAGTATTACCGAATGTGAAGCTTCTTACCTCAAGACTCAAGCCTCGAAAAGGCCAGGGAACCAGGGAGGGCTACGTTGGA
TCGTGCGTGACTCGAATGGATCCTTGGTCTATTTCGGAATGCAAAAAATCACGAGAAAATCGCCCATCAAAATCCTAGAGGCAAAAGAAATCCTAGCGGGTTTGAAACAC
ATTGTGGGTACCTGTAATCAACGCTCAATCCCCTTGGAAGTAGAATCTGACGCATTGGAAGTCGTCAATGTGATCGTCGAAGTCGCTGAAGACTTATCGGATCTGAAACC
CAATTTTTATACATCATGCATAACAAATATATACCAAATTTTAGATTATCCACCAATAAGAATTTAA
Protein sequenceShow/hide protein sequence
MKINAHHSSSQSDNSKQALFWKSLWKSNVLPRSKVCAWKIVNDIIPSKANILKKGVDLNPLCGICGKKLESTTHPIWECKIPREGWTRCLPSSLNLFSLCRDNWTTKDSW
SRMMINLSKEDLDKGITIMWKIWNFHNHNGDHNQIRAIEYLVHDIQTSITECEASYLKTQASKRPGNQGGLRWIVRDSNGSLVYFGMQKITRKSPIKILEAKEILAGLKH
IVGTCNQRSIPLEVESDALEVVNVIVEVAEDLSDLKPNFYTSCITNIYQILDYPPIRI