; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038975 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038975
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:32342300..32343084
RNA-Seq ExpressionLag0038975
SyntenyLag0038975
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3455898.1 reverse transcriptase [Gossypium australe]4.3e-2029.69Show/hide
Query:  CPMCLSRIESTDHCLFTCPRAKEIW-KVAFKHEALWSNFNQSFVDRWLA-LNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEI
        CP C  R E+ DH    CP   E+W ++ F+ E +  N +    ++WL  + E ++  + RI     WAIWGD N  I N  + + A    +IINYLAEI
Subjt:  CPMCLSRIESTDHCLFTCPRAKEIW-KVAFKHEALWSNFNQSFVDRWLA-LNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEI

Query:  DCAEVRRKDGCHSGSNLERVSRSRSWNERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYNLVSRKEVSWSEVGSIVDGIQ
        D +     DG     N    S     +  G ++ + +K           E +A +E V   +  G  ++I+E D +        ++   S +G+ +  IQ
Subjt:  DCAEVRRKDGCHSGSNLERVSRSRSWNERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYNLVSRKEVSWSEVGSIVDGIQ

Query:  MKSLSGEHIVFSFVPRECNVIADSITKRA
        M   S    VF  VPR  N++A SI  +A
Subjt:  MKSLSGEHIVFSFVPRECNVIADSITKRA

KAA3469709.1 reverse transcriptase [Gossypium australe]4.2e-1526.12Show/hide
Query:  CPMCLSRIESTDHCLFTCPRAKEIWKVAFKHEALWSNFNQSFVDRWLA-LNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEID
        CP C  R E+ +H    CP   E+W        L  + N+ F ++WL  + E L     RI +   WAIW DRN  I N           +IINY+AE+D
Subjt:  CPMCLSRIESTDHCLFTCPRAKEIWKVAFKHEALWSNFNQSFVDRWLA-LNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEID

Query:  CAEVRRK---------------------DGCHSGSNLERVSRSRSWNERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYN
          E R+                      D    G N    S   + N+ G I+ +C++           E +A ++ V   +      +I+E D +    
Subjt:  CAEVRRK---------------------DGCHSGSNLERVSRSRSWNERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYN

Query:  LVSRKEVSWSEVGSIVDGIQMKSLSGEHIVFSFVPRECNVIADSI
          + K+   S +G+ +  IQ      +H VF  +PR  N +A  I
Subjt:  LVSRKEVSWSEVGSIVDGIQMKSLSGEHIVFSFVPRECNVIADSI

XP_022131661.1 uncharacterized protein LOC111004786 [Momordica charantia]4.2e-1530Show/hide
Query:  IESTDHCLFTCPRAKEIWKVAFKHEALWS---NFNQSFVDRWLALNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEIDCAEVR
        +E T H +  C RAK+IW + F   +L++   N N SF+D WL L + L+  +  + A T WAIW DRN     + + + A+R  WI +Y      A+  
Subjt:  IESTDHCLFTCPRAKEIWKVAFKHEALWS---NFNQSFVDRWLALNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEIDCAEVR

Query:  RKDGCHSGS------------------NLERVSRSRSW-------NERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYNL
        ++      S                  N +   RS S        +  G ++ A + FL   L PL  E+  I E +  A S    RL+VE+DC +A  L
Subjt:  RKDGCHSGS------------------NLERVSRSRSW-------NERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYNL

Query:  VSRKEVSWSE
        V     S  E
Subjt:  VSRKEVSWSE

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]6.4e-1626.69Show/hide
Query:  CPMCLSRIESTDHCLFTCPRAKEIWKVAFKH-EALWSNFNQSFVDRWLALNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEID
        C +C  R ES  H  F C RA++IW+  F     L +  N SF++ W +L E L   +L + A+T W IW DRN  I    +     +  W+  +L    
Subjt:  CPMCLSRIESTDHCLFTCPRAKEIWKVAFKH-EALWSNFNQSFVDRWLALNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEID

Query:  CAEV------------------RRKDGCHSGSNLERVSRSRSW-------NERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCI
         A++                  R         N +   R  S        +    +V A +  + F L PL+ E+  I EG+ FA +     L VE+D +
Subjt:  CAEV------------------RRKDGCHSGSNLERVSRSRSW-------NERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCI

Query:  QAYNLVSRKEVSWSEVGSIVDGIQMKSLSGEHIVFSFVPRECNVIADSITK
         A  L+  +  +  +  + V  IQ  +     I FS   R+CN  A  + K
Subjt:  QAYNLVSRKEVSWSEVGSIVDGIQMKSLSGEHIVFSFVPRECNVIADSITK

XP_022158489.1 uncharacterized protein LOC111024968 [Momordica charantia]1.1e-1532.82Show/hide
Query:  LSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEIDCAEVRRKDGCHSGSNLERVSRSRSW----------------NERGDIVGA---
        LS  ELR+  +TCWA+W DR+  I    IP   I+  WI+ Y       EVR K   H+G    R+ R   W                +E G  VG    
Subjt:  LSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEIDCAEVRRKDGCHSGSNLERVSRSRSW----------------NERGDIVGA---

Query:  ------CNKFLDFSL--PPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYNLVSRKEVSWSEVGSIVDGIQMKSLSGEHIVFSFVPRECNVIA
                  +DF +   PL+ ++LAI+EG+  A  LG  R++VETD ++A NL+  K     E  S V+ I+  +   + I F  V RE N +A
Subjt:  ------CNKFLDFSL--PPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYNLVSRKEVSWSEVGSIVDGIQMKSLSGEHIVFSFVPRECNVIA

TrEMBL top hitse value%identityAlignment
A0A5B6UF59 Reverse transcriptase2.1e-2029.69Show/hide
Query:  CPMCLSRIESTDHCLFTCPRAKEIW-KVAFKHEALWSNFNQSFVDRWLA-LNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEI
        CP C  R E+ DH    CP   E+W ++ F+ E +  N +    ++WL  + E ++  + RI     WAIWGD N  I N  + + A    +IINYLAEI
Subjt:  CPMCLSRIESTDHCLFTCPRAKEIW-KVAFKHEALWSNFNQSFVDRWLA-LNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEI

Query:  DCAEVRRKDGCHSGSNLERVSRSRSWNERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYNLVSRKEVSWSEVGSIVDGIQ
        D +     DG     N    S     +  G ++ + +K           E +A +E V   +  G  ++I+E D +        ++   S +G+ +  IQ
Subjt:  DCAEVRRKDGCHSGSNLERVSRSRSWNERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYNLVSRKEVSWSEVGSIVDGIQ

Query:  MKSLSGEHIVFSFVPRECNVIADSITKRA
        M   S    VF  VPR  N++A SI  +A
Subjt:  MKSLSGEHIVFSFVPRECNVIADSITKRA

A0A5B6VK61 Reverse transcriptase2.0e-1526.12Show/hide
Query:  CPMCLSRIESTDHCLFTCPRAKEIWKVAFKHEALWSNFNQSFVDRWLA-LNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEID
        CP C  R E+ +H    CP   E+W        L  + N+ F ++WL  + E L     RI +   WAIW DRN  I N           +IINY+AE+D
Subjt:  CPMCLSRIESTDHCLFTCPRAKEIWKVAFKHEALWSNFNQSFVDRWLA-LNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEID

Query:  CAEVRRK---------------------DGCHSGSNLERVSRSRSWNERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYN
          E R+                      D    G N    S   + N+ G I+ +C++           E +A ++ V   +      +I+E D +    
Subjt:  CAEVRRK---------------------DGCHSGSNLERVSRSRSWNERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYN

Query:  LVSRKEVSWSEVGSIVDGIQMKSLSGEHIVFSFVPRECNVIADSI
          + K+   S +G+ +  IQ      +H VF  +PR  N +A  I
Subjt:  LVSRKEVSWSEVGSIVDGIQMKSLSGEHIVFSFVPRECNVIADSI

A0A6J1BQ49 uncharacterized protein LOC1110047862.0e-1530Show/hide
Query:  IESTDHCLFTCPRAKEIWKVAFKHEALWS---NFNQSFVDRWLALNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEIDCAEVR
        +E T H +  C RAK+IW + F   +L++   N N SF+D WL L + L+  +  + A T WAIW DRN     + + + A+R  WI +Y      A+  
Subjt:  IESTDHCLFTCPRAKEIWKVAFKHEALWS---NFNQSFVDRWLALNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEIDCAEVR

Query:  RKDGCHSGS------------------NLERVSRSRSW-------NERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYNL
        ++      S                  N +   RS S        +  G ++ A + FL   L PL  E+  I E +  A S    RL+VE+DC +A  L
Subjt:  RKDGCHSGS------------------NLERVSRSRSW-------NERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYNL

Query:  VSRKEVSWSE
        V     S  E
Subjt:  VSRKEVSWSE

A0A6J1DX30 uncharacterized protein LOC1110248743.1e-1626.69Show/hide
Query:  CPMCLSRIESTDHCLFTCPRAKEIWKVAFKH-EALWSNFNQSFVDRWLALNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEID
        C +C  R ES  H  F C RA++IW+  F     L +  N SF++ W +L E L   +L + A+T W IW DRN  I    +     +  W+  +L    
Subjt:  CPMCLSRIESTDHCLFTCPRAKEIWKVAFKH-EALWSNFNQSFVDRWLALNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEID

Query:  CAEV------------------RRKDGCHSGSNLERVSRSRSW-------NERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCI
         A++                  R         N +   R  S        +    +V A +  + F L PL+ E+  I EG+ FA +     L VE+D +
Subjt:  CAEV------------------RRKDGCHSGSNLERVSRSRSW-------NERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCI

Query:  QAYNLVSRKEVSWSEVGSIVDGIQMKSLSGEHIVFSFVPRECNVIADSITK
         A  L+  +  +  +  + V  IQ  +     I FS   R+CN  A  + K
Subjt:  QAYNLVSRKEVSWSEVGSIVDGIQMKSLSGEHIVFSFVPRECNVIADSITK

A0A6J1DZK3 uncharacterized protein LOC1110249685.3e-1632.82Show/hide
Query:  LSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEIDCAEVRRKDGCHSGSNLERVSRSRSW----------------NERGDIVGA---
        LS  ELR+  +TCWA+W DR+  I    IP   I+  WI+ Y       EVR K   H+G    R+ R   W                +E G  VG    
Subjt:  LSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEIDCAEVRRKDGCHSGSNLERVSRSRSW----------------NERGDIVGA---

Query:  ------CNKFLDFSL--PPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYNLVSRKEVSWSEVGSIVDGIQMKSLSGEHIVFSFVPRECNVIA
                  +DF +   PL+ ++LAI+EG+  A  LG  R++VETD ++A NL+  K     E  S V+ I+  +   + I F  V RE N +A
Subjt:  ------CNKFLDFSL--PPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYNLVSRKEVSWSEVGSIVDGIQMKSLSGEHIVFSFVPRECNVIA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-0428.92Show/hide
Query:  CPMCLSRIESTDHCLFTCPRAKEIWKVAFKHEALWSNFNQSFVD--RWLALNEALSGDELRIVA--VTCWAIWGDRNKEIQNN
        C +C    ES DH LF+C  A ++W++AF           S+ +   W+  + + +   LR V+     + IW  RN  + NN
Subjt:  CPMCLSRIESTDHCLFTCPRAKEIWKVAFKHEALWSNFNQSFVD--RWLALNEALSGDELRIVA--VTCWAIWGDRNKEIQNN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATCCCATCGATTTGCCCTATGTGCTTATCGAGGATAGAATCCACAGACCACTGTTTGTTTACATGCCCTAGAGCTAAGGAGATTTGGAAAGTTGCCTTTAAACA
TGAAGCGCTATGGAGTAACTTCAATCAGAGCTTTGTGGATAGATGGTTGGCCCTCAATGAAGCTTTATCTGGAGATGAGCTCCGCATTGTTGCAGTCACCTGTTGGGCTA
TCTGGGGAGATCGAAATAAGGAGATTCAGAATAATCATATCCCTTCTCATGCAATTCGCAGTAGGTGGATTATCAATTACCTGGCAGAGATTGACTGTGCCGAAGTCAGG
CGTAAGGATGGTTGCCATTCAGGCTCCAATTTGGAGCGCGTGTCCCGGTCTCGGAGCTGGAACGAACGGGGCGATATTGTTGGAGCCTGTAACAAGTTTTTGGATTTCTC
TCTTCCCCCTCTGATTGTTGAATTATTAGCCATTAAAGAAGGCGTGGATTTTGCTATTTCTTTGGGAGGCGAGAGGCTTATTGTGGAGACTGATTGTATTCAAGCTTATA
ATCTGGTGAGTCGAAAGGAGGTTTCGTGGAGTGAAGTCGGATCGATTGTTGATGGTATCCAAATGAAGTCTTTATCAGGTGAACATATTGTTTTTTCTTTTGTTCCGAGA
GAGTGTAATGTAATTGCCGATTCCATAACGAAGAGAGCTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATATCCCATCGATTTGCCCTATGTGCTTATCGAGGATAGAATCCACAGACCACTGTTTGTTTACATGCCCTAGAGCTAAGGAGATTTGGAAAGTTGCCTTTAAACA
TGAAGCGCTATGGAGTAACTTCAATCAGAGCTTTGTGGATAGATGGTTGGCCCTCAATGAAGCTTTATCTGGAGATGAGCTCCGCATTGTTGCAGTCACCTGTTGGGCTA
TCTGGGGAGATCGAAATAAGGAGATTCAGAATAATCATATCCCTTCTCATGCAATTCGCAGTAGGTGGATTATCAATTACCTGGCAGAGATTGACTGTGCCGAAGTCAGG
CGTAAGGATGGTTGCCATTCAGGCTCCAATTTGGAGCGCGTGTCCCGGTCTCGGAGCTGGAACGAACGGGGCGATATTGTTGGAGCCTGTAACAAGTTTTTGGATTTCTC
TCTTCCCCCTCTGATTGTTGAATTATTAGCCATTAAAGAAGGCGTGGATTTTGCTATTTCTTTGGGAGGCGAGAGGCTTATTGTGGAGACTGATTGTATTCAAGCTTATA
ATCTGGTGAGTCGAAAGGAGGTTTCGTGGAGTGAAGTCGGATCGATTGTTGATGGTATCCAAATGAAGTCTTTATCAGGTGAACATATTGTTTTTTCTTTTGTTCCGAGA
GAGTGTAATGTAATTGCCGATTCCATAACGAAGAGAGCTAAGTGA
Protein sequenceShow/hide protein sequence
MDIPSICPMCLSRIESTDHCLFTCPRAKEIWKVAFKHEALWSNFNQSFVDRWLALNEALSGDELRIVAVTCWAIWGDRNKEIQNNHIPSHAIRSRWIINYLAEIDCAEVR
RKDGCHSGSNLERVSRSRSWNERGDIVGACNKFLDFSLPPLIVELLAIKEGVDFAISLGGERLIVETDCIQAYNLVSRKEVSWSEVGSIVDGIQMKSLSGEHIVFSFVPR
ECNVIADSITKRAK