; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036525 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036525
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:47864696..47865512
RNA-Seq ExpressionLag0036525
SyntenyLag0036525
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7571790.1 Reverse transcriptase domain [Arabidopsis suecica]8.4e-2024.07Show/hide
Query:  MYFSAIVRHLGYNKSDHRPLELSRLPVIRGSIPRGMRIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGNYAKCICEI
        ++ ++ + +L    SDH+P+    +  I   +PRG + +R+D  W+  EGL + +SD W + GN +   N   K   C   +S+W ++      K I E+
Subjt:  MYFSAIVRHLGYNKSDHRPLELSRLPVIRGSIPRGMRIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGNYAKCICEI

Query:  -----------------------------------------------------------------NRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASS
                                                                         NRI+GL+D    W  +   +  +   YF ++F S+
Subjt:  -----------------------------------------------------------------NRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASS

Query:  DPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVGNDLI
        +P   DFE +L+EV   + + +N+ L    +  EVR A   +HP K+PGPDG++  F++  W  V  DL+
Subjt:  DPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVGNDLI

XP_021737580.1 uncharacterized protein LOC110704109 [Chenopodium quinoa]2.9e-2028.57Show/hide
Query:  AIVRHLGYNKSDHRPLELSRLPVIRGSIPR-GMRIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGNYAK----CICE
        + V HL   KSDH P+ L     +     R   ++YR+ + WL  E  S I+S+AW   G      +L +K A     +S W + K G++ K    C  +
Subjt:  AIVRHLGYNKSDHRPLELSRLPVIRGSIPR-GMRIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGNYAK----CICE

Query:  INRIVG-------------LYDRFDNWQQDPPL------LLDVLTDYFSSIFASSDPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPN
        + +++G             + DR D  ++   +      + +VL +YF ++FA +  S ++ +  + +V P V E  + +L   F  +EV+    Q+HP 
Subjt:  INRIVG-------------LYDRFDNWQQDPPL------LLDVLTDYFSSIFASSDPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPN

Query:  KSPGPDGLSGSFYKHHWEFVGNDL
        K+PGPDG+S  F++++W+ VG D+
Subjt:  KSPGPDGLSGSFYKHHWEFVGNDL

XP_021838584.1 uncharacterized protein LOC110778326 [Spinacia oleracea]1.7e-2024.15Show/hide
Query:  VRHLGYNKSDHRPLELSRLPVIRGSIPRGMRIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGNY-------AKCICE
        V HL  ++SDH+PL +S LP     +  G + +R++D WL   G  +++ DAW  + ++   +++  K  +C + + +W + + GN         K +C 
Subjt:  VRHLGYNKSDHRPLELSRLPVIRGSIPRGMRIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGNY-------AKCICE

Query:  I------------------------------------------------------------NRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDPSA
        +                                                            N I G+++    W +D   +  V  +YF ++F + +P  
Subjt:  I------------------------------------------------------------NRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDPSA

Query:  MDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVGNDL
         +FE AL  +  RV + +NQ L+  F   EV  A  Q+HP+K+PGPDG++ +F++ +W  VG D+
Subjt:  MDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVGNDL

XP_030494678.1 uncharacterized protein LOC115710455 [Cannabis sativa]3.1e-2233.01Show/hide
Query:  MYFSAIVRHLGYNKSDHRPLELSRLPVIRGSIPRGM-RIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGNYAKCICE
        ++   ++ +L  + SDH P+ L    V +G+IP      +R+++AWL      +IV   WG +G     + +  K  RC E++ RWG          I +
Subjt:  MYFSAIVRHLGYNKSDHRPLELSRLPVIRGSIPRGM-RIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGNYAKCICE

Query:  INRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEF
         N +      + NW+   P   +V+T YF  +F S   S+++    L  + P V    N+ALLE  S +EVR A  Q+HP+KSPGPDG++ +FY+ HW  
Subjt:  INRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEF

Query:  VGNDLI
        V  D+I
Subjt:  VGNDLI

XP_030939839.1 uncharacterized protein LOC115964720 [Quercus lobata]8.1e-2331.55Show/hide
Query:  VRHLGYNKSDHRPLELSRLPVIRGSIPRGMRIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGN--------YAKCIC
        V HL  + SDH  L +S   V++   P+  R + ++  W H E   +++ DAW    +   P  LA    +C    SR    K G+         A    
Subjt:  VRHLGYNKSDHRPLELSRLPVIRGSIPRGMRIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGN--------YAKCIC

Query:  EINRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWE
        + N I+GL+D    W      + +V   YF +I+ SS P++  F+     +  RV + MN+ L++ FS DE+  A  Q+HP K+PGPDG+S  F+  +W+
Subjt:  EINRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWE

Query:  FVGNDL
         VGN +
Subjt:  FVGNDL

TrEMBL top hitse value%identityAlignment
A0A2N9EDG5 CCHC-type domain-containing protein8.2e-2125.37Show/hide
Query:  MYFSAIVRHLGYNKSDHRPLELSRLPVIRGSIPRGMRIYRYDDAWLHYEGLSKIVSDAWGVTGN---VVC-----------------------PSNLATK
        ++  A+VRH+    SDH  L L  +P +     +  R +R+D  W+  E    ++++AW  + +   + C                       P N+ +K
Subjt:  MYFSAIVRHLGYNKSDHRPLELSRLPVIRGSIPRGMRIYRYDDAWLHYEGLSKIVSDAWGVTGN---VVC-----------------------PSNLATK

Query:  NARCMELMSR----WGRSKVGNYAKCICEI-------------------------------------NRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFA
            +EL S+    +   +V    K +C +                                     N + GL D  D WQ DP ++  ++ +YF ++F 
Subjt:  NARCMELMSR----WGRSKVGNYAKCICEI-------------------------------------NRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFA

Query:  SSDPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVG
        SS P+ ++    ++ V  R+   MN ALL  FS++EVR+A  QI P+K+PGPDG++  F++ +W  VG
Subjt:  SSDPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVG

A0A2N9GJ35 Uncharacterized protein3.7e-2127.88Show/hide
Query:  AIVRHLGYNKSDHRPLEL-SRLPVIRGSIP-RGMRIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGNYAKCI-----
        A + HL    SDH  L L SR    R  +P R  R++R++ +WL   G  +++  AW V         +A K  +C   + +W +S V    K I     
Subjt:  AIVRHLGYNKSDHRPLEL-SRLPVIRGSIP-RGMRIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGNYAKCI-----

Query:  --------------------------------------------------------------CEINRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASS
                                                                       +IN I+GL D+  NW+ +P  +  +  DYFSS+FASS
Subjt:  --------------------------------------------------------------CEINRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASS

Query:  DPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVGNDL
        +P A+D    L EV   V   MN  L+  F+ +E++ A  Q+HP+KSPGPDG+S  F++ +W  V +D+
Subjt:  DPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVGNDL

A0A2N9GLP4 Reverse transcriptase domain-containing protein1.1e-2029.8Show/hide
Query:  HLGYNKSDHRPLELSRLPVIRGSIPRGMRIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNAR-CMELMSRWGRSKVGNYAKCIC----EINRI
        +L +  S+H P+ LS    I  ++ R  +++R++  W   E   +++  AWG+  +   P  +  +  + C   +  W + K G+ A  I      + R+
Subjt:  HLGYNKSDHRPLELSRLPVIRGSIPRGMRIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNAR-CMELMSRWGRSKVGNYAKCIC----EINRI

Query:  VGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVG
        V      D    D   + ++  DYF SIF+S++P +      L  V   V + MN  +L+ FSA+EV  A  Q++P K+PGPDG+S  FY+ +W+ VG
Subjt:  VGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVG

A0A803P924 Uncharacterized protein8.2e-2129.96Show/hide
Query:  MYFSAIVRHLGYNKSDHRPLELSRLPVIRGSIPRGM-RIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWG-------------
        ++   ++ +L  + SDH P+ L    V +G+IP      +R+++AWL      +IV   WG +G     + +  K  RC E++ RWG             
Subjt:  MYFSAIVRHLGYNKSDHRPLELSRLPVIRGSIPRGM-RIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWG-------------

Query:  ---------RSK----------------VGNYAK---CICEINRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDPSAMDFERALREVGPRVDENMN
                 RSK                + N  K   CI ++    G++    NW+   P   +V+T YF  +F S   S+++    L  + P V    N
Subjt:  ---------RSK----------------VGNYAK---CICEINRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDPSAMDFERALREVGPRVDENMN

Query:  QALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVGNDLI
        +ALLE  S +EVR A  Q+HP+KSPGPDG++ +FY+ HW  V  D+I
Subjt:  QALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVGNDLI

A0A803PJG3 Uncharacterized protein6.3e-2135.12Show/hide
Query:  YRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGN-YAKCICEINRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDP
        +R+++AWL      ++V D W  +G     +N+  K  +C E++ +WGR   G+ +A      N I  L D    W      L  V+T YF  +F +S+ 
Subjt:  YRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGN-YAKCICEINRIVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDP

Query:  SAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVGNDLI
          +D    L  V P V    N  LL     DEVR+A  Q+HP+KSPGPDG++ +FY+ HW  VG D++
Subjt:  SAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVGNDLI

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog3.9e-0420.45Show/hide
Query:  KNARCMELMSR---WGRSKVGNYAKCICEINR-------IVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDPSAMDFERALREVG-PRVDENMNQALL
        +N R ++ +++   W   K+    K +  + R       I  + +  D    DP  +  +L +Y+  +++    +  + ++ L     PR+ +   + L 
Subjt:  KNARCMELMSR---WGRSKVGNYAKCICEINR-------IVGLYDRFDNWQQDPPLLLDVLTDYFSSIFASSDPSAMDFERALREVG-PRVDENMNQALL

Query:  ETFSADEVRAAFMQIHPNKSPGPDGLSGSFYK
           S+ E+ +    +   KSPGPDG +  FY+
Subjt:  ETFSADEVRAAFMQIHPNKSPGPDGLSGSFYK

P14381 Transposon TX1 uncharacterized 149 kDa protein4.6e-0531.46Show/hide
Query:  QDPPLLLDVLTDYFSSIFASSDPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVGND
        +DP  + D    ++ ++F S DP + D    L +  P V E   + L    + DE+  A   +  NKSPG DGL+  F++  W+ +G D
Subjt:  QDPPLLLDVLTDYFSSIFASSDPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVGND

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTTCAGCGCTATAGTTCGTCATCTGGGCTATAATAAATCTGACCATCGACCTCTTGAACTTTCCCGACTTCCTGTGATACGTGGGTCCATACCCAGGGGGATGAG
GATTTATCGCTATGATGATGCTTGGCTCCATTATGAGGGTCTCTCGAAGATTGTTAGTGATGCATGGGGTGTGACGGGGAATGTTGTTTGTCCGTCGAATCTGGCGACCA
AGAATGCCCGGTGTATGGAGTTGATGTCCCGTTGGGGTCGTTCAAAAGTTGGCAACTACGCTAAGTGCATTTGTGAGATTAATCGGATAGTTGGCTTGTATGATCGGTTT
GATAATTGGCAACAAGATCCACCACTTCTGCTTGATGTGCTCACTGACTACTTTTCTTCCATTTTTGCTAGCTCTGATCCAAGTGCTATGGATTTTGAGAGGGCGTTGCG
TGAGGTTGGGCCTCGGGTGGACGAGAATATGAATCAAGCCTTGTTGGAGACGTTTAGTGCTGATGAAGTTCGGGCTGCTTTTATGCAAATTCATCCAAATAAATCCCCGG
GACCAGATGGCCTTTCAGGATCCTTTTACAAACATCATTGGGAGTTTGTTGGTAACGACCTTATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATTTCAGCGCTATAGTTCGTCATCTGGGCTATAATAAATCTGACCATCGACCTCTTGAACTTTCCCGACTTCCTGTGATACGTGGGTCCATACCCAGGGGGATGAG
GATTTATCGCTATGATGATGCTTGGCTCCATTATGAGGGTCTCTCGAAGATTGTTAGTGATGCATGGGGTGTGACGGGGAATGTTGTTTGTCCGTCGAATCTGGCGACCA
AGAATGCCCGGTGTATGGAGTTGATGTCCCGTTGGGGTCGTTCAAAAGTTGGCAACTACGCTAAGTGCATTTGTGAGATTAATCGGATAGTTGGCTTGTATGATCGGTTT
GATAATTGGCAACAAGATCCACCACTTCTGCTTGATGTGCTCACTGACTACTTTTCTTCCATTTTTGCTAGCTCTGATCCAAGTGCTATGGATTTTGAGAGGGCGTTGCG
TGAGGTTGGGCCTCGGGTGGACGAGAATATGAATCAAGCCTTGTTGGAGACGTTTAGTGCTGATGAAGTTCGGGCTGCTTTTATGCAAATTCATCCAAATAAATCCCCGG
GACCAGATGGCCTTTCAGGATCCTTTTACAAACATCATTGGGAGTTTGTTGGTAACGACCTTATTTAG
Protein sequenceShow/hide protein sequence
MYFSAIVRHLGYNKSDHRPLELSRLPVIRGSIPRGMRIYRYDDAWLHYEGLSKIVSDAWGVTGNVVCPSNLATKNARCMELMSRWGRSKVGNYAKCICEINRIVGLYDRF
DNWQQDPPLLLDVLTDYFSSIFASSDPSAMDFERALREVGPRVDENMNQALLETFSADEVRAAFMQIHPNKSPGPDGLSGSFYKHHWEFVGNDLI