; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035163 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035163
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:15975737..15978920
RNA-Seq ExpressionLag0035163
SyntenyLag0035163
Gene Ontology termsNA
InterPro domainsIPR031646 - Vacuolar protein sorting-associated protein 13, second N-terminal domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69274.1 TatD related DNase [Prunus dulcis]2.3e-3032.6Show/hide
Query:  YVNFYKKKLEFLHNDQCFGLILFDFLSIFVKWNSEVVR------FGTIFSVAGLRRI---WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHT
        ++ F    + FL   + + L L   L +F   +   +        G  FS+  L  +   WG     W   YLG PL G P++  F +PV++K++KRL  
Subjt:  YVNFYKKKLEFLHNDQCFGLILFDFLSIFVKWNSEVVR------FGTIFSVAGLRRI---WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHT

Query:  WGSTSLRRKKAHSFSGHTIK--PAHILSLPLQGSHKVTKKIEKLFQNFLCRGNNEFKGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKER
        W    L +    +     +   P++ +SL  +    VT K+E+L +NFL  G  E K  HL+ W RV    EEGGLGI  ++++N +L AKW+WRF  E 
Subjt:  WGSTSLRRKKAHSFSGHTIK--PAHILSLPLQGSHKVTKKIEKLFQNFLCRGNNEFKGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKER

Query:  GALWRRVIGSKYGSIHFGLKPDKISKL-IQSPWKSIESTKFLVYSNGSIKVGDGKNTPFWKDTWLENLSLQRL
         +LW R+I SKYG    G    +I K+  ++PW+ I              VG+G+   FW+D WL+   L+ L
Subjt:  GALWRRVIGSKYGSIHFGLKPDKISKL-IQSPWKSIESTKFLVYSNGSIKVGDGKNTPFWKDTWLENLSLQRL

RVW39368.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]3.3e-2939.25Show/hide
Query:  RIWGWLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTIK---------PAHILSLPLQGSHKVTKKIEKLFQNFLCRGNNEF
        R+  W  +YLG PL G PK+  F DPVVE+I +RL  W       KKA+   G  I          P++ LSL  +    +  KIEK+ +NFL  G  E 
Subjt:  RIWGWLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTIK---------PAHILSLPLQGSHKVTKKIEKLFQNFLCRGNNEF

Query:  KGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFGLKPDKISKLI-QSPWKSIESTKFLVYSNGSIKVGDGKN
        K  HL+ WE V  P E GGLG   I  +N +LL KW+WRF +ER  LW +VIGS YG+   G   + + +   + PWK+I            + VG+G+ 
Subjt:  KGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFGLKPDKISKLI-QSPWKSIESTKFLVYSNGSIKVGDGKN

Query:  TPFWKDTWLENLSL
          FW+D W  N SL
Subjt:  TPFWKDTWLENLSL

TQD93576.1 hypothetical protein C1H46_020784 [Malus baccata]5.1e-3040.38Show/hide
Query:  WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTI--KPAHILSLPLQGSHKVTKKIEKLFQNFLCRGNNEFKGIH
        WG     W   YLG PL G+P++  F DPVVEK++KRL +W    L R    +     +   P + +SL  +    V  ++EKL + FL  G  E K  H
Subjt:  WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTI--KPAHILSLPLQGSHKVTKKIEKLFQNFLCRGNNEFKGIH

Query:  LLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFG---LKPDKISKLIQSPWKSIESTKFLVYSNGSIKVGDGKNTP
        L++WE V    EEGGLG+  ++ +N +LLAKW+WRF KE  +LW +VI SKYG  + G     P + S   +SPWK I S   L       +VG+G+   
Subjt:  LLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFG---LKPDKISKLIQSPWKSIESTKFLVYSNGSIKVGDGKNTP

Query:  FWKDTWLE
        FW+D WLE
Subjt:  FWKDTWLE

VVA30963.1 Hypothetical predicted protein, partial [Prunus dulcis]1.1e-2935.86Show/hide
Query:  GTIFSVAGLRRI---WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTIK--PAHILSLPLQGSHKVTKKIEKLF
        G  FS+  L  +   WG     W   YLG PL G P++  F +PV +K++KRL  W    L +    +     +   P++ +SL  +    V  K+E+L 
Subjt:  GTIFSVAGLRRI---WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTIK--PAHILSLPLQGSHKVTKKIEKLF

Query:  QNFLCRGNNEFKGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFGLKPDKISKL-IQSPWKSIESTKFLVYS
        +NFL  G  E K  HL+ WERV    EEGGLGI  ++++N +L AKW+WRF  E  +LW R+I SKYG    G    +I K+  ++PW+ I         
Subjt:  QNFLCRGNNEFKGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFGLKPDKISKL-IQSPWKSIESTKFLVYS

Query:  NGSIKVGDGKNTPFWKDTWLENLSLQ----RLRTLER
             VG+G+   FW+D WL+   L+    RL +L R
Subjt:  NGSIKVGDGKNTPFWKDTWLENLSLQ----RLRTLER

XP_031736165.1 uncharacterized protein LOC101204937 isoform X4 [Cucumis sativus]3.3e-2988.89Show/hide
Query:  RYGRFRPW-FPLSRKVTGWQILWWQYAQQSVLSDVRKKLKKCSWRYLGQRLIYRRRYVNFYKKKLEFLHNDQ
        RYGRFRPW FPLSRKV GWQ+LWW YAQQSVL DVRK+LKKCSWRYLGQRL+YRRRYV FYKKKLEFLHNDQ
Subjt:  RYGRFRPW-FPLSRKVTGWQILWWQYAQQSVLSDVRKKLKKCSWRYLGQRLIYRRRYVNFYKKKLEFLHNDQ

TrEMBL top hitse value%identityAlignment
A0A540M4H0 zf-RVT domain-containing protein2.5e-3040.38Show/hide
Query:  WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTI--KPAHILSLPLQGSHKVTKKIEKLFQNFLCRGNNEFKGIH
        WG     W   YLG PL G+P++  F DPVVEK++KRL +W    L R    +     +   P + +SL  +    V  ++EKL + FL  G  E K  H
Subjt:  WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTI--KPAHILSLPLQGSHKVTKKIEKLFQNFLCRGNNEFKGIH

Query:  LLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFG---LKPDKISKLIQSPWKSIESTKFLVYSNGSIKVGDGKNTP
        L++WE V    EEGGLG+  ++ +N +LLAKW+WRF KE  +LW +VI SKYG  + G     P + S   +SPWK I S   L       +VG+G+   
Subjt:  LLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFG---LKPDKISKLIQSPWKSIESTKFLVYSNGSIKVGDGKNTP

Query:  FWKDTWLE
        FW+D WLE
Subjt:  FWKDTWLE

A0A5H2XQW2 TatD related DNase1.1e-3032.6Show/hide
Query:  YVNFYKKKLEFLHNDQCFGLILFDFLSIFVKWNSEVVR------FGTIFSVAGLRRI---WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHT
        ++ F    + FL   + + L L   L +F   +   +        G  FS+  L  +   WG     W   YLG PL G P++  F +PV++K++KRL  
Subjt:  YVNFYKKKLEFLHNDQCFGLILFDFLSIFVKWNSEVVR------FGTIFSVAGLRRI---WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHT

Query:  WGSTSLRRKKAHSFSGHTIK--PAHILSLPLQGSHKVTKKIEKLFQNFLCRGNNEFKGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKER
        W    L +    +     +   P++ +SL  +    VT K+E+L +NFL  G  E K  HL+ W RV    EEGGLGI  ++++N +L AKW+WRF  E 
Subjt:  WGSTSLRRKKAHSFSGHTIK--PAHILSLPLQGSHKVTKKIEKLFQNFLCRGNNEFKGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKER

Query:  GALWRRVIGSKYGSIHFGLKPDKISKL-IQSPWKSIESTKFLVYSNGSIKVGDGKNTPFWKDTWLENLSLQRL
         +LW R+I SKYG    G    +I K+  ++PW+ I              VG+G+   FW+D WL+   L+ L
Subjt:  GALWRRVIGSKYGSIHFGLKPDKISKL-IQSPWKSIESTKFLVYSNGSIKVGDGKNTPFWKDTWLENLSLQRL

M5VS59 Reverse transcriptase domain-containing protein (Fragment)3.2e-3035.86Show/hide
Query:  GTIFSVAGLRRI---WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTIK--PAHILSLPLQGSHKVTKKIEKLF
        G  FS   L  +   WG     W   YLG PL G P++  F +PV++K++KRL  W    L +    +     +   P++ +SL  +    V  K+E+L 
Subjt:  GTIFSVAGLRRI---WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTIK--PAHILSLPLQGSHKVTKKIEKLF

Query:  QNFLCRGNNEFKGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFGLKPDKISKL-IQSPWKSIESTKFLVYS
        +NFL  G  E K  HL+ WERV    EEGGLGI  ++++N +L AKW+WRF  E  +LW R+I SKYG    G    +I K+  ++PW+ I         
Subjt:  QNFLCRGNNEFKGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFGLKPDKISKL-IQSPWKSIESTKFLVYS

Query:  NGSIKVGDGKNTPFWKDTWLENLSLQ----RLRTLER
             VG+G+   FW+D WL+   L+    RL +L R
Subjt:  NGSIKVGDGKNTPFWKDTWLENLSLQ----RLRTLER

M5WZA0 Reverse transcriptase domain-containing protein (Fragment)3.2e-3035.86Show/hide
Query:  GTIFSVAGLRRI---WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTIK--PAHILSLPLQGSHKVTKKIEKLF
        G  FS+  L  +   WG     W   YLG PL G P++  F +PVV+K++KRL  W    L +    +     +   P++ +SL  +    V  K+E+L 
Subjt:  GTIFSVAGLRRI---WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTIK--PAHILSLPLQGSHKVTKKIEKLF

Query:  QNFLCRGNNEFKGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFGLKPDKISKL-IQSPWKSIESTKFLVYS
        +NFL  G  E K  HL+ WERV    EEGGLGI  +++++ +L AKW+WRF  E  +LW R+I SKYG    G    +I K+  ++PW+ I         
Subjt:  QNFLCRGNNEFKGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFGLKPDKISKL-IQSPWKSIESTKFLVYS

Query:  NGSIKVGDGKNTPFWKDTWLENLSLQ----RLRTLER
             VG+G+   FW+D WL+   L+    RL +L R
Subjt:  NGSIKVGDGKNTPFWKDTWLENLSLQ----RLRTLER

M5X4S0 Reverse transcriptase domain-containing protein (Fragment)4.2e-3036.94Show/hide
Query:  WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTIK--PAHILSLPLQGSHKVTKKIEKLFQNFLCRGNNEFKGIH
        WG     W   YLG PL G P++  F +PV+EK++KRL  W    L +    +     +   P++ +SL  +    V  K+E+L +NFL  G  E K  H
Subjt:  WG-----WLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTIK--PAHILSLPLQGSHKVTKKIEKLFQNFLCRGNNEFKGIH

Query:  LLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFGLKPDKISKL-IQSPWKSIESTKFLVYSNGSIKVGDGKNTPFW
        L+ WERV    EEGGLGI  ++++N +L AKW+WRF  E  +LW R+I SKYG    G    +I K+  ++PW+ I              VG+G+   FW
Subjt:  LLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFGLKPDKISKL-IQSPWKSIESTKFLVYSNGSIKVGDGKNTPFW

Query:  KDTWLENLSLQ----RLRTLER
        +D WL+   L+    RL +L R
Subjt:  KDTWLENLSLQ----RLRTLER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.5e-1129.1Show/hide
Query:  VTKKIEKLFQNFLCRGNNEFKGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFGLKPDKISKLIQS----PW
        V K+I  +  +F  R   E KG+H   W+ +     EGG+G   I+  N +LL K +WR      +L  +V  S+Y   H   K D ++  + S     W
Subjt:  VTKKIEKLFQNFLCRGNNEFKGIHLLEWERVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFGLKPDKISKLIQS----PW

Query:  KSIESTKFLVYSNGSIKVGDGKNTPFWKDTWLEN
        KSI +++ ++       VG+G++   W+  WL++
Subjt:  KSIESTKFLVYSNGSIKVGDGKNTPFWKDTWLEN

AT5G24740.1 Protein of unknown function (DUF1162)3.2e-0670.59Show/hide
Query:  RYGRFRPW-FPLSRKVTGWQILWWQYAQQSVLSD
        RYGR+RP    LSRK  GWQ LWW YAQ SVLSD
Subjt:  RYGRFRPW-FPLSRKVTGWQILWWQYAQQSVLSD

AT5G24740.2 Protein of unknown function (DUF1162)1.1e-1963.41Show/hide
Query:  RYGRFRPW-FPLSRKVTGWQILWWQYAQQSVLSDVRKKLKKCSWRYLGQRLIYRRRYVNFYKKKLEFLHNDQCFGLILFDFL
        RYGR+RP    LSRK  GWQ LWW YAQ SVLSDVRKKL K SWR+L QR+  RRRY+NFYK KL+ L  +Q    ILF  L
Subjt:  RYGRFRPW-FPLSRKVTGWQILWWQYAQQSVLSDVRKKLKKCSWRYLGQRLIYRRRYVNFYKKKLEFLHNDQCFGLILFDFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACATAATGGCTATATATCACAGTCTACCTAACCTCTTATTCTGCAGGTATGGAAGATTTCGCCCCTGGTTTCCTCTGTCAAGGAAAGTTACGGGTTGGCAAATATT
ATGGTGGCAGTATGCACAGCAATCTGTCTTGTCAGATGTACGAAAGAAATTAAAGAAATGTTCATGGAGATATCTTGGACAGAGACTAATCTATCGTCGGAGGTATGTCA
ACTTTTACAAGAAGAAGTTGGAGTTTCTTCACAATGACCAGTGTTTTGGACTAATTCTCTTCGATTTTTTGAGTATTTTCGTCAAATGGAATTCCGAAGTTGTAAGATTT
GGAACAATTTTTTCTGTTGCTGGTTTGAGGAGGATATGGGGATGGCTGGACACTTACCTTGGTCCACCGTTGAATGGTAAACCAAAATCCTCTCCCTTTTTGGATCCTGT
TGTTGAGAAAATTAAGAAAAGGCTTCATACATGGGGCTCCACATCTCTCCGAAGGAAGAAGGCTCACTCTTTTTCTGGCCACACTATCAAACCTGCCCATATATTATCTC
TCCCTCTTCAAGGTTCCCACAAGGTGACTAAAAAGATTGAGAAACTATTTCAGAATTTTTTATGTAGAGGCAACAACGAATTCAAAGGCATCCATCTTTTGGAATGGGAG
AGAGTAAAAGTTCCCATCGAGGAAGGCGGCCTAGGCATTATGGGTATCCAACAGAAAAACTCTTCTCTCCTGGCAAAATGGATTTGGCGTTTTCATAAAGAGAGAGGAGC
TCTATGGCGTCGAGTTATAGGTTCTAAATATGGGTCAATTCATTTTGGCCTAAAGCCAGACAAAATCTCTAAACTCATCCAAAGCCCATGGAAATCTATTGAGAGCACGA
AATTTTTGGTGTATTCCAATGGTTCGATCAAGGTGGGTGATGGTAAAAACACTCCCTTTTGGAAGGACACATGGTTGGAGAATTTGAGCCTTCAACGTTTGAGGACTTTG
GAAAGGGGGGGTAGGAAGAAAAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCACATAATGGCTATATATCACAGTCTACCTAACCTCTTATTCTGCAGGTATGGAAGATTTCGCCCCTGGTTTCCTCTGTCAAGGAAAGTTACGGGTTGGCAAATATT
ATGGTGGCAGTATGCACAGCAATCTGTCTTGTCAGATGTACGAAAGAAATTAAAGAAATGTTCATGGAGATATCTTGGACAGAGACTAATCTATCGTCGGAGGTATGTCA
ACTTTTACAAGAAGAAGTTGGAGTTTCTTCACAATGACCAGTGTTTTGGACTAATTCTCTTCGATTTTTTGAGTATTTTCGTCAAATGGAATTCCGAAGTTGTAAGATTT
GGAACAATTTTTTCTGTTGCTGGTTTGAGGAGGATATGGGGATGGCTGGACACTTACCTTGGTCCACCGTTGAATGGTAAACCAAAATCCTCTCCCTTTTTGGATCCTGT
TGTTGAGAAAATTAAGAAAAGGCTTCATACATGGGGCTCCACATCTCTCCGAAGGAAGAAGGCTCACTCTTTTTCTGGCCACACTATCAAACCTGCCCATATATTATCTC
TCCCTCTTCAAGGTTCCCACAAGGTGACTAAAAAGATTGAGAAACTATTTCAGAATTTTTTATGTAGAGGCAACAACGAATTCAAAGGCATCCATCTTTTGGAATGGGAG
AGAGTAAAAGTTCCCATCGAGGAAGGCGGCCTAGGCATTATGGGTATCCAACAGAAAAACTCTTCTCTCCTGGCAAAATGGATTTGGCGTTTTCATAAAGAGAGAGGAGC
TCTATGGCGTCGAGTTATAGGTTCTAAATATGGGTCAATTCATTTTGGCCTAAAGCCAGACAAAATCTCTAAACTCATCCAAAGCCCATGGAAATCTATTGAGAGCACGA
AATTTTTGGTGTATTCCAATGGTTCGATCAAGGTGGGTGATGGTAAAAACACTCCCTTTTGGAAGGACACATGGTTGGAGAATTTGAGCCTTCAACGTTTGAGGACTTTG
GAAAGGGGGGGTAGGAAGAAAAGGTAG
Protein sequenceShow/hide protein sequence
MHIMAIYHSLPNLLFCRYGRFRPWFPLSRKVTGWQILWWQYAQQSVLSDVRKKLKKCSWRYLGQRLIYRRRYVNFYKKKLEFLHNDQCFGLILFDFLSIFVKWNSEVVRF
GTIFSVAGLRRIWGWLDTYLGPPLNGKPKSSPFLDPVVEKIKKRLHTWGSTSLRRKKAHSFSGHTIKPAHILSLPLQGSHKVTKKIEKLFQNFLCRGNNEFKGIHLLEWE
RVKVPIEEGGLGIMGIQQKNSSLLAKWIWRFHKERGALWRRVIGSKYGSIHFGLKPDKISKLIQSPWKSIESTKFLVYSNGSIKVGDGKNTPFWKDTWLENLSLQRLRTL
ERGGRKKR