; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017499 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017499
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:4437091..4438169
RNA-Seq ExpressionLag0017499
SyntenyLag0017499
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7824053.1 hypothetical protein G2W53_022197 [Senna tora]6.0e-4942.16Show/hide
Query:  KGSNSRSWKRRARDILSDVTNRDGSLGAVMAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHG--
        K    RSW     D        D S     APPG MS I WN RG G+PR  R L +L QDKRP + FL ETK  AS M   K  LGFD  F VDC G  
Subjt:  KGSNSRSWKRRARDILSDVTNRDGSLGAVMAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHG--

Query:  --RSGGLALMWVSSVSFNLLSFSKNHIDGWIS--WDDRRWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSA
          R+GGLAL W +++   L SFS NHID  +S    + +WRLTG +GFP      Q+W LL  L    D  WL  GDFN I+   EK GG  K    + A
Subjt:  --RSGGLALMWVSSVSFNLLSFSKNHIDGWIS--WDDRRWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSA

Query:  FQSVVDLCGLVDLGFAGDRL---RGAIDDRGGTIYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDHRPVELVLSPPPQCWSQRVSRL
        F+   + CG  D+GF G       G  D     I ERLDRVF T  W   +P   VNH+ +  SDH  +E+     P     R  RL
Subjt:  FQSVVDLCGLVDLGFAGDRL---RGAIDDRGGTIYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDHRPVELVLSPPPQCWSQRVSRL

OMO59710.1 reverse transcriptase [Corchorus capsularis]1.1e-4542.02Show/hide
Query:  MAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGWISW
        + PPG M  + WN RG G+P T R L +L++ +RP V FL ETK+   ++ S +       CF V   GRSGGLA+ W  SV   L+SFS++HID W+  
Subjt:  MAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGWISW

Query:  DDR--RWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAGDRL---RGAIDDRGGT
        + R  +WRLTGFYG         SW LL +     +  W   GDFN +L Q EKDGGR++P +++ AF+  +D CGL D+G+ G+     RG  ++    
Subjt:  DDR--RWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAGDRL---RGAIDDRGGT

Query:  IYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDHRPVEL
        I+ERLDR   T  WT  +P   + HL  S SDH P+ L
Subjt:  IYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDHRPVEL

XP_024021734.1 uncharacterized protein LOC112091706 [Morus notabilis]5.6e-4742.86Show/hide
Query:  MSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGWISWDDR-RW
        MS+I WNVRG G+PR F +L  L++D  P + FL ET++ +    S K   GF   F VDC GRSGGL LMW   V  ++ S+S++HID  + +     W
Subjt:  MSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGWISWDDR-RW

Query:  RLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAGDRLRGAIDDRGGT-IYERLDRVF
        R TGFYG PS      SW+LL RL+   +  W++ GDFN IL   +K+GG ++    ++ F+  V+ C LVDLGF G +        GG  I ERLDR  
Subjt:  RLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAGDRLRGAIDDRGGT-IYERLDRVF

Query:  GTTAWTDLYPNYAVNHLDYSQSDHRPVELVL
        G   W +L+P Y V ++D+  SDHR + L L
Subjt:  GTTAWTDLYPNYAVNHLDYSQSDHRPVELVL

XP_031099776.1 uncharacterized protein LOC116003973 [Ipomoea triloba]3.6e-4641Show/hide
Query:  MSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGWISWD--DRR
        MS++ WN  G G+P T + L  LV  K+P + FL+ET    +++S  K+ LGF   FCV+  G SGGLAL+W      ++ S+SK+HID  IS    D  
Subjt:  MSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGWISWD--DRR

Query:  WRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAG-----DRLRGAIDDRGGTIYER
        WR TGFYG P     P+SW+LL +L       W + GDFN IL QDEK GG  +PL  +S F+  V+  GL D GF G     +R +G        +  +
Subjt:  WRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAG-----DRLRGAIDDRGGTIYER

Query:  LDRVFGTTAWTDLYPNYAVNHLDYSQSDHRPVELVLSPP
        LDR+  T +W DL+PN     +  ++SDH P+ L + PP
Subjt:  LDRVFGTTAWTDLYPNYAVNHLDYSQSDHRPVELVLSPP

XP_042988712.1 uncharacterized protein LOC122316247 [Carya illinoinensis]1.7e-4843.04Show/hide
Query:  MSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGWI--SWDDRR
        M  I WN RG G+PR  R L  LV+ + P V FL ETK+  ++M   K +LG++ CF V   GRSGGLALMW    + N+ S+SKNHID  I  +  D +
Subjt:  MSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGWI--SWDDRR

Query:  WRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAGDRLRGAID-DRGGTIYERLDRV
        W+ TG YG P  +L  ++W+ +  LRG V   WL+ GDFN +LC  EK GGR++P  ++  F+ ++D C  VDLGF G         D   T+ ERLDR 
Subjt:  WRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAGDRLRGAID-DRGGTIYERLDRV

Query:  FGTTAWTDLYPNYAVNHLDYSQSDHRPVEL
             W D +P + V H   + SDH P+ L
Subjt:  FGTTAWTDLYPNYAVNHLDYSQSDHRPVEL

TrEMBL top hitse value%identityAlignment
A0A1R3GNW3 Reverse transcriptase5.1e-4642.02Show/hide
Query:  MAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGWISW
        + PPG M  + WN RG G+P T R L +L++ +RP V FL ETK+   ++ S +       CF V   GRSGGLA+ W  SV   L+SFS++HID W+  
Subjt:  MAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGWISW

Query:  DDR--RWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAGDRL---RGAIDDRGGT
        + R  +WRLTGFYG         SW LL +     +  W   GDFN +L Q EKDGGR++P +++ AF+  +D CGL D+G+ G+     RG  ++    
Subjt:  DDR--RWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAGDRL---RGAIDDRGGT

Query:  IYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDHRPVEL
        I+ERLDR   T  WT  +P   + HL  S SDH P+ L
Subjt:  IYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDHRPVEL

A0A2N9F086 Reverse transcriptase domain-containing protein6.0e-4740.74Show/hide
Query:  GAVMAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGW
        G   APP  M +I WN RG G+P   R L  LV+ + P+V FL ETK+  S M   +  LGF++ F V   GRSGGLA+ W   ++  + +++ +HID +
Subjt:  GAVMAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGW

Query:  I-SWDDRRWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAGDRLRGAIDDRG-GT
        I   +D  WRLTGFYG P      +SW+L+ +L G     WL  GDFN I+ Q+EK G   +PL  +  F+ V+  C L+D+G+ G       + RG   
Subjt:  I-SWDDRRWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAGDRLRGAIDDRG-GT

Query:  IYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDHRPVELVLSPP
        + ERLDR   + AWTDL+PN  V+H+  S SDH P+ + +  P
Subjt:  IYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDHRPVELVLSPP

A0A2N9G8I6 Reverse transcriptase domain-containing protein1.8e-4641.43Show/hide
Query:  KRRARDILSDVTNRDGSLGAVMAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWV
        KR    IL    N +       APP  M ++ WN +G G+P   R L  +V+ K P+V FL ETK+ A RM   +  LGFDN F V   GRSGGLAL+W 
Subjt:  KRRARDILSDVTNRDGSLGAVMAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWV

Query:  SSVSFNLLSFSKNHIDGWI-SWDDRRWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDL
        +     + ++S++HID  + S   + WRLTGFYG P      +SW+LL  L       W   GDFN IL  +EK GGR++ L ++  FQ  V++C  VDL
Subjt:  SSVSFNLLSFSKNHIDGWI-SWDDRRWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDL

Query:  GFAGDRLRGAID-DRGGTIYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDH
        GF G       + D    I  RLDR   T  W DL+P Y+V+H+  S SDH
Subjt:  GFAGDRLRGAID-DRGGTIYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDH

A0A2N9H936 Uncharacterized protein4.2e-4841.36Show/hide
Query:  DSVGQGKGVDLGG-----KGSNSRSWKRRARD------ILSDVT----NRDGSLGAVMAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSET
        +S+ + K VDLGG     K     +WKR AR+      I S+ +     RD   G    PP  MS +FWN RG G+P+T   LT +V+ K P V FLSET
Subjt:  DSVGQGKGVDLGG-----KGSNSRSWKRRARD------ILSDVT----NRDGSLGAVMAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSET

Query:  KVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGWISWD-DRRWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGD
        K+   ++   +   GF   F V   G+SGGLA+ W S +  ++ S+S +HID  + +D +  WR TGFYG P+      +W LL  LRG     WL GGD
Subjt:  KVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWVSSVSFNLLSFSKNHIDGWISWD-DRRWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGD

Query:  FNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAGDRLRGAIDDRG-GTIYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDHRPVELVLS
        FN IL  +EK G   +P S++SAF+ VVD CG VDLGF G          G   + ERLDR   T  W   +P   VNHL    SDHRP+ + LS
Subjt:  FNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDLGFAGDRLRGAIDDRG-GTIYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDHRPVELVLS

A0A2N9IBI9 Reverse transcriptase domain-containing protein1.8e-4641.43Show/hide
Query:  KRRARDILSDVTNRDGSLGAVMAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWV
        KR    IL    N +       APP  M ++ WN +G G+P   R L  +V+ K P+V FL ETK+ A RM   +  LGFDN F V   GRSGGLAL+W 
Subjt:  KRRARDILSDVTNRDGSLGAVMAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCVDCHGRSGGLALMWV

Query:  SSVSFNLLSFSKNHIDGWI-SWDDRRWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDL
        +     + ++S++HID  + S   + WRLTGFYG P      +SW+LL  L       W   GDFN IL  +EK GGR++ L ++  FQ  V++C  VDL
Subjt:  SSVSFNLLSFSKNHIDGWI-SWDDRRWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGLVDL

Query:  GFAGDRLRGAID-DRGGTIYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDH
        GF G       + D    I  RLDR   T  W DL+P Y+V+H+  S SDH
Subjt:  GFAGDRLRGAID-DRGGTIYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGTGGACTCTGTGGGGCAGGGCAAGGGGGTGGATTTGGGGGGGAAAGGGAGTAACAGTAGGAGTTGGAAGAGGAGGGCGAGGGATATCTTGTCCGATGTGACCAA
TCGTGATGGTTCTCTGGGAGCTGTTATGGCCCCGCCAGGGATCATGAGTGTGATATTCTGGAATGTTCGGGGGTCGGGGTCACCCCGAACATTCAGGCGCCTGACCAAGC
TGGTTCAGGATAAACGACCTCAGGTGTTCTTCTTATCTGAAACGAAGGTGTCTGCCAGTAGGATGTCTTCTGCGAAGAGTGTGCTGGGATTCGATAACTGTTTCTGTGTT
GACTGTCATGGACGGAGTGGTGGGTTGGCCCTTATGTGGGTTTCCTCGGTATCTTTCAACCTCCTCTCTTTCTCCAAGAATCACATTGATGGTTGGATTTCATGGGATGA
CCGTAGGTGGCGACTAACTGGTTTCTATGGATTTCCTTCAGCGGACCTGCATCCTCAGTCCTGGTCTCTCCTATCTAGACTGAGGGGATGTGTTGACACTCTGTGGCTGA
TTGGTGGTGATTTTAACGCCATCCTATGTCAGGATGAGAAAGATGGGGGCAGGGATAAGCCACTCTCTGAGCTGTCTGCTTTTCAGAGTGTGGTTGACTTATGTGGCTTG
GTGGATTTGGGGTTTGCGGGAGATCGTTTACGTGGTGCAATAGACGACCGGGGTGGAACGATTTATGAACGGTTGGATCGGGTTTTTGGCACAACAGCTTGGACAGACCT
CTATCCAAACTATGCAGTTAATCACCTCGACTATAGTCAGTCTGATCATAGACCAGTGGAGCTAGTCCTTAGCCCCCCTCCGCAGTGCTGGTCCCAGAGGGTCAGCAGAT
TGCCCGTTTTGAGGAAACTTGGCTTAGGCAGCGGATTTGTGCAGTTGGTGGATGTTCGTGGAGCGCAGGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGGTGGACTCTGTGGGGCAGGGCAAGGGGGTGGATTTGGGGGGGAAAGGGAGTAACAGTAGGAGTTGGAAGAGGAGGGCGAGGGATATCTTGTCCGATGTGACCAA
TCGTGATGGTTCTCTGGGAGCTGTTATGGCCCCGCCAGGGATCATGAGTGTGATATTCTGGAATGTTCGGGGGTCGGGGTCACCCCGAACATTCAGGCGCCTGACCAAGC
TGGTTCAGGATAAACGACCTCAGGTGTTCTTCTTATCTGAAACGAAGGTGTCTGCCAGTAGGATGTCTTCTGCGAAGAGTGTGCTGGGATTCGATAACTGTTTCTGTGTT
GACTGTCATGGACGGAGTGGTGGGTTGGCCCTTATGTGGGTTTCCTCGGTATCTTTCAACCTCCTCTCTTTCTCCAAGAATCACATTGATGGTTGGATTTCATGGGATGA
CCGTAGGTGGCGACTAACTGGTTTCTATGGATTTCCTTCAGCGGACCTGCATCCTCAGTCCTGGTCTCTCCTATCTAGACTGAGGGGATGTGTTGACACTCTGTGGCTGA
TTGGTGGTGATTTTAACGCCATCCTATGTCAGGATGAGAAAGATGGGGGCAGGGATAAGCCACTCTCTGAGCTGTCTGCTTTTCAGAGTGTGGTTGACTTATGTGGCTTG
GTGGATTTGGGGTTTGCGGGAGATCGTTTACGTGGTGCAATAGACGACCGGGGTGGAACGATTTATGAACGGTTGGATCGGGTTTTTGGCACAACAGCTTGGACAGACCT
CTATCCAAACTATGCAGTTAATCACCTCGACTATAGTCAGTCTGATCATAGACCAGTGGAGCTAGTCCTTAGCCCCCCTCCGCAGTGCTGGTCCCAGAGGGTCAGCAGAT
TGCCCGTTTTGAGGAAACTTGGCTTAGGCAGCGGATTTGTGCAGTTGGTGGATGTTCGTGGAGCGCAGGGCTAG
Protein sequenceShow/hide protein sequence
MLVDSVGQGKGVDLGGKGSNSRSWKRRARDILSDVTNRDGSLGAVMAPPGIMSVIFWNVRGSGSPRTFRRLTKLVQDKRPQVFFLSETKVSASRMSSAKSVLGFDNCFCV
DCHGRSGGLALMWVSSVSFNLLSFSKNHIDGWISWDDRRWRLTGFYGFPSADLHPQSWSLLSRLRGCVDTLWLIGGDFNAILCQDEKDGGRDKPLSELSAFQSVVDLCGL
VDLGFAGDRLRGAIDDRGGTIYERLDRVFGTTAWTDLYPNYAVNHLDYSQSDHRPVELVLSPPPQCWSQRVSRLPVLRKLGLGSGFVQLVDVRGAQG