; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009399 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009399
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:38852231..38853682
RNA-Seq ExpressionLag0009399
SyntenyLag0009399
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_012461392.1 PREDICTED: uncharacterized protein LOC105781396 [Gossypium raimondii]1.2e-2751.91Show/hide
Query:  DRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKT
        +R+L  P+S+E++   +  M PTKA G DG  A+ YQK W  VG+D++  CL IL    E+ PIN T I  IPKV  P  M +FRPISLCNV+YK+IAK 
Subjt:  DRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKT

Query:  LANRMKTVFGSIISPTQSTFIPRRQTSDNVL
        LANR ++V G  I   QS F+P R  SDNVL
Subjt:  LANRMKTVFGSIISPTQSTFIPRRQTSDNVL

XP_015384883.1 uncharacterized protein LOC107176610 [Citrus sinensis]8.6e-2951.15Show/hide
Query:  DRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSW-DVGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKT
        +R+LD+PF+ E++  T   MSPTKA GPDG+ A  +QK W  V   +I  CL IL  +  ++P+N T I+ IPKV +P R+S FRPISLCNV+Y+++AKT
Subjt:  DRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSW-DVGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKT

Query:  LANRMKTVFGSIISPTQSTFIPRRQTSDNVL
        +ANR+K +   +IS TQS FIP R  SDN +
Subjt:  LANRMKTVFGSIISPTQSTFIPRRQTSDNVL

XP_024033492.1 uncharacterized protein LOC112095617 [Citrus clementina]8.6e-2951.15Show/hide
Query:  DRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSW-DVGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKT
        +R+LD+PF+ E++  T   MSPTKA GPDG+ A  +QK W  V   +I  CL IL  +  ++P+N T I+ IPKV +P R+S FRPISLCNV+Y+++AKT
Subjt:  DRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSW-DVGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKT

Query:  LANRMKTVFGSIISPTQSTFIPRRQTSDNVL
        +ANR+K +   +IS TQS FIP R  SDN +
Subjt:  LANRMKTVFGSIISPTQSTFIPRRQTSDNVL

XP_024039324.1 uncharacterized protein LOC112097962 [Citrus clementina]2.3e-2941.94Show/hide
Query:  WIVGGDLNEIMYNKEKNEIMYNKEKKGGVPKPFKLLNDFCETIRYCDLIDVGFSGDRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGE
        W+  GD N   Y   K      K K  G+      + D  + I       V  S + +L+ PF++E++   L  MSPTKA GPDG+ A+ +QK W  V  
Subjt:  WIVGGDLNEIMYNKEKNEIMYNKEKKGGVPKPFKLLNDFCETIRYCDLIDVGFSGDRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGE

Query:  DIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKTLANRMKTVFGSIISPTQSTFIPRRQTSDNVL
         +I  C+ IL     +  +N T I+ IPK  +PK++++FRPISLCNV+Y++IAKT+ANR+K +   IISPTQS FIP R  SDNV+
Subjt:  DIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKTLANRMKTVFGSIISPTQSTFIPRRQTSDNVL

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.2e-2750Show/hide
Query:  DLIDVGFSGDRR--LDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISL
        D +D   + D R  L   F+ E+++A L  M PTKA GPDG++A+ YQK W  VG+ ++   L  L +   +  IN T I  IPKV+ P+RMS+FRPISL
Subjt:  DLIDVGFSGDRR--LDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISL

Query:  CNVVYKLIAKTLANRMKTVFGSIISPTQSTFIPRRQTSDNVL
        CNV+YK+I+K LANR+K V   IIS TQS F+P R  +DNVL
Subjt:  CNVVYKLIAKTLANRMKTVFGSIISPTQSTFIPRRQTSDNVL

TrEMBL top hitse value%identityAlignment
A0A5B6V614 Reverse transcriptase2.3e-2750.34Show/hide
Query:  RYCDLIDVGFSGD--RRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRP
        R  DL++   S D    L  PFS+E++   +KSM+P KA G DG  A+ +QK W+ +G DI + CL IL  + EM  INKT I  IPKV  PK +S+FRP
Subjt:  RYCDLIDVGFSGD--RRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRP

Query:  ISLCNVVYKLIAKTLANRMKTVFGSIISPTQSTFIPRRQTSDNVL
        ISLCNV+YK+IAK + NRM  V G  I   Q  FI  RQ SDN L
Subjt:  ISLCNVVYKLIAKTLANRMKTVFGSIISPTQSTFIPRRQTSDNVL

A0A5B6VYT4 Reverse transcriptase6.6e-2738.07Show/hide
Query:  WEDHDTMSLPWIVGGDLNEIMYNK------EKNEIMYNKEKKG-GVPKPFKLLN-------------DFCETIRYCDLIDVGFSGDR--RLDTPFSKEKL
        WE    ++  W+  GD N   +++      ++N+I+  ++K+G  V +  ++LN             +     R  DL++   + D    L  PFS+E++
Subjt:  WEDHDTMSLPWIVGGDLNEIMYNK------EKNEIMYNKEKKG-GVPKPFKLLN-------------DFCETIRYCDLIDVGFSGDR--RLDTPFSKEKL

Query:  EATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKTLANRMKTVFGSII
           +KS +P KA G DG  A+ +QK WD VG DI + CL +L  + EM+ INKT I  IPKV  PK++S+F PISLCNV+YK+IAK + NRM  + G  I
Subjt:  EATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKTLANRMKTVFGSII

Query:  SPTQSTFIPRRQTSDNVL
           Q  FI  RQ SDN L
Subjt:  SPTQSTFIPRRQTSDNVL

A0A5B6WI49 Reverse transcriptase3.9e-2733.21Show/hide
Query:  PWIVGGDLNEIMYNKEKNEIMYNKEKKGGVPKPFKLLNDFCETIRYCDLIDVGFSGD---------------RRLD------------------------
        PW+V  D          NEI+Y  EKKGG+P+  K + +F + +  C L D+G+SG+                RLD                        
Subjt:  PWIVGGDLNEIMYNKEKNEIMYNKEKKGGVPKPFKLLNDFCETIRYCDLIDVGFSGD---------------RRLD------------------------

Query:  ---------------------------------------TPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINK
                                                 F+KE+++  LK M PTKA G DG+ A+ YQK W  +GED+   CL  L    ++  INK
Subjt:  ---------------------------------------TPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINK

Query:  TLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKTLANRMKTVFGSIISPTQSTFIPRRQTSDNVL
        T I  IPKV  P  +S+FRPISLCNV+YK+IAK +ANR++ V    I   QS F+P R   DNVL
Subjt:  TLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKTLANRMKTVFGSIISPTQSTFIPRRQTSDNVL

A0A6J1DX30 uncharacterized protein LOC1110248743.9e-2747.33Show/hide
Query:  DRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKT
        + +L  P++KE++E  ++ M PTKALGPDG  A+ YQ  W  VG   +  CL  L + ++++  N T I+ IPK+K+P+ +S FRPISLCNV YK+I+K+
Subjt:  DRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKT

Query:  LANRMKTVFGSIISPTQSTFIPRRQTSDNVL
        + NR+K V G +IS  QS F+P R  SDNV+
Subjt:  LANRMKTVFGSIISPTQSTFIPRRQTSDNVL

A0A6J5UD52 Reverse transcriptase domain-containing protein3.9e-2749.23Show/hide
Query:  PFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKTLANRMK
        P+S++++E  L S+ P+KA GPDG+ A+ YQK W  VG D+  +C+ +L   + +   N TL++ IPKV  P R+S++RPISLCNV+YK+I+KTLANR+K
Subjt:  PFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKTLANRMK

Query:  TVFGSIISPTQSTFIPRRQTSDNVLTLLIT
         V   +IS  QS FIP R   DNVL    T
Subjt:  TVFGSIISPTQSTFIPRRQTSDNVLTLLIT

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.8e-0531.3Show/hide
Query:  LDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWDVGEDIIRVCLGIL--THKEEMEP--INKTLISPIPKV-KEPKRMSKFRPISLCNVVYKLIAK
        L+ P +  ++ A + S+   K+ GPDG  A  YQ+     E+++   L +     KE + P    +  I  IPK  ++  +   FRPISL N+  K++ K
Subjt:  LDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWDVGEDIIRVCLGIL--THKEEMEP--INKTLISPIPKV-KEPKRMSKFRPISLCNVVYKLIAK

Query:  TLANRMKTVFGSIISPTQSTFIPRRQTSDNV
         LANR++     +I   Q  FIP  Q   N+
Subjt:  TLANRMKTVFGSIISPTQSTFIPRRQTSDNV

P08548 LINE-1 reverse transcriptase homolog6.0e-0926.37Show/hide
Query:  NEIMYNKEKNEIMYNKEKKGGVPKPFKLLNDFCETIRYCDLIDVGFSGDRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWDVGEDIIRVCLG
        +EI  +  + + + N+  K      ++ L +  + +  C L  +       L+ P S  ++ +T++++   K+ GPDG  +  YQ      E+++ + L 
Subjt:  NEIMYNKEKNEIMYNKEKKGGVPKPFKLLNDFCETIRYCDLIDVGFSGDRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWDVGEDIIRVCLG

Query:  ILTHKEEMEPINKTL----ISPIPKV-KEPKRMSKFRPISLCNVVYKLIAKTLANRMKTVFGSIISPTQSTFIPRRQTSDNV
        +  + E+   +  T     I+ IPK  K+P R   +RPISL N+  K++ K L NR++     II   Q  FIP  Q   N+
Subjt:  ILTHKEEMEPINKTL----ISPIPKV-KEPKRMSKFRPISLCNVVYKLIAKTLANRMKTVFGSIISPTQSTFIPRRQTSDNV

P11369 LINE-1 retrotransposable element ORF2 protein7.6e-1238.64Show/hide
Query:  LDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWDVGEDIIRVCLGILTHKEEME-----PINKTLISPIPK-VKEPKRMSKFRPISLCNVVYKLIA
        L++P S +++EA + S+   K+ GPDG  A  YQ      ED+I + L  L HK E+E        +  I+ IPK  K+P ++  FRPISL N+  K++ 
Subjt:  LDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWDVGEDIIRVCLGILTHKEEME-----PINKTLISPIPK-VKEPKRMSKFRPISLCNVVYKLIA

Query:  KTLANRMKTVFGSIISPTQSTFIPRRQTSDNV
        K LANR++    +II P Q  FIP  Q   N+
Subjt:  KTLANRMKTVFGSIISPTQSTFIPRRQTSDNV

P14381 Transposon TX1 uncharacterized 149 kDa protein8.9e-1330.57Show/hide
Query:  PFKLLNDFCETIRYCDLIDVGFSGDRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVK
        P  +  D CE + +  L  V      RL+TP + ++L   L+ M   K+ G DG+    +Q  WD +G D  RV        E      + ++S +PK  
Subjt:  PFKLLNDFCETIRYCDLIDVGFSGDRRLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWD-VGEDIIRVCLGILTHKEEMEPINKTLISPIPKVK

Query:  EPKRMSKFRPISLCNVVYKLIAKTLANRMKTVFGSIISPTQSTFIPRRQTSDNVLTL
        + + +  +RP+SL +  YK++AK ++ R+K+V   +I P QS  +P R   DNV  +
Subjt:  EPKRMSKFRPISLCNVVYKLIAKTLANRMKTVFGSIISPTQSTFIPRRQTSDNVLTL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.6e-0735.79Show/hide
Query:  RLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWDVGED-IIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLI
        RL    S +++ A + +M   KA GPD   A  + +SW V +D  I            ++  N T I+ IPKV    ++S FRP+S C VVYK+I
Subjt:  RLDTPFSKEKLEATLKSMSPTKALGPDGVHAMLYQKSWDVGED-IIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGGTGGAGATTTACAGGATTTTATGGCAATCCATACCAAAGCAAGAGGAGCGAGTCCTGGAAGCTGCTGGGAAGACCATGATACAATGAGCTTACCTTGGATCGT
TGGAGGTGACCTCAACGAGATTATGTACAACAAGGAAAAGAACGAGATTATGTACAACAAGGAAAAGAAAGGGGGTGTTCCTAAACCTTTCAAACTCTTAAATGATTTTT
GTGAAACTATAAGATATTGTGACTTGATTGATGTGGGTTTTTCTGGTGACAGGAGGCTAGACACCCCGTTCTCGAAAGAAAAGCTAGAAGCAACCCTCAAAAGCATGAGC
CCCACAAAGGCACTGGGTCCTGACGGTGTACATGCTATGCTTTATCAAAAGTCCTGGGACGTTGGGGAGGATATCATCAGAGTTTGCCTAGGTATCCTAACCCATAAAGA
GGAGATGGAACCAATTAATAAAACCTTAATATCCCCCATTCCAAAAGTCAAGGAGCCTAAGAGAATGAGCAAATTCAGACCTATTAGTTTGTGCAATGTGGTGTATAAAC
TAATAGCTAAAACACTAGCAAATAGAATGAAGACGGTCTTTGGCTCGATCATCTCCCCCACCCAATCAACGTTTATCCCTAGAAGACAGACTTCAGACAATGTTTTGACG
CTCTTAATAACAGAAGGAAAGGCAAGACAAGGCATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTGGTGGAGATTTACAGGATTTTATGGCAATCCATACCAAAGCAAGAGGAGCGAGTCCTGGAAGCTGCTGGGAAGACCATGATACAATGAGCTTACCTTGGATCGT
TGGAGGTGACCTCAACGAGATTATGTACAACAAGGAAAAGAACGAGATTATGTACAACAAGGAAAAGAAAGGGGGTGTTCCTAAACCTTTCAAACTCTTAAATGATTTTT
GTGAAACTATAAGATATTGTGACTTGATTGATGTGGGTTTTTCTGGTGACAGGAGGCTAGACACCCCGTTCTCGAAAGAAAAGCTAGAAGCAACCCTCAAAAGCATGAGC
CCCACAAAGGCACTGGGTCCTGACGGTGTACATGCTATGCTTTATCAAAAGTCCTGGGACGTTGGGGAGGATATCATCAGAGTTTGCCTAGGTATCCTAACCCATAAAGA
GGAGATGGAACCAATTAATAAAACCTTAATATCCCCCATTCCAAAAGTCAAGGAGCCTAAGAGAATGAGCAAATTCAGACCTATTAGTTTGTGCAATGTGGTGTATAAAC
TAATAGCTAAAACACTAGCAAATAGAATGAAGACGGTCTTTGGCTCGATCATCTCCCCCACCCAATCAACGTTTATCCCTAGAAGACAGACTTCAGACAATGTTTTGACG
CTCTTAATAACAGAAGGAAAGGCAAGACAAGGCATCTGA
Protein sequenceShow/hide protein sequence
MIGGDLQDFMAIHTKARGASPGSCWEDHDTMSLPWIVGGDLNEIMYNKEKNEIMYNKEKKGGVPKPFKLLNDFCETIRYCDLIDVGFSGDRRLDTPFSKEKLEATLKSMS
PTKALGPDGVHAMLYQKSWDVGEDIIRVCLGILTHKEEMEPINKTLISPIPKVKEPKRMSKFRPISLCNVVYKLIAKTLANRMKTVFGSIISPTQSTFIPRRQTSDNVLT
LLITEGKARQGI