; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018361 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018361
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:24434505..24435065
RNA-Seq ExpressionLag0018361
SyntenyLag0018361
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3455882.1 reverse transcriptase [Gossypium australe]2.1e-3845.99Show/hide
Query:  MGDIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMNL----GLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIG
        M +IA  YF +LFES      D   +L GV P ISES N+ L  PF ++E+  ALK M L    G+DG  ++F+Q YW I+G++T+  C  +LN G+S+ 
Subjt:  MGDIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMNL----GLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIG

Query:  PLNKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRG
         +NKTL+  I  T  P  + NF PISL   IYK+I K+++NRLK +LE+ I  +Q+ F+P RLI+DNV+L++E +H L N+R GR+G
Subjt:  PLNKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRG

KAA3457116.1 reverse transcriptase [Gossypium australe]3.4e-3644.39Show/hide
Query:  MGDIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMN----LGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIG
        M +IA  YF  LFES      DME +L G+   IS+S N+ +   F++ +I  A+  M      G DG  ++F+Q +W IVG+DTT  CL +LN G S+ 
Subjt:  MGDIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMN----LGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIG

Query:  PLNKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRG
         +N+TLI  I  T  P  + NF PISL  V+YK+IAKA++N+L+ +L+  I  +Q+ FVP RLI+DNV+L++E +H L N+R GR+G
Subjt:  PLNKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRG

KAA3473268.1 reverse transcriptase [Gossypium australe]2.4e-3745.99Show/hide
Query:  DIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMN----LGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIGPL
        +IA  YF NLFES      D+  +L GV P ISES N+ L  PF+K EI  ALK M      G +G  ++F++ +W I+G DT+  CLE+LN G S+  +
Subjt:  DIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMN----LGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIGPL

Query:  NKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTL
        N+T++  I     P  + NF PISL  VIYK+I K+++NRLK +LE  I   Q+ FVP RLI+DNV+L++E +H L N+R GR+G +
Subjt:  NKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTL

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]6.9e-3744.81Show/hide
Query:  YFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSM----NLGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIGPLNKTLI
        +F+ LF SS PS+  +   L+G++P +S+  N  L  PF+  +I  AL  M      G DG+ + FFQ +W+IVGE  T+ CL ILNE  ++  LN T I
Subjt:  YFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSM----NLGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIGPLNKTLI

Query:  AFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTLL
        A I   + P+++  F PISL NV+Y+++AKA++NRLK IL  +ISP Q+ F+P RLI+DNVI+ +EC+H +    KGRR  L+
Subjt:  AFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTLL

XP_042972810.1 uncharacterized protein LOC122304618 [Carya illinoinensis]2.0e-3645.99Show/hide
Query:  MGDIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMN----LGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIG
        + D+ S YF+ LF SS PS   +E  L  V P +    N +L++ FS  E++AAL  MN     G DG  +LF+QS+WE+VG+D T   LEILN      
Subjt:  MGDIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMN----LGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIG

Query:  PLNKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRG
         +N T I  I   K+P+ + +F PISLYNVIYK+++K LSNRLK IL  +I+PTQ+ F+P R+I DNVI++FE +H ++ + KG++G
Subjt:  PLNKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRG

TrEMBL top hitse value%identityAlignment
A0A2N9ELB0 Uncharacterized protein3.3e-3747.09Show/hide
Query:  MGDIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAAL----KSMNLGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIG
        +  IA  YF NLF SS P  E ++ V++ V P +S   N SL +P+S  EI  AL     S   G DG+ +LFFQ YW IVG D +   L+ LN G+ +G
Subjt:  MGDIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAAL----KSMNLGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIG

Query:  PLNKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTL
         +N T IA I   K+P+ M NF PISL NV+YK+++K L NR+K IL +VIS +Q+ FVP RLI+DNVI++FE +H+L N R G    +
Subjt:  PLNKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTL

A0A2N9ESR2 Uncharacterized protein6.7e-3844.81Show/hide
Query:  SYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMNL----GLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIGPLNKTL
        +Y++ LF ++    ED+E +L+G+ P +++  N  L+ PF++ EIE A+K M      G DG+  +F+QSYW++VG D +   L+ LN G     LN T 
Subjt:  SYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMNL----GLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIGPLNKTL

Query:  IAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTL
        +  I  TK+P+ +T + PISL NVIYKLI+K L+NRLK +L KVIS TQ+ FVP RLI+DN++++FE +HH++N+R G+ G++
Subjt:  IAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTL

A0A2N9I9F4 Reverse transcriptase domain-containing protein6.7e-3844.81Show/hide
Query:  SYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMNL----GLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIGPLNKTL
        +Y++ LF ++    ED+E +L+G+ P +++  N  L+ PF+ +EIE A+K M      G DG+  +F+QSYW++VG D +   L+ LN G     LN T 
Subjt:  SYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMNL----GLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIGPLNKTL

Query:  IAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTL
        +  I  TK+P+ +T + PISL NVIYKLI+K L+NRLK +L KVIS TQ+ FVP RLI+DN++++FE +HH++N+R G+ G++
Subjt:  IAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTL

A0A5B6UI36 Reverse transcriptase1.0e-3845.99Show/hide
Query:  MGDIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMNL----GLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIG
        M +IA  YF +LFES      D   +L GV P ISES N+ L  PF ++E+  ALK M L    G+DG  ++F+Q YW I+G++T+  C  +LN G+S+ 
Subjt:  MGDIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMNL----GLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIG

Query:  PLNKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRG
         +NKTL+  I  T  P  + NF PISL   IYK+I K+++NRLK +LE+ I  +Q+ F+P RLI+DNV+L++E +H L N+R GR+G
Subjt:  PLNKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRG

A0A5B6VWD5 Reverse transcriptase1.1e-3745.99Show/hide
Query:  DIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMN----LGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIGPL
        +IA  YF NLFES      D+  +L GV P ISES N+ L  PF+K EI  ALK M      G +G  ++F++ +W I+G DT+  CLE+LN G S+  +
Subjt:  DIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMN----LGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIGPL

Query:  NKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTL
        N+T++  I     P  + NF PISL  VIYK+I K+++NRLK +LE  I   Q+ FVP RLI+DNV+L++E +H L N+R GR+G +
Subjt:  NKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTL

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog2.1e-1229.44Show/hide
Query:  IASSYFKNLFESSTPSKEDMERVLEGV-IPSISESQNRSLSRPFSKAEIEAAL----KSMNLGLDGVHSLFFQSYWEIVGEDTTRICLEILNEG---KSI
        I + Y+K L+     + +++++ LE   +P +S+ +   L+RP S +EI + +    K  + G DG  S F+Q++ E +      +   I  EG    + 
Subjt:  IASSYFKNLFESSTPSKEDMERVLEGV-IPSISESQNRSLSRPFSKAEIEAAL----KSMNLGLDGVHSLFFQSYWEIVGEDTTRICLEILNEG---KSI

Query:  GPLNKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLN
           N TLI   +  KDP    N+ PISL N+  K++ K L+NR++  ++K+I   Q  F+P      N+  S   I H+N
Subjt:  GPLNKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLN

P14381 Transposon TX1 uncharacterized 149 kDa protein2.1e-2033.91Show/hide
Query:  DIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSM----NLGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIGPL
        D A S+++NLF     S +  E + +G +P +SE +   L  P +  E+  AL+ M    + GLDG+   FFQ +W+ +G D  R+  E   +G+     
Subjt:  DIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSM----NLGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIGPL

Query:  NKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIH
         + +++ +    D + + N+ P+SL +  YK++AKA+S RLK +L +VI P Q+  VP R I DNV L  + +H
Subjt:  NKTLIAFIRITKDPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIH

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.1e-0438.78Show/hide
Query:  RLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTLL
        RLK ++  +I P QA+F+P R+ +DN++   E +H +  R+KG +G +L
Subjt:  RLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGATATTGCTTCTAGCTACTTTAAGAATCTTTTCGAATCTTCTACTCCCTCGAAGGAAGACATGGAAAGGGTGTTAGAAGGAGTCATCCCATCCATTTCTGAATC
TCAAAACAGAAGCCTGAGCAGACCGTTTTCAAAAGCTGAAATTGAAGCAGCTCTTAAGTCTATGAATCTAGGTTTAGATGGGGTACATTCCTTGTTTTTCCAATCTTACT
GGGAGATTGTGGGAGAGGACACTACAAGAATTTGCCTGGAGATCCTCAACGAAGGGAAGAGCATTGGGCCTTTAAACAAAACTCTTATAGCTTTTATCCGTATAACAAAG
GACCCAAAAGAGATGACTAATTTCATACCTATAAGCCTTTACAATGTCATCTACAAACTGATTGCCAAAGCATTATCGAACAGGCTGAAGGGGATCCTTGAAAAAGTCAT
TTCTCCCACTCAGGCTACTTTTGTCCCAAAAAGGCTCATTTCGGACAATGTGATCTTGAGCTTCGAATGTATTCACCATCTGAACAATAGAAGAAAAGGAAGACGGGGTA
CATTGCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGATATTGCTTCTAGCTACTTTAAGAATCTTTTCGAATCTTCTACTCCCTCGAAGGAAGACATGGAAAGGGTGTTAGAAGGAGTCATCCCATCCATTTCTGAATC
TCAAAACAGAAGCCTGAGCAGACCGTTTTCAAAAGCTGAAATTGAAGCAGCTCTTAAGTCTATGAATCTAGGTTTAGATGGGGTACATTCCTTGTTTTTCCAATCTTACT
GGGAGATTGTGGGAGAGGACACTACAAGAATTTGCCTGGAGATCCTCAACGAAGGGAAGAGCATTGGGCCTTTAAACAAAACTCTTATAGCTTTTATCCGTATAACAAAG
GACCCAAAAGAGATGACTAATTTCATACCTATAAGCCTTTACAATGTCATCTACAAACTGATTGCCAAAGCATTATCGAACAGGCTGAAGGGGATCCTTGAAAAAGTCAT
TTCTCCCACTCAGGCTACTTTTGTCCCAAAAAGGCTCATTTCGGACAATGTGATCTTGAGCTTCGAATGTATTCACCATCTGAACAATAGAAGAAAAGGAAGACGGGGTA
CATTGCTATGA
Protein sequenceShow/hide protein sequence
MGDIASSYFKNLFESSTPSKEDMERVLEGVIPSISESQNRSLSRPFSKAEIEAALKSMNLGLDGVHSLFFQSYWEIVGEDTTRICLEILNEGKSIGPLNKTLIAFIRITK
DPKEMTNFIPISLYNVIYKLIAKALSNRLKGILEKVISPTQATFVPKRLISDNVILSFECIHHLNNRRKGRRGTLL