; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021725 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021725
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:11193661..11194850
RNA-Seq ExpressionLag0021725
SyntenyLag0021725
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7116955.1 hypothetical protein RHSIM_RhsimUnG0010200 [Rhododendron simsii]5.1e-4655.62Show/hide
Query:  SNRQCPNEGRRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGI
        S  Q    G R IT+NV+++ E  H I  KRSG +G    KLDM KAYDRVEW YL++VM  +GF   W   VM CI +  FSI VN    E F PSRGI
Subjt:  SNRQCPNEGRRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGI

Query:  RQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG
        RQG  LSPYLFLLC EGFS L     S   L GFRIN HCP ISHL FADDS+IFCRA+++DC  + ++++LY  ASG
Subjt:  RQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG

KAF7127078.1 hypothetical protein RHSIM_Rhsim11G0146500 [Rhododendron simsii]1.3e-4653.37Show/hide
Query:  SNRQCPNEGRRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGI
        S  Q    G R IT+NV+++ E  H I  KRSG +G    KLDM KAYDRV+W YL++VM  +GF  +W   VM C+ +  FSI VN    E F PSRGI
Subjt:  SNRQCPNEGRRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGI

Query:  RQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG
        RQG  LSPYLFLLC EGF+ L N   S   LYGFRIN+HCP +SHL FADDS+++C+A E+DC  +  ++ LY  ASG
Subjt:  RQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG

KAF7129689.1 hypothetical protein RHSIM_Rhsim10G0093500 [Rhododendron simsii]1.3e-4653.37Show/hide
Query:  SNRQCPNEGRRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGI
        S  Q    G R IT+NV+++ E  H I  KRSG +G    KLDM KAYDRV+W YL++VM  +GF  +W   VM C+ +  FSI VN    E F PSRGI
Subjt:  SNRQCPNEGRRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGI

Query:  RQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG
        RQG  LSPYLFLLC EGF+ L N   S   LYGFRIN+HCP +SHL FADDS+++C+A E+DC  +  ++ LY  ASG
Subjt:  RQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG

XP_030498122.1 uncharacterized protein LOC115713779 [Cannabis sativa]1.5e-4550.3Show/hide
Query:  RRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPY
        +RLIT+NV+++FE +H++  ++ G +G+  +KLDM+KA+DRVEW Y+ ++M  MGF  +  N ++RC+ S  +S L+N  +  + KPSRGIRQGD LSPY
Subjt:  RRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPY

Query:  LFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG
        LFL+CAEG S L   +E+  +LYG R+++  PS+SHLFFADDS++FCRAN +  R I++V+++Y +ASG
Subjt:  LFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG

XP_030931068.1 uncharacterized protein LOC115956947 [Quercus lobata]3.9e-4645.87Show/hide
Query:  PKKRGPQDLPE-----LIDLRHSPPSRLAISRDGKI--TLKSNRQCPNEGRRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKV
        PKK+ P +L +     L ++ +   S+   +R  KI   L S  Q      RLIT+N++++FE +H +  KR GK GF+ +KLDM+KAYDRVEW +LDK+
Subjt:  PKKRGPQDLPE-----LIDLRHSPPSRLAISRDGKI--TLKSNRQCPNEGRRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKV

Query:  MDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRAN
        ++I+GF   WRN V  C  S  FS+++N      F PSRG+RQGD LSPYLFLLCAEG  +L  +      L G  + K  P I+HLFFADDSL+FCRA 
Subjt:  MDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRAN

Query:  EKDCRIIKKVINLYGKAS
        E DC+ +  ++ +Y +AS
Subjt:  EKDCRIIKKVINLYGKAS

TrEMBL top hitse value%identityAlignment
A0A803P996 Uncharacterized protein3.2e-4644.55Show/hide
Query:  PKKRGPQDLPE-----LIDLRHSPPSRLAISRDGKITLK---SNRQCPNEGRRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDK
        PK + PQ + E     L ++     +++ +SR  KI L    S  Q      RLIT+NV+++FE +H I  K +G+ G    KLDM+KA+DRVEW ++++
Subjt:  PKKRGPQDLPE-----LIDLRHSPPSRLAISRDGKITLK---SNRQCPNEGRRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDK

Query:  VMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRA
        VM  MGF+  W + +M C+ +  FS ++N  +  +  PSRG++QG  LSPYLFL+C+EGFS L   E+   NL GF++ +H P I+HLFFADDSL+FC+A
Subjt:  VMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRA

Query:  NEKDCRIIKKVINLYGKASG
        NE+ C  IK+V++ Y KASG
Subjt:  NEKDCRIIKKVINLYGKASG

A0A803PNH7 Uncharacterized protein1.6e-4550.6Show/hide
Query:  RLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYL
        RLIT+NV+++FE ++ I  K SG++G   +KLDM+KA+DRVEW +++KVM  MGF+  W   +M C+ +  FS ++N  ++    PSRG+RQG  LSPYL
Subjt:  RLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYL

Query:  FLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG
        FL+C+E  S L   EE + +L GF++ +H PSISHLFFADDSL+FC+ANE  C  +K+ +++Y KASG
Subjt:  FLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG

A0A803QAN3 Uncharacterized protein1.2e-4551.19Show/hide
Query:  RLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYL
        RLIT+N++++FE IH +  K  G++GF  +KLDM+KA+DRVEW YL+ VM  MGF+  W   +M CI ++ FS  +N  +    +PSRG+RQGD LSPYL
Subjt:  RLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYL

Query:  FLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG
        FL+C+EG S L   EE++ NL G R+ +H PS+SHL FADDSL+FCRAN +    I++ ++ Y +ASG
Subjt:  FLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG

A0A803QC75 Uncharacterized protein1.9e-4644.55Show/hide
Query:  PKKRGPQDLPE-----LIDLRHSPPSRLAISRDGKITLK---SNRQCPNEGRRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDK
        PK + PQ + E     L ++     +++ +SR  K+ L    S  Q      RLIT+NV+++FE +H I  K +G+ G    KLDM+KA+DRVEW ++++
Subjt:  PKKRGPQDLPE-----LIDLRHSPPSRLAISRDGKITLK---SNRQCPNEGRRLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDK

Query:  VMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRA
        VM  MGF+  W + +M C+ +  FS ++N  +     PSRG+RQG  LSPYLFL+C+EGFS L   E+  +NL GF++ +H P I+HLFFADDSL+FC+A
Subjt:  VMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRA

Query:  NEKDCRIIKKVINLYGKASG
        NE+ C  IK+V++ Y KASG
Subjt:  NEKDCRIIKKVINLYGKASG

A0A803QCS5 Uncharacterized protein1.4e-4650.6Show/hide
Query:  RLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYL
        RLIT+N++++FE +H I  K +G+ G   +KLDM+KA+DRVEW ++ +VM  MGF+  W N +M C+ +  F+ L+N  ++    PSRG+RQG  LSPYL
Subjt:  RLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYL

Query:  FLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG
        FL+C+EG S L ++E+S+ NL GF++ +H P ISHL FADDSL+FC+ANE  C  IK+V+++Y +ASG
Subjt:  FLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG

SwissProt top hitse value%identityAlignment
P0CV25 Secreted RxLR effector protein 781.0e-0437.97Show/hide
Query:  IVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYLFLLCAE
        +++ LD  KAYD V   +L  V+    FS M+   + +         LVN  LSE  +   GIRQG  L+P LF+L AE
Subjt:  IVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYLFLLCAE

P11369 LINE-1 retrotransposable element ORF2 protein1.9e-1128.22Show/hide
Query:  NVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYLFLLCA
        N+  S   IH IN  +   +  +++ LD  KA+D+++  ++ KV++  G  G + N +         +I VN    E      G RQG  LSPYLF +  
Subjt:  NVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYLFLLCA

Query:  EGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG
        E  +    +++ I    G +I K    IS L  ADD +++    +   R +  +IN +G+  G
Subjt:  EGFSTLQNREESISNLYGFRINKHCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG

P92555 Uncharacterized mitochondrial protein AtMg012507.2e-1154.39Show/hide
Query:  PSRGIRQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDS
        PSRG+RQGD LSPYLF+LC E  S L  R +    L G R++ + P I+HL FADD+
Subjt:  PSRGIRQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDS

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)2.2e-0729.23Show/hide
Query:  HTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYLFLLCAEGFSTLQNR
        H I  +R   + + V+ LD+ KA+D V    + + M   G     ++ +M  I  A+ +I+V    + +     G++QGD LSP LF +  +   T  N 
Subjt:  HTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYLFLLCAEGFSTLQNR

Query:  EESISNLYGFRINKHCPSISHLFFADDSLI
        E+      G  +   C  I+ L FADD L+
Subjt:  EESISNLYGFRINKHCPSISHLFFADDSLI

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.1e-0937.14Show/hide
Query:  RLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSA
        R+ T+N+V   E +H++  K+ G +G++++KLD+ KAYDR+ W YL+  +   GF  +W  ++ R  F A
Subjt:  RLITNNVVMSFECIHTINTKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSA

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)5.1e-1254.39Show/hide
Query:  PSRGIRQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDS
        PSRG+RQGD LSPYLF+LC E  S L  R +    L G R++ + P I+HL FADD+
Subjt:  PSRGIRQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINKHCPSISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGGGAAAGCCCGACCTCGAGATCGACGGTCTTCATTCCGTTCCATTTTAGATCTGAGAGATGCGAGCCATTTGTTCTGTTTCATTCCAGATTTAACGAGATGCAA
ACCCTTTGTTCCATCCCAATTCCGACGAAACCGACCTAAAAAACGAGGACCTCAAGATCTGCCCGAACTCATTGACCTTCGTCACTCTCCACCGAGTCGTCTCGCCATCT
CACGTGATGGAAAGATCACCCTCAAGTCCAACAGACAATGCCCAAATGAAGGAAGGAGGCTTATCACAAACAATGTGGTCATGAGTTTCGAGTGCATCCACACCATCAAC
ACTAAGAGATCAGGAAAAGAGGGATTCATTGTTATGAAGCTCGACATGAACAAAGCCTACGACAGAGTAGAATGGTACTACCTAGACAAGGTGATGGACATAATGGGCTT
CAGTGGAATGTGGAGGAATAAGGTCATGAGATGCATCTTTTCAGCATTTTTCTCAATATTAGTCAATGACAATCTGAGCGAGGAGTTTAAACCAAGTAGGGGGATCAGGC
AAGGGGATCTGTTGTCCCCATACCTGTTCCTCCTATGTGCTGAAGGCTTCTCGACCCTTCAAAACAGGGAAGAATCCATTTCTAACCTTTATGGCTTTAGAATCAACAAA
CATTGCCCCTCTATATCTCACCTATTTTTTGCTGATGATAGTCTCATTTTTTGCAGGGCCAATGAGAAGGATTGTAGGATTATCAAGAAGGTGATCAATTTATATGGCAA
AGCATCAGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGGGAAAGCCCGACCTCGAGATCGACGGTCTTCATTCCGTTCCATTTTAGATCTGAGAGATGCGAGCCATTTGTTCTGTTTCATTCCAGATTTAACGAGATGCAA
ACCCTTTGTTCCATCCCAATTCCGACGAAACCGACCTAAAAAACGAGGACCTCAAGATCTGCCCGAACTCATTGACCTTCGTCACTCTCCACCGAGTCGTCTCGCCATCT
CACGTGATGGAAAGATCACCCTCAAGTCCAACAGACAATGCCCAAATGAAGGAAGGAGGCTTATCACAAACAATGTGGTCATGAGTTTCGAGTGCATCCACACCATCAAC
ACTAAGAGATCAGGAAAAGAGGGATTCATTGTTATGAAGCTCGACATGAACAAAGCCTACGACAGAGTAGAATGGTACTACCTAGACAAGGTGATGGACATAATGGGCTT
CAGTGGAATGTGGAGGAATAAGGTCATGAGATGCATCTTTTCAGCATTTTTCTCAATATTAGTCAATGACAATCTGAGCGAGGAGTTTAAACCAAGTAGGGGGATCAGGC
AAGGGGATCTGTTGTCCCCATACCTGTTCCTCCTATGTGCTGAAGGCTTCTCGACCCTTCAAAACAGGGAAGAATCCATTTCTAACCTTTATGGCTTTAGAATCAACAAA
CATTGCCCCTCTATATCTCACCTATTTTTTGCTGATGATAGTCTCATTTTTTGCAGGGCCAATGAGAAGGATTGTAGGATTATCAAGAAGGTGATCAATTTATATGGCAA
AGCATCAGGCTAA
Protein sequenceShow/hide protein sequence
MKGKARPRDRRSSFRSILDLRDASHLFCFIPDLTRCKPFVPSQFRRNRPKKRGPQDLPELIDLRHSPPSRLAISRDGKITLKSNRQCPNEGRRLITNNVVMSFECIHTIN
TKRSGKEGFIVMKLDMNKAYDRVEWYYLDKVMDIMGFSGMWRNKVMRCIFSAFFSILVNDNLSEEFKPSRGIRQGDLLSPYLFLLCAEGFSTLQNREESISNLYGFRINK
HCPSISHLFFADDSLIFCRANEKDCRIIKKVINLYGKASG