; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022454 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022454
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:29445650..29446252
RNA-Seq ExpressionLag0022454
SyntenyLag0022454
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7129943.1 hypothetical protein RHSIM_Rhsim10G0167400 [Rhododendron simsii]6.0e-3946.2Show/hide
Query:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY
        M C SSVT+S L+NG PS   +PSRG RQG PLSPYLFL+C EGFS +L++ E   ++   +     P +SHL FADD+++F +AS KE      ++  Y
Subjt:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY

Query:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE
          ASGQL+N +K A   SKN G  L  +   ++ +P    LG YLG+ + N K+K+ LF  +KDKV  +L  WKEKL +  GKE
Subjt:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]2.1e-4449.73Show/hide
Query:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY
        MNC  SV F+ L+NG P D+F P+RG RQG PLSPYLF++C EG S L+  EE   N+   K N  CP +SHLF+ADD LLFF+AS   C   KG++ +Y
Subjt:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY

Query:  GKA-SGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE
         KA SGQ +N +K  ++ SKN    +     + L V  T+SLG YLG+ S   +NK  +F  +KD+VWK LQ WK KLFS  G+E
Subjt:  GKA-SGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE

XP_030502375.1 uncharacterized protein LOC115717530 [Cannabis sativa]5.5e-4045.11Show/hide
Query:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY
        M+C ++ +FS L+NG    +  PSRG RQG PLSPYLFL+C+EGFS LL+ ++ I NL  FK   H P ++HLFFADDSLLF +A+E+ C   K V+ TY
Subjt:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY

Query:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE
         KASGQ +N  K     S N     ++   + L +P  +    YLG+ S + ++K  +F+ +K+K+WK +  W EK F A GKE
Subjt:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE

XP_030505385.1 uncharacterized protein LOC115720373 [Cannabis sativa]6.7e-3844.57Show/hide
Query:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY
        M+C ++  FS L+NG  +    P RG RQG PLSPYLFL+C EG S LL+RE+ + NL  FK     P +SHL FADDSLL  +A E  C   K V+ TY
Subjt:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY

Query:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE
         +ASGQL+N +K     S N     ++   + L +P ++    YLG+ S + ++K  LF+ +K+++WK +Q W EKLFSA G+E
Subjt:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE

XP_030923215.1 uncharacterized protein LOC115950106 [Quercus lobata]6.7e-3846.2Show/hide
Query:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY
        M C ++V++S L+NG PS    PSRG RQG P+SPYLFLL TEG   L+ +  +  ++       + P L+HLFFADDSLLF+RAS +ECN  + +++TY
Subjt:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY

Query:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE
         +ASGQ +N EK     SKN G  ++     +LGVP  K    YLG+ S   K K    A +KD++W  LQ WKEKL S  G+E
Subjt:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE

TrEMBL top hitse value%identityAlignment
A0A2N9HME5 Reverse transcriptase domain-containing protein6.5e-3948.37Show/hide
Query:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY
        MNC +SV++S +VNG P+    P+RG RQG PLSPYLFLLC EG S LL +     +L     +   P ++HLFFADDSLLF +A+ +EC I    +S Y
Subjt:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY

Query:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE
         KASGQLVN EK +   S+N    L++    ILGVP  K    YLG+ S   ++K   F ++K+KVWK +  WKEKL S  GKE
Subjt:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE

A0A6J1DUG8 uncharacterized protein LOC1110241351.0e-4449.73Show/hide
Query:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY
        MNC  SV F+ L+NG P D+F P+RG RQG PLSPYLF++C EG S L+  EE   N+   K N  CP +SHLF+ADD LLFF+AS   C   KG++ +Y
Subjt:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY

Query:  GKA-SGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE
         KA SGQ +N +K  ++ SKN    +     + L V  T+SLG YLG+ S   +NK  +F  +KD+VWK LQ WK KLFS  G+E
Subjt:  GKA-SGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE

A0A803PIC3 Uncharacterized protein1.5e-4046.74Show/hide
Query:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY
        M C S+ +FS L+N     +  PSRG RQG PLSPYLFL+C+EGFS LL  EES+  L  FK     P +SHLFFADD+LLFF+A+E  C   K V+ TY
Subjt:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY

Query:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE
         +ASGQ++N +K     S N   G ++   + L +P ++    YLG+ S + ++K  LF+ +KDK+WK +  W EK+FSA G+E
Subjt:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE

A0A803PQ30 Uncharacterized protein2.6e-4045.11Show/hide
Query:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY
        M+C ++ +FS L+NG    +  PSRG RQG PLSPYLFL+C+EGFS LL+ ++ I NL  FK   H P ++HLFFADDSLLF +A+E+ C   K V+ TY
Subjt:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY

Query:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE
         KASGQ +N  K     S N     ++   + L +P  +    YLG+ S + ++K  +F+ +K+K+WK +  W EK F A GKE
Subjt:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE

A0A803QC75 Uncharacterized protein2.0e-4044.57Show/hide
Query:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY
        M+C ++  FS ++NG    +  PSRG RQG PLSPYLFL+C+EGFS LL+ E+  +NL  FK   H P ++HLFFADDSLLF +A+E+ C   K V+ TY
Subjt:  MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTY

Query:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE
         KASGQ +N +K     S N     ++   + L +P  +    YLG+ S + ++K  +F+ +K+++WK +  W EK+FSA GKE
Subjt:  GKASGQLVNFEKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKE

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein6.3e-0723.56Show/hide
Query:  FSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTYGKA
        +S    +  VNG   +      G RQG PLSPYLF +  E  +  +++++ I  +   K       +     ADD +++    +        +I+++G+ 
Subjt:  FSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTYGKA

Query:  SGQLVNFEK-FAYMTSKNVGRGLEIKCG----------KILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKV--WKTLQL-WKEKL------------
         G  +N  K  A++ +KN     EI+            K LGV  TK + +        +KN   L  ++K+ +  WK L   W  ++            
Subjt:  SGQLVNFEK-FAYMTSKNVGRGLEIKCG----------KILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKV--WKTLQL-WKEKL------------

Query:  --FSAVGKETPNNIFKEIDGICAKF
          F+A+  + P   F E++G   KF
Subjt:  --FSAVGKETPNNIFKEIDGICAKF

P92555 Uncharacterized mitochondrial protein AtMg012508.5e-1247.06Show/hide
Query:  LVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDS
        ++NG+P     PSRG RQG PLSPYLF+LCTE  S L +R +    L   + +++ P ++HL FADD+
Subjt:  LVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDS

Arabidopsis top hitse value%identityAlignment
ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)6.1e-1347.06Show/hide
Query:  LVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDS
        ++NG+P     PSRG RQG PLSPYLF+LCTE  S L +R +    L   + +++ P ++HL FADD+
Subjt:  LVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTGCTTCTCTTCAGTTACTTTCTCAGCCCTTGTCAATGGAAGTCCTAGCGATGATTTCAAGCCTAGCAGGGGGTTTAGGCAAGGGGTTCCTCTATCCCCTTACCT
ATTTCTGTTATGCACTGAAGGTTTCTCTACTCTCCTTAAAAGGGAAGAATCCATCTCAAATCTTTTTAGCTTTAAAACTAATCATCACTGCCCTTCCTTATCTCATTTGT
TTTTCGCTGATGATAGTCTTCTTTTTTTCAGGGCCTCCGAGAAGGAATGCAACATCACTAAAGGAGTGATATCTACATATGGGAAAGCTTCTGGACAGCTTGTAAATTTT
GAAAAATTTGCCTACATGACTAGTAAAAACGTGGGGAGAGGGCTTGAGATCAAGTGTGGCAAGATCTTAGGAGTCCCGTGGACCAAAAGCTTGGGGAATTACCTAGGGAT
GTCCTCCTCAAACAACAAAAACAAGAGCCACCTTTTTGCCAAGGTCAAGGACAAAGTCTGGAAAACTCTCCAACTGTGGAAGGAGAAACTTTTCTCTGCGGTGGGAAAGG
AAACTCCGAACAACATTTTCAAGGAAATTGATGGAATTTGCGCGAAATTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTGCTTCTCTTCAGTTACTTTCTCAGCCCTTGTCAATGGAAGTCCTAGCGATGATTTCAAGCCTAGCAGGGGGTTTAGGCAAGGGGTTCCTCTATCCCCTTACCT
ATTTCTGTTATGCACTGAAGGTTTCTCTACTCTCCTTAAAAGGGAAGAATCCATCTCAAATCTTTTTAGCTTTAAAACTAATCATCACTGCCCTTCCTTATCTCATTTGT
TTTTCGCTGATGATAGTCTTCTTTTTTTCAGGGCCTCCGAGAAGGAATGCAACATCACTAAAGGAGTGATATCTACATATGGGAAAGCTTCTGGACAGCTTGTAAATTTT
GAAAAATTTGCCTACATGACTAGTAAAAACGTGGGGAGAGGGCTTGAGATCAAGTGTGGCAAGATCTTAGGAGTCCCGTGGACCAAAAGCTTGGGGAATTACCTAGGGAT
GTCCTCCTCAAACAACAAAAACAAGAGCCACCTTTTTGCCAAGGTCAAGGACAAAGTCTGGAAAACTCTCCAACTGTGGAAGGAGAAACTTTTCTCTGCGGTGGGAAAGG
AAACTCCGAACAACATTTTCAAGGAAATTGATGGAATTTGCGCGAAATTTTAA
Protein sequenceShow/hide protein sequence
MNCFSSVTFSALVNGSPSDDFKPSRGFRQGVPLSPYLFLLCTEGFSTLLKREESISNLFSFKTNHHCPSLSHLFFADDSLLFFRASEKECNITKGVISTYGKASGQLVNF
EKFAYMTSKNVGRGLEIKCGKILGVPWTKSLGNYLGMSSSNNKNKSHLFAKVKDKVWKTLQLWKEKLFSAVGKETPNNIFKEIDGICAKF