; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025669 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025669
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:17436785..17439069
RNA-Seq ExpressionLag0025669
SyntenyLag0025669
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ABA94593.2 retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group]8.0e-2634.29Show/hide
Query:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDLV-
        +  +R ISLCNVIYK+++K L N L+  L +++S   SAFV + LI+DN  + FEC H +   +         KLD+SK  +RV+W +L + MHKM    
Subjt:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDLV-

Query:  -FFKATEGEGSRIKSVLETYEKATA--PLGS--------NPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWIGKNGNNVPIWEKGR---NYIPSV
         +      EG   K   ++ ++     P GS        N    WR I  G +L KKG+ WR+GNG  I + +DPW+ ++ +  PI  KG     ++  +
Subjt:  -FFKATEGEGSRIKSVLETYEKATA--PLGS--------NPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWIGKNGNNVPIWEKGR---NYIPSV

Query:  LVLQGRWKLN
        L   G W+++
Subjt:  LVLQGRWKLN

EEC84864.1 hypothetical protein OsI_31997 [Oryza sativa Indica Group]2.6e-2431.75Show/hide
Query:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDLV-
        + +FR ISLCNVIYKI++K L N L+ +L D+ISQ  SAFV   +I+DN  L FEC H++   +         KLD+SK  DRV+W +L + M+K+    
Subjt:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDLV-

Query:  --------------FFKATEGEGSRIKS---------VLETYEKATAP----------------------LGSNPFAKWRSIVWGRKLFKKGMRWRIGNG
                        KA +  G   K            + +     P                         N    WR+I +G +L KKG  WR+GNG
Subjt:  --------------FFKATEGEGSRIKS---------VLETYEKATAP----------------------LGSNPFAKWRSIVWGRKLFKKGMRWRIGNG

Query:  NRIIIDQDPWIGKNGNNVPIWEKGR---NYIPSVLVLQGRWKLNTNATWKDN
          + I +DPWI ++ +  PI  KG     ++  +L L G W +      KDN
Subjt:  NRIIIDQDPWIGKNGNNVPIWEKGR---NYIPSVLVLQGRWKLNTNATWKDN

KAA3469681.1 reverse transcriptase [Gossypium australe]1.8e-2542.93Show/hide
Query:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDL--
        +SQFR ISLCNVIYK+IAKV+AN L  V+   I    SAFV   LI+DNV L +E +HTL +++ GK GL  +KLDMSK  DRVEW ++ K    +    
Subjt:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDL--

Query:  -----VFFKATEG-------EGSRIKSVLETY----EKATAPLGSNPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWI-GKNGNNV
             V   A +G       +   +K +   Y      A A LG+ P   WRS+  G+ L +KGM WR+G G++I I  D WI GK  + V
Subjt:  -----VFFKATEG-------EGSRIKSVLETY----EKATAPLGSNPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWI-GKNGNNV

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]2.1e-2632.04Show/hide
Query:  MSQFRLISL----CNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKM
        M  F  ISL    CNVIYKII+KVLAN LK+VL ++IS T SAFV   LI+DN  +GFECIH+L  R KGK G   LKLDM K  DRVEW YLR V    
Subjt:  MSQFRLISL----CNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKM

Query:  DLVFFKATE-----------------GEGSRIKSVLETYEKAT---------------------------------------------------------
          +F    +                 G+   IK+V +     T                                                         
Subjt:  DLVFFKATE-----------------GEGSRIKSVLETYEKAT---------------------------------------------------------

Query:  ------------------------------------APLGSNPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWIGKNGNNVPIWE--KGRNY-IP
                                            A LG+ P   WRSI+WGR LFKKG RW++GNG  I +  DPW+ ++GN  P++     RN+ + 
Subjt:  ------------------------------------APLGSNPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWIGKNGNNVPIWE--KGRNY-IP

Query:  SVLVLQGRW
         ++  +GRW
Subjt:  SVLVLQGRW

XP_030958840.1 uncharacterized protein LOC115980760 [Quercus lobata]8.0e-2644.31Show/hide
Query:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDLV-
        +S++R ISLCNV+YKI +KVLAN LKK + ++I++  SAF  + LISDNV + FE +H + +   GKTG   +KLDMSK  DRVEW YL+K+M  M    
Subjt:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDLV-

Query:  -FFKATEGEGSRIKSVLETYEKATAPLGSNPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWI
          FKA         S +E  E  T          W+SI+ GR++ K G R+++GNG  I I Q  W+
Subjt:  -FFKATEGEGSRIKSVLETYEKATAPLGSNPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWI

TrEMBL top hitse value%identityAlignment
A0A0A9DL65 Reverse transcriptase domain-containing protein1.2e-2437.78Show/hide
Query:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDL--
        M+ FR ISLCNVIYKII+KVLAN LK VL +IIS+   AFV   LI+DNV + +E  H++ +RRKG+ GL  +KLDM+K  DRVEW +L  +M K+    
Subjt:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDL--

Query:  ----------------VFFKATEGEGSRIK----------------------SVLETYEKATAPL-------GSNPFAKWRSIVWGRKLFKKGMRWRIGN
                        V+F  T    + I                       S L +Y +A   L       GS+    W+SI+ GR   K G  WRIGN
Subjt:  ----------------VFFKATEGEGSRIK----------------------SVLETYEKATAPL-------GSNPFAKWRSIVWGRKLFKKGMRWRIGN

Query:  GNRIIIDQDPWIGKNGNNVPIWEKG
        G+ + I  D WI  + N   +  +G
Subjt:  GNRIIIDQDPWIGKNGNNVPIWEKG

A0A5B6VKR7 Reverse transcriptase8.6e-2642.93Show/hide
Query:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDL--
        +SQFR ISLCNVIYK+IAKV+AN L  V+   I    SAFV   LI+DNV L +E +HTL +++ GK GL  +KLDMSK  DRVEW ++ K    +    
Subjt:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDL--

Query:  -----VFFKATEG-------EGSRIKSVLETY----EKATAPLGSNPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWI-GKNGNNV
             V   A +G       +   +K +   Y      A A LG+ P   WRS+  G+ L +KGM WR+G G++I I  D WI GK  + V
Subjt:  -----VFFKATEG-------EGSRIKSVLETY----EKATAPLGSNPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWI-GKNGNNV

A0A6J1DRA0 uncharacterized protein LOC1110224231.0e-2632.04Show/hide
Query:  MSQFRLISL----CNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKM
        M  F  ISL    CNVIYKII+KVLAN LK+VL ++IS T SAFV   LI+DN  +GFECIH+L  R KGK G   LKLDM K  DRVEW YLR V    
Subjt:  MSQFRLISL----CNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKM

Query:  DLVFFKATE-----------------GEGSRIKSVLETYEKAT---------------------------------------------------------
          +F    +                 G+   IK+V +     T                                                         
Subjt:  DLVFFKATE-----------------GEGSRIKSVLETYEKAT---------------------------------------------------------

Query:  ------------------------------------APLGSNPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWIGKNGNNVPIWE--KGRNY-IP
                                            A LG+ P   WRSI+WGR LFKKG RW++GNG  I +  DPW+ ++GN  P++     RN+ + 
Subjt:  ------------------------------------APLGSNPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWIGKNGNNVPIWE--KGRNY-IP

Query:  SVLVLQGRW
         ++  +GRW
Subjt:  SVLVLQGRW

B8BDD1 Post-SET domain-containing protein1.2e-2431.75Show/hide
Query:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDLV-
        + +FR ISLCNVIYKI++K L N L+ +L D+ISQ  SAFV   +I+DN  L FEC H++   +         KLD+SK  DRV+W +L + M+K+    
Subjt:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDLV-

Query:  --------------FFKATEGEGSRIKS---------VLETYEKATAP----------------------LGSNPFAKWRSIVWGRKLFKKGMRWRIGNG
                        KA +  G   K            + +     P                         N    WR+I +G +L KKG  WR+GNG
Subjt:  --------------FFKATEGEGSRIKS---------VLETYEKATAP----------------------LGSNPFAKWRSIVWGRKLFKKGMRWRIGNG

Query:  NRIIIDQDPWIGKNGNNVPIWEKGR---NYIPSVLVLQGRWKLNTNATWKDN
          + I +DPWI ++ +  PI  KG     ++  +L L G W +      KDN
Subjt:  NRIIIDQDPWIGKNGNNVPIWEKGR---NYIPSVLVLQGRWKLNTNATWKDN

Q2R1K6 Retrotransposon protein, putative, unclassified3.9e-2634.29Show/hide
Query:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDLV-
        +  +R ISLCNVIYK+++K L N L+  L +++S   SAFV + LI+DN  + FEC H +   +         KLD+SK  +RV+W +L + MHKM    
Subjt:  MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDLV-

Query:  -FFKATEGEGSRIKSVLETYEKATA--PLGS--------NPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWIGKNGNNVPIWEKGR---NYIPSV
         +      EG   K   ++ ++     P GS        N    WR I  G +L KKG+ WR+GNG  I + +DPW+ ++ +  PI  KG     ++  +
Subjt:  -FFKATEGEGSRIKSVLETYEKATA--PLGS--------NPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWIGKNGNNVPIWEKGR---NYIPSV

Query:  LVLQGRWKLN
        L   G W+++
Subjt:  LVLQGRWKLN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.7e-0531.88Show/hide
Query:  LKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVM
        LK ++ ++I    ++F+   + +DN+    E +H++  R+KG  G  +LKLD+ K  DR+ W YL   +
Subjt:  LKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAATTCAGGCTAATTAGCCTTTGCAATGTAATATACAAAATCATTGCAAAGGTCTTGGCAAACATATTGAAAAAAGTCCTAGGAGATATCATATCTCAGACCCC
ATCAGCTTTTGTTTCTGACATGCTTATTTCAGATAACGTCAATCTGGGCTTTGAATGCATTCATACTCTAAACAGCAGAAGAAAGGGTAAAACAGGGCTAGGCGTGCTAA
AGCTCGACATGAGCAAGACCGACGATAGAGTAGAGTGGATCTACCTCAGGAAAGTGATGCACAAAATGGATTTGGTGTTCTTCAAAGCTACCGAAGGTGAAGGGAGTCGA
ATCAAAAGTGTGCTAGAGACATACGAAAAAGCGACAGCCCCTTTGGGATCAAATCCATTTGCAAAGTGGAGAAGCATAGTATGGGGGAGAAAGTTGTTTAAGAAGGGCAT
GAGATGGAGAATAGGCAATGGGAACAGAATCATTATTGACCAAGACCCCTGGATAGGAAAAAATGGCAACAACGTCCCAATCTGGGAGAAAGGAAGAAACTATATCCCAT
CTGTTCTGGTTTTGCAAGGTAGATGGAAACTAAACACAAATGCGACCTGGAAAGATAACCTCAATCTAGGCCGGATTGGCTGCGTCATTCGTGACTCTCCAGGTTCTCTG
ATTGTTGTTGGAGGTAATCAAATGGAGAGAAATTGGTCAATCAAAAGCATTGAAGGAATGGCAATCCTTGAAGGGCCAAAGGCTTACCGGAAGCTCGTCTCTAGCACTGC
CGTTGGCACTGATTTTTTGCTCGATGTCGAATCTGATCATTTGAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAATTCAGGCTAATTAGCCTTTGCAATGTAATATACAAAATCATTGCAAAGGTCTTGGCAAACATATTGAAAAAAGTCCTAGGAGATATCATATCTCAGACCCC
ATCAGCTTTTGTTTCTGACATGCTTATTTCAGATAACGTCAATCTGGGCTTTGAATGCATTCATACTCTAAACAGCAGAAGAAAGGGTAAAACAGGGCTAGGCGTGCTAA
AGCTCGACATGAGCAAGACCGACGATAGAGTAGAGTGGATCTACCTCAGGAAAGTGATGCACAAAATGGATTTGGTGTTCTTCAAAGCTACCGAAGGTGAAGGGAGTCGA
ATCAAAAGTGTGCTAGAGACATACGAAAAAGCGACAGCCCCTTTGGGATCAAATCCATTTGCAAAGTGGAGAAGCATAGTATGGGGGAGAAAGTTGTTTAAGAAGGGCAT
GAGATGGAGAATAGGCAATGGGAACAGAATCATTATTGACCAAGACCCCTGGATAGGAAAAAATGGCAACAACGTCCCAATCTGGGAGAAAGGAAGAAACTATATCCCAT
CTGTTCTGGTTTTGCAAGGTAGATGGAAACTAAACACAAATGCGACCTGGAAAGATAACCTCAATCTAGGCCGGATTGGCTGCGTCATTCGTGACTCTCCAGGTTCTCTG
ATTGTTGTTGGAGGTAATCAAATGGAGAGAAATTGGTCAATCAAAAGCATTGAAGGAATGGCAATCCTTGAAGGGCCAAAGGCTTACCGGAAGCTCGTCTCTAGCACTGC
CGTTGGCACTGATTTTTTGCTCGATGTCGAATCTGATCATTTGAGCTAA
Protein sequenceShow/hide protein sequence
MSQFRLISLCNVIYKIIAKVLANILKKVLGDIISQTPSAFVSDMLISDNVNLGFECIHTLNSRRKGKTGLGVLKLDMSKTDDRVEWIYLRKVMHKMDLVFFKATEGEGSR
IKSVLETYEKATAPLGSNPFAKWRSIVWGRKLFKKGMRWRIGNGNRIIIDQDPWIGKNGNNVPIWEKGRNYIPSVLVLQGRWKLNTNATWKDNLNLGRIGCVIRDSPGSL
IVVGGNQMERNWSIKSIEGMAILEGPKAYRKLVSSTAVGTDFLLDVESDHLS