; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006144 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006144
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr6:38191485..38192161
RNA-Seq ExpressionLag0006144
SyntenyLag0006144
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017979793.1 PREDICTED: uncharacterized protein LOC18594299 [Theobroma cacao]5.6e-0730.77Show/hide
Query:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKWDNKKWRFTSLYGHPDSPNVIIHGNFYVVFTTTMILCGLLVGISMKFCGVMRNQVALSGIVINFRLSERFY
        SGGL ++W+   DV + ++ RYH+D  I     KWRF   YGHP +       +      +   L  L +G   +     + +  L+G     R  ER  
Subjt:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKWDNKKWRFTSLYGHPDSPNVIIHGNFYVVFTTTMILCGLLVGISMKFCGVMRNQVALSGIVINFRLSERFY

Query:  TIVIYRIWASLETFLLG
         IV+Y IW  L+  +LG
Subjt:  TIVIYRIWASLETFLLG

XP_021751647.1 uncharacterized protein LOC110717303 [Chenopodium quinoa]4.3e-0751.02Show/hide
Query:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKW-DNKKWRFTSLYGHPDSPN
        SGG+ LLWKD +DV+I++F   H+DA+++W  N +WRFT +YGH +  N
Subjt:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKW-DNKKWRFTSLYGHPDSPN

XP_030936665.1 uncharacterized protein LOC115961908 [Quercus lobata]1.5e-0727.83Show/hide
Query:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKWDN--KKWRFTSLYGHPDS------------------PNVIIHGNFYVV-------------------FTT
        SGGL +LWK+  DV++++    H+D  +   N  + WR T  YGHPDS                     I+ G+F  +                   F  
Subjt:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKWDN--KKWRFTSLYGHPDS------------------PNVIIHGNFYVV-------------------FTT

Query:  TMILCGLL----VGISMKFC----GVMRNQVALSGIVINFRLSERFYTIVIY-RIWASLETFLLGVTEEMRGHNLPNHLARGKHRSSFKFEEWWTHHEEC
         +  CGL+    VG    +C    G  R  V L  +V N      F    +Y R  A+ +  LL ++  MRG   P  + + +    F FEE WT  E C
Subjt:  TMILCGLL----VGISMKFC----GVMRNQVALSGIVINFRLSERFYTIVIY-RIWASLETFLLGVTEEMRGHNLPNHLARGKHRSSFKFEEWWTHHEEC

Query:  RRLIINSGLWEP
        R +I     W+P
Subjt:  RRLIINSGLWEP

XP_042950313.1 uncharacterized protein LOC122282426 [Carya illinoinensis]7.3e-0757.45Show/hide
Query:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKWDN-KKWRFTSLYGHPDS
        SGGL LLW  D+ V++R+F +YH+D +IK D+   WRFT LYGHPD+
Subjt:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKWDN-KKWRFTSLYGHPDS

XP_042980077.1 uncharacterized protein LOC122310261 [Carya illinoinensis]4.3e-0757.45Show/hide
Query:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKWDN-KKWRFTSLYGHPDS
        SGGL LLW  D+ V++R+F +YH+D +IK D+   WRFT LYGHPD+
Subjt:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKWDN-KKWRFTSLYGHPDS

TrEMBL top hitse value%identityAlignment
A0A2N9GPL3 Uncharacterized protein4.6e-0744.44Show/hide
Query:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKW-DNKKWRFTSLYGHPDSPNVIIHGN---FYVVFTTTMILCGLLVGISMKFC
        SGGL LLW D   + I+NF + HVD++++  D  KWRFT  YG      VI +GN    ++ FT  ++L G   GI MKFC
Subjt:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKW-DNKKWRFTSLYGHPDSPNVIIHGN---FYVVFTTTMILCGLLVGISMKFC

A0A2N9I611 Uncharacterized protein2.1e-0727.84Show/hide
Query:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKWDNKKWRFTSLYGHPDSPNVIIHGNFYVVFTTTMILCGLLVGISMKFCGVMRNQVALSGIVIN--------
        SGGL LLW +DVD+ I ++ R+H+DA IK     WRFT  YGHP++      G++ ++     I    L  + M     + +    +G+  N        
Subjt:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKWDNKKWRFTSLYGHPDSPNVIIHGNFYVVFTTTMILCGLLVGISMKFCGVMRNQVALSGIVIN--------

Query:  --FRLSERFYTIVIYRIWASL--ETFLLGVTEEMRGHN------LPNHLARGKHRSSFKFEEWWTHHEECRRLIIN
           RL   F  +    +W  +     ++ +      H       L N   R +    F+FE+ WT HEEC ++I++
Subjt:  --FRLSERFYTIVIYRIWASL--ETFLLGVTEEMRGHN------LPNHLARGKHRSSFKFEEWWTHHEECRRLIIN

A0A5B6V9C1 Reverse transcriptase3.5e-0729.94Show/hide
Query:  GLCLLWKDDVDVQIRNFLRYHVDAYIKWDN--KKWRFTSLYGHPDSPNVIIHGNFYVVFTTTMILCGLLVGISMKFCGVMRNQVALSGIVINFRLSERFY
        GLCL WK +V V++R F + ++D  I  +N   KWRFT  YG P   +     N +       I   +   +  K  G +  +   S         ERFY
Subjt:  GLCLLWKDDVDVQIRNFLRYHVDAYIKWDN--KKWRFTSLYGHPDSPNVIIHGNFYVVFTTTMILCGLLVGISMKFCGVMRNQVALSGIVINFRLSERFY

Query:  TIVIYRIWASLE-------------TFLLGVT----EEMRGHNLPNHLARGKHRSSFKFEEWWTHHE
         IV+  IW SLE             T++ G+T     E+R         +G   ++F+FE WWT  E
Subjt:  TIVIYRIWASLE-------------TFLLGVT----EEMRGHNLPNHLARGKHRSSFKFEEWWTHHE

A0A803N3V6 Uncharacterized protein2.1e-0751.02Show/hide
Query:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKW-DNKKWRFTSLYGHPDSPN
        SGG+ LLWKD +DV+I++F   H+DA+++W  N +WRFT +YGH +  N
Subjt:  SGGLCLLWKDDVDVQIRNFLRYHVDAYIKW-DNKKWRFTSLYGHPDSPN

A0A803NM27 Uncharacterized protein1.6e-0726.61Show/hide
Query:  MSGGLCLLWKDDVDVQIRNFLRYHVDAYIKW-DNKKWRFTSLYGHPDSPN----------------------------VIIHG----------NFYVVFT
        + GGL LLW DDV+V + +    + D YI + D  +W F+++YG P++ N                            +  H           N  + F 
Subjt:  MSGGLCLLWKDDVDVQIRNFLRYHVDAYIKW-DNKKWRFTSLYGHPDSPN----------------------------VIIHG----------NFYVVFT

Query:  TTMILCGLL----VGISMKFCGVMRN----QVALSGIVINFRLSERFYTIVIYRI--WASLETFLLGVTEEMRGHNLPNHLARGKHRSSFKFEEWWTHHE
        TT+ LC LL     G    +C   RN    Q  L  + IN   ++RF T  +  +  + S    +L         NL + ++  K+RS F+FE+ W    
Subjt:  TTMILCGLL----VGISMKFCGVMRN----QVALSGIVINFRLSERFYTIVIYRI--WASLETFLLGVTEEMRGHNLPNHLARGKHRSSFKFEEWWTHHE

Query:  ECRRLIINSGLWEPGRAN
        EC  +I  S  W    AN
Subjt:  ECRRLIINSGLWEPGRAN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGTGGTTTATGTTTACTGTGGAAAGATGATGTGGATGTCCAAATTCGAAATTTTTTGCGTTACCATGTGGATGCATATATTAAATGGGATAATAAAAAATGGAG
ATTTACGAGTCTTTATGGTCATCCTGATTCACCCAACGTCATCATACATGGAAACTTTTACGTCGTCTTTACAACTACGATGATTCTCTGTGGCTTATTGGTGGGGATCT
CAATGAAATTTTGTGGAGTAATGAGAAATCAAGTGGCTCTGAGTGGGATAGTAATCAACTTTCGACTTTCAGAGAGGTTTTACACGATTGTCATTTACAGGATATGGGCT
TCTCTGGAAACATTTTTACTTGGTGTAACAGAAGAGATGCGGGGGCACAATCTTCCGAATCATCTTGCTCGAGGAAAGCATCGTTCATCATTCAAGTTTGAAGAATGGTG
GACACATCACGAGGAATGTAGAAGACTCATTATAAATTCTGGACTATGGGAGCCTGGTAGGGCAAATGAATGTTCTCTTGGGAGAATCTTCAATGTTGTGCCTCTGCTCT
AG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGTGGTTTATGTTTACTGTGGAAAGATGATGTGGATGTCCAAATTCGAAATTTTTTGCGTTACCATGTGGATGCATATATTAAATGGGATAATAAAAAATGGAG
ATTTACGAGTCTTTATGGTCATCCTGATTCACCCAACGTCATCATACATGGAAACTTTTACGTCGTCTTTACAACTACGATGATTCTCTGTGGCTTATTGGTGGGGATCT
CAATGAAATTTTGTGGAGTAATGAGAAATCAAGTGGCTCTGAGTGGGATAGTAATCAACTTTCGACTTTCAGAGAGGTTTTACACGATTGTCATTTACAGGATATGGGCT
TCTCTGGAAACATTTTTACTTGGTGTAACAGAAGAGATGCGGGGGCACAATCTTCCGAATCATCTTGCTCGAGGAAAGCATCGTTCATCATTCAAGTTTGAAGAATGGTG
GACACATCACGAGGAATGTAGAAGACTCATTATAAATTCTGGACTATGGGAGCCTGGTAGGGCAAATGAATGTTCTCTTGGGAGAATCTTCAATGTTGTGCCTCTGCTCT
AG
Protein sequenceShow/hide protein sequence
MSGGLCLLWKDDVDVQIRNFLRYHVDAYIKWDNKKWRFTSLYGHPDSPNVIIHGNFYVVFTTTMILCGLLVGISMKFCGVMRNQVALSGIVINFRLSERFYTIVIYRIWA
SLETFLLGVTEEMRGHNLPNHLARGKHRSSFKFEEWWTHHEECRRLIINSGLWEPGRANECSLGRIFNVVPLL