; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022063 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022063
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:17215940..17217598
RNA-Seq ExpressionLag0022063
SyntenyLag0022063
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU50750.1 hypothetical protein TSUD_410580 [Trifolium subterraneum]1.7e-1224.35Show/hide
Query:  MGHSHRDCSVAVETAVVDGCFPFSDWIRASPVRPMRQSSNREEGQGPKRHGGRPFGGF---GRGRSRDHGQLGLSEANIVEAPGQASDAMNEMEPSSNIV
        +GHS   C V       DG   +S  +R                  P+R GGRP   +    RG     G+ G S+                        
Subjt:  MGHSHRDCSVAVETAVVDGCFPFSDWIRASPVRPMRQSSNREEGQGPKRHGGRPFGGF---GRGRSRDHGQLGLSEANIVEAPGQASDAMNEMEPSSNIV

Query:  EMSEAMVGHPESTTPMPTTVSVGLPVDKRKAVVESSPSVVAKKGVAGSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVG
                 P  T   PT+   GLP           P +V      G  +P A+  L  + Q  +P ++FLSET   S  +E V+  LG+DCC +ID  G
Subjt:  EMSEAMVGHPESTTPMPTTVSVGLPVDKRKAVVESSPSVVAKKGVAGSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVG

Query:  QSCGLALLW----DFSD-----------------------SIVSCGLVDLGFVGERYTWCNWRPGVHHLD-------------------------YCGLD
        +S GLA+LW    DF+D                       ++  C L D+   G  YTW   R   H ++                             D
Subjt:  QSCGLALLW----DFSD-----------------------SIVSCGLVDLGFVGERYTWCNWRPGVHHLD-------------------------YCGLD

Query:  HRPLDLNLSLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLAQLTNQCMRFMAGGGKTKLDSFPRRIREA
        H P+ L    V+  G RR    +FE  WL+ SD+++VV   WG    V  +++   + +  + C   + G G+ K   F   I ++
Subjt:  HRPLDLNLSLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLAQLTNQCMRFMAGGGKTKLDSFPRRIREA

KAF4351405.1 hypothetical protein F8388_001025, partial [Cannabis sativa]8.6e-1223.57Show/hide
Query:  RKAVVESSPSVVAKKGVAGSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQSCGLALLW-------------------
        +K  +E    VV +    G G+P AL  L  VV+   P ++FLSETK+     E ++R + F   F +DCVG+S GL LLW                   
Subjt:  RKAVVESSPSVVAKKGVAGSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQSCGLALLW-------------------

Query:  --------------------------------------------------------------------DFSDSIVSCGLVDLGFVGERYTWCNWRPGVHH
                                                                            DF  ++  C LVD+GF G+ +TW N R GV H
Subjt:  --------------------------------------------------------------------DFSDSIVSCGLVDLGFVGERYTWCNWRPGVHH

Query:  L-------------------------DYCGLDHRPLDLNL-SLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLAQLTNQCMRF
        +                         D+   DHRP+   L ++V N    ++   RFE  WL+ ++ +D+V+Q+W S  V   +     +  +  +C   
Subjt:  L-------------------------DYCGLDHRPLDLNL-SLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLAQLTNQCMRF

Query:  MAGGGKTKLDSFPR
        +    KTK  S P+
Subjt:  MAGGGKTKLDSFPR

KAG2725981.1 hypothetical protein I3760_01G090600 [Carya illinoinensis]9.1e-1424.03Show/hide
Query:  GSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQSCGLALLWD------------------------------------
        G G+P  +R L  +V+ + P+V+FL ETK+TS QME V+  +GF+CCF++DC G+  G+ALLW                                     
Subjt:  GSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQSCGLALLWD------------------------------------

Query:  -----------------------------------FSDSIVSCGLVDLGFVGERYTWCN--------------------WRPGVHHLDY-----CGLDHR
                                           F   I  C L+DLGF G++YTWCN                    W+    H           DH 
Subjt:  -----------------------------------FSDSIVSCGLVDLGFVGERYTWCN--------------------WRPGVHHLDY-----CGLDHR

Query:  PLDLNLSLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLAQLTNQCMRFMAGGGKTKLDSFPRRIREASQRVQFAIVDLKVSGT
        P+ L  S   N    R  + RFE +W+   +  DV+S+ W     V ++     +    + C + +    + K     ++I+ A + +Q   +    S T
Subjt:  PLDLNLSLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLAQLTNQCMRFMAGGGKTKLDSFPRRIREASQRVQFAIVDLKVSGT

Query:  RDELIQAE
        R+E+ +A+
Subjt:  RDELIQAE

KAG6687674.1 hypothetical protein I3842_11G085000 [Carya illinoinensis]2.0e-1331Show/hide
Query:  GSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQSCGLALLWD------------------------------------
        G G+P  +R L  +V+ +RP V+FLSETK    +ME ++  L FD  F +DCVG+S GLA  W                                     
Subjt:  GSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQSCGLALLWD------------------------------------

Query:  FSDSIVSCGLVDLGFVGERYTWCNWRPGVH----HLDYCGLDHRPLDLNLSLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLA
        F   +  CGL DLGF G+ YTW N R G       LD   ++   LDL           R CI R+E  W    + + +V Q+W    ++ S + P+ +A
Subjt:  FSDSIVSCGLVDLGFVGERYTWCNWRPGVH----HLDYCGLDHRPLDLNLSLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLA

XP_030498017.1 uncharacterized protein LOC115713672 [Cannabis sativa]3.8e-1223.47Show/hide
Query:  GSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQSCGLALLW-------------------------------------
        G G+P AL  L  VV+   P ++FLSETK+     E ++R + F   F +DCVG+S GL LLW                                     
Subjt:  GSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQSCGLALLW-------------------------------------

Query:  --------------------------------DFSDSIVSCGLVDLGFVGERYTWCNWRPGVHHL-------------------------DYCGLDHRPL
                                        +F  ++ +C LVD+GF G+ +TW N R G  H+                         D+   DHRP+
Subjt:  --------------------------------DFSDSIVSCGLVDLGFVGERYTWCNWRPGVHHL-------------------------DYCGLDHRPL

Query:  DLNL-SLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQG-------LAQLTNQCMRFMAGGGKTKLDSFPRRIREASQRV
           L ++V +     +   RFE  WL+ ++  D+V+Q+W         +SP G       +  +  +C   +    K+K  S P+ +RE  +++
Subjt:  DLNL-SLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQG-------LAQLTNQCMRFMAGGGKTKLDSFPRRIREASQRV

TrEMBL top hitse value%identityAlignment
A0A2N9FDP4 Reverse transcriptase domain-containing protein2.2e-1327.73Show/hide
Query:  KGVAGSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQSCGLALLW---------------------------------
        KG  G G+P A+R L  +V+ + P V+FL ETK+ + +ME ++  LGFD  F++  +G+S GLALLW                                 
Subjt:  KGVAGSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQSCGLALLW---------------------------------

Query:  --------------------------DFSDSIVSCGLVDLGFVGERYTWCN--------------------W-----RPGVHHLDYCGLDHRPLDLNLSL
                                  DF +++ +C LVDLGF G +YTW N                    W     R  V H      DH  L+L +S 
Subjt:  --------------------------DFSDSIVSCGLVDLGFVGERYTWCN--------------------W-----RPGVHHLDYCGLDHRPLDLNLSL

Query:  VLNCGT--RRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLAQLTNQC
        V    T  R++ + RFEE W    D + ++ +SW     V S M    L Q  ++C
Subjt:  VLNCGT--RRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLAQLTNQC

A0A2N9GN60 zf-RVT domain-containing protein3.4e-1427.98Show/hide
Query:  SEAMVGHPESTTPMPTTVSVGLPVDKRKAVVESSPSVVAKKGVAGSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQS
        S  + G P++T  + T  S  +  ++    +++S   +   G     SP A+  L  +++ ++P+++FLSET+++S  +E ++  LG      I   G  
Subjt:  SEAMVGHPESTTPMPTTVSVGLPVDKRKAVVESSPSVVAKKGVAGSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQS

Query:  CGLALLW-DFSDSIVSCGLVDLGFVGERYTWCNWRP-GVHHLDYCGLDHRPLDLNLSLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSD
         GLALLW  F D + +CG  +LG+ G  + WCN     V H+     DH  L   L+   N   +R+   +F+E W++    +D++ ++W +D
Subjt:  CGLALLW-DFSDSIVSCGLVDLGFVGERYTWCNWRP-GVHHLDYCGLDHRPLDLNLSLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSD

A0A2N9I0P4 Reverse transcriptase domain-containing protein1.6e-1130.26Show/hide
Query:  ALRCLTKVVQAQRPMVMFLSETKVTSFQMEAV--KRGLGFDCCFSID---CVG--------------QSCGLAL-----LWDFSDSIVSCGLVDLGFVGE
        A++ L+++V+ + P+V+F+ ET +   ++E +      G  C F I+   C+G              +  G  +     + DF D+I SCG +DLGFVGE
Subjt:  ALRCLTKVVQAQRPMVMFLSETKVTSFQMEAV--KRGLGFDCCFSID---CVG--------------QSCGLAL-----LWDFSDSIVSCGLVDLGFVGE

Query:  RYTWCNWRPG-------------------------VHHLDYCGLDHRPLDLNLSLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPS
         +TWCN R G                         +HH+D    DH PL LNL       + ++   RFEE+WL     +D+V+Q+W  D  V S
Subjt:  RYTWCNWRPG-------------------------VHHLDYCGLDHRPLDLNLSLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPS

A0A2Z6PHQ3 Reverse transcriptase domain-containing protein8.4e-1324.35Show/hide
Query:  MGHSHRDCSVAVETAVVDGCFPFSDWIRASPVRPMRQSSNREEGQGPKRHGGRPFGGF---GRGRSRDHGQLGLSEANIVEAPGQASDAMNEMEPSSNIV
        +GHS   C V       DG   +S  +R                  P+R GGRP   +    RG     G+ G S+                        
Subjt:  MGHSHRDCSVAVETAVVDGCFPFSDWIRASPVRPMRQSSNREEGQGPKRHGGRPFGGF---GRGRSRDHGQLGLSEANIVEAPGQASDAMNEMEPSSNIV

Query:  EMSEAMVGHPESTTPMPTTVSVGLPVDKRKAVVESSPSVVAKKGVAGSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVG
                 P  T   PT+   GLP           P +V      G  +P A+  L  + Q  +P ++FLSET   S  +E V+  LG+DCC +ID  G
Subjt:  EMSEAMVGHPESTTPMPTTVSVGLPVDKRKAVVESSPSVVAKKGVAGSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVG

Query:  QSCGLALLW----DFSD-----------------------SIVSCGLVDLGFVGERYTWCNWRPGVHHLD-------------------------YCGLD
        +S GLA+LW    DF+D                       ++  C L D+   G  YTW   R   H ++                             D
Subjt:  QSCGLALLW----DFSD-----------------------SIVSCGLVDLGFVGERYTWCNWRPGVHHLD-------------------------YCGLD

Query:  HRPLDLNLSLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLAQLTNQCMRFMAGGGKTKLDSFPRRIREA
        H P+ L    V+  G RR    +FE  WL+ SD+++VV   WG    V  +++   + +  + C   + G G+ K   F   I ++
Subjt:  HRPLDLNLSLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLAQLTNQCMRFMAGGGKTKLDSFPRRIREA

A0A7J6DZ24 CCHC-type domain-containing protein4.1e-1223.57Show/hide
Query:  RKAVVESSPSVVAKKGVAGSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQSCGLALLW-------------------
        +K  +E    VV +    G G+P AL  L  VV+   P ++FLSETK+     E ++R + F   F +DCVG+S GL LLW                   
Subjt:  RKAVVESSPSVVAKKGVAGSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQSCGLALLW-------------------

Query:  --------------------------------------------------------------------DFSDSIVSCGLVDLGFVGERYTWCNWRPGVHH
                                                                            DF  ++  C LVD+GF G+ +TW N R GV H
Subjt:  --------------------------------------------------------------------DFSDSIVSCGLVDLGFVGERYTWCNWRPGVHH

Query:  L-------------------------DYCGLDHRPLDLNL-SLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLAQLTNQCMRF
        +                         D+   DHRP+   L ++V N    ++   RFE  WL+ ++ +D+V+Q+W S  V   +     +  +  +C   
Subjt:  L-------------------------DYCGLDHRPLDLNL-SLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLAQLTNQCMRF

Query:  MAGGGKTKLDSFPR
        +    KTK  S P+
Subjt:  MAGGGKTKLDSFPR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACACTCTCATCGCGACTGCTCGGTTGCGGTTGAGACAGCGGTTGTGGATGGTTGCTTTCCCTTTAGTGATTGGATACGAGCTTCCCCTGTGAGGCCGATGAGGCA
GAGTTCGAACAGGGAGGAGGGACAAGGACCTAAGCGCCATGGTGGAAGGCCTTTTGGCGGATTTGGCAGAGGCAGGAGTAGGGATCATGGACAGTTGGGGTTGAGTGAAG
CGAATATTGTTGAGGCTCCTGGTCAGGCTTCAGATGCTATGAATGAGATGGAACCTTCATCGAATATTGTTGAAATGTCGGAGGCTATGGTCGGGCACCCCGAGTCTACG
ACCCCTATGCCAACAACTGTTTCTGTGGGGTTGCCGGTGGATAAGAGAAAGGCAGTGGTTGAATCATCTCCTTCTGTAGTGGCTAAGAAGGGTGTTGCGGGCTCAGGGTC
CCCTTGGGCGCTCCGATGCCTGACTAAGGTGGTTCAAGCACAACGACCCATGGTGATGTTTCTGTCTGAGACAAAGGTTACGTCGTTTCAAATGGAGGCTGTAAAACGTG
GTTTGGGATTTGATTGTTGTTTTTCTATTGATTGTGTTGGCCAGAGTTGTGGTCTGGCCCTCCTTTGGGATTTTTCGGATTCGATTGTTTCTTGTGGTCTTGTGGATCTG
GGGTTTGTGGGTGAGCGTTACACGTGGTGTAACTGGCGACCAGGGGTGCATCACCTAGATTACTGTGGTTTGGATCATCGCCCTTTAGATTTGAATCTTTCTCTGGTGCT
TAATTGTGGGACTAGGCGGCAGTGTATTAATCGTTTTGAGGAGGTGTGGTTGCGATACTCAGACCTTCAGGATGTGGTAAGTCAATCATGGGGTTCAGACTCTGTTGTAC
CCTCCTCTATGAGTCCACAGGGCCTTGCTCAACTGACTAACCAGTGTATGCGGTTTATGGCTGGAGGGGGGAAGACAAAGCTGGACAGTTTTCCTAGACGGATTAGGGAG
GCTTCACAGAGGGTGCAATTTGCTATTGTTGACTTGAAAGTTTCAGGCACTCGTGATGAGCTTATTCAGGCTGAAGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGACACTCTCATCGCGACTGCTCGGTTGCGGTTGAGACAGCGGTTGTGGATGGTTGCTTTCCCTTTAGTGATTGGATACGAGCTTCCCCTGTGAGGCCGATGAGGCA
GAGTTCGAACAGGGAGGAGGGACAAGGACCTAAGCGCCATGGTGGAAGGCCTTTTGGCGGATTTGGCAGAGGCAGGAGTAGGGATCATGGACAGTTGGGGTTGAGTGAAG
CGAATATTGTTGAGGCTCCTGGTCAGGCTTCAGATGCTATGAATGAGATGGAACCTTCATCGAATATTGTTGAAATGTCGGAGGCTATGGTCGGGCACCCCGAGTCTACG
ACCCCTATGCCAACAACTGTTTCTGTGGGGTTGCCGGTGGATAAGAGAAAGGCAGTGGTTGAATCATCTCCTTCTGTAGTGGCTAAGAAGGGTGTTGCGGGCTCAGGGTC
CCCTTGGGCGCTCCGATGCCTGACTAAGGTGGTTCAAGCACAACGACCCATGGTGATGTTTCTGTCTGAGACAAAGGTTACGTCGTTTCAAATGGAGGCTGTAAAACGTG
GTTTGGGATTTGATTGTTGTTTTTCTATTGATTGTGTTGGCCAGAGTTGTGGTCTGGCCCTCCTTTGGGATTTTTCGGATTCGATTGTTTCTTGTGGTCTTGTGGATCTG
GGGTTTGTGGGTGAGCGTTACACGTGGTGTAACTGGCGACCAGGGGTGCATCACCTAGATTACTGTGGTTTGGATCATCGCCCTTTAGATTTGAATCTTTCTCTGGTGCT
TAATTGTGGGACTAGGCGGCAGTGTATTAATCGTTTTGAGGAGGTGTGGTTGCGATACTCAGACCTTCAGGATGTGGTAAGTCAATCATGGGGTTCAGACTCTGTTGTAC
CCTCCTCTATGAGTCCACAGGGCCTTGCTCAACTGACTAACCAGTGTATGCGGTTTATGGCTGGAGGGGGGAAGACAAAGCTGGACAGTTTTCCTAGACGGATTAGGGAG
GCTTCACAGAGGGTGCAATTTGCTATTGTTGACTTGAAAGTTTCAGGCACTCGTGATGAGCTTATTCAGGCTGAAGCTTAG
Protein sequenceShow/hide protein sequence
MGHSHRDCSVAVETAVVDGCFPFSDWIRASPVRPMRQSSNREEGQGPKRHGGRPFGGFGRGRSRDHGQLGLSEANIVEAPGQASDAMNEMEPSSNIVEMSEAMVGHPEST
TPMPTTVSVGLPVDKRKAVVESSPSVVAKKGVAGSGSPWALRCLTKVVQAQRPMVMFLSETKVTSFQMEAVKRGLGFDCCFSIDCVGQSCGLALLWDFSDSIVSCGLVDL
GFVGERYTWCNWRPGVHHLDYCGLDHRPLDLNLSLVLNCGTRRQCINRFEEVWLRYSDLQDVVSQSWGSDSVVPSSMSPQGLAQLTNQCMRFMAGGGKTKLDSFPRRIRE
ASQRVQFAIVDLKVSGTRDELIQAEA