; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001969 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001969
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:37756280..37758793
RNA-Seq ExpressionLag0001969
SyntenyLag0001969
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_012857002.1 PREDICTED: uncharacterized protein LOC105976270 [Erythranthe guttata]6.1e-3835.71Show/hide
Query:  MSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISET----------KEWWACYLMGTD---------------VSFSLISYSNNHIDGWVL--WNECRW
        MS +FWN +GLG+P     L  +++  RPL++F+SET          K  W     G D               V   LISYSNNHID  VL   +  +W
Subjt:  MSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISET----------KEWWACYLMGTD---------------VSFSLISYSNNHIDGWVL--WNECRW

Query:  RLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERLD---
        R++GFYGFP +    +SWA+L +L+   + PW++GGD+N +L  +EK GGL +  A++ AF++ +D CGL DL F G RFTW N R    T+  RLD   
Subjt:  RLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERLD---

Query:  --------------HY----------------PVQSHWGRYKQRIDRFEETWLRYPDLQDVVRRAW
                      H+                P Q+   R K+R  RFE  WLR  D +++V++ W
Subjt:  --------------HY----------------PVQSHWGRYKQRIDRFEETWLRYPDLQDVVRRAW

XP_018816246.1 uncharacterized protein LOC108987722 [Juglans regia]6.3e-3542.03Show/hide
Query:  MSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISE----TKEWWAC-YLMG---------------------TDVSFSLISYSNNHIDGWVLWNECR--
        M +  WNARGLG+PR    L  L+Q + P VLF+ E    T+E  +C Y +G                      ++  S+I+YS+NH+D  +     R  
Subjt:  MSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISE----TKEWWAC-YLMG---------------------TDVSFSLISYSNNHIDGWVLWNECR--

Query:  -WRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERLDH
         W L+  YGFP   L  QSW+LL  L    D PWL+ GDFN +L   EK GG  +P  +L+AF++VVD C L DLGFSG   TW NRR G+  I ERLDH
Subjt:  -WRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERLDH

Query:  YPVQSHW
          V S W
Subjt:  YPVQSHW

XP_023871634.1 uncharacterized protein LOC111984238 [Quercus suber]1.4e-3432.82Show/hide
Query:  MSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISET----------KEWW----------------ACYLMGTDVSFSLISYSNNHIDGWV-LWNECRW
        M++L WN RGLGSP+    L  L++A  P ++F++ET          ++ W                       DV FS+ SYS NHID  +    E  W
Subjt:  MSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISET----------KEWW----------------ACYLMGTDVSFSLISYSNNHIDGWV-LWNECRW

Query:  RLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERL----
        R +GFYG      H  SWA L RLK  +  PW+  GDFN ++   EK GG  +P  ++  F+DV+D CG  DLG++G++FTWCN      T++ER+    
Subjt:  RLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERL----

Query:  ----------------------DHYPVQSHWGRYKQRID---RFEETWLRYPDLQDVVRRAW
                              DH P+  H     +R++   RFE+ W+R    ++V+  AW
Subjt:  ----------------------DHYPVQSHWGRYKQRID---RFEETWLRYPDLQDVVRRAW

XP_030498017.1 uncharacterized protein LOC115713672 [Cannabis sativa]4.1e-3435.41Show/hide
Query:  MSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISETKEW--WACYLMGTDVSFSLISYSNN-HIDG-------WVLWNEC-----RWRLSGFYGFPSLE
        MSLL WNARGLG+P A   L  +V+   P ++F+SETK +  WA      +     IS+SN+ H+D         +LWN+       + +  FYG P   
Subjt:  MSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISETKEW--WACYLMGTDVSFSLISYSNN-HIDG-------WVLWNEC-----RWRLSGFYGFPSLE

Query:  LHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERLDHYPVQSHWGRY----
            SW LL RLKG +D PW+ GGDFN +L  +E+ GG+D+ ++ +  FQ  +D C LVD+GF G  FTW N+R G A + ERLD Y     W       
Subjt:  LHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERLDHYPVQSHWGRY----

Query:  -----------------------------KQRIDRFEETWLRYPDLQDVVRRAWVLP
                                      +R  RFE  WL+  +  D+V +AW+ P
Subjt:  -----------------------------KQRIDRFEETWLRYPDLQDVVRRAWVLP

XP_042988712.1 uncharacterized protein LOC122316247 [Carya illinoinensis]1.2e-3335.92Show/hide
Query:  MSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISETK-------------EWWACYLMGT-------------DVSFSLISYSNNHIDGWV--LWNECR
        M  + WN RGLG+PR    L  LV+ + P+VLF+ ETK              +  C+ + +             + + ++ SYS NHID  +    ++ +
Subjt:  MSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISETK-------------EWWACYLMGT-------------DVSFSLISYSNNHIDGWV--LWNECR

Query:  WRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERLDHY
        W+ +G YG P  EL  ++W  +  L+G    PWL+ GDFN VLC  EK GG ++P  ++  F+ ++D C  VDLGF G  FTWCN+R  + T+ ERLD Y
Subjt:  WRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERLDHY

Query:  PVQSHW
             W
Subjt:  PVQSHW

TrEMBL top hitse value%identityAlignment
A0A2I4EA22 uncharacterized protein LOC1089877223.1e-3542.03Show/hide
Query:  MSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISE----TKEWWAC-YLMG---------------------TDVSFSLISYSNNHIDGWVLWNECR--
        M +  WNARGLG+PR    L  L+Q + P VLF+ E    T+E  +C Y +G                      ++  S+I+YS+NH+D  +     R  
Subjt:  MSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISE----TKEWWAC-YLMG---------------------TDVSFSLISYSNNHIDGWVLWNECR--

Query:  -WRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERLDH
         W L+  YGFP   L  QSW+LL  L    D PWL+ GDFN +L   EK GG  +P  +L+AF++VVD C L DLGFSG   TW NRR G+  I ERLDH
Subjt:  -WRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERLDH

Query:  YPVQSHW
          V S W
Subjt:  YPVQSHW

A0A2N9EWI9 Reverse transcriptase domain-containing protein1.8e-3534.57Show/hide
Query:  PPGVMSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISETK------------------EWWACY--------LMGTDVSFSLISYSNNHIDGWV-LWN
        PP  M+L+ WN RGLG+ R    L  LV+ K P VLF+ ETK                      CY          G D+   + SYS +HID  V + +
Subjt:  PPGVMSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISETK------------------EWWACY--------LMGTDVSFSLISYSNNHIDGWV-LWN

Query:  ECRWRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERL
           WRL+GFYG P      +SWALL  L+     PW   GDFN +L Q EK G  D+P  ++ AF+DV+  C L D+GF G  FTW NRR G A +  RL
Subjt:  ECRWRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERL

Query:  DHYPVQSHW--------------------------------GRYKQRIDRFEETWLRYPDLQDVVRRAW
        D     + W                                 R ++R+ RFE+ W+ +P+ + VV  AW
Subjt:  DHYPVQSHW--------------------------------GRYKQRIDRFEETWLRYPDLQDVVRRAW

A0A2N9H936 Uncharacterized protein5.9e-3932.16Show/hide
Query:  QGVLSSSVDNGKTVVGDQPPNLGLEVGSAPGHGRKGWKRLARGTLKDVTNVMEHESSKQKRSLSIGENLENGKDGKRRKGADVDAMVCASSDQNVAAAGQ
        + +  S VD G    G Q PN  L     PG     WKRLAR T        E    KQKR +  G N                                
Subjt:  QGVLSSSVDNGKTVVGDQPPNLGLEVGSAPGHGRKGWKRLARGTLKDVTNVMEHESSKQKRSLSIGENLENGKDGKRRKGADVDAMVCASSDQNVAAAGQ

Query:  PPPGVMSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISETK------EWWACY--------------------LMGTDVSFSLISYSNNHIDGWVLWN
         PP  MS LFWN RGLG+P+    L  +V+ K PLVLF+SETK      E   CY                       +D+  S+ SYS++HID  + ++
Subjt:  PPPGVMSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISETK------EWWACY--------------------LMGTDVSFSLISYSNNHIDGWVLWN

Query:  -ECRWRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYER
         E  WR +GFYG P++     +W LL  L+G +  PWL GGDFN +L  +EK G + +P ++++AF+ VVD CG VDLGF G  +TW N++ G A + ER
Subjt:  -ECRWRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYER

Query:  LDHYPVQSHW--------------------------------GRYKQRIDRFEETWLRYPDLQDVVRRAW
        LD     + W                                 R +++  RFEE W  +   +D ++ AW
Subjt:  LDHYPVQSHW--------------------------------GRYKQRIDRFEETWLRYPDLQDVVRRAW

A0A2N9I239 RNase H domain-containing protein4.7e-3635.32Show/hide
Query:  PPGVMSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISETK------EWWACY--------------------LMGTDVSFSLISYSNNHIDGWVLWNE
        PP  MS LFWN RGLG+P     L  +V+ K PL LFISETK      E   CY                         VS ++ SYS +HID  V  ++
Subjt:  PPGVMSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISETK------EWWACY--------------------LMGTDVSFSLISYSNNHIDGWVLWNE

Query:  CR-WRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERL
         + WR++GFYG P+      +W +L  L   +  PWL GGDFN +L  +EK G + +  A++A F+ VVD CG +DLGFSG ++TW N+R G A + ERL
Subjt:  CR-WRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERL

Query:  DHYPVQSHW--------------------------------GRYKQRIDRFEETWLRYPDLQDVVRRAW
        D     + W                                 R +++  RFEE W   P  ++ VR+AW
Subjt:  DHYPVQSHW--------------------------------GRYKQRIDRFEETWLRYPDLQDVVRRAW

A0A2N9IFR8 RNase H domain-containing protein3.6e-3635.32Show/hide
Query:  PPGVMSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISETK------EWWACY--------------------LMGTDVSFSLISYSNNHIDGWVLWNE
        PP  MS LFWN RGLG+P     L  +V+ K PL LFISETK      E   CY                         VS ++ SYS++HID  V  ++
Subjt:  PPGVMSLLFWNARGLGSPRAFCRLNKLVQAKRPLVLFISETK------EWWACY--------------------LMGTDVSFSLISYSNNHIDGWVLWNE

Query:  CR-WRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERL
         + WR++GFYG P+      +W +L  L   +  PWL GGDFN +L  +EK G + +  A++A F+ VVD CG +DLGFSG ++TW N+R G A + ERL
Subjt:  CR-WRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVVDWCGLVDLGFSGDRFTWCNRRPGEATIYERL

Query:  DHYPVQSHW--------------------------------GRYKQRIDRFEETWLRYPDLQDVVRRAW
        D     + W                                 R +++  RFEE W   P  ++ VR+AW
Subjt:  DHYPVQSHW--------------------------------GRYKQRIDRFEETWLRYPDLQDVVRRAW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGCTGCTCAAAAAAATCAACCAAAGTGGAATTCTCAAACACTCTTTGATCGATTAAGAGGGAGTCGATGTGAGCTGGTTTGCTACCACACGCGGCATATTATGGG
CGAATCTATTGCTCGATCTTGGACATCAAGCAAAGACAAGCCTCCAAAGAGACATGAAGTGGAAGACGTTCTGACAGGAGAAGCAGGACCATCTGCGGCTGTTTCTGTGG
GGCCTACGGTCCAACCGCAGGTGGTGGATGAGGGTCCTTTGCAGGGTGTACTATCATCTTCTGTAGACAACGGGAAGACTGTGGTGGGGGATCAGCCTCCGAATCTGGGA
TTAGAGGTCGGTTCTGCGCCGGGGCATGGTCGGAAAGGGTGGAAGAGACTTGCCCGGGGTACCTTGAAAGACGTGACTAATGTGATGGAGCATGAGAGTAGTAAGCAGAA
GAGGAGTCTTTCTATAGGTGAGAATTTGGAGAACGGAAAGGATGGGAAGCGAAGGAAAGGGGCGGATGTGGATGCGATGGTGTGTGCATCTAGTGACCAAAATGTGGCGG
CGGCTGGCCAGCCGCCGCCAGGAGTTATGAGTCTCTTATTTTGGAACGCCCGAGGTTTAGGGTCACCTCGAGCGTTCTGTCGCTTGAACAAGTTGGTTCAGGCAAAACGA
CCCTTGGTGCTGTTCATTTCCGAAACTAAAGAGTGGTGGGCTTGCTATCTTATGGGCACAGATGTCTCCTTCAGCCTCATCTCCTACTCAAACAATCATATCGATGGGTG
GGTGTTGTGGAACGAGTGTAGGTGGCGGCTGTCGGGTTTTTATGGCTTTCCCTCCTTAGAACTTCACGACCAGTCATGGGCTCTACTAAGTAGATTGAAGGGGTGCTATG
ATACTCCTTGGCTTATTGGAGGTGATTTTAATGCAGTCCTCTGCCAAGATGAGAAGGGGGGTGGGTTAGATAAGCCGATGGCTGAGTTGGCAGCTTTTCAGGACGTTGTT
GATTGGTGTGGGCTTGTGGATCTGGGCTTTTCTGGTGACCGTTTCACTTGGTGTAATAGAAGGCCTGGAGAGGCGACTATTTATGAGCGATTGGACCATTATCCTGTGCA
GTCCCATTGGGGGAGGTATAAGCAGCGTATAGATCGGTTTGAGGAGACATGGTTACGTTATCCTGATTTGCAGGATGTGGTTCGTAGAGCTTGGGTGTTGCCTCCACTAG
TTCCCTCTCGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGCTGCTCAAAAAAATCAACCAAAGTGGAATTCTCAAACACTCTTTGATCGATTAAGAGGGAGTCGATGTGAGCTGGTTTGCTACCACACGCGGCATATTATGGG
CGAATCTATTGCTCGATCTTGGACATCAAGCAAAGACAAGCCTCCAAAGAGACATGAAGTGGAAGACGTTCTGACAGGAGAAGCAGGACCATCTGCGGCTGTTTCTGTGG
GGCCTACGGTCCAACCGCAGGTGGTGGATGAGGGTCCTTTGCAGGGTGTACTATCATCTTCTGTAGACAACGGGAAGACTGTGGTGGGGGATCAGCCTCCGAATCTGGGA
TTAGAGGTCGGTTCTGCGCCGGGGCATGGTCGGAAAGGGTGGAAGAGACTTGCCCGGGGTACCTTGAAAGACGTGACTAATGTGATGGAGCATGAGAGTAGTAAGCAGAA
GAGGAGTCTTTCTATAGGTGAGAATTTGGAGAACGGAAAGGATGGGAAGCGAAGGAAAGGGGCGGATGTGGATGCGATGGTGTGTGCATCTAGTGACCAAAATGTGGCGG
CGGCTGGCCAGCCGCCGCCAGGAGTTATGAGTCTCTTATTTTGGAACGCCCGAGGTTTAGGGTCACCTCGAGCGTTCTGTCGCTTGAACAAGTTGGTTCAGGCAAAACGA
CCCTTGGTGCTGTTCATTTCCGAAACTAAAGAGTGGTGGGCTTGCTATCTTATGGGCACAGATGTCTCCTTCAGCCTCATCTCCTACTCAAACAATCATATCGATGGGTG
GGTGTTGTGGAACGAGTGTAGGTGGCGGCTGTCGGGTTTTTATGGCTTTCCCTCCTTAGAACTTCACGACCAGTCATGGGCTCTACTAAGTAGATTGAAGGGGTGCTATG
ATACTCCTTGGCTTATTGGAGGTGATTTTAATGCAGTCCTCTGCCAAGATGAGAAGGGGGGTGGGTTAGATAAGCCGATGGCTGAGTTGGCAGCTTTTCAGGACGTTGTT
GATTGGTGTGGGCTTGTGGATCTGGGCTTTTCTGGTGACCGTTTCACTTGGTGTAATAGAAGGCCTGGAGAGGCGACTATTTATGAGCGATTGGACCATTATCCTGTGCA
GTCCCATTGGGGGAGGTATAAGCAGCGTATAGATCGGTTTGAGGAGACATGGTTACGTTATCCTGATTTGCAGGATGTGGTTCGTAGAGCTTGGGTGTTGCCTCCACTAG
TTCCCTCTCGGTGA
Protein sequenceShow/hide protein sequence
MRAAQKNQPKWNSQTLFDRLRGSRCELVCYHTRHIMGESIARSWTSSKDKPPKRHEVEDVLTGEAGPSAAVSVGPTVQPQVVDEGPLQGVLSSSVDNGKTVVGDQPPNLG
LEVGSAPGHGRKGWKRLARGTLKDVTNVMEHESSKQKRSLSIGENLENGKDGKRRKGADVDAMVCASSDQNVAAAGQPPPGVMSLLFWNARGLGSPRAFCRLNKLVQAKR
PLVLFISETKEWWACYLMGTDVSFSLISYSNNHIDGWVLWNECRWRLSGFYGFPSLELHDQSWALLSRLKGCYDTPWLIGGDFNAVLCQDEKGGGLDKPMAELAAFQDVV
DWCGLVDLGFSGDRFTWCNRRPGEATIYERLDHYPVQSHWGRYKQRIDRFEETWLRYPDLQDVVRRAWVLPPLVPSR