; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000594 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000594
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:10942559..10943119
RNA-Seq ExpressionLag0000594
SyntenyLag0000594
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG57064.1 hypothetical protein EZV62_018377 [Acer yangbiense]5.8e-2034.34Show/hide
Query:  FREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCEECK
        FRE ID C L+D GF GP+ TW N       + ER+DR L +T   +      VQHL + +SDHRP++  +   +Q   +   ++  +FE  W K EEC 
Subjt:  FREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCEECK

Query:  DIISQAWRSHNTQDAP----SVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWVCRNDVQ
         +I +AW   N  + P     ++ K+  C   L  WS  ++ GSLR  I   +KE++E + R  ++
Subjt:  DIISQAWRSHNTQDAP----SVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWVCRNDVQ

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]1.3e-3241.4Show/hide
Query:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEW-DRDYQDKNQMGLKRLLRFEESWTKC
        M +F++ +D CGL+DPGF G  FTWC+ H   + IWERLDRFL+NT + +      ++HL F ASDHRPI+AEW         +   +R  RFEE W   
Subjt:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEW-DRDYQDKNQMGLKRLLRFEESWTKC

Query:  EECKDIISQAWRSHNTQDAPSVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWVCRNDVQSMGFVKMKEKELAQLLEDDE
        +ECK+I+ + W           Q KI  CL +L KW+  R  GSLRGAI R E E+Q  V          +   +++L +LLE++E
Subjt:  EECKDIISQAWRSHNTQDAPSVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWVCRNDVQSMGFVKMKEKELAQLLEDDE

XP_030479239.1 uncharacterized protein LOC115696480 [Cannabis sativa]1.5e-2033.86Show/hide
Query:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCE
        M DFR++ID CG+    +TG +FTW N + N  ++ ERLDR  +N   +E  S   + HL + +SDHR I        QD+     +   RFE+ W K E
Subjt:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCE

Query:  ECKDIISQAWRSHNTQD-APSVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWVCRNDVQSMGFVKM--KEKELAQLLEDDEI
        EC+++I   W+S  T D  PS+ + I  C   L+ W  ++Y G ++  I++ +K+  E    +   +    +M   EK L  LLE +E+
Subjt:  ECKDIISQAWRSHNTQD-APSVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWVCRNDVQSMGFVKM--KEKELAQLLEDDEI

XP_030969676.1 uncharacterized protein LOC115989953 [Quercus lobata]4.5e-2038.51Show/hide
Query:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCE
        M  FR+ +D CG +D GF+GPEFTW   H   E+IWERLD  + N     +     ++HL    SDHRPI+   D +   +NQ   ++  RFE  W    
Subjt:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCE

Query:  ECKDIISQAWRSHNTQDAPSV--QEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWV
        ECK ++S AW + N +  P V   +KIK C   LK W+ D + GS+   I + ++ L  WV
Subjt:  ECKDIISQAWRSHNTQDAPSV--QEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWV

XP_042983236.1 uncharacterized protein LOC122312642 [Carya illinoinensis]4.0e-2134.74Show/hide
Query:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCE
        M  FR A+D CGL D GF G +FTW NN   ++   ERLDR L N+ +    S+ ++  L+ Q+SDH P+      D    N    + L R+E SW K E
Subjt:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCE

Query:  ECKDIISQAWRSHNTQDAP--SVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVE--KELQEWVCRNDVQSMGFVKMKEKELAQLLEDDEI
        EC+ I+ + WRS  T+ +   SV   + +C   L +W    Y+ +    +  ++    LQE    N  QS   +K K+K + QLL +D++
Subjt:  ECKDIISQAWRSHNTQDAP--SVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVE--KELQEWVCRNDVQSMGFVKMKEKELAQLLEDDEI

TrEMBL top hitse value%identityAlignment
A0A2N9ERX7 CCHC-type domain-containing protein1.5e-2137.63Show/hide
Query:  DFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCEEC
        DFREA+D C L D GF G  FTW      S  I ERLDR L + +   +     V HLA  +SDH P++ E     Q  ++   KR+  F+  W K ++C
Subjt:  DFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCEEC

Query:  KDIISQAWRSHNTQDAP--SVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWVCRNDVQSMGFVKMKEKELAQLLEDDEI
        K +I+QAW S  +  +P   V EK+K C   L  WS DR+ GSL   I    K+LQ         +   +   + EL  LLE +EI
Subjt:  KDIISQAWRSHNTQDAP--SVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWVCRNDVQSMGFVKMKEKELAQLLEDDEI

A0A2N9EXU0 Reverse transcriptase domain-containing protein2.0e-2140Show/hide
Query:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCE
        M  FR  ID CG ID GF G  FTWCNN   S   W RLDRF+       +     V HL   ASDH+PI   W   Y    Q   +RL RFE+ W    
Subjt:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCE

Query:  ECKDIISQAWRSHNTQDAP--SVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWVCRNDVQSMG
         C+ II+ AW    T+ +P   VQ KI  C   LKKWS D +     G +++  KE  E + R +  S G
Subjt:  ECKDIISQAWRSHNTQDAP--SVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWVCRNDVQSMG

A0A2N9H727 Uncharacterized protein5.7e-2135.83Show/hide
Query:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCE
        MS FR+ +D CG +D G+TGP FTWCNNH+    + ERLDR L  +    + S   V HL    SDH P+  E     Q K+    ++  RFEE WT   
Subjt:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCE

Query:  ECKDIISQAWR-SHNTQDAPSVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQ-EWVCRNDVQSMGFVKMKEKELAQLLEDDE
         C++ I QAW           V EKIK     L+ WS   + GS+R +I    ++L+ E       Q++  +K   +EL +L   +E
Subjt:  ECKDIISQAWR-SHNTQDAPSVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQ-EWVCRNDVQSMGFVKMKEKELAQLLEDDE

A0A2N9IXK4 RNase H domain-containing protein4.7e-2337.97Show/hide
Query:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCE
        M DFR+AID CG  D GF GP FTWCNN + S  +WERLDR L  T          VQHL   +SDH PI  ++      + +    R+ RFEE W    
Subjt:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCE

Query:  ECKDIISQAWRSHNTQDAP-SVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQE
         CK+ I+ AW++     A   V +K++ C   L++WS D + G++   + +  + L+E
Subjt:  ECKDIISQAWRSHNTQDAP-SVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQE

A0A6J1DRA0 uncharacterized protein LOC1110224236.5e-3341.4Show/hide
Query:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEW-DRDYQDKNQMGLKRLLRFEESWTKC
        M +F++ +D CGL+DPGF G  FTWC+ H   + IWERLDRFL+NT + +      ++HL F ASDHRPI+AEW         +   +R  RFEE W   
Subjt:  MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEW-DRDYQDKNQMGLKRLLRFEESWTKC

Query:  EECKDIISQAWRSHNTQDAPSVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWVCRNDVQSMGFVKMKEKELAQLLEDDE
        +ECK+I+ + W           Q KI  CL +L KW+  R  GSLRGAI R E E+Q  V          +   +++L +LLE++E
Subjt:  EECKDIISQAWRSHNTQDAPSVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWVCRNDVQSMGFVKMKEKELAQLLEDDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATTTCAGGGAGGCTATAGACACGTGTGGTTTAATCGATCCTGGCTTTACTGGCCCAGAGTTTACTTGGTGTAACAACCATGTGAACAGTGAAATTATATGGGA
AAGGTTGGATCGTTTCTTAATGAACACTGTGATGCAAGAGAAGTGTAGTTTCTTTAATGTGCAACACCTCGCTTTTCAAGCTTCGGACCATCGACCAATTATAGCAGAAT
GGGATAGAGATTACCAAGACAAAAATCAGATGGGTCTGAAACGTTTGCTAAGATTTGAGGAGTCATGGACCAAATGCGAAGAATGTAAAGATATTATTTCGCAGGCCTGG
AGGAGTCATAATACTCAAGATGCTCCCTCAGTCCAAGAAAAGATAAAGATGTGCTTAATTGACCTCAAAAAATGGAGTTGTGATAGGTACAGAGGTTCTTTGAGAGGGGC
AATTTCAAGAGTAGAGAAAGAGCTTCAAGAGTGGGTGTGCAGAAATGATGTTCAGAGTATGGGGTTTGTGAAAATGAAGGAAAAGGAATTAGCACAACTGTTGGAGGATG
ATGAGATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGATTTCAGGGAGGCTATAGACACGTGTGGTTTAATCGATCCTGGCTTTACTGGCCCAGAGTTTACTTGGTGTAACAACCATGTGAACAGTGAAATTATATGGGA
AAGGTTGGATCGTTTCTTAATGAACACTGTGATGCAAGAGAAGTGTAGTTTCTTTAATGTGCAACACCTCGCTTTTCAAGCTTCGGACCATCGACCAATTATAGCAGAAT
GGGATAGAGATTACCAAGACAAAAATCAGATGGGTCTGAAACGTTTGCTAAGATTTGAGGAGTCATGGACCAAATGCGAAGAATGTAAAGATATTATTTCGCAGGCCTGG
AGGAGTCATAATACTCAAGATGCTCCCTCAGTCCAAGAAAAGATAAAGATGTGCTTAATTGACCTCAAAAAATGGAGTTGTGATAGGTACAGAGGTTCTTTGAGAGGGGC
AATTTCAAGAGTAGAGAAAGAGCTTCAAGAGTGGGTGTGCAGAAATGATGTTCAGAGTATGGGGTTTGTGAAAATGAAGGAAAAGGAATTAGCACAACTGTTGGAGGATG
ATGAGATCTAG
Protein sequenceShow/hide protein sequence
MSDFREAIDTCGLIDPGFTGPEFTWCNNHVNSEIIWERLDRFLMNTVMQEKCSFFNVQHLAFQASDHRPIIAEWDRDYQDKNQMGLKRLLRFEESWTKCEECKDIISQAW
RSHNTQDAPSVQEKIKMCLIDLKKWSCDRYRGSLRGAISRVEKELQEWVCRNDVQSMGFVKMKEKELAQLLEDDEI