; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018262 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018262
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE2
Genome locationchr5:20723744..20724994
RNA-Seq ExpressionLag0018262
SyntenyLag0018262
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU17915.1 hypothetical protein TSUD_330400, partial [Trifolium subterraneum]3.2e-1529.18Show/hide
Query:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIANDKVTGKVVL-----DGLYQL-------HLDLPKSYKRPTAHIYEAMLSNLV------SSPSLE
        GASNHVT   +  Q   ++ G   + VGNG +LEI     TG+ +L     DGLYQL       ++ + +S+ R   H     +  LV       +P + 
Subjt:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIANDKVTGKVVL-----DGLYQL-------HLDLPKSYKRPTAHIYEAMLSNLV------SSPSLE

Query:  SKYPSPVKFSCHFVFSVEKH-----------------GVKIIWHQHLGQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNG------------
        S   S  K+  HF+    +                    K +      +RIK +Q + GGE++         GI+FR  C +TS+QNG            
Subjt:  SKYPSPVKFSCHFVFSVEKH-----------------GVKIIWHQHLGQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNG------------

Query:  -------IVMPLRYWWDAIVTTTYIINHLPTPI
                 MPL YWW+A  T  Y+IN LP+P+
Subjt:  -------IVMPLRYWWDAIVTTTYIINHLPTPI

GAU26774.1 hypothetical protein TSUD_317720 [Trifolium subterraneum]8.0e-1428.39Show/hide
Query:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIAN-DKVTGKVVLDGLYQLHLDLPKSYKRPTA------HIYEAMLSNLVSSPSLESKY-----PSP
        GAS+HVT D+SNL       G++ + +GNG  L I +        + + L   ++ +P  +   TA      H   A  S  V S S+E        P+P
Subjt:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIAN-DKVTGKVVLDGLYQLHLDLPKSYKRPTA------HIYEAMLSNLVSSPSLESKY-----PSP

Query:  VKFSCHFVF--SVEKHGVKIIW---------------------HQHLGQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNGIV----------
        VK S  + +  +      + +W                         G +IK+VQ++  GEFRPF   L   G+  R  C HT  QNG+V          
Subjt:  VKFSCHFVF--SVEKHGVKIIW---------------------HQHLGQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNGIV----------

Query:  ---------MPLRYWWDAIVTTTYIINHLPTPILDH
                 +PL++W  A +T T++IN LPTP+L++
Subjt:  ---------MPLRYWWDAIVTTTYIINHLPTPILDH

KAA0067212.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]1.4e-1330.04Show/hide
Query:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIA-----------------------NDKVTGKVVL-----DGLYQLHLDLPKSYKR------PTAH
        GA+NH+T  LSNL    +Y G   I   NGS L I                        +D  TG+V+L     DGLY+    +  S+KR       T  
Subjt:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIA-----------------------NDKVTGKVVL-----DGLYQLHLDLPKSYKR------PTAH

Query:  IYEAMLSNLVSSPSLE--------SKYPSPVKFSCHFVFSVEKHGVKIIWHQHLGQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNGIV---
        ++  ++    ++P L+           P+      H   S      K    + LGQ IK +Q++ G EF+PF   L   GIE R  C +TS+QN IV   
Subjt:  IYEAMLSNLVSSPSLE--------SKYPSPVKFSCHFVFSVEKHGVKIIWHQHLGQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNGIV---

Query:  ----------------MPLRYWWDAIVTTTYIINHLPTPILDH
                        +PL +W +A  T+ Y+IN LPTP+LD+
Subjt:  ----------------MPLRYWWDAIVTTTYIINHLPTPILDH

KAG8501032.1 hypothetical protein CXB51_003112 [Gossypium anomalum]5.5e-1528.86Show/hide
Query:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIANDKVTGKVVLDGLYQLHLDLPKSYKRPTAHIYEAMLSNLVSSPSLESKYPSPVKFSCHFVFSVE
        G SNHVT DL NL     Y GN  + +GNG  + +A+   +     + ++ L   L  +    +A+++   L+   S+ S+   +   +   C+      
Subjt:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIANDKVTGKVVLDGLYQLHLDLPKSYKRPTAHIYEAMLSNLVSSPSLESKYPSPVKFSCHFVFSVE

Query:  KHGVKI--IWHQHLGQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNGIV-------------------MPLRYWWDAIVTTTYIINHLPTPI
           + +  +     G +IK +Q+++GGEF  F  +L   GI  R  C HTSEQNG+V                   MP+ +W  A ++ TY+IN LPT +
Subjt:  KHGVKI--IWHQHLGQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNGIV-------------------MPLRYWWDAIVTTTYIINHLPTPI

Query:  L
        L
Subjt:  L

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]6.1e-1427.43Show/hide
Query:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIAN----------------------------DKVTGKVVLDGLYQLHL-DLPKSYKRP------TA
        GA++HVT + +N++ + DY G E + V NG++L I++                            DK +G+ +L G  + +L  L +S++ P      TA
Subjt:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIAN----------------------------DKVTGKVVLDGLYQLHL-DLPKSYKRP------TA

Query:  HIYEAMLSNLVSSPSLESKYPSPVKFSCHFVFSVEKHGVKIIWHQHLGQ---RIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNG---------
         ++   + +L S+ +L S+ P+P   S  F   +       +WH+ LG    ++  +QS+ GGE++P  +L +  GI+ R    +TS QNG         
Subjt:  HIYEAMLSNLVSSPSLESKYPSPVKFSCHFVFSVEKHGVKIIWHQHLGQ---RIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNG---------

Query:  ----------IVMPLRYWWDAIVTTTYIINHLPTPIL
                    M L YWWD  +T T +IN +P  +L
Subjt:  ----------IVMPLRYWWDAIVTTTYIINHLPTPIL

TrEMBL top hitse value%identityAlignment
A0A2Z6M2G8 Integrase catalytic domain-containing protein3.9e-1428.39Show/hide
Query:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIAN-DKVTGKVVLDGLYQLHLDLPKSYKRPTA------HIYEAMLSNLVSSPSLESKY-----PSP
        GAS+HVT D+SNL       G++ + +GNG  L I +        + + L   ++ +P  +   TA      H   A  S  V S S+E        P+P
Subjt:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIAN-DKVTGKVVLDGLYQLHLDLPKSYKRPTA------HIYEAMLSNLVSSPSLESKY-----PSP

Query:  VKFSCHFVF--SVEKHGVKIIW---------------------HQHLGQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNGIV----------
        VK S  + +  +      + +W                         G +IK+VQ++  GEFRPF   L   G+  R  C HT  QNG+V          
Subjt:  VKFSCHFVF--SVEKHGVKIIW---------------------HQHLGQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNGIV----------

Query:  ---------MPLRYWWDAIVTTTYIINHLPTPILDH
                 +PL++W  A +T T++IN LPTP+L++
Subjt:  ---------MPLRYWWDAIVTTTYIINHLPTPILDH

A0A803NRU8 Uncharacterized protein1.0e-1429.67Show/hide
Query:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIAN------------------------------------------------------DKVTGKVVL
        GA+NH T DL NL    +Y G E I VGNG+ L I N                                                      D+VT  ++L
Subjt:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIAN------------------------------------------------------DKVTGKVVL

Query:  -----DGLY---QLHLD-LPKSYKRPTAHIYEAMLSN----LVSSPSLESKYPSPVKFSCHFVFSVEKHGVKIIWH------QHLGQRIKIVQSEFGGEF
             +GLY     H   LP S       I   M SN    LV+S  ++   P PV F+  ++   +   +K   H        LG++IK+ QS++GGE+
Subjt:  -----DGLY---QLHLD-LPKSYKRPTAHIYEAMLSN----LVSSPSLESKYPSPVKFSCHFVFSVEKHGVKIIWH------QHLGQRIKIVQSEFGGEF

Query:  RPFATLLKTRGIEFRHPCSHTSEQNGIV-------------------MPLRYWWDAIVTTTYIINHLPTPILD
        R F+  LK  GI  RHPC  T EQNG+V                   MPL++W +A     Y+ N LPTP+L+
Subjt:  RPFATLLKTRGIEFRHPCSHTSEQNGIV-------------------MPLRYWWDAIVTTTYIINHLPTPILD

A0A803NU85 Uncharacterized protein3.9e-1426.45Show/hide
Query:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEI----------------------------------------------------ANDKVTGKVVL--
        GASNH+T D   +Q + +Y G E I++G+GS+L I                                                      ++ TG+VVL  
Subjt:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEI----------------------------------------------------ANDKVTGKVVL--

Query:  ---DGLYQLH----LDLPKSYKRPTAHIYEAMLSNLVSSPSLESKYPSPVKFSCHFVFS--VEKHGVKI------------IWHQHLGQRIKIVQSEFGG
           DGLYQLH        KS   P A +     ++ V   SL+  +   +    + V +  ++   VK+             +      +IK +++++GG
Subjt:  ---DGLYQLH----LDLPKSYKRPTAHIYEAMLSNLVSSPSLESKYPSPVKFSCHFVFS--VEKHGVKI------------IWHQHLGQRIKIVQSEFGG

Query:  EFRPFATLLKTRGIEFRHPCSHTSEQNG-------------------IVMPLRYWWDAIVTTTYIINHLPTPILDH
        EF+ F+ L+   GI F H C HTSEQNG                     +PL+YW DA  T  Y+IN LPT +L +
Subjt:  EFRPFATLLKTRGIEFRHPCSHTSEQNG-------------------IVMPLRYWWDAIVTTTYIINHLPTPILDH

A0A803NUC9 Uncharacterized protein7.8e-1528.02Show/hide
Query:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIANDKVTGKVVLDGLYQLHLDLPKSYKRPTAHIYEAMLSNLVSSPSLESKYPSPVKFSCHFVFSVE
        GASNH+T D   +Q + +Y+G E I++G+G +L I++       V  G+ Q     P   K    HI  ++  NL+S   L +     ++F   F    +
Subjt:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIANDKVTGKVVLDGLYQLHLDLPKSYKRPTAHIYEAMLSNLVSSPSLESKYPSPVKFSCHFVFSVE

Query:  KHGVKIIWHQHL-------------------------------GQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNG----------------
        +   K++ H+ L                               G+++K +++++GGE + F+TL+   GI F H C +T  QNG                
Subjt:  KHGVKIIWHQHL-------------------------------GQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNG----------------

Query:  ---IVMPLRYWWDAIVTTTYIINHLPTPILDH
             MP +YW DA  T  Y IN LPT IL+H
Subjt:  ---IVMPLRYWWDAIVTTTYIINHLPTPILDH

A0A803NZ12 Uncharacterized protein2.9e-1729.81Show/hide
Query:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIANDKVTGKVV----LDGLYQLHL--DLPK-SYKRPTAHIYEAMLSNLVSSPSLESKYPSPVKFSC
        G SNHVT + S++  + +Y G E ++VG+G+ L++ + ++ G       ++  Y +H   D  + ++  P  H  EA+L+       +E+K+        
Subjt:  GASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIANDKVTGKVV----LDGLYQLHL--DLPK-SYKRPTAHIYEAMLSNLVSSPSLESKYPSPVKFSC

Query:  HFVFSVEKHGVKIIWHQHLGQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNG-------------------IVMPLRYWWDAIVTTTYIINH
                            ++IK ++++ GGE++ F+ L++  GIEF H C HTS QNG                     MPL+YW DA  T  Y+IN 
Subjt:  HFVFSVEKHGVKIIWHQHLGQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNG-------------------IVMPLRYWWDAIVTTTYIINH

Query:  LPTPILDH
        LPTPIL H
Subjt:  LPTPILDH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATGAGTGACTACCTGAGTGTTATGAAACAAGCTGCATACAACTTAGCTCTAGCTGGAGCACCCGTTGGAACAGATGATCTCACTTCCTACGTGATTTCTGGACT
TGACACAAAATATTTGTCGATAACCTGTCAAATCAACAAGGATGAACTGACATGTGGAGCATCCAACCACGTCACTCTAGATCTATCTAATCTTCAAATTCAGTTCGATT
ATCAGGGTAATGAATATATTTCAGTGGGTAATGGTTCTAGACTTGAGATTGCTAATGACAAGGTAACAGGGAAGGTGGTGCTTGATGGCCTCTACCAACTTCATTTAGAT
CTACCCAAGTCCTACAAAAGGCCAACAGCTCATATCTATGAAGCCATGTTATCCAACTTAGTGTCTAGCCCGAGTCTCGAGTCTAAGTATCCAAGCCCTGTTAAGTTTTC
ATGTCATTTTGTGTTTAGTGTTGAAAAACATGGAGTAAAGATAATATGGCATCAACATTTAGGTCAAAGAATTAAAATTGTTCAAAGTGAATTCGGAGGAGAGTTTAGGC
CTTTTGCCACTCTTCTCAAAACCAGAGGGATAGAATTTCGACACCCTTGCTCCCATACTAGTGAACAAAATGGCATTGTTATGCCATTGAGGTATTGGTGGGATGCCATT
GTTACTACAACCTACATTATTAATCACCTACCTACACCTATCCTAGACCATTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGATGAGTGACTACCTGAGTGTTATGAAACAAGCTGCATACAACTTAGCTCTAGCTGGAGCACCCGTTGGAACAGATGATCTCACTTCCTACGTGATTTCTGGACT
TGACACAAAATATTTGTCGATAACCTGTCAAATCAACAAGGATGAACTGACATGTGGAGCATCCAACCACGTCACTCTAGATCTATCTAATCTTCAAATTCAGTTCGATT
ATCAGGGTAATGAATATATTTCAGTGGGTAATGGTTCTAGACTTGAGATTGCTAATGACAAGGTAACAGGGAAGGTGGTGCTTGATGGCCTCTACCAACTTCATTTAGAT
CTACCCAAGTCCTACAAAAGGCCAACAGCTCATATCTATGAAGCCATGTTATCCAACTTAGTGTCTAGCCCGAGTCTCGAGTCTAAGTATCCAAGCCCTGTTAAGTTTTC
ATGTCATTTTGTGTTTAGTGTTGAAAAACATGGAGTAAAGATAATATGGCATCAACATTTAGGTCAAAGAATTAAAATTGTTCAAAGTGAATTCGGAGGAGAGTTTAGGC
CTTTTGCCACTCTTCTCAAAACCAGAGGGATAGAATTTCGACACCCTTGCTCCCATACTAGTGAACAAAATGGCATTGTTATGCCATTGAGGTATTGGTGGGATGCCATT
GTTACTACAACCTACATTATTAATCACCTACCTACACCTATCCTAGACCATTGTTGA
Protein sequenceShow/hide protein sequence
MKMSDYLSVMKQAAYNLALAGAPVGTDDLTSYVISGLDTKYLSITCQINKDELTCGASNHVTLDLSNLQIQFDYQGNEYISVGNGSRLEIANDKVTGKVVLDGLYQLHLD
LPKSYKRPTAHIYEAMLSNLVSSPSLESKYPSPVKFSCHFVFSVEKHGVKIIWHQHLGQRIKIVQSEFGGEFRPFATLLKTRGIEFRHPCSHTSEQNGIVMPLRYWWDAI
VTTTYIINHLPTPILDHC