; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001242 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001242
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:27624600..27625827
RNA-Seq ExpressionLag0001242
SyntenyLag0001242
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB5561215.1 hypothetical protein DKX38_006172 [Salix brachista]4.6e-0825.59Show/hide
Query:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVS------------DTR----
        M DE+W +LD + +  IR+ L+  VA  V    T   LM  L+  WE M+TT+SNS   + L + ++ DL + EE+RR+ +            +TR    
Subjt:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVS------------DTR----

Query:  -----------------------------CSNHSSDWILDSAASVHIALDRSLFTSFTGGHHGLVRMGMVEPPKLDGLEMIPGSVSWPQESTLYKCQLNV
                                      S+    W+LDS AS H    +++  ++  G +G+V +   EP ++ G+  +   +  P  ST    Q+  
Subjt:  -----------------------------CSNHSSDWILDSAASVHIALDRSLFTSFTGGHHGLVRMGMVEPPKLDGLEMIPGSVSWPQESTLYKCQLNV

Query:  AKGSKRQWMSV
            K+  +SV
Subjt:  AKGSKRQWMSV

KAE8660039.1 hypothetical protein F3Y22_tig00116959pilonHSYRG00493 [Hibiscus syriacus]2.2e-1035.57Show/hide
Query:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVSDTRCSNHSS---DWILDSA
        M +EDW  LD +A+  IR+ LS +VA  +A   T   LM AL+NSW    T VS+S+ NN LKF +V DL  A  +  +  D    + +S    WILDS 
Subjt:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVSDTRCSNHSS---DWILDSA

Query:  ASVHIALDRSLFTSFTGGHHGLVRMGMVEPPKLDGLE----MIPGSVSW
        AS H    + +  ++  G  G V +   E  K+ G       +P   +W
Subjt:  ASVHIALDRSLFTSFTGGHHGLVRMGMVEPPKLDGLE----MIPGSVSW

KAG6525408.1 hypothetical protein ZIOFF_015364 [Zingiber officinale]3.3e-0634.26Show/hide
Query:  AVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVSDTRCSNHSS----------DWILDSAASVHIALDRSLFTSFTGGHHGLVRMGM
        A+ L+ +L N+WE M+  VSNS  N  L F++V D  +AEE+R   S    +++S+           W+LDS AS H    R +  ++  G+HG V +  
Subjt:  AVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVSDTRCSNHSS----------DWILDSAASVHIALDRSLFTSFTGGHHGLVRMGM

Query:  VEPPKLDG
         EP  + G
Subjt:  VEPPKLDG

RVX04508.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.9e-0732.03Show/hide
Query:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEAL---------TNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVSDTRCSNHSS-
        M  E+W  LD + +  IR+ LS  VA  V    T   LM+AL         T SWE M+  VSNST    LK++++ DL +AEEIRR+ +     + S+ 
Subjt:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEAL---------TNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVSDTRCSNHSS-

Query:  -----------------------DWILDSAASVHIALDRSLFTSFTGGHHGLV
                               DW+LDS A  H    R +  ++  G  G V
Subjt:  -----------------------DWILDSAASVHIALDRSLFTSFTGGHHGLV

TMW80639.1 hypothetical protein EJD97_017495, partial [Solanum chilense]4.4e-1134.03Show/hide
Query:  MTDEDWEALDEEAVASIRMCLSMDVA-SLVAHVTT-----AVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQ-VSDTRCSNHSSD--
        MT+E W+  D +A+  IR+ LS +VA ++V   TT     A+ LM +L  SW+ +  T+S+S  +  LKF E+CD+ ++E IR+Q V D+  S  S D  
Subjt:  MTDEDWEALDEEAVASIRMCLSMDVA-SLVAHVTT-----AVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQ-VSDTRCSNHSSD--

Query:  --------------------WILDSAASVHIALDRSLFTSFTGGHHGLVRMGMVEPPKLD--GLEMIPGSVSWPQESTLYKCQLNVAKGSK
                            WILDS AS H +  + LF +F  G+ G          KLD  G     G  SW     + KC + VA+G+K
Subjt:  --------------------WILDSAASVHIALDRSLFTSFTGGHHGLVRMGMVEPPKLD--GLEMIPGSVSWPQESTLYKCQLNVAKGSK

TrEMBL top hitse value%identityAlignment
A0A438J6C8 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-0732.03Show/hide
Query:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEAL---------TNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVSDTRCSNHSS-
        M  E+W  LD + +  IR+ LS  VA  V    T   LM+AL         T SWE M+  VSNST    LK++++ DL +AEEIRR+ +     + S+ 
Subjt:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEAL---------TNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVSDTRCSNHSS-

Query:  -----------------------DWILDSAASVHIALDRSLFTSFTGGHHGLV
                               DW+LDS A  H    R +  ++  G  G V
Subjt:  -----------------------DWILDSAASVHIALDRSLFTSFTGGHHGLV

A0A5N5N166 Uncharacterized protein2.2e-0825.59Show/hide
Query:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVS------------DTR----
        M DE+W +LD + +  IR+ L+  VA  V    T   LM  L+  WE M+TT+SNS   + L + ++ DL + EE+RR+ +            +TR    
Subjt:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVS------------DTR----

Query:  -----------------------------CSNHSSDWILDSAASVHIALDRSLFTSFTGGHHGLVRMGMVEPPKLDGLEMIPGSVSWPQESTLYKCQLNV
                                      S+    W+LDS AS H    +++  ++  G +G+V +   EP ++ G+  +   +  P  ST    Q+  
Subjt:  -----------------------------CSNHSSDWILDSAASVHIALDRSLFTSFTGGHHGLVRMGMVEPPKLDGLEMIPGSVSWPQESTLYKCQLNV

Query:  AKGSKRQWMSV
            K+  +SV
Subjt:  AKGSKRQWMSV

A0A6A2XNT2 NB-ARC domain-containing protein1.1e-1035.57Show/hide
Query:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVSDTRCSNHSS---DWILDSA
        M +EDW  LD +A+  IR+ LS +VA  +A   T   LM AL+NSW    T VS+S+ NN LKF +V DL  A  +  +  D    + +S    WILDS 
Subjt:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVSDTRCSNHSS---DWILDSA

Query:  ASVHIALDRSLFTSFTGGHHGLVRMGMVEPPKLDGLE----MIPGSVSW
        AS H    + +  ++  G  G V +   E  K+ G       +P   +W
Subjt:  ASVHIALDRSLFTSFTGGHHGLVRMGMVEPPKLDGLE----MIPGSVSW

A0A6N2AG08 Uncharacterized protein (Fragment)2.1e-1134.03Show/hide
Query:  MTDEDWEALDEEAVASIRMCLSMDVA-SLVAHVTT-----AVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQ-VSDTRCSNHSSD--
        MT+E W+  D +A+  IR+ LS +VA ++V   TT     A+ LM +L  SW+ +  T+S+S  +  LKF E+CD+ ++E IR+Q V D+  S  S D  
Subjt:  MTDEDWEALDEEAVASIRMCLSMDVA-SLVAHVTT-----AVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQ-VSDTRCSNHSSD--

Query:  --------------------WILDSAASVHIALDRSLFTSFTGGHHGLVRMGMVEPPKLD--GLEMIPGSVSWPQESTLYKCQLNVAKGSK
                            WILDS AS H +  + LF +F  G+ G          KLD  G     G  SW     + KC + VA+G+K
Subjt:  --------------------WILDSAASVHIALDRSLFTSFTGGHHGLVRMGMVEPPKLD--GLEMIPGSVSWPQESTLYKCQLNVAKGSK

A0A7N2LJ68 Uncharacterized protein5.0e-0835.77Show/hide
Query:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVSDTRCSNHSSDWILDS---A
        M  ++W  LD + +  I++ LS  VA  V    T   LM+AL+  WE M+  V+NSTR   LK++++ DL +AEEIR++ +    S  SS   L++    
Subjt:  MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVSDTRCSNHSSDWILDS---A

Query:  ASVHIALDRSLFTSFTGGHHGLV
        AS HI   R +  ++  G  G V
Subjt:  ASVHIALDRSLFTSFTGGHHGLV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGACGAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAAGCATAAGGATGTGTTTGTCAATGGATGTGGCAAGTCTAGTAGCCCATGTGACAACTGCAGTTAA
ATTGATGGAAGCGCTTACAAACAGTTGGGAAATGATGAAGACAACAGTGTCTAATTCGACTAGAAATAATACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGCTG
AGGAAATTCGTAGGCAAGTAAGTGACACAAGGTGTAGTAACCACTCATCAGATTGGATATTAGACAGTGCAGCTTCTGTACACATAGCTTTAGATAGGAGTTTGTTCACA
TCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGAATGGTAGAACCTCCAAAACTAGATGGATTGGAGATGATCCCAGGTAGTGTCAGTTGGCCACAGGAATCTAC
ACTGTACAAATGTCAGTTGAATGTTGCCAAAGGATCAAAGAGACAGTGGATGTCGGTTAAAGCTGCATATGGCAGTTGTAGAGGTACAGTTAAGCCAACAACAATGATAG
CCAATTTCGATCTGTCCAATCAAGATCCTTCAGTTCAGAAACAATTGGGAAGTCCAGGAGAGAAAGTTGATGGCTATCGTGAATCCCCATTGTTAGACGCTCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGACAGACGAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAAGCATAAGGATGTGTTTGTCAATGGATGTGGCAAGTCTAGTAGCCCATGTGACAACTGCAGTTAA
ATTGATGGAAGCGCTTACAAACAGTTGGGAAATGATGAAGACAACAGTGTCTAATTCGACTAGAAATAATACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGCTG
AGGAAATTCGTAGGCAAGTAAGTGACACAAGGTGTAGTAACCACTCATCAGATTGGATATTAGACAGTGCAGCTTCTGTACACATAGCTTTAGATAGGAGTTTGTTCACA
TCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGAATGGTAGAACCTCCAAAACTAGATGGATTGGAGATGATCCCAGGTAGTGTCAGTTGGCCACAGGAATCTAC
ACTGTACAAATGTCAGTTGAATGTTGCCAAAGGATCAAAGAGACAGTGGATGTCGGTTAAAGCTGCATATGGCAGTTGTAGAGGTACAGTTAAGCCAACAACAATGATAG
CCAATTTCGATCTGTCCAATCAAGATCCTTCAGTTCAGAAACAATTGGGAAGTCCAGGAGAGAAAGTTGATGGCTATCGTGAATCCCCATTGTTAGACGCTCGATGA
Protein sequenceShow/hide protein sequence
MTDEDWEALDEEAVASIRMCLSMDVASLVAHVTTAVKLMEALTNSWEMMKTTVSNSTRNNTLKFSEVCDLAIAEEIRRQVSDTRCSNHSSDWILDSAASVHIALDRSLFT
SFTGGHHGLVRMGMVEPPKLDGLEMIPGSVSWPQESTLYKCQLNVAKGSKRQWMSVKAAYGSCRGTVKPTTMIANFDLSNQDPSVQKQLGSPGEKVDGYRESPLLDAR