; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010781 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010781
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr1:6471518..6472580
RNA-Seq ExpressionLag0010781
SyntenyLag0010781
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.8e-3139.15Show/hide
Query:  MESHSTEKTSSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSK-----IPNSAYDHWVRQDSLIIA
        M S S+       E SS   +I   GNKI+ +KL++D FLLWK QILT + ++ L++ L+ +SE PSKYL + +SS       PN AY  W RQD LI +
Subjt:  MESHSTEKTSSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSK-----IPNSAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNV
        WLLGSMS  +L++M+ C++A+E                       L  +KKG++ L++YFLK+   VDAL +  + +S +DH+L IL  LG++Y S ++V
Subjt:  WLLGSMSNSLLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNV

Query:  IFYHPDAQGTGE
        I    D+    E
Subjt:  IFYHPDAQGTGE

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]3.5e-2937.44Show/hide
Query:  SSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSS-----KIPNSAYDHWVRQDSLIIAWLLGSMSNS
        SS + V +  + I   GNKI+ +KL +DNFLLWK QILT + ++ L++  + + E PSKYL +  SS     + PN  Y  W R + LI  WLLGSMS  
Subjt:  SSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSS-----KIPNSAYDHWVRQDSLIIAWLLGSMSNS

Query:  LLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVIFYHPDAQG
        +L++M+ C++A+E                       L  +KKG++ L++YFLK++  VDAL +  + +S +DH+L ILV LG +Y S +++I    D+  
Subjt:  LLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVIFYHPDAQG

Query:  TGE
          E
Subjt:  TGE

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]5.4e-3039.6Show/hide
Query:  MESHSTEKTSSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSS-----KIPNSAYDHWVRQDSLIIA
        M S S+       E SS   +I    NKI+ +KL +DNFLLWK QILT + ++ L++ L+ +SE PSKYL +  SS     + PN  Y  W RQD LI +
Subjt:  MESHSTEKTSSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSS-----KIPNSAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNV
        WLLGSMS  +L++M+ C++A+E                       L  +KK ++ L++YFLK+++ VDAL +  + +S +DH+L IL  LG++Y S ++V
Subjt:  WLLGSMSNSLLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNV

Query:  IF
        IF
Subjt:  IF

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.8e-3139.15Show/hide
Query:  MESHSTEKTSSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSK-----IPNSAYDHWVRQDSLIIA
        M S S+       E SS   +I   GNKI+ +KL++D FLLWK QILT + ++ L++ L+ +SE PSKYL + +SS       PN AY  W RQD LI +
Subjt:  MESHSTEKTSSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSK-----IPNSAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNV
        WLLGSMS  +L++M+ C++A+E                       L  +KKG++ L++YFLK+   VDAL +  + +S +DH+L IL  LG++Y S ++V
Subjt:  WLLGSMSNSLLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNV

Query:  IFYHPDAQGTGE
        I    D+    E
Subjt:  IFYHPDAQGTGE

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]4.8e-3442.39Show/hide
Query:  QTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYL-----ANGDSSKIPNSAYDHWVRQDSLIIAWLLGSMSNSLLSEMMEC
        Q  K INPG+K++ ++L++DN LLWK QI T +  +GL+ ++D + + P++++      +  SS   N AY  W++QD LI AWLLGSM+  +LS+M++C
Subjt:  QTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYL-----ANGDSSKIPNSAYDHWVRQDSLIIAWLLGSMSNSLLSEMMEC

Query:  ETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVI
        ++ARE                       LE  KKGNL L+DYFLK+KNLVD+L   G+K+S EDH++ IL  LG E+D+ ++VI
Subjt:  ETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVI

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-3139.15Show/hide
Query:  MESHSTEKTSSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSK-----IPNSAYDHWVRQDSLIIA
        M S S+       E SS   +I   GNKI+ +KL++D FLLWK QILT + ++ L++ L+ +SE PSKYL + +SS       PN AY  W RQD LI +
Subjt:  MESHSTEKTSSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSK-----IPNSAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNV
        WLLGSMS  +L++M+ C++A+E                       L  +KKG++ L++YFLK+   VDAL +  + +S +DH+L IL  LG++Y S ++V
Subjt:  WLLGSMSNSLLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNV

Query:  IFYHPDAQGTGE
        I    D+    E
Subjt:  IFYHPDAQGTGE

A0A5A7UB21 Keratin, type II cytoskeletal 1-like1.7e-2937.44Show/hide
Query:  SSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSS-----KIPNSAYDHWVRQDSLIIAWLLGSMSNS
        SS + V +  + I   GNKI+ +KL +DNFLLWK QILT + ++ L++  + + E PSKYL +  SS     + PN  Y  W R + LI  WLLGSMS  
Subjt:  SSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSS-----KIPNSAYDHWVRQDSLIIAWLLGSMSNS

Query:  LLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVIFYHPDAQG
        +L++M+ C++A+E                       L  +KKG++ L++YFLK++  VDAL +  + +S +DH+L ILV LG +Y S +++I    D+  
Subjt:  LLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVIFYHPDAQG

Query:  TGE
          E
Subjt:  TGE

A0A5A7VGJ8 Keratin, type II cytoskeletal 1-like2.6e-3039.6Show/hide
Query:  MESHSTEKTSSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSS-----KIPNSAYDHWVRQDSLIIA
        M S S+       E SS   +I    NKI+ +KL +DNFLLWK QILT + ++ L++ L+ +SE PSKYL +  SS     + PN  Y  W RQD LI +
Subjt:  MESHSTEKTSSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSS-----KIPNSAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNV
        WLLGSMS  +L++M+ C++A+E                       L  +KK ++ L++YFLK+++ VDAL +  + +S +DH+L IL  LG++Y S ++V
Subjt:  WLLGSMSNSLLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNV

Query:  IF
        IF
Subjt:  IF

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-3139.15Show/hide
Query:  MESHSTEKTSSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSK-----IPNSAYDHWVRQDSLIIA
        M S S+       E SS   +I   GNKI+ +KL++D FLLWK QILT + ++ L++ L+ +SE PSKYL + +SS       PN AY  W RQD LI +
Subjt:  MESHSTEKTSSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSK-----IPNSAYDHWVRQDSLIIA

Query:  WLLGSMSNSLLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNV
        WLLGSMS  +L++M+ C++A+E                       L  +KKG++ L++YFLK+   VDAL +  + +S +DH+L IL  LG++Y S ++V
Subjt:  WLLGSMSNSLLSEMMECETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNV

Query:  IFYHPDAQGTGE
        I    D+    E
Subjt:  IFYHPDAQGTGE

A0A6J1DLT9 uncharacterized protein LOC1110217572.3e-3442.39Show/hide
Query:  QTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYL-----ANGDSSKIPNSAYDHWVRQDSLIIAWLLGSMSNSLLSEMMEC
        Q  K INPG+K++ ++L++DN LLWK QI T +  +GL+ ++D + + P++++      +  SS   N AY  W++QD LI AWLLGSM+  +LS+M++C
Subjt:  QTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYL-----ANGDSSKIPNSAYDHWVRQDSLIIAWLLGSMSNSLLSEMMEC

Query:  ETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVI
        ++ARE                       LE  KKGNL L+DYFLK+KNLVD+L   G+K+S EDH++ IL  LG E+D+ ++VI
Subjt:  ETARE-----------------------LELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVI

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-0723.53Show/hide
Query:  SSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSKIPNSAYDHWVRQDSLIIAWLLGSMSNSLLSEM
        + E+ +++ ++  +N  N     KL   N+L+W  Q+      + L   LD  + +P   +   D++   N  Y  W RQD LI + +LG++S S+   +
Subjt:  SSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSKIPNSAYDHWVRQDSLIIAWLLGSMSNSLLSEM

Query:  MECETAREL-ELLK----------------------KGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVI
            TA ++ E L+                      KG   ++DY   +    D L   G+ + H++ V ++L  L  EY   ++ I
Subjt:  MECETAREL-ELLK----------------------KGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.7e-0625.81Show/hide
Query:  NKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSKIPNSAYDHWVRQDSLIIAWLLGSMSNSLLSEMMECETAREL-ELLKK--
        N     KL   N+L+W  Q+      + L   LD  + +P   +   D+    N  Y  W RQD LI + +LG++S S+   +    TA ++ E L+K  
Subjt:  NKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSKIPNSAYDHWVRQDSLIIAWLLGSMSNSLLSEMMECETAREL-ELLKK--

Query:  -----GNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVI
             G++      L+     D L   G+ + H++ V ++L  L  +Y   ++ I
Subjt:  -----GNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVI

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.2e-0420.36Show/hide
Query:  MKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSKIPNSAYD-HWVRQDSLIIAWLLGSMS-NSLLSEMMECETARELELLKK------
        + ++E N+  W+   LT   S  +  H+              D + +P +A D +W ++D ++   L G+++        +   T+R++ L  K      
Subjt:  MKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSKIPNSAYD-HWVRQDSLIIAWLLGSMS-NSLLSEMMECETARELELLKK------

Query:  -----------------GNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVI
                         G++++ DY+ K+K L D+L      ++  + V+ +L  L  ++D+ +NVI
Subjt:  -----------------GNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVI

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.5e-0525.15Show/hide
Query:  TMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSKIPNSAYDHWVRQDSLIIAWLLGSMSNSLLSEMMECE-TARELELLKKG-----
        T+ L++ N+ +W+    T   S G+  H+D            G S+  P +    W  +D L+  W+ G++++SLL  +++   TAR+L L  +      
Subjt:  TMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSKIPNSAYDHWVRQDSLIIAWLLGSMSNSLLSEMMECE-TARELELLKKG-----

Query:  ------------------NLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVI
                          +L + +Y  K+K+L D L      IS    V+ +L  L  +YD  +NVI
Subjt:  ------------------NLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTCATTCAACAGAGAAAACCAGTTCTGAGATTGAAGTGTCTTCCCAGACGATGAAGATCATTAATCCAGGCAATAAAATCACGACGATGAAGCTCGACGAAGA
CAATTTTCTTCTCTGGAAGCTTCAAATTCTTACTACTATACCAAGCCATGGGTTGAAGCACCATCTCGATGAAGATTCTGAAATTCCTTCGAAGTACCTTGCAAATGGTG
ATTCTTCGAAAATCCCTAACTCTGCGTATGACCATTGGGTTCGACAGGATAGCCTCATTATCGCCTGGTTACTCGGCTCAATGTCCAACTCTTTGCTCTCCGAAATGATG
GAATGCGAAACCGCTCGAGAACTGGAATTGCTCAAGAAAGGTAATCTTAAGCTTGAAGATTACTTTCTGAAAGTTAAGAATCTCGTTGACGCATTAAATGCTACTGGAAG
AAAGATTTCTCATGAGGATCATGTGTTAAAGATTCTTGTTGAACTAGGAACTGAATACGACTCAACTGTGAATGTGATTTTTTACCACCCCGATGCACAAGGAACTGGCG
AGGCCAATCGGGTAGAAATCGGATTGGGAGATGGACCCGAACTCTCTACTCTCTTTCTCTTGCTCTCTAGTCCTCATACTTTCGTTTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTCATTCAACAGAGAAAACCAGTTCTGAGATTGAAGTGTCTTCCCAGACGATGAAGATCATTAATCCAGGCAATAAAATCACGACGATGAAGCTCGACGAAGA
CAATTTTCTTCTCTGGAAGCTTCAAATTCTTACTACTATACCAAGCCATGGGTTGAAGCACCATCTCGATGAAGATTCTGAAATTCCTTCGAAGTACCTTGCAAATGGTG
ATTCTTCGAAAATCCCTAACTCTGCGTATGACCATTGGGTTCGACAGGATAGCCTCATTATCGCCTGGTTACTCGGCTCAATGTCCAACTCTTTGCTCTCCGAAATGATG
GAATGCGAAACCGCTCGAGAACTGGAATTGCTCAAGAAAGGTAATCTTAAGCTTGAAGATTACTTTCTGAAAGTTAAGAATCTCGTTGACGCATTAAATGCTACTGGAAG
AAAGATTTCTCATGAGGATCATGTGTTAAAGATTCTTGTTGAACTAGGAACTGAATACGACTCAACTGTGAATGTGATTTTTTACCACCCCGATGCACAAGGAACTGGCG
AGGCCAATCGGGTAGAAATCGGATTGGGAGATGGACCCGAACTCTCTACTCTCTTTCTCTTGCTCTCTAGTCCTCATACTTTCGTTTTCTGA
Protein sequenceShow/hide protein sequence
MESHSTEKTSSEIEVSSQTMKIINPGNKITTMKLDEDNFLLWKLQILTTIPSHGLKHHLDEDSEIPSKYLANGDSSKIPNSAYDHWVRQDSLIIAWLLGSMSNSLLSEMM
ECETARELELLKKGNLKLEDYFLKVKNLVDALNATGRKISHEDHVLKILVELGTEYDSTVNVIFYHPDAQGTGEANRVEIGLGDGPELSTLFLLLSSPHTFVF