; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021075 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021075
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr7:4430614..4431234
RNA-Seq ExpressionLag0021075
SyntenyLag0021075
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]4.9e-4451.06Show/hide
Query:  SSSESALISNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQ-PPRINPEFEVWYERDQALITLINAT
        SSS +   S+   +S S + LL+NICNL+S+RLDSTN+VLW+FQ++ +LK+HKLF +VDG+   P+         PP+ NP +E W  +DQAL+T+INAT
Subjt:  SSSESALISNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQ-PPRINPEFEVWYERDQALITLINAT

Query:  LTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS
        L+  AL+YV+G  +SK+VWD L K +SS +R+N+V LK++LQ++  K  E++D Y++RIKEI +KLA VS  I+ EDL+IY +NGLP+
Subjt:  LTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS

KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]9.9e-4551.87Show/hide
Query:  SSESALISNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQP-PRINPEFEVWYERDQALITLINATL
        SS S+  S+  +N  S + LL+NICNL+S++LDSTNYVLW+FQ++ LLK+HKLF ++DG+            QP P  NP ++ W+ +DQAL+T+INATL
Subjt:  SSESALISNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQP-PRINPEFEVWYERDQALITLINATL

Query:  TQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS
        +  AL+YV+G  TSK+VW+ L K +SSS+R+N+V LK++LQ++S KS E++D Y++RIKEI +KLA VS +++ EDL+IY +NGLP+
Subjt:  TQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]9.9e-4551.87Show/hide
Query:  SSESALISNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQP-PRINPEFEVWYERDQALITLINATL
        SS S+  S+  +N  S + LL+NICNL+S++LDSTNYVLW+FQ++ LLK+HKLF ++DG+            QP P  NP ++ W+ +DQAL+T+INATL
Subjt:  SSESALISNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQP-PRINPEFEVWYERDQALITLINATL

Query:  TQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS
        +  AL+YV+G  TSK+VW+ L K +SSS+R+N+V LK++LQ++S KS E++D Y++RIKEI +KLA VS +++ EDL+IY +NGLP+
Subjt:  TQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]1.5e-4552.41Show/hide
Query:  SNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIV----RVEGQP------PRINPEFEVWYERDQALITLINA
        +N   + +S + LL+NICNLVS+RLDST+++LW+FQ++ +LK+HKLF ++DGS+ AP   +      E QP      P INP FE W  +DQAL+TLINA
Subjt:  SNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIV----RVEGQP------PRINPEFEVWYERDQALITLINA

Query:  TLTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGL
        TL+  AL+YV+   TSK+VW+ LEKH+SS++RTN+V LK++LQS+  K+ E++D YV+RIKEI +K A VS+ I+ E L+IY +NGL
Subjt:  TLTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGL

XP_022158689.1 uncharacterized protein LOC111025150 [Momordica charantia]3.1e-4651.37Show/hide
Query:  SNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIV-----RVEGQPPRINPEFEVWYERDQALITLINATLTQT
        S P  + +S + LL+NICNLVS+RLDS+N+VLW+FQ++ +LK+HKL+ ++DGS   P   +          PP  NP F  W  +D AL+TL+NA L+ +
Subjt:  SNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIV-----RVEGQPPRINPEFEVWYERDQALITLINATLTQT

Query:  ALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLP
        AL+YV+GC +S++VW  L KH+SSS+RTN+V LK++LQS+S K G ++D YV+RIKE+ +KLA V V++D+EDL+IYT+N LP
Subjt:  ALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLP

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X22.0e-4349.47Show/hide
Query:  SSSESALISNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQ---PPRINPEFEVWYERDQALITLIN
        SSS +   S+   +S S + LL+NICNL+S+RLDSTN+VLW+FQ++ +LK+HKL+ ++DG+   P            PP+ NP +E W  +DQAL+T+IN
Subjt:  SSSESALISNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQ---PPRINPEFEVWYERDQALITLIN

Query:  ATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS
        ATL+  AL+YV+G  +SK+VWD L K +SS +R+N+V LK++LQ++  K  E++D Y++RIKEI +KLA VS  I+ EDL+IY +NGLP+
Subjt:  ATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X12.0e-4349.47Show/hide
Query:  SSSESALISNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQ---PPRINPEFEVWYERDQALITLIN
        SSS +   S+   +S S + LL+NICNL+S+RLDSTN+VLW+FQ++ +LK+HKL+ ++DG+   P            PP+ NP +E W  +DQAL+T+IN
Subjt:  SSSESALISNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQ---PPRINPEFEVWYERDQALITLIN

Query:  ATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS
        ATL+  AL+YV+G  +SK+VWD L K +SS +R+N+V LK++LQ++  K  E++D Y++RIKEI +KLA VS  I+ EDL+IY +NGLP+
Subjt:  ATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS

A0A5D3CLI6 T4.52.0e-4349.47Show/hide
Query:  SSSESALISNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQ---PPRINPEFEVWYERDQALITLIN
        SSS +   S+   +S S + LL+NICNL+S+RLDSTN+VLW+FQ++ +LK+HKL+ ++DG+   P            PP+ NP +E W  +DQAL+T+IN
Subjt:  SSSESALISNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQ---PPRINPEFEVWYERDQALITLIN

Query:  ATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS
        ATL+  AL+YV+G  +SK+VWD L K +SS +R+N+V LK++LQ++  K  E++D Y++RIKEI +KLA VS  I+ EDL+IY +NGLP+
Subjt:  ATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS

A0A6J1D9L6 uncharacterized protein LOC1110188927.4e-4652.41Show/hide
Query:  SNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIV----RVEGQP------PRINPEFEVWYERDQALITLINA
        +N   + +S + LL+NICNLVS+RLDST+++LW+FQ++ +LK+HKLF ++DGS+ AP   +      E QP      P INP FE W  +DQAL+TLINA
Subjt:  SNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIV----RVEGQP------PRINPEFEVWYERDQALITLINA

Query:  TLTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGL
        TL+  AL+YV+   TSK+VW+ LEKH+SS++RTN+V LK++LQS+  K+ E++D YV+RIKEI +K A VS+ I+ E L+IY +NGL
Subjt:  TLTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGL

A0A6J1E049 uncharacterized protein LOC1110251501.5e-4651.37Show/hide
Query:  SNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIV-----RVEGQPPRINPEFEVWYERDQALITLINATLTQT
        S P  + +S + LL+NICNLVS+RLDS+N+VLW+FQ++ +LK+HKL+ ++DGS   P   +          PP  NP F  W  +D AL+TL+NA L+ +
Subjt:  SNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIV-----RVEGQPPRINPEFEVWYERDQALITLINATLTQT

Query:  ALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLP
        AL+YV+GC +S++VW  L KH+SSS+RTN+V LK++LQS+S K G ++D YV+RIKE+ +KLA V V++D+EDL+IYT+N LP
Subjt:  ALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLP

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-0826.53Show/hide
Query:  WRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQPPRINPEFEVWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTEL
        W+ ++  LL    L K +D   K P+ +            + E W + D+   + I   L+   ++ +I   T++ +W RLE  + S T TN + LK +L
Subjt:  WRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQPPRINPEFEVWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTEL

Query:  QSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS
         ++ M  G     ++     ++ +LA + V I+ ED  I  +N LPS
Subjt:  QSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-1224.56Show/hide
Query:  SSVSLLN-NICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQPPRINPEFEVWYERDQALITLINATLTQTALSYVIGCQTSK
        ++ S+LN N+ N+   +L STNY++W  Q+  L   ++L  ++DGS   P A +  +   PR+NP++  W  +D+ + + +   ++ +    V    T+ 
Subjt:  SSVSLLN-NICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQPPRINPEFEVWYERDQALITLINATLTQTALSYVIGCQTSK

Query:  EVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLP
        ++W+ L K +++ +  ++  L+T+L+  + K  +T+D Y++ +    ++LA +   +D ++ +   +  LP
Subjt:  EVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-0825.62Show/hide
Query:  SVSLLN-NICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQPPRINPEFEVWYERDQALITLINATLTQTALSYVIGCQTSKE
        + ++LN N+ N+   +L STNY++W  Q+  L   ++L  ++DGS   P A +  +   PR+NP++  W  +D+ + + I   ++ +    V    T+ +
Subjt:  SVSLLN-NICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQPPRINPEFEVWYERDQALITLINATLTQTALSYVIGCQTSKE

Query:  VWDRLEKHFSSSTRTNIVGLK
        +W+ L K +++ +  ++  L+
Subjt:  VWDRLEKHFSSSTRTNIVGLK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGATCACTGAAGTTGATCCCTCTACTGAAGTTTCCCGAAGTTCGAGTGAATCTGCCCTAATTTCAAATCCGAGTTCAAATTCAAATTCCTCGGTTTCTCTCCTCAA
CAACATCTGCAATTTGGTTTCCGTGCGACTCGATTCGACGAATTATGTTCTTTGGAGGTTTCAAATTTCTCCTCTTCTGAAATCGCACAAACTTTTCAAGTATGTTGATG
GCTCAATCAAGGCTCCTGAAGCTATTGTTAGGGTTGAAGGACAGCCTCCTCGCATTAATCCTGAGTTCGAGGTTTGGTATGAACGTGATCAGGCTCTTATCACACTGATC
AACGCTACCTTGACGCAAACTGCCTTATCGTATGTTATTGGTTGTCAAACTTCCAAAGAGGTTTGGGATCGGTTGGAGAAACACTTTTCGTCATCTACTCGTACTAATAT
TGTTGGCTTGAAGACCGAATTACAGAGCGTTTCCATGAAGTCTGGTGAAACGGTTGATGTGTATGTTCGTCGAATTAAAGAAATAGTTAACAAGTTGGCTGCTGTCTCTG
TCATTATTGATTCCGAGGACCTCATAATCTACACTGTCAATGGTCTTCCATCTGGTGGTTATTATCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGACGATCACTGAAGTTGATCCCTCTACTGAAGTTTCCCGAAGTTCGAGTGAATCTGCCCTAATTTCAAATCCGAGTTCAAATTCAAATTCCTCGGTTTCTCTCCTCAA
CAACATCTGCAATTTGGTTTCCGTGCGACTCGATTCGACGAATTATGTTCTTTGGAGGTTTCAAATTTCTCCTCTTCTGAAATCGCACAAACTTTTCAAGTATGTTGATG
GCTCAATCAAGGCTCCTGAAGCTATTGTTAGGGTTGAAGGACAGCCTCCTCGCATTAATCCTGAGTTCGAGGTTTGGTATGAACGTGATCAGGCTCTTATCACACTGATC
AACGCTACCTTGACGCAAACTGCCTTATCGTATGTTATTGGTTGTCAAACTTCCAAAGAGGTTTGGGATCGGTTGGAGAAACACTTTTCGTCATCTACTCGTACTAATAT
TGTTGGCTTGAAGACCGAATTACAGAGCGTTTCCATGAAGTCTGGTGAAACGGTTGATGTGTATGTTCGTCGAATTAAAGAAATAGTTAACAAGTTGGCTGCTGTCTCTG
TCATTATTGATTCCGAGGACCTCATAATCTACACTGTCAATGGTCTTCCATCTGGTGGTTATTATCAATAA
Protein sequenceShow/hide protein sequence
MTITEVDPSTEVSRSSSESALISNPSSNSNSSVSLLNNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPEAIVRVEGQPPRINPEFEVWYERDQALITLI
NATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTRTNIVGLKTELQSVSMKSGETVDVYVRRIKEIVNKLAAVSVIIDSEDLIIYTVNGLPSGGYYQ