; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015043 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015043
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr12:7031136..7031522
RNA-Seq ExpressionLag0015043
SyntenyLag0015043
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]2.4e-2553.66Show/hide
Query:  STAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVVQPNPAFETWFEKDQALITLIDATLSQSALA
        S++ S E +L SPIVLL+NI NLI+++LDS+NYVLW FQL+ LL+AHKLFG+ DG+   P            NP+++ WF KDQAL+T+I+ATLS  ALA
Subjt:  STAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVVQPNPAFETWFEKDQALITLIDATLSQSALA

Query:  YVVACSTSQQIWSKLQQHFSSST
        YVV  +TS+Q+W+ L + +SSS+
Subjt:  YVVACSTSQQIWSKLQQHFSSST

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-2553.66Show/hide
Query:  STAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVVQPNPAFETWFEKDQALITLIDATLSQSALA
        S++ S E +L SPIVLL+NI NLI+++LDS+NYVLW FQL+ LL+AHKLFG+ DG+   P            NP+++ WF KDQAL+T+I+ATLS  ALA
Subjt:  STAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVVQPNPAFETWFEKDQALITLIDATLSQSALA

Query:  YVVACSTSQQIWSKLQQHFSSST
        YVV  +TS+Q+W+ L + +SSS+
Subjt:  YVVACSTSQQIWSKLQQHFSSST

TYK17989.1 uncharacterized protein E5676_scaffold306G002980 [Cucumis melo var. makuwa]6.4e-2646.85Show/hide
Query:  MDLTGSTAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVVQ---------------PNPAFETWF
        M    S++ S  + LSSP+ LLTNI NLI++RLDS+NY LW FQ  P+L+AHKL+G+ D SI  PPK I                       NP +E W 
Subjt:  MDLTGSTAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVVQ---------------PNPAFETWF

Query:  EKDQALITLIDATLSQSALAYVVACSTSQQIWSKLQQHFSSST
         KDQA + LI+ATLS  AL YVV C +S Q+W  L++H+SS+T
Subjt:  EKDQALITLIDATLSQSALAYVVACSTSQQIWSKLQQHFSSST

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]5.4e-2547.76Show/hide
Query:  STAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVV--QP---------NPAFETWFEKDQALITL
        S++ + ++ L SPI LL+NI NL+++RLDS++++LW FQL+ +L+AHKLFG+ DGS+  P + +        QP         NP FE W  KDQAL+TL
Subjt:  STAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVV--QP---------NPAFETWFEKDQALITL

Query:  IDATLSQSALAYVVACSTSQQIWSKLQQHFSSST
        I+ATLS  ALAYVV   TS+Q+W  L++H+SS++
Subjt:  IDATLSQSALAYVVACSTSQQIWSKLQQHFSSST

XP_022158689.1 uncharacterized protein LOC111025150 [Momordica charantia]2.4e-2552.46Show/hide
Query:  QSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGS------IFVPPKEIKIDGVVQPNPAFETWFEKDQALITLIDATLSQSALAY
        + LSSPI LL+NI NL+++RLDSSN+VLW FQL+ +L+AHKL+G+ DGS        V   +         NPAF  W  KD AL+TL++A LS SALAY
Subjt:  QSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGS------IFVPPKEIKIDGVVQPNPAFETWFEKDQALITLIDATLSQSALAY

Query:  VVACSTSQQIWSKLQQHFSSST
        VV C +SQQ+W  L +H+SSS+
Subjt:  VVACSTSQQIWSKLQQHFSSST

TrEMBL top hitse value%identityAlignment
A0A5B7C9B1 Retrotran_gag_3 domain-containing protein2.9e-2450Show/hide
Query:  TGSTAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDG---VVQPNPAFETWFEKDQALITLIDATLS
        + +++ S   S  SPI LL+NI NLIT+ LDS+NYV W FQ+S +LRAH L GY DGS+  P K I  +        NP +  W   DQAL+TLI+ATLS
Subjt:  TGSTAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDG---VVQPNPAFETWFEKDQALITLIDATLS

Query:  QSALAYVVACSTSQQIWSKLQQHFSSST
         SAL YV+  STS+++W  L++ FSSS+
Subjt:  QSALAYVVACSTSQQIWSKLQQHFSSST

A0A5D3CLI6 T4.54.9e-2449.6Show/hide
Query:  AKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGV----VQPNPAFETWFEKDQALITLIDATLSQSA
        + SAE+   SPI LL+NI NLI++RLDS+N+VLW FQL+ +L+AHKL+G+ DG+   PP+            Q NP++E W  KDQAL+T+I+ATLS  A
Subjt:  AKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGV----VQPNPAFETWFEKDQALITLIDATLSQSA

Query:  LAYVVACSTSQQIWSKLQQHFSSST
        LAYVV  ++S+Q+W  L + +SS +
Subjt:  LAYVVACSTSQQIWSKLQQHFSSST

A0A5D3D3T6 Retrotran_gag_3 domain-containing protein3.1e-2646.85Show/hide
Query:  MDLTGSTAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVVQ---------------PNPAFETWF
        M    S++ S  + LSSP+ LLTNI NLI++RLDS+NY LW FQ  P+L+AHKL+G+ D SI  PPK I                       NP +E W 
Subjt:  MDLTGSTAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVVQ---------------PNPAFETWF

Query:  EKDQALITLIDATLSQSALAYVVACSTSQQIWSKLQQHFSSST
         KDQA + LI+ATLS  AL YVV C +S Q+W  L++H+SS+T
Subjt:  EKDQALITLIDATLSQSALAYVVACSTSQQIWSKLQQHFSSST

A0A6J1D9L6 uncharacterized protein LOC1110188922.6e-2547.76Show/hide
Query:  STAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVV--QP---------NPAFETWFEKDQALITL
        S++ + ++ L SPI LL+NI NL+++RLDS++++LW FQL+ +L+AHKLFG+ DGS+  P + +        QP         NP FE W  KDQAL+TL
Subjt:  STAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVV--QP---------NPAFETWFEKDQALITL

Query:  IDATLSQSALAYVVACSTSQQIWSKLQQHFSSST
        I+ATLS  ALAYVV   TS+Q+W  L++H+SS++
Subjt:  IDATLSQSALAYVVACSTSQQIWSKLQQHFSSST

A0A6J1E049 uncharacterized protein LOC1110251501.2e-2552.46Show/hide
Query:  QSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGS------IFVPPKEIKIDGVVQPNPAFETWFEKDQALITLIDATLSQSALAY
        + LSSPI LL+NI NL+++RLDSSN+VLW FQL+ +L+AHKL+G+ DGS        V   +         NPAF  W  KD AL+TL++A LS SALAY
Subjt:  QSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGS------IFVPPKEIKIDGVVQPNPAFETWFEKDQALITLIDATLSQSALAY

Query:  VVACSTSQQIWSKLQQHFSSST
        VV C +SQQ+W  L +H+SSS+
Subjt:  VVACSTSQQIWSKLQQHFSSST

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.1e-1029.41Show/hide
Query:  AKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVVQPNPAFETWFEKDQALITLIDATLSQSALAYV
        A + E  L++  +L  N+SN+   +L S+NY++W+ Q+  L   ++L G+ DGS  +PP  I  D   + NP +  W  +D+ + + +   +S S    V
Subjt:  AKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVVQPNPAFETWFEKDQALITLIDATLSQSALAYV

Query:  VACSTSQQIWSKLQQHFSS
           +T+ QIW  L++ +++
Subjt:  VACSTSQQIWSKLQQHFSS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.7e-1031.78Show/hide
Query:  VLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVVQPNPAFETWFEKDQALITLIDATLSQSALAYVVACSTSQQIWSK
        +L  N+SN+   +L S+NY++W+ Q+  L   ++L G+ DGS  +PP  I  D V + NP +  W  +D+ + + I   +S S    V   +T+ QIW  
Subjt:  VLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVVQPNPAFETWFEKDQALITLIDATLSQSALAYVVACSTSQQIWSK

Query:  LQQHFSS
        L++ +++
Subjt:  LQQHFSS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTCACTGGTTCAACCGCAAAATCTGCAGAACAATCGCTTTCTTCGCCGATTGTCCTTCTCACTAATATTAGCAATTTAATCACGGTACGTTTGGATTCT
TCCAACTATGTACTTTGGAACTTCCAGCTATCGCCTCTGTTACGAGCACACAAACTTTTTGGCTATTTCGATGGATCAATTTTTGTTCCGCCTAAGGAAATCAAA
ATTGACGGTGTTGTTCAACCTAATCCTGCCTTTGAGACCTGGTTTGAGAAGGATCAAGCCTTGATAACGTTGATCGACGCCACTTTGTCGCAGTCCGCCCTCGCC
TATGTCGTTGCCTGCTCTACGTCTCAGCAGATCTGGTCCAAACTTCAACAACATTTTTCTTCTTCAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCTCACTGGTTCAACCGCAAAATCTGCAGAACAATCGCTTTCTTCGCCGATTGTCCTTCTCACTAATATTAGCAATTTAATCACGGTACGTTTGGATTCT
TCCAACTATGTACTTTGGAACTTCCAGCTATCGCCTCTGTTACGAGCACACAAACTTTTTGGCTATTTCGATGGATCAATTTTTGTTCCGCCTAAGGAAATCAAA
ATTGACGGTGTTGTTCAACCTAATCCTGCCTTTGAGACCTGGTTTGAGAAGGATCAAGCCTTGATAACGTTGATCGACGCCACTTTGTCGCAGTCCGCCCTCGCC
TATGTCGTTGCCTGCTCTACGTCTCAGCAGATCTGGTCCAAACTTCAACAACATTTTTCTTCTTCAACATGA
Protein sequenceShow/hide protein sequence
MDLTGSTAKSAEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPLLRAHKLFGYFDGSIFVPPKEIKIDGVVQPNPAFETWFEKDQALITLIDATLSQSALA
YVVACSTSQQIWSKLQQHFSSST