; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031272 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031272
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr11:6510203..6515209
RNA-Seq ExpressionLag0031272
SyntenyLag0031272
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056218.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.5e-0542.86Show/hide
Query:  MNNLELKLFDEVNSDKKLHSSFPSRMKRKFSVLITTEGSLKLLIAKPKLHDAPSPHELKREWKLLSPSSKVPTRFAAVPSPKFEGSHALRCSSFLTVE
        M  L+ K F E N D K+H+  PSRMKRK SV I  E        +PKLH APSP ELK    + SP  +   +FA+  SP  EG+ +L     LT +
Subjt:  MNNLELKLFDEVNSDKKLHSSFPSRMKRKFSVLITTEGSLKLLIAKPKLHDAPSPHELKREWKLLSPSSKVPTRFAAVPSPKFEGSHALRCSSFLTVE

KAA0065966.1 hypothetical protein E6C27_scaffold62G00430 [Cucumis melo var. makuwa]5.0e-0548.78Show/hide
Query:  MNNLELKLFDEVNSDKKLHSSFPSRMKRKFSVLITTEGSLKLLIAKPKLHDAPSPHELKREWKLLSPSSKVPTRFAAVPSPK
        + +L+ KLF E N D K+HS  PSRMKRK SV I TE        KPKLH APSP ELK         S  P  F+   SPK
Subjt:  MNNLELKLFDEVNSDKKLHSSFPSRMKRKFSVLITTEGSLKLLIAKPKLHDAPSPHELKREWKLLSPSSKVPTRFAAVPSPK

RUS92128.1 hypothetical protein EGW08_000152 [Elysia chlorotica]3.2e-0435.29Show/hide
Query:  SPNEIGDWSSRSDITASEFDHQANRPI-QEINKPTD-RSRRSTSQQADHPRDQQANRPI-KKINKSAGRSSKRINKLTSRSNRSSSQQAD-PRDQQANRP
        +P+  G  + R+  T    D + NRP  Q+ N+PTD ++ R T QQ + P DQQ NRP  ++ N+ A +          ++NR + QQA  P DQQANRP
Subjt:  SPNEIGDWSSRSDITASEFDHQANRPI-QEINKPTD-RSRRSTSQQADHPRDQQANRPI-KKINKSAGRSSKRINKLTSRSNRSSSQQAD-PRDQQANRP

Query:  IKKINKSAGRSSKRSTSQPTDQE-----DQQVSRPIIQE-----DQQANKPI-QEIITPTGRSSKKINKP
             +   R + + T++PTDQ+     DQQ +RP  Q+     DQQ N+P  Q+   PT + + +   P
Subjt:  IKKINKSAGRSSKRSTSQPTDQE-----DQQVSRPIIQE-----DQQANKPI-QEIITPTGRSSKKINKP

TrEMBL top hitse value%identityAlignment
A0A433UEB9 Uncharacterized protein1.6e-0435.29Show/hide
Query:  SPNEIGDWSSRSDITASEFDHQANRPI-QEINKPTD-RSRRSTSQQADHPRDQQANRPI-KKINKSAGRSSKRINKLTSRSNRSSSQQAD-PRDQQANRP
        +P+  G  + R+  T    D + NRP  Q+ N+PTD ++ R T QQ + P DQQ NRP  ++ N+ A +          ++NR + QQA  P DQQANRP
Subjt:  SPNEIGDWSSRSDITASEFDHQANRPI-QEINKPTD-RSRRSTSQQADHPRDQQANRPI-KKINKSAGRSSKRINKLTSRSNRSSSQQAD-PRDQQANRP

Query:  IKKINKSAGRSSKRSTSQPTDQE-----DQQVSRPIIQE-----DQQANKPI-QEIITPTGRSSKKINKP
             +   R + + T++PTDQ+     DQQ +RP  Q+     DQQ N+P  Q+   PT + + +   P
Subjt:  IKKINKSAGRSSKRSTSQPTDQE-----DQQVSRPIIQE-----DQQANKPI-QEIITPTGRSSKKINKP

A0A5A7UM99 Ty3-gypsy retrotransposon protein3.2e-0542.86Show/hide
Query:  MNNLELKLFDEVNSDKKLHSSFPSRMKRKFSVLITTEGSLKLLIAKPKLHDAPSPHELKREWKLLSPSSKVPTRFAAVPSPKFEGSHALRCSSFLTVE
        M  L+ K F E N D K+H+  PSRMKRK SV I  E        +PKLH APSP ELK    + SP  +   +FA+  SP  EG+ +L     LT +
Subjt:  MNNLELKLFDEVNSDKKLHSSFPSRMKRKFSVLITTEGSLKLLIAKPKLHDAPSPHELKREWKLLSPSSKVPTRFAAVPSPKFEGSHALRCSSFLTVE

A0A5A7VHY3 Uncharacterized protein2.4e-0548.78Show/hide
Query:  MNNLELKLFDEVNSDKKLHSSFPSRMKRKFSVLITTEGSLKLLIAKPKLHDAPSPHELKREWKLLSPSSKVPTRFAAVPSPK
        + +L+ KLF E N D K+HS  PSRMKRK SV I TE        KPKLH APSP ELK         S  P  F+   SPK
Subjt:  MNNLELKLFDEVNSDKKLHSSFPSRMKRKFSVLITTEGSLKLLIAKPKLHDAPSPHELKREWKLLSPSSKVPTRFAAVPSPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCTTCCCGTCACGTATGAAGAGGAAATTCTCTGTTCTCATAACTACAGA
AGGTTCCTTGAAGCTCCTTATCGCAAAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGAGAATGGAAGTTGCTTTCTCCAAGTTCGAAGGTTCCCACGC
GCTTCGCTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAA
ATTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCACTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGTCGCTT
CGCTGCAGTTCCTTCCTCCAAGTTGAAGGTTCTCACATCGCTTTGCTTCGCTCACGCGCTTCGCTGCATTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTCTG
CAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCATTTCCTTCCCCCAAATTCGAAGG
TTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTC
TTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTGAAGGTTCTCA
CGCGCTTCGTTGCAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCC
TCCAAGTTTGAAGGTTCTTACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGATGCAGTTTCTTCTCCCTAA
GTTCGAAGGTTCTCACGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCTGCACAGTTCCTTCCTCCA
AGTTCGAAGGTTCTCATGCGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGATGGTTCCCTCACGCGCTTC
GCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTTGAAGGTTCTCTCACG
CGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGA
AGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCCCTCACGCACTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTC
TCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACTGCAATTGAATCT
GATGACGACCGTTGTAGGCGAGTCTGGTGATGAAGTCACTGCAAGTGAATCTGATGACAACCGTTGTAGGCGAGTCGAGTCTGGTGACCACCCTTGCAGGTTACTCAGAT
CACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTATCCCTGCAGGTTACTCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGG
AGTGATATCACTGCAAGCGAATTTGATCATCAAGCCAACAGGCCGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCC
AAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAAC
AGGCTGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAA
CAAGTCAGCAGGCCGATCATCCAGGAAGATCAACAAGCCAACAAGCCGATCCAAGAGATCATCACGCCAACAGGCCGATCATCCAAGAAGATCAACAAGCCCAATAGGTC
GATCCAGGAGATCATCAACCTAACAAGCCGATCATCCAAGAAGATCAACAAGTCCAATAGGTCGATCCAGGAGATCATCAACCTAACAGACCGATCATCCAAGAAGATCA
ACAAGCCAATAAGCCGATCCAAGAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAGCAAGTTGATCATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCTTCCCGTCACGTATGAAGAGGAAATTCTCTGTTCTCATAACTACAGA
AGGTTCCTTGAAGCTCCTTATCGCAAAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGAGAATGGAAGTTGCTTTCTCCAAGTTCGAAGGTTCCCACGC
GCTTCGCTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAA
ATTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCACTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGTCGCTT
CGCTGCAGTTCCTTCCTCCAAGTTGAAGGTTCTCACATCGCTTTGCTTCGCTCACGCGCTTCGCTGCATTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTCTG
CAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCATTTCCTTCCCCCAAATTCGAAGG
TTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTC
TTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTGAAGGTTCTCA
CGCGCTTCGTTGCAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCC
TCCAAGTTTGAAGGTTCTTACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGATGCAGTTTCTTCTCCCTAA
GTTCGAAGGTTCTCACGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCTGCACAGTTCCTTCCTCCA
AGTTCGAAGGTTCTCATGCGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGATGGTTCCCTCACGCGCTTC
GCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTTGAAGGTTCTCTCACG
CGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGA
AGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCCCTCACGCACTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTC
TCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACTGCAATTGAATCT
GATGACGACCGTTGTAGGCGAGTCTGGTGATGAAGTCACTGCAAGTGAATCTGATGACAACCGTTGTAGGCGAGTCGAGTCTGGTGACCACCCTTGCAGGTTACTCAGAT
CACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTATCCCTGCAGGTTACTCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGG
AGTGATATCACTGCAAGCGAATTTGATCATCAAGCCAACAGGCCGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCC
AAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAAC
AGGCTGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAA
CAAGTCAGCAGGCCGATCATCCAGGAAGATCAACAAGCCAACAAGCCGATCCAAGAGATCATCACGCCAACAGGCCGATCATCCAAGAAGATCAACAAGCCCAATAGGTC
GATCCAGGAGATCATCAACCTAACAAGCCGATCATCCAAGAAGATCAACAAGTCCAATAGGTCGATCCAGGAGATCATCAACCTAACAGACCGATCATCCAAGAAGATCA
ACAAGCCAATAAGCCGATCCAAGAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAGCAAGTTGATCATCTAA
Protein sequenceShow/hide protein sequence
MNNLELKLFDEVNSDKKLHSSFPSRMKRKFSVLITTEGSLKLLIAKPKLHDAPSPHELKREWKLLSPSSKVPTRFAAVPSPKFEGSHALRCSSFLTVEGSHALRCSSFPQ
IRRFSRRFAAVPSSKFEGSHAFHCSSFLTVRRFSRRFAAVPSSKLKVLTSLCFAHALRCIPSSKFEGSHTLRSAIPSPKFEGSHALRAVPSSKFEGSHAFRCISFPQIRR
FSRASLQFLPPSSKVLTRFAAVPSSKFKGSHALRCSSFLQVRRFSRASLQFLPPSSKVLTRFAAVPSSKLKVLTRFVAVPSSQFEGSHALRCSSFPQVRRFSRRFAAVPS
SKFEGSYIASLRAALRCSSFLQVRRFSRASMQFLLPKFEGSHALRCSSFLPKFEVPSSKFEGSHALRCTVPSSKFEGSHALRCYLPPSSKVLSRAAAAPSSKFDGSLTRF
ARSFSKFEGASLHCSFSKFEGASLRCYLPPSLKVLSRAAAVPSSKFEGSLTRFARSFSKFEGASLRCYLPPSSKVLSRAAAVPSSKFEGSLTHFARSFSKFEGASLRCYF
SKFEGASLHCSFSKFEGNYSHQSDWSRQVVKSLQLNLMTTVVGESGDEVTASESDDNRCRRVESGDHPCRLLRSPNKMGTGLAGVHEGESGDYPCRLLRSPNEIGDWSSR
SDITASEFDHQANRPIQEINKPTDRSRRSTSQQADHPRDQQANRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQ
QVSRPIIQEDQQANKPIQEIITPTGRSSKKINKPNRSIQEIINLTSRSSKKINKSNRSIQEIINLTDRSSKKINKPISRSKRSSSQQADPRDHQPSKLII