; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002406 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002406
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:42565549..42570167
RNA-Seq ExpressionLag0002406
SyntenyLag0002406
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5219081.1 Retrovirus-related polyprotein from transposon [Salix suchowensis]8.5e-4058.91Show/hide
Query:  SSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTAKEIWDATRDTYSSSENSS
        +S  ++T HKL G+N+LQ + SV ++ICG+G+D +LTG+   P  +DP+FR+WKT++H++MSWL+NSMT E+GENFLL+ TAKEIWDA R+TYS+S+N+S
Subjt:  SSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTAKEIWDATRDTYSSSENSS

Query:  ALLAIETQLYDLRQGDLNATQYFNLLVRN
         L  IE  L+DLRQG+LN TQYFN L R+
Subjt:  ALLAIETQLYDLRQGDLNATQYFNLLVRN

PKA63925.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Apostasia shenzhenica]3.7e-5168Show/hide
Query:  QMAKHGLASVSYENSSQP--HNSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLL
        Q +   + S S  NSS P   ++S  +VTNHKL GHNFLQ +QSVF+YICGRGKDGHLTG+  AP+  DPK+R+W+TDDHLVMSWL+NSMT EVGENFLL
Subjt:  QMAKHGLASVSYENSSQP--HNSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLL

Query:  FKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR
        F+TAKEIW+A R+TYSS+ENSS L  IET+LYDLRQG+L+ TQYFN L R
Subjt:  FKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR

RVX20378.1 hypothetical protein CK203_004521 [Vitis vinifera]5.5e-3948.75Show/hide
Query:  MAKHGLASVSYENSSQPH-------------NSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSM
        M K+G+AS    + + P              +SSP ++T HKL GHN+LQ +QSV ++ICG+GKD +LTG+   P+  +P FR WK ++ ++MSWL+NSM
Subjt:  MAKHGLASVSYENSSQPH-------------NSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSM

Query:  TPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR
          ++GENFLLF+TAK+IWDA ++TYSSSEN+S L  +E+ L+D RQG+ + TQY+N L R
Subjt:  TPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR

XP_023738515.1 uncharacterized protein LOC111886493 [Lactuca sativa]1.4e-4572.13Show/hide
Query:  TNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIE
        T+HKL G NF Q +QSVF++ICGR KDGHLTG+T A D+KDPKFR+W+T+DHLVMSWL+NSMT EVGENFLL+KTA+EIW+A ++TYSS+ENSS L  +E
Subjt:  TNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIE

Query:  TQLYDLRQGDLNATQYFNLLVR
        T+LYDLRQGDL+ TQYF+LL R
Subjt:  TQLYDLRQGDLNATQYFNLLVR

XP_034898954.1 uncharacterized protein LOC118037156 [Populus alba]5.5e-3954.09Show/hide
Query:  MAKHGLASVSYENSSQPHNSS-----------PQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSMTP
        MA  G +S+S E +SQP  +S              +T HKL G+N+LQ + SV ++ICG+G+D +LTGD   P+  DP FR+WKT++H+VMSWL+NSMT 
Subjt:  MAKHGLASVSYENSSQPHNSS-----------PQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSMTP

Query:  EVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVRN
        E+GENFLL+ TAKEIW+A R+TYSSSEN+S L  IE  L+DLRQG+L+ TQ+FN L R+
Subjt:  EVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVRN

TrEMBL top hitse value%identityAlignment
A0A2I0B7Z4 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-5168Show/hide
Query:  QMAKHGLASVSYENSSQP--HNSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLL
        Q +   + S S  NSS P   ++S  +VTNHKL GHNFLQ +QSVF+YICGRGKDGHLTG+  AP+  DPK+R+W+TDDHLVMSWL+NSMT EVGENFLL
Subjt:  QMAKHGLASVSYENSSQP--HNSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLL

Query:  FKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR
        F+TAKEIW+A R+TYSS+ENSS L  IET+LYDLRQG+L+ TQYFN L R
Subjt:  FKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR

A0A438D529 Retrovirus-related Pol polyprotein from transposon TNT 1-947.8e-3948.75Show/hide
Query:  MAKHGLASVSYENSSQP-------------HNSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSM
        M K+G+AS    + + P             ++SSP ++T HKL GHN+LQ +QSV ++ICG+GKD +LTG+   P+  +P FR WK ++ ++MSWL+NSM
Subjt:  MAKHGLASVSYENSSQP-------------HNSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSM

Query:  TPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR
          ++GENFLLF TAK+IWDA ++TYSSSEN+S L  +E+ L+D RQG+ + TQY+N L R
Subjt:  TPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR

A0A438IAX1 Retrovirus-related Pol polyprotein from transposon TNT 1-947.8e-3948.75Show/hide
Query:  MAKHGLASVSYENSSQP-------------HNSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSM
        M K+G+AS    + + P             ++SSP ++T HKL GHN+LQ +QSV ++ICG+GKD +LTG+   P+  +P FR WK ++ ++MSWL+NSM
Subjt:  MAKHGLASVSYENSSQP-------------HNSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSM

Query:  TPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR
          ++GENFLLF TAK+IWDA ++TYSSSEN+S L  +E+ L+D RQG+ + TQY+N L R
Subjt:  TPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR

A0A438KGR6 Uncharacterized protein2.7e-3948.75Show/hide
Query:  MAKHGLASVSYENSSQPH-------------NSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSM
        M K+G+AS    + + P              +SSP ++T HKL GHN+LQ +QSV ++ICG+GKD +LTG+   P+  +P FR WK ++ ++MSWL+NSM
Subjt:  MAKHGLASVSYENSSQPH-------------NSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSM

Query:  TPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR
          ++GENFLLF+TAK+IWDA ++TYSSSEN+S L  +E+ L+D RQG+ + TQY+N L R
Subjt:  TPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR

A0A438KNE1 Copia protein4.5e-3948.75Show/hide
Query:  MAKHGLASVSYENSSQP-------------HNSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSM
        M K+G+AS    + + P             ++SSP ++T HKL GHN+LQ +QSV ++ICG+GKD +LTG+   P+  +P FR WK +++++MSWL+NSM
Subjt:  MAKHGLASVSYENSSQP-------------HNSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSM

Query:  TPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR
          ++GENFLLF TAK+IWDA ++TYSSSEN+S L  +E+ L+D RQG+ + TQY+N L R
Subjt:  TPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).9.1e-0830.53Show/hide
Query:  KDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLL
        K G + G  P PD   P ++ W+  + +VM WL+NSMT ++ E+ +  +TA ++W+  R  +    +   +  +  +L  LRQG  +  +YF  L
Subjt:  KDGHLTGDTPAPDAKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGTCTTCATAACCTGCAAAATCTATTGGTTGGGAGGAAAGAGAAAGGAGGTGAGTACAACCATGGGCTTTAATGCTCTCTACGGAGCTCGACATTCGATCCTTGA
CCATTTGTACTCAAGGCCTCGAGCTGTGCATCGTCCCATGACCATTTGTATCCAAGGTCTCGATGCTCCTCTTATTCCAATGGTGACATGGCACCTCTTGACCATTTGTA
CCCAAGGTGCATTGACAGCCACCTATATTTCCTTTCTTTCACATTTCAGGACTCTTCTTACAAACTTCGCTTCGAGTGATAGAGGTATTCTTAAACCTTGCAAGTTAACT
GAATCCATGGTACTACCAACTGGCAAGATTCTAAGTAAACTCCAAAACTCTCTAAGCCGTCAGACACTATCTTTACCGGCGAACCCTCCTTCCGACAACCCTCCGACGAG
CTCTTCTTTCTTCTTCGGCGAACTCTCATCGGCGAACGGAGTTCGTAACAGTCGCAACAGAACTCGTGTTAACGTCTGCTCGTCTGCTCACGTTCACGTTTGCTCGTCTG
TTCGAAGTTCGTCAGCTGCCGCCGCTGCTTGCCACTCGTCCGTCGCTCGCTGCTCGTCAGTTGCCGCCGCGCGCAACTCGCCGCTTGTTTGCCTCCGCCACTCGCCGCTC
GGCTGCTCATCAGATTTTTTTTCCGCTGGTTTGCTCGTTTTTTGTTGCTCAGTTTTTTGGTCGGCTTGTTTTTCCAGATTTGGTTGTTTGGTTGTTCAAGGCTTTCAACG
ATTTCAAATGGCGAAACACGGGTTAGCAAGTGTTAGTTATGAAAACTCCTCTCAACCTCACAATTCCTCTCCTCAAATTGTGACTAATCATAAACTTCAAGGTCACAATT
TTCTTCAATCGAATCAATCAGTCTTCATATATATCTGTGGCCGTGGTAAAGATGGTCATCTAACTGGTGATACCCCTGCACCGGATGCTAAAGACCCGAAGTTTCGATCA
TGGAAAACTGATGATCATTTGGTCATGTCTTGGCTACTAAACTCTATGACTCCAGAGGTTGGAGAGAACTTTCTATTGTTCAAAACAGCAAAAGAGATATGGGATGCAAC
CCGTGACACCTATTCCAGCTCAGAGAACTCATCTGCTCTATTGGCTATTGAAACCCAGTTGTATGATTTACGTCAAGGAGACCTTAATGCTACTCAGTACTTTAATCTTC
TTGTTCGAAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAGTCTTCATAACCTGCAAAATCTATTGGTTGGGAGGAAAGAGAAAGGAGGTGAGTACAACCATGGGCTTTAATGCTCTCTACGGAGCTCGACATTCGATCCTTGA
CCATTTGTACTCAAGGCCTCGAGCTGTGCATCGTCCCATGACCATTTGTATCCAAGGTCTCGATGCTCCTCTTATTCCAATGGTGACATGGCACCTCTTGACCATTTGTA
CCCAAGGTGCATTGACAGCCACCTATATTTCCTTTCTTTCACATTTCAGGACTCTTCTTACAAACTTCGCTTCGAGTGATAGAGGTATTCTTAAACCTTGCAAGTTAACT
GAATCCATGGTACTACCAACTGGCAAGATTCTAAGTAAACTCCAAAACTCTCTAAGCCGTCAGACACTATCTTTACCGGCGAACCCTCCTTCCGACAACCCTCCGACGAG
CTCTTCTTTCTTCTTCGGCGAACTCTCATCGGCGAACGGAGTTCGTAACAGTCGCAACAGAACTCGTGTTAACGTCTGCTCGTCTGCTCACGTTCACGTTTGCTCGTCTG
TTCGAAGTTCGTCAGCTGCCGCCGCTGCTTGCCACTCGTCCGTCGCTCGCTGCTCGTCAGTTGCCGCCGCGCGCAACTCGCCGCTTGTTTGCCTCCGCCACTCGCCGCTC
GGCTGCTCATCAGATTTTTTTTCCGCTGGTTTGCTCGTTTTTTGTTGCTCAGTTTTTTGGTCGGCTTGTTTTTCCAGATTTGGTTGTTTGGTTGTTCAAGGCTTTCAACG
ATTTCAAATGGCGAAACACGGGTTAGCAAGTGTTAGTTATGAAAACTCCTCTCAACCTCACAATTCCTCTCCTCAAATTGTGACTAATCATAAACTTCAAGGTCACAATT
TTCTTCAATCGAATCAATCAGTCTTCATATATATCTGTGGCCGTGGTAAAGATGGTCATCTAACTGGTGATACCCCTGCACCGGATGCTAAAGACCCGAAGTTTCGATCA
TGGAAAACTGATGATCATTTGGTCATGTCTTGGCTACTAAACTCTATGACTCCAGAGGTTGGAGAGAACTTTCTATTGTTCAAAACAGCAAAAGAGATATGGGATGCAAC
CCGTGACACCTATTCCAGCTCAGAGAACTCATCTGCTCTATTGGCTATTGAAACCCAGTTGTATGATTTACGTCAAGGAGACCTTAATGCTACTCAGTACTTTAATCTTC
TTGTTCGAAACTGA
Protein sequenceShow/hide protein sequence
MTVFITCKIYWLGGKRKEVSTTMGFNALYGARHSILDHLYSRPRAVHRPMTICIQGLDAPLIPMVTWHLLTICTQGALTATYISFLSHFRTLLTNFASSDRGILKPCKLT
ESMVLPTGKILSKLQNSLSRQTLSLPANPPSDNPPTSSSFFFGELSSANGVRNSRNRTRVNVCSSAHVHVCSSVRSSSAAAAACHSSVARCSSVAAARNSPLVCLRHSPL
GCSSDFFSAGLLVFCCSVFWSACFSRFGCLVVQGFQRFQMAKHGLASVSYENSSQPHNSSPQIVTNHKLQGHNFLQSNQSVFIYICGRGKDGHLTGDTPAPDAKDPKFRS
WKTDDHLVMSWLLNSMTPEVGENFLLFKTAKEIWDATRDTYSSSENSSALLAIETQLYDLRQGDLNATQYFNLLVRN