; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036320 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036320
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr3:43958463..43962295
RNA-Seq ExpressionLag0036320
SyntenyLag0036320
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFS33695.1 hypothetical protein Acr_00g0030110 [Actinidia rufa]5.7e-2135.43Show/hide
Query:  LNLSTSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNS
        L  + S+  +  L++ LQ I+K G++   Y+ K + + + L++IGEP+++ DH+ Y L GLG +YNPFVTSI ++   P++E+V +LLLSYD RLE+ ++
Subjt:  LNLSTSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNS

Query:  VDQLNNIQANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQNSGSNILGNPNQFQVRWQKSNLPKFPTGAKVQCQIC
         D L+++QANL  L + +      S  S P+   +  P+    N   +PN           P  P   + +CQIC
Subjt:  VDQLNNIQANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQNSGSNILGNPNQFQVRWQKSNLPKFPTGAKVQCQIC

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]9.7e-3757.93Show/hide
Query:  QLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQLNNIQANLTRLH
        ++Q+++K G+SV+QYLAK+KEI  KLS+IGEPIS KDHISYI+EGLG EYN FVTSI NR D+ TLEDV TLLL+YDYRLEK NSVDQLN +QAN+  L 
Subjt:  QLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQLNNIQANLTRLH

Query:  FNQNSNPRRSQR--SSPSQQQFQCPQNSGSNILGNPNQ-FQVRWQKSNLPKFPTGAKVQCQICW
         N+ S   R+ R  S  S + F      G  +LG PNQ  Q  W  S   + P   KVQCQIC+
Subjt:  FNQNSNPRRSQR--SSPSQQQFQCPQNSGSNILGNPNQ-FQVRWQKSNLPKFPTGAKVQCQICW

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.1e-3548.59Show/hide
Query:  STSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQ
        S +  R++ LK++LQ +RK G SV+QYLAK+KEIADK +A+GEP+S++DH++++L+GLG+EYN FVTSIHNR D P+LEDV +LLL+Y+ RL+K N+VDQ
Subjt:  STSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQ

Query:  LNNIQANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQN-----SGSNILGNPNQFQVRWQKSNLPKFPTGAKVQCQIC
        LN  QANL  L    NS     + S P+  +   P +        +ILG P     +W     P  P+ +K+QCQIC
Subjt:  LNNIQANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQN-----SGSNILGNPNQFQVRWQKSNLPKFPTGAKVQCQIC

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]2.8e-2849.66Show/hide
Query:  STSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQ
        S+S   ++   SQLQKI+K G++V+QYLA++K++ D  +AIGEP+S++DH+SYILEGLG+EYNPFV+SIHNR + P++ DV  LL++YD RLEK  + D 
Subjt:  STSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQ

Query:  LNNIQANLTRLHFN-QNSNPRRSQRSSPSQQQFQCPQNSGSNILGNP
        L  IQAN+  L  N QN +P+  Q +  S +       S  +IL NP
Subjt:  LNNIQANLTRLHFN-QNSNPRRSQRSSPSQQQFQCPQNSGSNILGNP

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]2.8e-2846.71Show/hide
Query:  LSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQLNNIQAN
        +SLK++LQKIRK  +S++QYL+++K++ADK S +GE IS++DH+++IL+GLG+EYN FVTSI N VD  ++EDV +LLLSY+ +LEK N++D LN  QA 
Subjt:  LSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQLNNIQAN

Query:  LTRLHFNQNSNPRRSQRSSPSQQQFQCPQNSGSNIL---GNPNQFQVRWQKSNLPKFPTGAKVQCQI
        L++L F  NS  R + R  P+      P  + S IL    N    +  + K   P  P  +K QCQI
Subjt:  LTRLHFNQNSNPRRSQRSSPSQQQFQCPQNSGSNIL---GNPNQFQVRWQKSNLPKFPTGAKVQCQI

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein6.2e-2143.14Show/hide
Query:  STSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQ
        S+S  ++  L+++LQ +RK G++  +Y+ K K I + L+A+GEP+S KDH+ Y+  GL  EYN FVTSI  R D   LE++ +LLLSY++RLE  N+  Q
Subjt:  STSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQ

Query:  LNNIQANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQNSGSNILGNP---NQFQ
        L+++QANL   H N N  P R   S+P     Q  QN       +P   NQFQ
Subjt:  LNNIQANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQNSGSNILGNP---NQFQ

A0A6J1D6N7 uncharacterized protein LOC1110174384.7e-3757.93Show/hide
Query:  QLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQLNNIQANLTRLH
        ++Q+++K G+SV+QYLAK+KEI  KLS+IGEPIS KDHISYI+EGLG EYN FVTSI NR D+ TLEDV TLLL+YDYRLEK NSVDQLN +QAN+  L 
Subjt:  QLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQLNNIQANLTRLH

Query:  FNQNSNPRRSQR--SSPSQQQFQCPQNSGSNILGNPNQ-FQVRWQKSNLPKFPTGAKVQCQICW
         N+ S   R+ R  S  S + F      G  +LG PNQ  Q  W  S   + P   KVQCQIC+
Subjt:  FNQNSNPRRSQR--SSPSQQQFQCPQNSGSNILGNPNQ-FQVRWQKSNLPKFPTGAKVQCQICW

A0A6J1DQX7 uncharacterized protein LOC1110223155.2e-3648.59Show/hide
Query:  STSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQ
        S +  R++ LK++LQ +RK G SV+QYLAK+KEIADK +A+GEP+S++DH++++L+GLG+EYN FVTSIHNR D P+LEDV +LLL+Y+ RL+K N+VDQ
Subjt:  STSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQ

Query:  LNNIQANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQN-----SGSNILGNPNQFQVRWQKSNLPKFPTGAKVQCQIC
        LN  QANL  L    NS     + S P+  +   P +        +ILG P     +W     P  P+ +K+QCQIC
Subjt:  LNNIQANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQN-----SGSNILGNPNQFQVRWQKSNLPKFPTGAKVQCQIC

A0A7J0DER3 Uncharacterized protein2.8e-2135.43Show/hide
Query:  LNLSTSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNS
        L  + S+  +  L++ LQ I+K G++   Y+ K + + + L++IGEP+++ DH+ Y L GLG +YNPFVTSI ++   P++E+V +LLLSYD RLE+ ++
Subjt:  LNLSTSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNS

Query:  VDQLNNIQANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQNSGSNILGNPNQFQVRWQKSNLPKFPTGAKVQCQIC
         D L+++QANL  L + +      S  S P+   +  P+    N   +PN           P  P   + +CQIC
Subjt:  VDQLNNIQANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQNSGSNILGNPNQFQVRWQKSNLPKFPTGAKVQCQIC

A0A7J0E8R3 Uncharacterized protein2.8e-2135.43Show/hide
Query:  LNLSTSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNS
        L  + S+  +  L++ LQ I+K G++   Y+ K + + + L++IGEP+++ DH+ Y L GLG +YNPFVTSI ++   P++E+V +LLLSYD RLE+ ++
Subjt:  LNLSTSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNS

Query:  VDQLNNIQANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQNSGSNILGNPNQFQVRWQKSNLPKFPTGAKVQCQIC
         D L+++QANL  L + +      S  S P+   +  P+    N   +PN           P  P   + +CQIC
Subjt:  VDQLNNIQANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQNSGSNILGNPNQFQVRWQKSNLPKFPTGAKVQCQIC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)5.5e-0625.9Show/hide
Query:  RVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQLNNIQ
        R L L S+L+    G + V  Y  K+K++AD L  +  P++ ++ + Y+L GL  +++  +  I +R   P+ +D  T+L   + RL++         I+
Subjt:  RVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAEYNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQLNNIQ

Query:  ANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQNSGSNILG
         N T +  + +S       + P        Q SG N +G
Subjt:  ANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQNSGSNILG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAACGACGGCAGCGACGACGACAAGGACGACGTCTCCGACAAGGGCTTCGGTGCCGACGACGACAGTTGGTGAATGTGAAGAAGATGAAGTTGAAGGTGAAGCCTC
TTCCTCCTCAAACCAAGCCTCACCGGAGCTGCGCATGCCTCGTCGGGAACGTCATTACTGTCGTCGAACCTATCTGCACAAGAACGCTGGTGTTTCAGATCTGGAAAATC
GAGATGTCAACCGAGCTACGAAACCGCTGTCGGCGACGACGCAACCTCCTCAAACAAACCCTGATCCCGTCGACTTCTCATCAATTCCAGTGACCATTCTGGCGCCGAAA
AGTTCCCCTTTTCAAGTCTCTCGTCTGTGCACGCTCACCGCCATTGCCTCATCTTTTCTCCGATCCGCCACACTGCCTCTTTTCTTTGGTTCCGTCTTGAAACGAAAGGG
AAATCCCCTTCGTTTTAGATCTGGAACGAAACCCAGTGGATTTTGTTCCAGATCTAGAACAAACGAGATTTCCCCTTCGTTCCAAGACGAAAACGAAGGGGAAATCTCCT
TTGTTTCAAAAGCAAAGAACAAGGAAGAGAAAGAAGAGAAGAACAAGATTAAGGAAGTTGGGATTAGGGTGATGGTTGCAAGAAATGGAAATGAGTCATCTTCTTCAACC
TCCACACTGGCTGGATTACCTTGCCAAAATCCATCAGTTCCACCGGTTTCGAACCCAAATGCTCGGTTGCCGGCATTGGACAACTTCCCACTGCTTCAATTTTTCCTTCT
ACACCTCCTGTGTTCACAACGCCACAACCTCAATTTGTCCACCTCCTATACCCGCGTTTTAAGCCTTAAATCTCAGTTGCAAAAGATTCGAAAAGGAGGTATCTCAGTCA
CTCAATATTTGGCAAAATTGAAGGAAATAGCTGATAAATTGTCTGCTATTGGGGAACCCATATCACACAAGGATCATATCTCCTATATCTTGGAAGGTCTTGGTGCTGAG
TATAATCCTTTTGTAACTTCCATTCATAATCGGGTTGATATACCAACATTAGAAGATGTTGGAACCTTGTTGTTGAGCTATGATTATCGCCTTGAGAAACATAATTCCGT
TGATCAATTGAACAACATTCAAGCAAATCTGACTCGTCTCCATTTTAATCAAAATTCCAACCCCCGTCGGTCACAAAGGTCTTCACCCTCTCAGCAACAATTTCAATGCC
CTCAAAATTCGGGGTCCAACATTCTTGGCAATCCAAATCAATTTCAAGTGCGTTGGCAAAAGAGTAATCTGCCAAAATTTCCAACCGGTGCAAAAGTTCAGTGTCAAATC
TGTTGGGGTTGGTGCCCTAAAACTCAAAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTAACGACGGCAGCGACGACGACAAGGACGACGTCTCCGACAAGGGCTTCGGTGCCGACGACGACAGTTGGTGAATGTGAAGAAGATGAAGTTGAAGGTGAAGCCTC
TTCCTCCTCAAACCAAGCCTCACCGGAGCTGCGCATGCCTCGTCGGGAACGTCATTACTGTCGTCGAACCTATCTGCACAAGAACGCTGGTGTTTCAGATCTGGAAAATC
GAGATGTCAACCGAGCTACGAAACCGCTGTCGGCGACGACGCAACCTCCTCAAACAAACCCTGATCCCGTCGACTTCTCATCAATTCCAGTGACCATTCTGGCGCCGAAA
AGTTCCCCTTTTCAAGTCTCTCGTCTGTGCACGCTCACCGCCATTGCCTCATCTTTTCTCCGATCCGCCACACTGCCTCTTTTCTTTGGTTCCGTCTTGAAACGAAAGGG
AAATCCCCTTCGTTTTAGATCTGGAACGAAACCCAGTGGATTTTGTTCCAGATCTAGAACAAACGAGATTTCCCCTTCGTTCCAAGACGAAAACGAAGGGGAAATCTCCT
TTGTTTCAAAAGCAAAGAACAAGGAAGAGAAAGAAGAGAAGAACAAGATTAAGGAAGTTGGGATTAGGGTGATGGTTGCAAGAAATGGAAATGAGTCATCTTCTTCAACC
TCCACACTGGCTGGATTACCTTGCCAAAATCCATCAGTTCCACCGGTTTCGAACCCAAATGCTCGGTTGCCGGCATTGGACAACTTCCCACTGCTTCAATTTTTCCTTCT
ACACCTCCTGTGTTCACAACGCCACAACCTCAATTTGTCCACCTCCTATACCCGCGTTTTAAGCCTTAAATCTCAGTTGCAAAAGATTCGAAAAGGAGGTATCTCAGTCA
CTCAATATTTGGCAAAATTGAAGGAAATAGCTGATAAATTGTCTGCTATTGGGGAACCCATATCACACAAGGATCATATCTCCTATATCTTGGAAGGTCTTGGTGCTGAG
TATAATCCTTTTGTAACTTCCATTCATAATCGGGTTGATATACCAACATTAGAAGATGTTGGAACCTTGTTGTTGAGCTATGATTATCGCCTTGAGAAACATAATTCCGT
TGATCAATTGAACAACATTCAAGCAAATCTGACTCGTCTCCATTTTAATCAAAATTCCAACCCCCGTCGGTCACAAAGGTCTTCACCCTCTCAGCAACAATTTCAATGCC
CTCAAAATTCGGGGTCCAACATTCTTGGCAATCCAAATCAATTTCAAGTGCGTTGGCAAAAGAGTAATCTGCCAAAATTTCCAACCGGTGCAAAAGTTCAGTGTCAAATC
TGTTGGGGTTGGTGCCCTAAAACTCAAAGATAG
Protein sequenceShow/hide protein sequence
MVTTAATTTRTTSPTRASVPTTTVGECEEDEVEGEASSSSNQASPELRMPRRERHYCRRTYLHKNAGVSDLENRDVNRATKPLSATTQPPQTNPDPVDFSSIPVTILAPK
SSPFQVSRLCTLTAIASSFLRSATLPLFFGSVLKRKGNPLRFRSGTKPSGFCSRSRTNEISPSFQDENEGEISFVSKAKNKEEKEEKNKIKEVGIRVMVARNGNESSSST
STLAGLPCQNPSVPPVSNPNARLPALDNFPLLQFFLLHLLCSQRHNLNLSTSYTRVLSLKSQLQKIRKGGISVTQYLAKLKEIADKLSAIGEPISHKDHISYILEGLGAE
YNPFVTSIHNRVDIPTLEDVGTLLLSYDYRLEKHNSVDQLNNIQANLTRLHFNQNSNPRRSQRSSPSQQQFQCPQNSGSNILGNPNQFQVRWQKSNLPKFPTGAKVQCQI
CWGWCPKTQR