; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005075 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005075
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:10278141..10278698
RNA-Seq ExpressionLag0005075
SyntenyLag0005075
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.0e-4253.45Show/hide
Query:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRN---NPDYAKWKQQDSLISFWLLGSMTESLLEQ
        ++SS  NQI   GNKIS VKL +D FL+WK QI  AL ++DL +F+  +S  P K + S E S+       NP Y  WK+QD LIS WLLGSM+E +L Q
Subjt:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRN---NPDYAKWKQQDSLISFWLLGSMTESLLEQ

Query:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL
        ++HCKSAKEIW+ L  IF+SR  A  M+ KNKL  ++KGSM L EYF +I +C+DALA++ K V ++DHILYIL
Subjt:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]1.2e-3851.81Show/hide
Query:  IINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRN---NPDYAKWKQQDSLISFWLLGSMTESLLEQVIHCKSAK
        I   GNKIS VKL++DNFL+WK QI  AL ++DL +F   +   P K + S   S+       NP+Y  WK+ + LIS WLLGSM+E +L Q++HCKSAK
Subjt:  IINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRN---NPDYAKWKQQDSLISFWLLGSMTESLLEQVIHCKSAK

Query:  EIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL
        EIW  L  IF+SR  A  M+ KNKL  ++KGSM L EYF +I++C+DALA++ K V ++DHILYIL
Subjt:  EIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]5.6e-3951.72Show/hide
Query:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQI---PSAEGSTVMRNNPDYAKWKQQDSLISFWLLGSMTESLLEQ
        ++SS  NQI    NKIS VKL++DNFL+WK QI  AL ++DL +F+  +S  P K +    S+  S     NP Y  WK+QD LIS WLLGSM+E +L Q
Subjt:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQI---PSAEGSTVMRNNPDYAKWKQQDSLISFWLLGSMTESLLEQ

Query:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL
        ++HCKSAKEIW  L  IF+SR  A  M+ KNKL  ++K SM L EYF +I+  +DALA++ K V ++DHILYIL
Subjt:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.0e-4253.45Show/hide
Query:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRN---NPDYAKWKQQDSLISFWLLGSMTESLLEQ
        ++SS  NQI   GNKIS VKL +D FL+WK QI  AL ++DL +F+  +S  P K + S E S+       NP Y  WK+QD LIS WLLGSM+E +L Q
Subjt:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRN---NPDYAKWKQQDSLISFWLLGSMTESLLEQ

Query:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL
        ++HCKSAKEIW+ L  IF+SR  A  M+ KNKL  ++KGSM L EYF +I +C+DALA++ K V ++DHILYIL
Subjt:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL

TYK18917.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]5.6e-3951.72Show/hide
Query:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQI---PSAEGSTVMRNNPDYAKWKQQDSLISFWLLGSMTESLLEQ
        ++SS  NQI    NKIS VKL++DNFL+WK QI  AL ++DL +F+  +S  P K +    S+  S     NP Y  WK+QD LIS WLLGSM+E +L Q
Subjt:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQI---PSAEGSTVMRNNPDYAKWKQQDSLISFWLLGSMTESLLEQ

Query:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL
        ++HCKSAKEIW  L  IF+SR  A  M+ KNKL  ++K SM L EYF +I+  +DALA++ K V ++DHILYIL
Subjt:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-4253.45Show/hide
Query:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRN---NPDYAKWKQQDSLISFWLLGSMTESLLEQ
        ++SS  NQI   GNKIS VKL +D FL+WK QI  AL ++DL +F+  +S  P K + S E S+       NP Y  WK+QD LIS WLLGSM+E +L Q
Subjt:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRN---NPDYAKWKQQDSLISFWLLGSMTESLLEQ

Query:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL
        ++HCKSAKEIW+ L  IF+SR  A  M+ KNKL  ++KGSM L EYF +I +C+DALA++ K V ++DHILYIL
Subjt:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL

A0A5A7UB21 Keratin, type II cytoskeletal 1-like6.0e-3951.81Show/hide
Query:  IINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRN---NPDYAKWKQQDSLISFWLLGSMTESLLEQVIHCKSAK
        I   GNKIS VKL++DNFL+WK QI  AL ++DL +F   +   P K + S   S+       NP+Y  WK+ + LIS WLLGSM+E +L Q++HCKSAK
Subjt:  IINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRN---NPDYAKWKQQDSLISFWLLGSMTESLLEQVIHCKSAK

Query:  EIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL
        EIW  L  IF+SR  A  M+ KNKL  ++KGSM L EYF +I++C+DALA++ K V ++DHILYIL
Subjt:  EIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL

A0A5A7VGJ8 Keratin, type II cytoskeletal 1-like2.7e-3951.72Show/hide
Query:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQI---PSAEGSTVMRNNPDYAKWKQQDSLISFWLLGSMTESLLEQ
        ++SS  NQI    NKIS VKL++DNFL+WK QI  AL ++DL +F+  +S  P K +    S+  S     NP Y  WK+QD LIS WLLGSM+E +L Q
Subjt:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQI---PSAEGSTVMRNNPDYAKWKQQDSLISFWLLGSMTESLLEQ

Query:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL
        ++HCKSAKEIW  L  IF+SR  A  M+ KNKL  ++K SM L EYF +I+  +DALA++ K V ++DHILYIL
Subjt:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-4253.45Show/hide
Query:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRN---NPDYAKWKQQDSLISFWLLGSMTESLLEQ
        ++SS  NQI   GNKIS VKL +D FL+WK QI  AL ++DL +F+  +S  P K + S E S+       NP Y  WK+QD LIS WLLGSM+E +L Q
Subjt:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRN---NPDYAKWKQQDSLISFWLLGSMTESLLEQ

Query:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL
        ++HCKSAKEIW+ L  IF+SR  A  M+ KNKL  ++KGSM L EYF +I +C+DALA++ K V ++DHILYIL
Subjt:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL

A0A5D3D5T2 Keratin, type II cytoskeletal 1-like2.7e-3951.72Show/hide
Query:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQI---PSAEGSTVMRNNPDYAKWKQQDSLISFWLLGSMTESLLEQ
        ++SS  NQI    NKIS VKL++DNFL+WK QI  AL ++DL +F+  +S  P K +    S+  S     NP Y  WK+QD LIS WLLGSM+E +L Q
Subjt:  DSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQI---PSAEGSTVMRNNPDYAKWKQQDSLISFWLLGSMTESLLEQ

Query:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL
        ++HCKSAKEIW  L  IF+SR  A  M+ KNKL  ++K SM L EYF +I+  +DALA++ K V ++DHILYIL
Subjt:  VIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.6e-1828.48Show/hide
Query:  NKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRNNPDYAKWKQQDSLISFWLLGSMTESLLEQVIHCKSAKEIWKCLLL
        N  +  KLT+ N+L+W  Q+    + ++L  F++  + +PP  I +       R NPDY +WK+QD LI   +LG+++ S+   V    +A +IW+ L  
Subjt:  NKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRNNPDYAKWKQQDSLISFWLLGSMTESLLEQVIHCKSAKEIWKCLLL

Query:  IFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL
        I+ + +  H+ +++ +L+   KG+  +++Y   +    D LA +GK +D ++ +  +L
Subjt:  IFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-1125.95Show/hide
Query:  NKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRNNPDYAKWKQQDSLISFWLLGSMTESLLEQVIHCKSAKEIWKCLLL
        N  +  KLT+ N+L+W  Q+    + ++L  F++  + +PP  I +     V R NPDY +W++QD LI   +LG+++ S+   V    +A +IW+ L  
Subjt:  NKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRNNPDYAKWKQQDSLISFWLLGSMTESLLEQVIHCKSAKEIWKCLLL

Query:  IFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL
        I+ + +  H+ +++   +                    D LA +GK +D ++ +  +L
Subjt:  IFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.6e-1027.88Show/hide
Query:  MDQTNTEVQITQDSSSS---TNQIINPGN-KISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRNNPDYAKWKQQDSLISF
        M +T   V  T D  S       I +P +  I  +    DN++ WK++    L       FI  D  + PK  P          +P Y  W+Q ++++ +
Subjt:  MDQTNTEVQITQDSSSS---TNQIINPGN-KISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRNNPDYAKWKQQDSLISF

Query:  WLLGSMTESLLEQVIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKK
        WL+ SMT+ LLE V++ ++A ++W+ L  +F       I +++ +L TL++G   + EYF ++ K
Subjt:  WLLGSMTESLLEQVIHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKK

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.1e-0826.26Show/hide
Query:  KWKQQDSLISFWLLGSMTESLLEQVIHCK-SAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL
        +WK++D L+  W+ G++T+SLL+ +I    +A+++W  L  +F     A  ++ +N+L+T     + ++EY  ++K   D L  V   +     ++++L
Subjt:  KWKQQDSLISFWLLGSMTESLLEQVIHCK-SAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAAACAAATACGGAAGTTCAGATTACTCAAGATTCGAGTAGTTCAACAAATCAGATCATAAATCCGGGTAACAAAATATCCACGGTTAAACTCACAAATGATAA
TTTTCTTGTATGGAAAGTCCAGATTGAGTTTGCCCTAAACAGTCATGACCTTGGAGATTTCATCAATAAAGACTCTGTCATTCCTCCAAAACAAATTCCGTCTGCTGAAG
GATCTACCGTCATGAGGAACAATCCAGATTATGCGAAGTGGAAACAACAAGATAGTCTAATATCTTTTTGGTTATTGGGATCAATGACTGAGAGCTTACTTGAACAAGTC
ATTCACTGTAAGTCCGCGAAAGAGATATGGAAGTGTCTATTGTTAATCTTCAATTCCAGAAATAGAGCACATATTATGAGAATGAAGAATAAACTTCAGACTCTACAGAA
AGGATCAATGCTATTAAATGAATATTTTGCTCAGATAAAGAAGTGTATCGATGCTCTCGCGGCCGTAGGAAAAGAAGTTGATGCTGAAGATCATATCTTGTACATATTAT
GGTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCAAACAAATACGGAAGTTCAGATTACTCAAGATTCGAGTAGTTCAACAAATCAGATCATAAATCCGGGTAACAAAATATCCACGGTTAAACTCACAAATGATAA
TTTTCTTGTATGGAAAGTCCAGATTGAGTTTGCCCTAAACAGTCATGACCTTGGAGATTTCATCAATAAAGACTCTGTCATTCCTCCAAAACAAATTCCGTCTGCTGAAG
GATCTACCGTCATGAGGAACAATCCAGATTATGCGAAGTGGAAACAACAAGATAGTCTAATATCTTTTTGGTTATTGGGATCAATGACTGAGAGCTTACTTGAACAAGTC
ATTCACTGTAAGTCCGCGAAAGAGATATGGAAGTGTCTATTGTTAATCTTCAATTCCAGAAATAGAGCACATATTATGAGAATGAAGAATAAACTTCAGACTCTACAGAA
AGGATCAATGCTATTAAATGAATATTTTGCTCAGATAAAGAAGTGTATCGATGCTCTCGCGGCCGTAGGAAAAGAAGTTGATGCTGAAGATCATATCTTGTACATATTAT
GGTCTTGA
Protein sequenceShow/hide protein sequence
MDQTNTEVQITQDSSSSTNQIINPGNKISTVKLTNDNFLVWKVQIEFALNSHDLGDFINKDSVIPPKQIPSAEGSTVMRNNPDYAKWKQQDSLISFWLLGSMTESLLEQV
IHCKSAKEIWKCLLLIFNSRNRAHIMRMKNKLQTLQKGSMLLNEYFAQIKKCIDALAAVGKEVDAEDHILYILWS