; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000240 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000240
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:1941009..1941632
RNA-Seq ExpressionLag0000240
SyntenyLag0000240
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.7e-2837Show/hide
Query:  SSDAISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSADESSPAR--IPNPEYDYWIQQDNLITARLLNSMNLSLL
        +++A S I++     +K S  KL+++ FL+WKFQ+   L  + L+ F++ +S+PPSK+L S + SS +    PNP Y  W +QD LI++ LL SM+  +L
Subjt:  SSDAISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSADESSPAR--IPNPEYDYWIQQDNLITARLLNSMNLSLL

Query:  LNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLTGDDTFPPLQ
           L  +  K      Q     +++A+ ++ K KL   KKG+  L+EYF KI   VD+LA     +S  DH+LYIL GLG +Y   +SV++     P +Q
Subjt:  LNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLTGDDTFPPLQ

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]8.5e-2838.42Show/hide
Query:  SSDAISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSADESSPA--RIPNPEYDYWIQQDNLITARLLNSMNLSLL
        +++A S I++     +K S  KL ++NFL+WKFQ+   L  + L+ F++ +S+PPSK+L S   SS +  R PNP Y  W +QD LI++ LL SM+  +L
Subjt:  SSDAISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSADESSPA--RIPNPEYDYWIQQDNLITARLLNSMNLSLL

Query:  LNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVL
           L  +  K      Q     +++A+ ++ K KL   KK +  L+EYF KI++ VD+LA     +S  DH+LYIL GLG +Y   +SV+
Subjt:  LNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVL

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.7e-2837Show/hide
Query:  SSDAISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSADESSPAR--IPNPEYDYWIQQDNLITARLLNSMNLSLL
        +++A S I++     +K S  KL+++ FL+WKFQ+   L  + L+ F++ +S+PPSK+L S + SS +    PNP Y  W +QD LI++ LL SM+  +L
Subjt:  SSDAISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSADESSPAR--IPNPEYDYWIQQDNLITARLLNSMNLSLL

Query:  LNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLTGDDTFPPLQ
           L  +  K      Q     +++A+ ++ K KL   KKG+  L+EYF KI   VD+LA     +S  DH+LYIL GLG +Y   +SV++     P +Q
Subjt:  LNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLTGDDTFPPLQ

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]4.9e-3644.29Show/hide
Query:  MSSFLSERSSSDA-ISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSA-DESSPARI-PNPEYDYWIQQDNLITAR
        M+S  S R+S  A I Q SK INP SK S  +L+++N L+WKFQ+   L+G+GL+ +ID + D P++F+ +  DESS + +  NP Y  WI+QD LI+A 
Subjt:  MSSFLSERSSSDA-ISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSA-DESSPARI-PNPEYDYWIQQDNLITAR

Query:  LLNSMNLSLLLNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVL
        LL SMN  +L   L  +  +    + +     + +ARV++LK KL   KKG   L++YF KIKNLVDSLA+A   +S  DH+++IL GLGPE+D  +SV+
Subjt:  LLNSMNLSLLLNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVL

Query:  TGDDTFPPLQ
        T  +    LQ
Subjt:  TGDDTFPPLQ

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]1.1e-3046.25Show/hide
Query:  KFQVSITLRGHGLQKFIDPDSDPPSKFLNSAD--ESSPARIPNPEYDYWIQQDNLITARLLNSMNLSLLLNFLIARLPKRYGLIFQITSHRKHMARVLEL
        KFQV   ++GHGL+++ID D +PPS+F+ + D   SS  + PNPEY +WI+QD LI+  LL SM+  +L   L  R+ K    + + T   +++ARV++L
Subjt:  KFQVSITLRGHGLQKFIDPDSDPPSKFLNSAD--ESSPARIPNPEYDYWIQQDNLITARLLNSMNLSLLLNFLIARLPKRYGLIFQITSHRKHMARVLEL

Query:  KTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLT
        K+KL   KKG+  L+ YF KIKNLVDSLA A   +   DH+++IL  LGPE+D  VSV++
Subjt:  KTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLT

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-948.2e-2937Show/hide
Query:  SSDAISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSADESSPAR--IPNPEYDYWIQQDNLITARLLNSMNLSLL
        +++A S I++     +K S  KL+++ FL+WKFQ+   L  + L+ F++ +S+PPSK+L S + SS +    PNP Y  W +QD LI++ LL SM+  +L
Subjt:  SSDAISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSADESSPAR--IPNPEYDYWIQQDNLITARLLNSMNLSLL

Query:  LNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLTGDDTFPPLQ
           L  +  K      Q     +++A+ ++ K KL   KKG+  L+EYF KI   VD+LA     +S  DH+LYIL GLG +Y   +SV++     P +Q
Subjt:  LNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLTGDDTFPPLQ

A0A5A7VGJ8 Keratin, type II cytoskeletal 1-like4.1e-2838.42Show/hide
Query:  SSDAISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSADESSPA--RIPNPEYDYWIQQDNLITARLLNSMNLSLL
        +++A S I++     +K S  KL ++NFL+WKFQ+   L  + L+ F++ +S+PPSK+L S   SS +  R PNP Y  W +QD LI++ LL SM+  +L
Subjt:  SSDAISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSADESSPA--RIPNPEYDYWIQQDNLITARLLNSMNLSLL

Query:  LNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVL
           L  +  K      Q     +++A+ ++ K KL   KK +  L+EYF KI++ VD+LA     +S  DH+LYIL GLG +Y   +SV+
Subjt:  LNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVL

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-948.2e-2937Show/hide
Query:  SSDAISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSADESSPAR--IPNPEYDYWIQQDNLITARLLNSMNLSLL
        +++A S I++     +K S  KL+++ FL+WKFQ+   L  + L+ F++ +S+PPSK+L S + SS +    PNP Y  W +QD LI++ LL SM+  +L
Subjt:  SSDAISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSADESSPAR--IPNPEYDYWIQQDNLITARLLNSMNLSLL

Query:  LNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLTGDDTFPPLQ
           L  +  K      Q     +++A+ ++ K KL   KKG+  L+EYF KI   VD+LA     +S  DH+LYIL GLG +Y   +SV++     P +Q
Subjt:  LNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLTGDDTFPPLQ

A0A6J1DLT9 uncharacterized protein LOC1110217572.4e-3644.29Show/hide
Query:  MSSFLSERSSSDA-ISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSA-DESSPARI-PNPEYDYWIQQDNLITAR
        M+S  S R+S  A I Q SK INP SK S  +L+++N L+WKFQ+   L+G+GL+ +ID + D P++F+ +  DESS + +  NP Y  WI+QD LI+A 
Subjt:  MSSFLSERSSSDA-ISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSA-DESSPARI-PNPEYDYWIQQDNLITAR

Query:  LLNSMNLSLLLNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVL
        LL SMN  +L   L  +  +    + +     + +ARV++LK KL   KKG   L++YF KIKNLVDSLA+A   +S  DH+++IL GLGPE+D  +SV+
Subjt:  LLNSMNLSLLLNFLIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVL

Query:  TGDDTFPPLQ
        T  +    LQ
Subjt:  TGDDTFPPLQ

A0A6J1DSS1 uncharacterized protein LOC1110235865.2e-3146.25Show/hide
Query:  KFQVSITLRGHGLQKFIDPDSDPPSKFLNSAD--ESSPARIPNPEYDYWIQQDNLITARLLNSMNLSLLLNFLIARLPKRYGLIFQITSHRKHMARVLEL
        KFQV   ++GHGL+++ID D +PPS+F+ + D   SS  + PNPEY +WI+QD LI+  LL SM+  +L   L  R+ K    + + T   +++ARV++L
Subjt:  KFQVSITLRGHGLQKFIDPDSDPPSKFLNSAD--ESSPARIPNPEYDYWIQQDNLITARLLNSMNLSLLLNFLIARLPKRYGLIFQITSHRKHMARVLEL

Query:  KTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLT
        K+KL   KKG+  L+ YF KIKNLVDSLA A   +   DH+++IL  LGPE+D  VSV++
Subjt:  KTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.3e-0633.33Show/hide
Query:  ARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLTGDDTFP
        AR L L ++L T   G   + +Y+ K+K L DSL   +  ++  + V+Y+L GL P++D  ++V+     FP
Subjt:  ARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLTGDDTFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCTTTTTTGTCGGAGAGAAGTTCTTCAGATGCGATTTCTCAAATTTCCAAATTTATCAATCCTAGGAGCAAGACTTCTACGCCAAAATTAGATGAAGAGAATTT
CTTGATGTGGAAATTTCAAGTCTCGATCACGCTTCGCGGCCATGGACTTCAGAAATTTATTGATCCCGATAGTGATCCACCGTCAAAATTTCTGAACTCAGCTGATGAAT
CCTCTCCTGCTCGCATTCCTAATCCCGAATACGACTACTGGATTCAACAGGACAATCTCATTACAGCAAGGCTTCTCAATTCCATGAATCTGTCATTGTTGCTGAACTTC
TTGATTGCAAGACTTCCAAAGAGGTATGGTCTCATCTTTCAAATCACTTCTCATCGAAAACACATGGCTAGGGTTCTTGAATTGAAAACGAAACTTGGAACAACCAAGAA
AGGTACTTCTGGTCTGCAAGAATATTTTACAAAAATAAAGAATTTGGTTGATTCTTTGGCGGTAGCAGAGCATTCAATTTCACACAGTGATCATGTCCTCTACATCCTAG
GGGGTCTTGGTCCTGAATATGACCCTACTGTCTCTGTTCTAACTGGAGATGACACATTTCCTCCTTTGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCTTTTTTGTCGGAGAGAAGTTCTTCAGATGCGATTTCTCAAATTTCCAAATTTATCAATCCTAGGAGCAAGACTTCTACGCCAAAATTAGATGAAGAGAATTT
CTTGATGTGGAAATTTCAAGTCTCGATCACGCTTCGCGGCCATGGACTTCAGAAATTTATTGATCCCGATAGTGATCCACCGTCAAAATTTCTGAACTCAGCTGATGAAT
CCTCTCCTGCTCGCATTCCTAATCCCGAATACGACTACTGGATTCAACAGGACAATCTCATTACAGCAAGGCTTCTCAATTCCATGAATCTGTCATTGTTGCTGAACTTC
TTGATTGCAAGACTTCCAAAGAGGTATGGTCTCATCTTTCAAATCACTTCTCATCGAAAACACATGGCTAGGGTTCTTGAATTGAAAACGAAACTTGGAACAACCAAGAA
AGGTACTTCTGGTCTGCAAGAATATTTTACAAAAATAAAGAATTTGGTTGATTCTTTGGCGGTAGCAGAGCATTCAATTTCACACAGTGATCATGTCCTCTACATCCTAG
GGGGTCTTGGTCCTGAATATGACCCTACTGTCTCTGTTCTAACTGGAGATGACACATTTCCTCCTTTGCAATGA
Protein sequenceShow/hide protein sequence
MSSFLSERSSSDAISQISKFINPRSKTSTPKLDEENFLMWKFQVSITLRGHGLQKFIDPDSDPPSKFLNSADESSPARIPNPEYDYWIQQDNLITARLLNSMNLSLLLNF
LIARLPKRYGLIFQITSHRKHMARVLELKTKLGTTKKGTSGLQEYFTKIKNLVDSLAVAEHSISHSDHVLYILGGLGPEYDPTVSVLTGDDTFPPLQ