; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040103 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040103
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionKeratin, type II cytoskeletal 1-like
Genome locationchr13:2200966..2201388
RNA-Seq ExpressionLag0040103
SyntenyLag0040103
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.8e-3452.11Show/hide
Query:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS
        ++D+ FLLWKFQIL A E +DL++ +  +  PP++ +    S +AS    PNPAYK+WK++D+L+SSW++GSMS+ IL Q+LHCK+AKEIW  L  IF+S
Subjt:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS

Query:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK
        R+L Q M+ K+KL NI+KG     EY  KI +CVDAL++I K
Subjt:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]1.6e-3351.41Show/hide
Query:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTE--TIQVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS
        +SD+NFLLWKFQIL A E +DL++    +  PP++  T   S + S    PNP YK+WK+ ++L+S W++GSMS+ IL Q++HCK+AKEIW  L  IF+S
Subjt:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTE--TIQVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS

Query:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK
        R+L Q M+ K+KL NI+KG  S  EY  KI++CVDAL++I K
Subjt:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]4.5e-3352.11Show/hide
Query:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS
        +SD+NFLLWKFQIL A E +DL++ +  +  PP++ +    S +AS    PNP YK+WK++D+L+SSW++GSMS+ IL Q+LHCK+AKEIW  L  IF+S
Subjt:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS

Query:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK
        R+L Q MK K+KL NI+K      EY  KI+  VDAL++I K
Subjt:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.8e-3452.11Show/hide
Query:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS
        ++D+ FLLWKFQIL A E +DL++ +  +  PP++ +    S +AS    PNPAYK+WK++D+L+SSW++GSMS+ IL Q+LHCK+AKEIW  L  IF+S
Subjt:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS

Query:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK
        R+L Q M+ K+KL NI+KG     EY  KI +CVDAL++I K
Subjt:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK

TYK18917.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]4.5e-3352.11Show/hide
Query:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS
        +SD+NFLLWKFQIL A E +DL++ +  +  PP++ +    S +AS    PNP YK+WK++D+L+SSW++GSMS+ IL Q+LHCK+AKEIW  L  IF+S
Subjt:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS

Query:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK
        R+L Q MK K+KL NI+K      EY  KI+  VDAL++I K
Subjt:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-948.9e-3552.11Show/hide
Query:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS
        ++D+ FLLWKFQIL A E +DL++ +  +  PP++ +    S +AS    PNPAYK+WK++D+L+SSW++GSMS+ IL Q+LHCK+AKEIW  L  IF+S
Subjt:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS

Query:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK
        R+L Q M+ K+KL NI+KG     EY  KI +CVDAL++I K
Subjt:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK

A0A5A7UB21 Keratin, type II cytoskeletal 1-like7.5e-3451.41Show/hide
Query:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTE--TIQVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS
        +SD+NFLLWKFQIL A E +DL++    +  PP++  T   S + S    PNP YK+WK+ ++L+S W++GSMS+ IL Q++HCK+AKEIW  L  IF+S
Subjt:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTE--TIQVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS

Query:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK
        R+L Q M+ K+KL NI+KG  S  EY  KI++CVDAL++I K
Subjt:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK

A0A5A7VGJ8 Keratin, type II cytoskeletal 1-like2.2e-3352.11Show/hide
Query:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS
        +SD+NFLLWKFQIL A E +DL++ +  +  PP++ +    S +AS    PNP YK+WK++D+L+SSW++GSMS+ IL Q+LHCK+AKEIW  L  IF+S
Subjt:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS

Query:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK
        R+L Q MK K+KL NI+K      EY  KI+  VDAL++I K
Subjt:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-948.9e-3552.11Show/hide
Query:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS
        ++D+ FLLWKFQIL A E +DL++ +  +  PP++ +    S +AS    PNPAYK+WK++D+L+SSW++GSMS+ IL Q+LHCK+AKEIW  L  IF+S
Subjt:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS

Query:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK
        R+L Q M+ K+KL NI+KG     EY  KI +CVDAL++I K
Subjt:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK

A0A5D3D5T2 Keratin, type II cytoskeletal 1-like2.2e-3352.11Show/hide
Query:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS
        +SD+NFLLWKFQIL A E +DL++ +  +  PP++ +    S +AS    PNP YK+WK++D+L+SSW++GSMS+ IL Q+LHCK+AKEIW  L  IF+S
Subjt:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETI--QVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNS

Query:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK
        R+L Q MK K+KL NI+K      EY  KI+  VDAL++I K
Subjt:  RHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.1e-1125.71Show/hide
Query:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETIQVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNSRH
        ++  N+L+W  Q+   F+G++L   +      P  TI          + NP Y  WK++DKL+ S ++G++S S+   V    TA +IW  L +I+ +  
Subjt:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETIQVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNSRH

Query:  LTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK
           + +++++L+   KG  + ++Y+  +    D L+ + K
Subjt:  LTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAIRK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-0830.84Show/hide
Query:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETIQVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIF---N
        ++  N+L+W  Q+   F+G++L   +      P  TI        V + NP Y  W+++DKL+ S I+G++S S+   V    TA +IW  L +I+   +
Subjt:  MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETIQVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIF---N

Query:  SRHLTQI
          H+TQ+
Subjt:  SRHLTQI

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).5.4e-0829.21Show/hide
Query:  TVSKP---NPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNSRHLTQIMKIKSKLQNIQKGGSSTNEYISKIKK
        T+ KP   +P Y+ W++ + ++  W++ SM+  +LE V++ +TA ++W  L ++F      +I +++ +L  +++GG S  EY  K+ K
Subjt:  TVSKP---NPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNSRHLTQIMKIKSKLQNIQKGGSSTNEYISKIKK

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.1e-0527.66Show/hide
Query:  SKPNP-AYKLWKKEDKLMSSWIVGSMSKSILEQVLHCK-TAKEIWSYLLQIFNSRHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAI
        S P P   K WK+ D L+  WI G+++ S+L+ ++    TA+++W  L  +F      + ++ +++L+       S +EY  K+K   D L+ +
Subjt:  SKPNP-AYKLWKKEDKLMSSWIVGSMSKSILEQVLHCK-TAKEIWSYLLQIFNSRHLTQIMKIKSKLQNIQKGGSSTNEYISKIKKCVDALSAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGATGAAAATTTTCTTCTCTGGAAGTTCCAAATTCTTATCGCATTTGAAGGCCATGATCTAGACGATCATATCAGTGACGATTGCGTACCACCTACTGAGACGAT
TCAAGTAAGTGAGAATGCTTCGACGGTCAGTAAACCTAACCCTGCTTATAAACTATGGAAAAAGGAAGATAAGTTAATGTCATCTTGGATCGTTGGGTCTATGTCTAAAT
CCATTTTAGAGCAAGTTCTTCACTGTAAAACGGCAAAAGAAATTTGGTCCTATTTGCTTCAGATTTTTAATTCAAGGCATCTTACTCAAATTATGAAAATTAAGTCAAAA
CTTCAAAACATTCAGAAAGGAGGGTCTTCTACGAATGAATATATTTCTAAAATCAAGAAATGTGTAGATGCTTTGTCTGCAATAAGAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCGATGAAAATTTTCTTCTCTGGAAGTTCCAAATTCTTATCGCATTTGAAGGCCATGATCTAGACGATCATATCAGTGACGATTGCGTACCACCTACTGAGACGAT
TCAAGTAAGTGAGAATGCTTCGACGGTCAGTAAACCTAACCCTGCTTATAAACTATGGAAAAAGGAAGATAAGTTAATGTCATCTTGGATCGTTGGGTCTATGTCTAAAT
CCATTTTAGAGCAAGTTCTTCACTGTAAAACGGCAAAAGAAATTTGGTCCTATTTGCTTCAGATTTTTAATTCAAGGCATCTTACTCAAATTATGAAAATTAAGTCAAAA
CTTCAAAACATTCAGAAAGGAGGGTCTTCTACGAATGAATATATTTCTAAAATCAAGAAATGTGTAGATGCTTTGTCTGCAATAAGAAAATAG
Protein sequenceShow/hide protein sequence
MSDENFLLWKFQILIAFEGHDLDDHISDDCVPPTETIQVSENASTVSKPNPAYKLWKKEDKLMSSWIVGSMSKSILEQVLHCKTAKEIWSYLLQIFNSRHLTQIMKIKSK
LQNIQKGGSSTNEYISKIKKCVDALSAIRK