; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008967 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008967
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionKeratin, type II cytoskeletal 1-like
Genome locationchr9:33140037..33140790
RNA-Seq ExpressionLag0008967
SyntenyLag0008967
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.5e-3033.33Show/hide
Query:  SSSVSERSSSDANSQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDES--SSTRIPNPEYDYWIRQDNLITAWLL
        +SS+    +++A+S I +    G+  S +KL+++ FL+WKFQ+   L  + L+ F++ + +PPSK+L + + S  S+T  PNP Y  W RQD LI++WLL
Subjt:  SSSVSERSSSDANSQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDES--SSTRIPNPEYDYWIRQDNLITAWLL

Query:  NSMNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVLTG
         SM+  I+ ++L CK++KE+W  L   FSS+++A+ ++ K KL   KKG                                    GLG +Y   +SV++ 
Subjt:  NSMNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVLTG

Query:  DDTFPPLQRIYSMLLTQESRIQRHSTTSVNSDGTLPSVNL
            P +Q + S+LLTQES+    + + + S+  LPSVN+
Subjt:  DDTFPPLQRIYSMLLTQESRIQRHSTTSVNSDGTLPSVNL

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]8.7e-2833.49Show/hide
Query:  GSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDES--SSTRIPNPEYDYWIRQDNLITAWLLNSMNTAIVAELLDCKTSKEVWS
        G+  S +KL ++NFL+WKFQ+   L  + L+ F + + +PPSK+LT+   S  S+TR PNPEY  W R + LI+ WLL SM+  I+ +++ CK++KE+W 
Subjt:  GSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDES--SSTRIPNPEYDYWIRQDNLITAWLLNSMNTAIVAELLDCKTSKEVWS

Query:  HLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVLTGDDTFPPLQRIYSMLLTQESRIQ
         L   FSS+++A+ ++ K KL   KKG                                    GLG +Y   +S+++     P +Q + S+LLTQES+  
Subjt:  HLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVLTGDDTFPPLQRIYSMLLTQESRIQ

Query:  RHSTTSVNSDGTLPSVNL
          + + + S+  LP V +
Subjt:  RHSTTSVNSDGTLPSVNL

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.5e-3033.33Show/hide
Query:  SSSVSERSSSDANSQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDES--SSTRIPNPEYDYWIRQDNLITAWLL
        +SS+    +++A+S I +    G+  S +KL+++ FL+WKFQ+   L  + L+ F++ + +PPSK+L + + S  S+T  PNP Y  W RQD LI++WLL
Subjt:  SSSVSERSSSDANSQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDES--SSTRIPNPEYDYWIRQDNLITAWLL

Query:  NSMNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVLTG
         SM+  I+ ++L CK++KE+W  L   FSS+++A+ ++ K KL   KKG                                    GLG +Y   +SV++ 
Subjt:  NSMNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVLTG

Query:  DDTFPPLQRIYSMLLTQESRIQRHSTTSVNSDGTLPSVNL
            P +Q + S+LLTQES+    + + + S+  LPSVN+
Subjt:  DDTFPPLQRIYSMLLTQESRIQRHSTTSVNSDGTLPSVNL

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]4.6e-3741.15Show/hide
Query:  MSSSVSERSSSDAN-SQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFL-TAVDESSSTRI-PNPEYDYWIRQDNLITAW
        M+S  S R+S  A   Q  K IN GS  S ++L+++N L+WKFQ+   L+G+GL+ +ID + D P++F+ T  DESSS+ +  NP Y  WI+QD LI+AW
Subjt:  MSSSVSERSSSDAN-SQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFL-TAVDESSSTRI-PNPEYDYWIRQDNLITAW

Query:  LLNSMNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVL
        LL SMN  I++++LDCK+++E+W+ L   F+S+ +ARV++LK KL   KKG                                    GLGPE+D  +SV+
Subjt:  LLNSMNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVL

Query:  TGDDTFPPLQRIYSMLLTQESRIQRHSTTSVNSDGTLPSVNLT
        T  +    LQ + S+LL QE R +R+    +NSDG+LPSVNLT
Subjt:  TGDDTFPPLQRIYSMLLTQESRIQRHSTTSVNSDGTLPSVNLT

XP_022159146.1 uncharacterized protein LOC111025572 [Momordica charantia]1.8e-2550Show/hide
Query:  MSSSVSERSSSDANSQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDESSSTRIPNPEYDYWIRQDNLITAWLLN
        M+ S SE  S+  N Q    +N G+  ST+KL++ENFL+W+ Q++  L+GHGL KFIDP+   PS+F+ + DESSS   PNPE+  W RQD LIT+WLL 
Subjt:  MSSSVSERSSSDANSQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDESSSTRIPNPEYDYWIRQDNLITAWLLN

Query:  SMNTAIVAELLDCKTSKEVW
        SM+  I++++L+C+T++EVW
Subjt:  SMNTAIVAELLDCKTSKEVW

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-3033.33Show/hide
Query:  SSSVSERSSSDANSQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDES--SSTRIPNPEYDYWIRQDNLITAWLL
        +SS+    +++A+S I +    G+  S +KL+++ FL+WKFQ+   L  + L+ F++ + +PPSK+L + + S  S+T  PNP Y  W RQD LI++WLL
Subjt:  SSSVSERSSSDANSQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDES--SSTRIPNPEYDYWIRQDNLITAWLL

Query:  NSMNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVLTG
         SM+  I+ ++L CK++KE+W  L   FSS+++A+ ++ K KL   KKG                                    GLG +Y   +SV++ 
Subjt:  NSMNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVLTG

Query:  DDTFPPLQRIYSMLLTQESRIQRHSTTSVNSDGTLPSVNL
            P +Q + S+LLTQES+    + + + S+  LPSVN+
Subjt:  DDTFPPLQRIYSMLLTQESRIQRHSTTSVNSDGTLPSVNL

A0A5A7UB21 Keratin, type II cytoskeletal 1-like4.2e-2833.49Show/hide
Query:  GSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDES--SSTRIPNPEYDYWIRQDNLITAWLLNSMNTAIVAELLDCKTSKEVWS
        G+  S +KL ++NFL+WKFQ+   L  + L+ F + + +PPSK+LT+   S  S+TR PNPEY  W R + LI+ WLL SM+  I+ +++ CK++KE+W 
Subjt:  GSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDES--SSTRIPNPEYDYWIRQDNLITAWLLNSMNTAIVAELLDCKTSKEVWS

Query:  HLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVLTGDDTFPPLQRIYSMLLTQESRIQ
         L   FSS+++A+ ++ K KL   KKG                                    GLG +Y   +S+++     P +Q + S+LLTQES+  
Subjt:  HLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVLTGDDTFPPLQRIYSMLLTQESRIQ

Query:  RHSTTSVNSDGTLPSVNL
          + + + S+  LP V +
Subjt:  RHSTTSVNSDGTLPSVNL

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-3033.33Show/hide
Query:  SSSVSERSSSDANSQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDES--SSTRIPNPEYDYWIRQDNLITAWLL
        +SS+    +++A+S I +    G+  S +KL+++ FL+WKFQ+   L  + L+ F++ + +PPSK+L + + S  S+T  PNP Y  W RQD LI++WLL
Subjt:  SSSVSERSSSDANSQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDES--SSTRIPNPEYDYWIRQDNLITAWLL

Query:  NSMNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVLTG
         SM+  I+ ++L CK++KE+W  L   FSS+++A+ ++ K KL   KKG                                    GLG +Y   +SV++ 
Subjt:  NSMNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVLTG

Query:  DDTFPPLQRIYSMLLTQESRIQRHSTTSVNSDGTLPSVNL
            P +Q + S+LLTQES+    + + + S+  LPSVN+
Subjt:  DDTFPPLQRIYSMLLTQESRIQRHSTTSVNSDGTLPSVNL

A0A6J1DLT9 uncharacterized protein LOC1110217572.2e-3741.15Show/hide
Query:  MSSSVSERSSSDAN-SQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFL-TAVDESSSTRI-PNPEYDYWIRQDNLITAW
        M+S  S R+S  A   Q  K IN GS  S ++L+++N L+WKFQ+   L+G+GL+ +ID + D P++F+ T  DESSS+ +  NP Y  WI+QD LI+AW
Subjt:  MSSSVSERSSSDAN-SQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFL-TAVDESSSTRI-PNPEYDYWIRQDNLITAW

Query:  LLNSMNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVL
        LL SMN  I++++LDCK+++E+W+ L   F+S+ +ARV++LK KL   KKG                                    GLGPE+D  +SV+
Subjt:  LLNSMNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKG------------------------------------GLGPEYDPTVSVL

Query:  TGDDTFPPLQRIYSMLLTQESRIQRHSTTSVNSDGTLPSVNLT
        T  +    LQ + S+LL QE R +R+    +NSDG+LPSVNLT
Subjt:  TGDDTFPPLQRIYSMLLTQESRIQRHSTTSVNSDGTLPSVNLT

A0A6J1E314 uncharacterized protein LOC1110255728.8e-2650Show/hide
Query:  MSSSVSERSSSDANSQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDESSSTRIPNPEYDYWIRQDNLITAWLLN
        M+ S SE  S+  N Q    +N G+  ST+KL++ENFL+W+ Q++  L+GHGL KFIDP+   PS+F+ + DESSS   PNPE+  W RQD LIT+WLL 
Subjt:  MSSSVSERSSSDANSQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDESSSTRIPNPEYDYWIRQDNLITAWLLN

Query:  SMNTAIVAELLDCKTSKEVW
        SM+  I++++L+C+T++EVW
Subjt:  SMNTAIVAELLDCKTSKEVW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.1e-0727.03Show/hide
Query:  SVSERSSSDANSQIPKFINLGSMTSTLKL--DEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDESSSTRIPNPEYDYWIRQDNLITAWLLNS
        SVS  S  D+   +P  I+  S  S  KL  DE+N++ WK +    LR      FID     P  F             +P Y  W + + ++  WL+NS
Subjt:  SVSERSSSDANSQIPKFINLGSMTSTLKL--DEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDESSSTRIPNPEYDYWIRQDNLITAWLLNS

Query:  MNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKGG
        M   ++  ++  +T+ ++W  L   F      ++ +L+ +L T ++GG
Subjt:  MNTAIVAELLDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKGG

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.2e-0522.37Show/hide
Query:  TLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDESSSTRIPNPEYDYWIRQDNLITAWLLNSMNTAIVAELLDCK-TSKEVWSHLSNRF
        TL L++ N+ +W+          G+   ID                SST  P  E   W  +D L+  W+  ++  +++  ++    T++++W  L N F
Subjt:  TLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDESSSTRIPNPEYDYWIRQDNLITAWLLNSMNTAIVAELLDCK-TSKEVWSHLSNRF

Query:  SSKHMARVLELKTKLGTTK------------------------------------KGGLGPEYDPTVSVLTGDDTFPPLQRIYSMLLTQESRIQRHSTTS
             AR L+ + +L TT                                       GL  +YD  ++V+     FP      SMLL +ESR+   S +S
Subjt:  SSKHMARVLELKTKLGTTK------------------------------------KGGLGPEYDPTVSVLTGDDTFPPLQRIYSMLLTQESRIQRHSTTS

Query:  VNSDGTLPSVNLTQSKVPQ
        + S    PS++     VP+
Subjt:  VNSDGTLPSVNLTQSKVPQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCTTCTGTGTCTGAGAGAAGTTCTTCAGATGCGAATTCCCAAATTCCCAAATTCATCAATCTTGGGAGCATGACTTCTACGCTGAAACTAGATGAAGAGAACTT
TCTAATGTGGAAATTCCAAGTCTCGATCACGCTTCGCGGCCATGGACTTCAGAAATTTATTGATCCTGATGGTGATCCACCGTCAAAATTTCTGACCGCTGTTGATGAAT
CCTCTTCTACTCGCATTCCTAATCCCGAATACGACTACTGGATTCGACAGGACAATCTCATTACAGCATGGCTTCTCAATTCCATGAATACAGCCATTGTTGCTGAACTT
CTTGATTGCAAGACTTCTAAAGAGGTATGGTCTCATCTTTCCAATCGTTTCTCATCGAAACACATGGCTAGAGTTCTTGAATTGAAGACGAAACTTGGAACAACCAAGAA
AGGGGGTCTTGGTCCTGAATATGATCCCACTGTTTCTGTTCTGACTGGAGACGACACATTTCCACCCTTGCAACGAATCTATTCTATGTTACTTACTCAAGAAAGCAGGA
TCCAAAGGCATTCGACAACTTCAGTTAATTCTGATGGCACACTACCTTCAGTAAATTTGACTCAATCGAAGGTTCCACAATCAACTCTGCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCTTCTGTGTCTGAGAGAAGTTCTTCAGATGCGAATTCCCAAATTCCCAAATTCATCAATCTTGGGAGCATGACTTCTACGCTGAAACTAGATGAAGAGAACTT
TCTAATGTGGAAATTCCAAGTCTCGATCACGCTTCGCGGCCATGGACTTCAGAAATTTATTGATCCTGATGGTGATCCACCGTCAAAATTTCTGACCGCTGTTGATGAAT
CCTCTTCTACTCGCATTCCTAATCCCGAATACGACTACTGGATTCGACAGGACAATCTCATTACAGCATGGCTTCTCAATTCCATGAATACAGCCATTGTTGCTGAACTT
CTTGATTGCAAGACTTCTAAAGAGGTATGGTCTCATCTTTCCAATCGTTTCTCATCGAAACACATGGCTAGAGTTCTTGAATTGAAGACGAAACTTGGAACAACCAAGAA
AGGGGGTCTTGGTCCTGAATATGATCCCACTGTTTCTGTTCTGACTGGAGACGACACATTTCCACCCTTGCAACGAATCTATTCTATGTTACTTACTCAAGAAAGCAGGA
TCCAAAGGCATTCGACAACTTCAGTTAATTCTGATGGCACACTACCTTCAGTAAATTTGACTCAATCGAAGGTTCCACAATCAACTCTGCTATGA
Protein sequenceShow/hide protein sequence
MSSSVSERSSSDANSQIPKFINLGSMTSTLKLDEENFLMWKFQVSITLRGHGLQKFIDPDGDPPSKFLTAVDESSSTRIPNPEYDYWIRQDNLITAWLLNSMNTAIVAEL
LDCKTSKEVWSHLSNRFSSKHMARVLELKTKLGTTKKGGLGPEYDPTVSVLTGDDTFPPLQRIYSMLLTQESRIQRHSTTSVNSDGTLPSVNLTQSKVPQSTLL