; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G000480 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G000480
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionhAT transposon superfamily
Genome locationchr09:412175..416907
RNA-Seq ExpressionLsi09G000480
SyntenyLsi09G000480
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008422 - beta-glucosidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042569.1 HAT transposon superfamily isoform 2 [Cucumis melo var. makuwa]0.0e+0097.65Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKE SSGKKQKLAEVKTVEN PS+SMCKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        VVSME PSPIAKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLKTTWLERIKTEVSLQSKDIEKEW 
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKS+DASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        +ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLS QSILKQRSRLKHMFNSP+YTTN Y+NKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
        DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
Subjt:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL
        FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
        NDLVYINYNLKLARQMRTKP ESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
Subjt:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

KGN49576.2 hypothetical protein Csa_000026 [Cucumis sativus]0.0e+0093.26Show/hide
Query:  DLLLQFDFAAPLPVERELRAKGVGI---FQMVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATR
        +L+L FD +         +  GV +   + ++VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATR
Subjt:  DLLLQFDFAAPLPVERELRAKGVGI---FQMVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATR

Query:  EEIKETSSGKKQKLAEVKTVENAPSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT
        EEIKE S+GKKQKLAEVKTVE+ PS+SMCKSVVS+E PSP+AKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT
Subjt:  EEIKETSSGKKQKLAEVKTVENAPSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT

Query:  APSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIM
         PSAETLKTTWLERIKTEVSLQSKDIEKEW TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKS+DASTYFKNTKCL DLFDSVIQDFGHENVVQIIM
Subjt:  APSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIM

Query:  DSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSR
        DSSLNYSG ANHILQTYGTIFVSPCASQCLN+ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLSLQSILKQRSR
Subjt:  DSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSR

Query:  LKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQ
        LKHMFNSP+YTTN Y+NKPQSISC+AIIEDNDFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQ
Subjt:  LKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQ

Query:  LHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILS
        LHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILS
Subjt:  LHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILS

Query:  QVCSTFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNA
        QVCSTFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDG DLNTRQFNA
Subjt:  QVCSTFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNA

Query:  AMFGASDHIFNL
        AMFGA+DHIFNL
Subjt:  AMFGASDHIFNL

XP_004145979.2 uncharacterized protein LOC101215128 isoform X1 [Cucumis sativus]0.0e+0096.62Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        +VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKE S+GKKQKLAEVKTVE+ PS+SMCKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        VVS+E PSP+AKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLKTTWLERIKTEVSLQSKDIEKEW 
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKS+DASTYFKNTKCL DLFDSVIQDFGHENVVQIIMDSSLNYSG ANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        +ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSP+YTTN Y+NKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
        DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
Subjt:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL
        FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
        NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDG DLNTRQFNAAMFGA+DHIFNL
Subjt:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

XP_008437565.1 PREDICTED: uncharacterized protein LOC103482941 [Cucumis melo]0.0e+0097.65Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        +VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKE SSGKKQKLAEVKTVEN PS+SMCKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        VVSME PSPIAKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLKTTWLERIKTEVSLQSKDIEKEW 
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKS+DASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        +ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLS QSILKQRSRLKHMFNSP+YTTN Y+NKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
        DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
Subjt:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL
        FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
        NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
Subjt:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

XP_038874524.1 uncharacterized protein LOC120067148 isoform X1 [Benincasa hispida]0.0e+0098.97Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        +VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        V+SMEAPSPIAKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDS LNYSGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
        DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
Subjt:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL
        FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFER+WSMFQQIHSEKRNKIDKETL
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
        NDLVYINYNL+LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
Subjt:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

TrEMBL top hitse value%identityAlignment
A0A1S4DSA2 uncharacterized protein LOC1034829410.0e+0097.65Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        +VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKE SSGKKQKLAEVKTVEN PS+SMCKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        VVSME PSPIAKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLKTTWLERIKTEVSLQSKDIEKEW 
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKS+DASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        +ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLS QSILKQRSRLKHMFNSP+YTTN Y+NKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
        DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
Subjt:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL
        FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
        NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
Subjt:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

A0A5A7TMH8 HAT transposon superfamily isoform 20.0e+0097.65Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKE SSGKKQKLAEVKTVEN PS+SMCKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        VVSME PSPIAKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLKTTWLERIKTEVSLQSKDIEKEW 
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKS+DASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        +ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLS QSILKQRSRLKHMFNSP+YTTN Y+NKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
        DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
Subjt:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL
        FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
        NDLVYINYNLKLARQMRTKP ESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
Subjt:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

A0A6J1E643 uncharacterized protein LOC111430305 isoform X10.0e+0096.19Show/hide
Query:  MVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCK
        ++VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQK+AEVKT+ENAPSMS CK
Subjt:  MVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCK

Query:  SVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEW
        SVVSMEAPSPIAKVFPT TPMAPPSL NHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLK+TWLERIKTEVSLQSKDIEKEW
Subjt:  SVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEW

Query:  ATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCL
        ATTGCTIIVDTWTDNKSRALINFLVSSPS+TFFHKS+DAS YFKNTKCLADLFDSVIQDFGHENVVQIIMDSS NY+GIANHILQTYGTIFVSPCASQCL
Subjt:  ATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCL

Query:  NAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIED
        N+ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPY+NKPQSISC+AIIED
Subjt:  NAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIED

Query:  NDFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKE
        NDFWRAVEECVAISEPFLRVLREV GGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYN EIKFLTSIKE
Subjt:  NDFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKE

Query:  DFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKET
        DFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWS FQQIHSEKRNKIDKET
Subjt:  DFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKET

Query:  LNDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
        LNDLVYINYNLKLARQM+TKPLESDPIQFDDIDMTSEWVEESEN SPTQWLDRFG SLDGGDLNTRQFNAA+F ASDHIFNL
Subjt:  LNDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

A0A6J1E893 uncharacterized protein LOC111430305 isoform X20.0e+0096.33Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        +VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQK+AEVKT+ENAPSMS CKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        VVSMEAPSPIAKVFPT TPMAPPSL NHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLK+TWLERIKTEVSLQSKDIEKEWA
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPS+TFFHKS+DAS YFKNTKCLADLFDSVIQDFGHENVVQIIMDSS NY+GIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        +ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPY+NKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
        DFWRAVEECVAISEPFLRVLREV GGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYN EIKFLTSIKED
Subjt:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL
        FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWS FQQIHSEKRNKIDKETL
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
        NDLVYINYNLKLARQM+TKPLESDPIQFDDIDMTSEWVEESEN SPTQWLDRFG SLDGGDLNTRQFNAA+F ASDHIFNL
Subjt:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

A0A6J1IDC4 uncharacterized protein LOC111471543 isoform X10.0e+0096.19Show/hide
Query:  MVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCK
        ++VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE KETSSGKKQK+AEVKTVENAPSMS CK
Subjt:  MVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCK

Query:  SVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEW
        SVVSMEAPSPIAKVFPT TPMAPPSL NHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLK+TWLERIKTEVSLQSKDIEKEW
Subjt:  SVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEW

Query:  ATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCL
        ATTGCTIIVDTWTDNKSRALINFLVSSPS+TFFHKS+DAS YFKNTKCLADLFDSVIQDFGHENVVQIIMDSS NY+GIANHILQTYGTIFVSPCASQCL
Subjt:  ATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCL

Query:  NAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIED
        N+ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPY+NKPQSISC+AIIED
Subjt:  NAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIED

Query:  NDFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKE
        NDFWRAVEECVAISEPFLRVLREV GGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYN EIKFLTSIKE
Subjt:  NDFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKE

Query:  DFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKET
        DFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWS FQQIHSEKRNKIDKET
Subjt:  DFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKET

Query:  LNDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
        LNDLVYINYNLKLARQM+TKPLESDPIQFDDIDMTSEWVEESEN SPTQWLDRFG SLDGGDLNTRQFNAA+F ASDHIFNL
Subjt:  LNDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily1.1e-27970.38Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        +VREKDICWEYAEKLDGNKVKCKFC RVLNGGISRLKHHLSRLPS+GVNPC+KVRDDV+DRVR+IL+ +++   T+  K             P +S    
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
            +AP+    VFP++ P A       + AE+SI+LFFFENK+DF++ARS SY  M+DA+ KCGPGF APS    KT WL+R+K+++SLQ KD EKEW 
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTII + WTDNKSRALINF VSSPSR FFHKS+DAS+YFKN+KCLADLFDSVIQD G E++VQIIMD+S  Y+GI+NH+LQ Y TIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
         ILEEFSKVDWVN+CI QAQ ISKF+YN+S +LDL+R+ TG Q++IR+G+++ VS+FLSLQS++KQ++RLKHMFN PEYTTN  +NKPQSISC+ I+EDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED
        DFWRAVEE VAISEP L+VLREV  GKPAVG IYELM++AKESIRTYYIMDE K K F DIVD  W + LHSPLHAAAAFLNPSIQYNPEIKFLTS+KED
Subjt:  DFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL
        FF VLEKLLP  ++RRDITNQIFTFT+A GMFGC+LAMEARD+VSP LWWEQFGDSAPVLQRVAIRILSQVCS ++ ER WS FQQ+H E+RNKID+E L
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDH-IFNL
        N L Y+N NLKL R +    LE+DPI  +DIDM SEWVEE+EN SP QWLDRFG++LDGGDLNTRQF  A+F A+DH IF L
Subjt:  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDH-IFNL

AT3G22220.1 hAT transposon superfamily5.8e-8232.76Show/hide
Query:  PPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALIN
        P S    +    ++  F F+   DF  A S + Q  IDAI   G G + P+ E L+   L+    EV  +  + +  W  TGC+++V     N+   ++ 
Subjt:  PPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALIN

Query:  FLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQT
        FLV  P +  F KS+DAS    +   L +L   V+++ G  NVVQ+I     +Y+     ++  Y +++  PCA+ C++ +LEEF K+DW+   I QA+T
Subjt:  FLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQT

Query:  ISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLR
        +++ +YN S +L+LMR+FT   ++++   +   ++F ++  I   +  L+ M  S E+    YS +   ++    I D DFW+A+     I+ P LRVLR
Subjt:  ISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLR

Query:  EVCG-GKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITN
         VC   KPA+G +Y  M RAKE+I+T     E +   +  I+DR W   L  PL+AA  +LNP   Y+ + +  + I     + +EKL+P   ++  +  
Subjt:  EVCG-GKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITN

Query:  QIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTK
         I ++  A G+FG +LA+ ARDT+ P  WW  +G+S   L R AIRILSQ C S+    R+ +   QI+ E +N I+++ LNDLV++ YN++L R     
Subjt:  QIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTK

Query:  PLES--DPIQFDDIDMTSEWVEESE
          +   DP+   ++++  +WV  ++
Subjt:  PLES--DPIQFDDIDMTSEWVEESE

AT3G22220.2 hAT transposon superfamily5.8e-8232.76Show/hide
Query:  PPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALIN
        P S    +    ++  F F+   DF  A S + Q  IDAI   G G + P+ E L+   L+    EV  +  + +  W  TGC+++V     N+   ++ 
Subjt:  PPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALIN

Query:  FLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQT
        FLV  P +  F KS+DAS    +   L +L   V+++ G  NVVQ+I     +Y+     ++  Y +++  PCA+ C++ +LEEF K+DW+   I QA+T
Subjt:  FLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQT

Query:  ISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLR
        +++ +YN S +L+LMR+FT   ++++   +   ++F ++  I   +  L+ M  S E+    YS +   ++    I D DFW+A+     I+ P LRVLR
Subjt:  ISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLR

Query:  EVCG-GKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITN
         VC   KPA+G +Y  M RAKE+I+T     E +   +  I+DR W   L  PL+AA  +LNP   Y+ + +  + I     + +EKL+P   ++  +  
Subjt:  EVCG-GKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITN

Query:  QIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTK
         I ++  A G+FG +LA+ ARDT+ P  WW  +G+S   L R AIRILSQ C S+    R+ +   QI+ E +N I+++ LNDLV++ YN++L R     
Subjt:  QIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTK

Query:  PLES--DPIQFDDIDMTSEWVEESE
          +   DP+   ++++  +WV  ++
Subjt:  PLES--DPIQFDDIDMTSEWVEESE

AT4G15020.1 hAT transposon superfamily2.8e-8429.68Show/hide
Query:  REKDICWEYAEKL---DGNKVKCKFCLRVL-NGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAIL---ATREEIKETSSGKKQKLAEVKTVE-----
        +++D  W++ E     D  +++C +C ++   GGI+R+K HL+    +G   C +V +DV   ++  +     R+  +  SS +   +A +  +E     
Subjt:  REKDICWEYAEKL---DGNKVKCKFCLRVL-NGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAIL---ATREEIKETSSGKKQKLAEVKTVE-----

Query:  --------------------------------------NA---PSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENA-EKSIALFFFENKLDFSI
                                              NA    S S    ++  +  + I     +   +  PS  + EN    +I  F F    DF  
Subjt:  --------------------------------------NA---PSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENA-EKSIALFFFENKLDFSI

Query:  ARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCL
          S ++Q MIDAI   G G +AP+ + L+   L+    E++ +  + +  W  TGC+I+V+    +K   ++NFLV  P +  F KS+DAS    +   L
Subjt:  ARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCL

Query:  ADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRT
         +L   ++++ G  NVVQ+I      Y      ++  Y +++  PCA+ C++ +LEEF K+ W++  I QAQ I++F+YN S +L+LM +FT   +++  
Subjt:  ADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRT

Query:  GISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLREVCGGK-PAVGCIYELMTRAKESIRTY
          S   ++F +L  I + +S L+ M  S E+    YS +P  +  +  + D  FW+AV     ++ P LR LR VC  K PA+G +Y  + RAK++I+T+
Subjt:  GISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLREVCGGK-PAVGCIYELMTRAKESIRTY

Query:  YIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPW
         +  E     +  I+DR W  Q H PL AA  FLNP + YN   +  + +     + +E+L+P  +++  I  ++ ++  A G+FG +LA+ ARDT+ P 
Subjt:  YIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPW

Query:  LWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTKPLES--DPIQFDDIDMTSEWV
         WW  +G+S   L R AIRILSQ C S+ S  R+    + I+  K N I+++ L+DLV++ YN++L RQ+     +   DP+  + ID+  EWV
Subjt:  LWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTKPLES--DPIQFDDIDMTSEWV

AT4G15020.2 hAT transposon superfamily2.8e-8429.68Show/hide
Query:  REKDICWEYAEKL---DGNKVKCKFCLRVL-NGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAIL---ATREEIKETSSGKKQKLAEVKTVE-----
        +++D  W++ E     D  +++C +C ++   GGI+R+K HL+    +G   C +V +DV   ++  +     R+  +  SS +   +A +  +E     
Subjt:  REKDICWEYAEKL---DGNKVKCKFCLRVL-NGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAIL---ATREEIKETSSGKKQKLAEVKTVE-----

Query:  --------------------------------------NA---PSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENA-EKSIALFFFENKLDFSI
                                              NA    S S    ++  +  + I     +   +  PS  + EN    +I  F F    DF  
Subjt:  --------------------------------------NA---PSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENA-EKSIALFFFENKLDFSI

Query:  ARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCL
          S ++Q MIDAI   G G +AP+ + L+   L+    E++ +  + +  W  TGC+I+V+    +K   ++NFLV  P +  F KS+DAS    +   L
Subjt:  ARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCL

Query:  ADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRT
         +L   ++++ G  NVVQ+I      Y      ++  Y +++  PCA+ C++ +LEEF K+ W++  I QAQ I++F+YN S +L+LM +FT   +++  
Subjt:  ADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRT

Query:  GISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLREVCGGK-PAVGCIYELMTRAKESIRTY
          S   ++F +L  I + +S L+ M  S E+    YS +P  +  +  + D  FW+AV     ++ P LR LR VC  K PA+G +Y  + RAK++I+T+
Subjt:  GISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLREVCGGK-PAVGCIYELMTRAKESIRTY

Query:  YIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPW
         +  E     +  I+DR W  Q H PL AA  FLNP + YN   +  + +     + +E+L+P  +++  I  ++ ++  A G+FG +LA+ ARDT+ P 
Subjt:  YIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPW

Query:  LWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTKPLES--DPIQFDDIDMTSEWV
         WW  +G+S   L R AIRILSQ C S+ S  R+    + I+  K N I+++ L+DLV++ YN++L RQ+     +   DP+  + ID+  EWV
Subjt:  LWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTKPLES--DPIQFDDIDMTSEWV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGGGATTTGCTTTTACAGTTCGATTTTGCTGCTCCTTTGCCTGTCGAGAGAGAATTGAGGGCAAAGGGTGTTGGGATTTTTCAAATGGTGGTCCGTGAGAAAGA
TATTTGTTGGGAATATGCCGAGAAATTAGATGGTAACAAGGTGAAGTGCAAATTTTGTCTGAGAGTTTTGAATGGTGGGATTAGTAGATTGAAGCATCATTTATCTCGAT
TACCGAGTAGAGGTGTAAATCCTTGTAGTAAAGTGAGGGACGATGTTTCTGATAGAGTGAGAGCCATACTAGCAACTAGAGAGGAGATCAAGGAAACATCCAGTGGGAAA
AAGCAGAAGCTAGCTGAAGTCAAGACGGTTGAAAATGCACCATCGATGTCAATGTGTAAATCTGTTGTTTCAATGGAGGCCCCATCACCAATTGCCAAAGTTTTTCCAAC
TGCTACTCCCATGGCTCCCCCGTCACTACACAACCATGAAAATGCTGAGAAAAGCATTGCTTTATTCTTTTTTGAGAATAAGCTAGATTTTAGTATAGCTAGATCTTCAT
CCTATCAGCTAATGATCGATGCAATAGGGAAATGTGGCCCTGGATTTACAGCCCCTTCTGCTGAAACTCTGAAGACTACTTGGTTGGAGAGGATCAAAACTGAAGTGAGC
CTTCAGTCAAAGGATATTGAGAAAGAGTGGGCTACCACCGGCTGCACAATCATTGTAGACACATGGACTGACAATAAATCAAGAGCTTTGATTAACTTTTTGGTTTCATC
CCCATCCCGGACCTTTTTTCACAAATCCATCGATGCATCTACATATTTCAAGAACACAAAGTGCCTTGCTGATTTATTTGATTCCGTCATTCAAGATTTCGGCCATGAAA
ATGTAGTGCAGATTATCATGGACAGTAGTTTGAATTATTCAGGTATTGCAAATCATATCCTTCAGACTTACGGGACTATATTTGTGTCTCCCTGTGCTTCACAGTGTCTG
AATGCAATTTTGGAGGAATTTTCAAAGGTAGATTGGGTAAACAGATGTATCCTGCAAGCACAAACCATATCAAAATTTCTATATAATAGTTCCTCACTGCTTGACCTGAT
GCGAAGGTTCACTGGCAGTCAAGAACTCATTCGGACTGGGATATCGAAACCCGTATCGAGTTTCCTGTCTTTGCAATCTATTCTGAAGCAAAGGTCAAGACTGAAGCATA
TGTTCAACAGCCCTGAATACACCACAAATCCTTATTCAAATAAACCACAGAGCATTTCTTGTCTTGCCATTATAGAAGATAATGATTTCTGGAGGGCAGTGGAAGAATGT
GTGGCAATATCAGAGCCTTTCCTAAGAGTCTTGAGAGAAGTGTGTGGGGGTAAACCTGCTGTGGGATGTATTTATGAGTTAATGACTAGAGCAAAAGAATCAATAAGAAC
GTACTATATCATGGATGAGATCAAGTGCAAGACGTTTCTCGATATCGTTGACAGGAAGTGGCGAGACCAACTTCATTCCCCGCTTCATGCAGCAGCTGCATTTTTGAATC
CGAGTATTCAGTATAATCCAGAAATAAAGTTCCTTACTTCCATTAAAGAAGATTTCTTTAATGTTTTGGAGAAATTACTCCCCTTGCCAGAGATGAGACGGGATATTACC
AATCAAATATTTACTTTCACAAAGGCGAATGGGATGTTTGGATGCAGTTTAGCAATGGAAGCAAGAGATACAGTTTCGCCTTGGCTTTGGTGGGAACAGTTTGGTGACTC
TGCGCCCGTGTTACAACGAGTCGCAATACGGATTCTCAGTCAAGTTTGTAGTACGTTCTCCTTCGAGCGGCATTGGAGCATGTTTCAGCAAATTCACTCTGAAAAACGTA
ATAAAATAGACAAGGAAACGTTGAATGACCTCGTCTACATAAACTACAATCTCAAGTTGGCTAGACAGATGAGAACAAAACCCCTGGAATCTGACCCTATTCAGTTTGAC
GACATTGATATGACTTCGGAGTGGGTAGAGGAGAGCGAAAACCAAAGCCCGACGCAGTGGCTCGACAGATTTGGTTCTTCTTTGGATGGGGGCGACTTGAATACCAGACA
GTTCAATGCTGCCATGTTTGGTGCAAGTGACCACATATTTAATCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGGGATTTGCTTTTACAGTTCGATTTTGCTGCTCCTTTGCCTGTCGAGAGAGAATTGAGGGCAAAGGGTGTTGGGATTTTTCAAATGGTGGTCCGTGAGAAAGA
TATTTGTTGGGAATATGCCGAGAAATTAGATGGTAACAAGGTGAAGTGCAAATTTTGTCTGAGAGTTTTGAATGGTGGGATTAGTAGATTGAAGCATCATTTATCTCGAT
TACCGAGTAGAGGTGTAAATCCTTGTAGTAAAGTGAGGGACGATGTTTCTGATAGAGTGAGAGCCATACTAGCAACTAGAGAGGAGATCAAGGAAACATCCAGTGGGAAA
AAGCAGAAGCTAGCTGAAGTCAAGACGGTTGAAAATGCACCATCGATGTCAATGTGTAAATCTGTTGTTTCAATGGAGGCCCCATCACCAATTGCCAAAGTTTTTCCAAC
TGCTACTCCCATGGCTCCCCCGTCACTACACAACCATGAAAATGCTGAGAAAAGCATTGCTTTATTCTTTTTTGAGAATAAGCTAGATTTTAGTATAGCTAGATCTTCAT
CCTATCAGCTAATGATCGATGCAATAGGGAAATGTGGCCCTGGATTTACAGCCCCTTCTGCTGAAACTCTGAAGACTACTTGGTTGGAGAGGATCAAAACTGAAGTGAGC
CTTCAGTCAAAGGATATTGAGAAAGAGTGGGCTACCACCGGCTGCACAATCATTGTAGACACATGGACTGACAATAAATCAAGAGCTTTGATTAACTTTTTGGTTTCATC
CCCATCCCGGACCTTTTTTCACAAATCCATCGATGCATCTACATATTTCAAGAACACAAAGTGCCTTGCTGATTTATTTGATTCCGTCATTCAAGATTTCGGCCATGAAA
ATGTAGTGCAGATTATCATGGACAGTAGTTTGAATTATTCAGGTATTGCAAATCATATCCTTCAGACTTACGGGACTATATTTGTGTCTCCCTGTGCTTCACAGTGTCTG
AATGCAATTTTGGAGGAATTTTCAAAGGTAGATTGGGTAAACAGATGTATCCTGCAAGCACAAACCATATCAAAATTTCTATATAATAGTTCCTCACTGCTTGACCTGAT
GCGAAGGTTCACTGGCAGTCAAGAACTCATTCGGACTGGGATATCGAAACCCGTATCGAGTTTCCTGTCTTTGCAATCTATTCTGAAGCAAAGGTCAAGACTGAAGCATA
TGTTCAACAGCCCTGAATACACCACAAATCCTTATTCAAATAAACCACAGAGCATTTCTTGTCTTGCCATTATAGAAGATAATGATTTCTGGAGGGCAGTGGAAGAATGT
GTGGCAATATCAGAGCCTTTCCTAAGAGTCTTGAGAGAAGTGTGTGGGGGTAAACCTGCTGTGGGATGTATTTATGAGTTAATGACTAGAGCAAAAGAATCAATAAGAAC
GTACTATATCATGGATGAGATCAAGTGCAAGACGTTTCTCGATATCGTTGACAGGAAGTGGCGAGACCAACTTCATTCCCCGCTTCATGCAGCAGCTGCATTTTTGAATC
CGAGTATTCAGTATAATCCAGAAATAAAGTTCCTTACTTCCATTAAAGAAGATTTCTTTAATGTTTTGGAGAAATTACTCCCCTTGCCAGAGATGAGACGGGATATTACC
AATCAAATATTTACTTTCACAAAGGCGAATGGGATGTTTGGATGCAGTTTAGCAATGGAAGCAAGAGATACAGTTTCGCCTTGGCTTTGGTGGGAACAGTTTGGTGACTC
TGCGCCCGTGTTACAACGAGTCGCAATACGGATTCTCAGTCAAGTTTGTAGTACGTTCTCCTTCGAGCGGCATTGGAGCATGTTTCAGCAAATTCACTCTGAAAAACGTA
ATAAAATAGACAAGGAAACGTTGAATGACCTCGTCTACATAAACTACAATCTCAAGTTGGCTAGACAGATGAGAACAAAACCCCTGGAATCTGACCCTATTCAGTTTGAC
GACATTGATATGACTTCGGAGTGGGTAGAGGAGAGCGAAAACCAAAGCCCGACGCAGTGGCTCGACAGATTTGGTTCTTCTTTGGATGGGGGCGACTTGAATACCAGACA
GTTCAATGCTGCCATGTTTGGTGCAAGTGACCACATATTTAATCTGTGAGAGCATTTTAACTTGCAACATCAAATTGTTTGTTTGTAATCCTCTCCAGTTCTTAGGCCTT
GTATGTATTCTCTTATTAGAGGCTTCATTTTTTTTTTTCTTTTTCTCTTCCTTTTGTTGTATTGTAGCATGAAATTGCTCCTTGCTTTTTGTATATAAAATATAAACAAA
GCTGAGGAGCCTTTGATGGTTTATTCTATTTCTAAGACAGATTGCTCAATTGAATGTAACCTTAGGCCTCACATTGTTGGGATTTTTAATGCGTTGACAATGGAATGAGA
AAATAATTCAAGAGCAA
Protein sequenceShow/hide protein sequence
MGWDLLLQFDFAAPLPVERELRAKGVGIFQMVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGK
KQKLAEVKTVENAPSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVS
LQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCL
NAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEEC
VAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDIT
NQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTKPLESDPIQFD
DIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL