; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg26223 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg26223
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionhAT transposon superfamily
Genome locationCarg_Chr15:7203962..7208193
RNA-Seq ExpressionCarg26223
SyntenyCarg26223
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008422 - beta-glucosidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022922273.1 uncharacterized protein LOC111430305 isoform X1 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE
        MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE
Subjt:  MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE

Query:  IKETSSGKKQKIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP
        IKETSSGKKQKIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP
Subjt:  IKETSSGKKQKIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP

Query:  SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS
        SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS
Subjt:  SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS

Query:  SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK
        SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK
Subjt:  SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK

Query:  HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH
        HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH
Subjt:  HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH

Query:  SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV
        SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV
Subjt:  SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV

Query:  CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF
        CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF
Subjt:  CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF

Query:  SASDHIFNL
        SASDHIFNL
Subjt:  SASDHIFNL

XP_022922274.1 uncharacterized protein LOC111430305 isoform X2 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS
        MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS
Subjt:  MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS

Query:  VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA
        VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA
Subjt:  VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN

Query:  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN
        SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN
Subjt:  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED
        DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED
Subjt:  DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL
        FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALFSASDHIFNL
        NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALFSASDHIFNL
Subjt:  NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALFSASDHIFNL

XP_022973029.1 uncharacterized protein LOC111471543 isoform X1 [Cucurbita maxima]0.0e+0099.58Show/hide
Query:  MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE
        MASQLL+TVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE
Subjt:  MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE

Query:  IKETSSGKKQKIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP
         KETSSGKKQKIAEVKT+ENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP
Subjt:  IKETSSGKKQKIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP

Query:  SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS
        SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS
Subjt:  SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS

Query:  SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK
        SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK
Subjt:  SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK

Query:  HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH
        HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH
Subjt:  HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH

Query:  SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV
        SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV
Subjt:  SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV

Query:  CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF
        CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF
Subjt:  CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF

Query:  SASDHIFNL
        SASDHIFNL
Subjt:  SASDHIFNL

XP_022973030.1 uncharacterized protein LOC111471543 isoform X2 [Cucurbita maxima]0.0e+0099.71Show/hide
Query:  MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS
        MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE KETSSGKKQKIAEVKT+ENAPSMSTCKS
Subjt:  MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS

Query:  VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA
        VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA
Subjt:  VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN

Query:  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN
        SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN
Subjt:  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED
        DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED
Subjt:  DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL
        FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALFSASDHIFNL
        NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALFSASDHIFNL
Subjt:  NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALFSASDHIFNL

XP_023550627.1 uncharacterized protein LOC111808713 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0099.72Show/hide
Query:  MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE
        MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE
Subjt:  MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE

Query:  IKETSSGKKQKIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP
        IKETS GKKQKIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCG GFTGP
Subjt:  IKETSSGKKQKIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP

Query:  SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS
        SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS
Subjt:  SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS

Query:  SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK
        SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK
Subjt:  SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK

Query:  HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH
        HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH
Subjt:  HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH

Query:  SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV
        SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV
Subjt:  SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV

Query:  CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF
        CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF
Subjt:  CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF

Query:  SASDHIFNL
        SASDHIFNL
Subjt:  SASDHIFNL

TrEMBL top hitse value%identityAlignment
A0A1S4DSA2 uncharacterized protein LOC1034829410.0e+0096.33Show/hide
Query:  MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS
        MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKE SSGKKQK+AEVKT+EN PS+S CKS
Subjt:  MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS

Query:  VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA
        VVSME PSPIAKVFPTVTPMAPPSL NHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLK+TWLERIKTEVSLQSKDIEKEW 
Subjt:  VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPS+TFFHKSVDAS YFKNTKCLADLFDSVIQDFGHENVVQIIMDSS NY+GIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN

Query:  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN
        SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLS QSILKQRSRLKHMFNSP+YTTN YANKPQSISCIAIIEDN
Subjt:  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED
        DFWRAVEECVAISEPFLRVLREV GGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYN EIKFLTSIKED
Subjt:  DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL
        FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWS FQQIHSEKRNKIDKETL
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFG-SLDGGDLNTRQFNAALFSASDHIFNL
        NDLVYINYNLKLARQM+TKPLESDPIQFDDIDMTSEWVEESEN SPTQWLDRFG SLDGGDLNTRQFNAA+F ASDHIFNL
Subjt:  NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFG-SLDGGDLNTRQFNAALFSASDHIFNL

A0A6J1E643 uncharacterized protein LOC111430305 isoform X10.0e+00100Show/hide
Query:  MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE
        MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE
Subjt:  MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE

Query:  IKETSSGKKQKIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP
        IKETSSGKKQKIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP
Subjt:  IKETSSGKKQKIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP

Query:  SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS
        SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS
Subjt:  SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS

Query:  SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK
        SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK
Subjt:  SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK

Query:  HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH
        HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH
Subjt:  HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH

Query:  SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV
        SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV
Subjt:  SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV

Query:  CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF
        CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF
Subjt:  CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF

Query:  SASDHIFNL
        SASDHIFNL
Subjt:  SASDHIFNL

A0A6J1E893 uncharacterized protein LOC111430305 isoform X20.0e+00100Show/hide
Query:  MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS
        MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS
Subjt:  MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS

Query:  VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA
        VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA
Subjt:  VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN

Query:  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN
        SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN
Subjt:  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED
        DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED
Subjt:  DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL
        FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALFSASDHIFNL
        NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALFSASDHIFNL
Subjt:  NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALFSASDHIFNL

A0A6J1IAB6 uncharacterized protein LOC111471543 isoform X20.0e+0099.71Show/hide
Query:  MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS
        MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE KETSSGKKQKIAEVKT+ENAPSMSTCKS
Subjt:  MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS

Query:  VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA
        VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA
Subjt:  VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN

Query:  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN
        SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN
Subjt:  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED
        DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED
Subjt:  DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL
        FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALFSASDHIFNL
        NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALFSASDHIFNL
Subjt:  NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALFSASDHIFNL

A0A6J1IDC4 uncharacterized protein LOC111471543 isoform X10.0e+0099.58Show/hide
Query:  MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE
        MASQLL+TVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE
Subjt:  MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE

Query:  IKETSSGKKQKIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP
         KETSSGKKQKIAEVKT+ENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP
Subjt:  IKETSSGKKQKIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGP

Query:  SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS
        SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS
Subjt:  SAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDS

Query:  SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK
        SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK
Subjt:  SFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLK

Query:  HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH
        HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH
Subjt:  HMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLH

Query:  SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV
        SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV
Subjt:  SPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQV

Query:  CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF
        CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF
Subjt:  CSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALF

Query:  SASDHIFNL
        SASDHIFNL
Subjt:  SASDHIFNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily2.1e-28170.97Show/hide
Query:  MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS
        MVREKDICWEYAEKLDGNKVKCKFC RVLNGGISRLKHHLSRLPS+GVNPC+KVRDDV+DRVR+IL+ +++   T+  K             P +S    
Subjt:  MVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKIAEVKTIENAPSMSTCKS

Query:  VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA
              P   A     V P +PP+    + AE+SI+LFFFENK+DF++ARS SY  M+DA+ KCGPGF  PS    K+ WL+R+K+++SLQ KD EKEW 
Subjt:  VVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN
        TTGCTII + WTDNKSRALINF VSSPS+ FFHKSVDAS+YFKN+KCLADLFDSVIQD G E++VQIIMD+SF YTGI+NH+LQ Y TIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLN

Query:  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN
         ILEEFSKVDWVN+CI QAQ ISKF+YN+S +LDL+R+ TGGQ++IR+G+++ VS+FLSLQS++KQ++RLKHMFN PEYTTN   NKPQSISC+ I+EDN
Subjt:  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDN

Query:  DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED
        DFWRAVEE VAISEP L+VLREVS GKPAVG IYELM++AKESIRTYYIMDE K K F DIVD  W + LHSPLHAAAAFLNPSIQYN EIKFLTS+KED
Subjt:  DFWRAVEECVAISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKED

Query:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL
        FF VLEKLLP  ++RRDITNQIFTFT+A GMFGC+LAMEARD+VSP LWWEQFGDSAPVLQRVAIRILSQVCS ++ ER WSTFQQ+H E+RNKID+E L
Subjt:  FFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETL

Query:  NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFG-SLDGGDLNTRQFNAALFSASDH-IFNL
        N L Y+N NLKL R +    LE+DPI  +DIDM SEWVEE+ENPSP QWLDRFG +LDGGDLNTRQF  A+FSA+DH IF L
Subjt:  NDLVYINYNLKLARQMKTKPLESDPIQFDDIDMTSEWVEESENPSPTQWLDRFG-SLDGGDLNTRQFNAALFSASDH-IFNL

AT3G22220.1 hAT transposon superfamily4.4e-8232.95Show/hide
Query:  PPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALIN
        P S    +    ++  F F+   DF  A S + Q  IDAI   G G + P+ E L+   L+    EV  +  + +  W  TGC+++V     N+   ++ 
Subjt:  PPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALIN

Query:  FLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQT
        FLV  P +  F KSVDAS    +   L +L   V+++ G  NVVQ+I     +Y      ++  Y +++  PCA+ C++ +LEEF K+DW+   I QA+T
Subjt:  FLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQT

Query:  ISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLR
        +++ +YN S +L+LMR+FT G ++++   +   ++F ++  I   +  L+ M  S E+    Y+ +   ++    I D DFW+A+     I+ P LRVLR
Subjt:  ISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLR

Query:  EV-SGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITN
         V S  KPA+G +Y  M RAKE+I+T     E +   +  I+DR W   L  PL+AA  +LNP   Y+ + +  + I     + +EKL+P   ++  +  
Subjt:  EV-SGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITN

Query:  QIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTK
         I ++  A G+FG +LA+ ARDT+ P  WW  +G+S   L R AIRILSQ C S+    R+ ++  QI+ E +N I+++ LNDLV++ YN++L R     
Subjt:  QIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTK

Query:  PLES--DPIQFDDIDMTSEWVEESE
          +   DP+   ++++  +WV  ++
Subjt:  PLES--DPIQFDDIDMTSEWVEESE

AT3G22220.2 hAT transposon superfamily4.4e-8232.95Show/hide
Query:  PPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALIN
        P S    +    ++  F F+   DF  A S + Q  IDAI   G G + P+ E L+   L+    EV  +  + +  W  TGC+++V     N+   ++ 
Subjt:  PPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALIN

Query:  FLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQT
        FLV  P +  F KSVDAS    +   L +L   V+++ G  NVVQ+I     +Y      ++  Y +++  PCA+ C++ +LEEF K+DW+   I QA+T
Subjt:  FLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQT

Query:  ISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLR
        +++ +YN S +L+LMR+FT G ++++   +   ++F ++  I   +  L+ M  S E+    Y+ +   ++    I D DFW+A+     I+ P LRVLR
Subjt:  ISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLR

Query:  EV-SGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITN
         V S  KPA+G +Y  M RAKE+I+T     E +   +  I+DR W   L  PL+AA  +LNP   Y+ + +  + I     + +EKL+P   ++  +  
Subjt:  EV-SGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITN

Query:  QIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTK
         I ++  A G+FG +LA+ ARDT+ P  WW  +G+S   L R AIRILSQ C S+    R+ ++  QI+ E +N I+++ LNDLV++ YN++L R     
Subjt:  QIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTK

Query:  PLES--DPIQFDDIDMTSEWVEESE
          +   DP+   ++++  +WV  ++
Subjt:  PLES--DPIQFDDIDMTSEWVEESE

AT4G15020.1 hAT transposon superfamily1.9e-8529.99Show/hide
Query:  LMVREKDICWEYAEKL---DGNKVKCKFCLRVL-NGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAIL---ATREEIKETSSGKKQKIAEVKTIE--
        L  +++D  W++ E     D  +++C +C ++   GGI+R+K HL+    +G   C +V +DV   ++  +     R+  +  SS +   +A +  IE  
Subjt:  LMVREKDICWEYAEKL---DGNKVKCKFCLRVL-NGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAIL---ATREEIKETSSGKKQKIAEVKTIE--

Query:  -----------------------------------------NA---PSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENA-EKSIALFFFENKLD
                                                 NA    S S    ++  +  + I     +V  +  PS  + EN    +I  F F    D
Subjt:  -----------------------------------------NA---PSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENA-EKSIALFFFENKLD

Query:  FSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNT
        F    S ++Q MIDAI   G G + P+ + L+   L+    E++ +  + +  W  TGC+I+V+    +K   ++NFLV  P +  F KSVDAS    + 
Subjt:  FSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNT

Query:  KCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQEL
          L +L   ++++ G  NVVQ+I      Y      ++  Y +++  PCA+ C++ +LEEF K+ W++  I QAQ I++F+YN S +L+LM +FT G ++
Subjt:  KCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQEL

Query:  IRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREV-SGGKPAVGCIYELMTRAKESI
        +    S   ++F +L  I + +S L+ M  S E+    Y+ +P  +   A + D  FW+AV     ++ P LR LR V S  +PA+G +Y  + RAK++I
Subjt:  IRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREV-SGGKPAVGCIYELMTRAKESI

Query:  RTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTV
        +T+ +  E     +  I+DR W  Q H PL AA  FLNP + YN+  +  + +     + +E+L+P  +++  I  ++ ++  A G+FG +LA+ ARDT+
Subjt:  RTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTV

Query:  SPWLWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLES--DPIQFDDIDMTSEWV
         P  WW  +G+S   L R AIRILSQ C S+ S  R+    + I+  K N I+++ L+DLV++ YN++L RQ+     +   DP+  + ID+  EWV
Subjt:  SPWLWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLES--DPIQFDDIDMTSEWV

AT4G15020.2 hAT transposon superfamily1.9e-8529.99Show/hide
Query:  LMVREKDICWEYAEKL---DGNKVKCKFCLRVL-NGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAIL---ATREEIKETSSGKKQKIAEVKTIE--
        L  +++D  W++ E     D  +++C +C ++   GGI+R+K HL+    +G   C +V +DV   ++  +     R+  +  SS +   +A +  IE  
Subjt:  LMVREKDICWEYAEKL---DGNKVKCKFCLRVL-NGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAIL---ATREEIKETSSGKKQKIAEVKTIE--

Query:  -----------------------------------------NA---PSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENA-EKSIALFFFENKLD
                                                 NA    S S    ++  +  + I     +V  +  PS  + EN    +I  F F    D
Subjt:  -----------------------------------------NA---PSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENA-EKSIALFFFENKLD

Query:  FSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNT
        F    S ++Q MIDAI   G G + P+ + L+   L+    E++ +  + +  W  TGC+I+V+    +K   ++NFLV  P +  F KSVDAS    + 
Subjt:  FSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNT

Query:  KCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQEL
          L +L   ++++ G  NVVQ+I      Y      ++  Y +++  PCA+ C++ +LEEF K+ W++  I QAQ I++F+YN S +L+LM +FT G ++
Subjt:  KCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQEL

Query:  IRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREV-SGGKPAVGCIYELMTRAKESI
        +    S   ++F +L  I + +S L+ M  S E+    Y+ +P  +   A + D  FW+AV     ++ P LR LR V S  +PA+G +Y  + RAK++I
Subjt:  IRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREV-SGGKPAVGCIYELMTRAKESI

Query:  RTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTV
        +T+ +  E     +  I+DR W  Q H PL AA  FLNP + YN+  +  + +     + +E+L+P  +++  I  ++ ++  A G+FG +LA+ ARDT+
Subjt:  RTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTV

Query:  SPWLWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLES--DPIQFDDIDMTSEWV
         P  WW  +G+S   L R AIRILSQ C S+ S  R+    + I+  K N I+++ L+DLV++ YN++L RQ+     +   DP+  + ID+  EWV
Subjt:  SPWLWWEQFGDSAPVLQRVAIRILSQVC-STFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLES--DPIQFDDIDMTSEWV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTCAGCTGCTTGTTACTGTCCTTTTGGTTTGGTCCTCGGTTGGTTGGCTTGCCGGAACATCAGAACACATACACATCCTGATGGTCCGTGAGAAAGATATTTG
TTGGGAGTATGCTGAGAAATTAGATGGTAACAAGGTGAAGTGTAAATTTTGTCTTAGAGTTTTGAATGGTGGGATTAGTAGATTGAAGCATCATTTATCTCGATTACCGA
GTAGAGGTGTAAATCCGTGTAGTAAAGTGAGGGACGATGTTTCGGATAGAGTTAGAGCCATACTAGCAACTAGAGAGGAGATTAAGGAAACGTCTAGTGGGAAAAAGCAG
AAGATAGCTGAAGTCAAGACTATCGAAAATGCGCCATCAATGTCGACGTGTAAATCTGTTGTTTCAATGGAGGCCCCGTCTCCAATCGCCAAAGTTTTTCCAACGGTTAC
TCCCATGGCTCCCCCATCATTACTCAACCATGAAAATGCTGAGAAAAGCATTGCTTTGTTCTTTTTTGAGAATAAGCTAGACTTTAGTATAGCTAGATCTTCATCGTATC
AGCTAATGATCGATGCAATAGGGAAATGTGGCCCGGGATTTACGGGTCCTTCTGCCGAAACTTTGAAGAGTACATGGTTGGAAAGGATCAAAACTGAAGTGAGCCTTCAA
TCAAAGGATATTGAGAAAGAGTGGGCTACCACTGGCTGCACAATCATCGTAGACACGTGGACCGACAATAAATCAAGAGCTTTGATAAACTTTTTGGTTTCATCCCCATC
CCAGACCTTTTTTCACAAATCGGTCGATGCATCTGCATATTTCAAGAACACGAAATGCCTAGCGGATTTATTCGATTCCGTGATTCAAGATTTTGGACATGAAAACGTAG
TACAGATAATTATGGACAGTAGTTTCAACTATACAGGCATTGCTAATCATATCCTTCAGACTTATGGAACTATATTTGTGTCTCCTTGTGCTTCTCAGTGTCTGAATTCA
ATTTTGGAGGAATTTTCAAAGGTAGATTGGGTTAACAGATGTATCTTGCAAGCACAAACCATATCAAAGTTTCTATACAATAGTTCCTCATTGCTTGACCTGATGCGAAG
GTTCACGGGCGGTCAAGAACTCATTCGAACTGGGATATCGAAACCCGTATCGAGCTTCCTGTCTCTGCAATCTATTCTGAAGCAAAGGTCAAGACTGAAGCATATGTTCA
ACAGCCCTGAATACACTACAAATCCTTATGCAAATAAACCACAGAGCATTTCTTGTATTGCCATTATAGAAGATAATGATTTCTGGAGGGCAGTGGAAGAATGTGTAGCA
ATTTCAGAGCCTTTCCTAAGAGTATTAAGAGAAGTGTCTGGGGGTAAACCTGCTGTGGGATGTATATATGAGTTAATGACTAGAGCAAAAGAATCAATAAGAACTTACTA
TATAATGGATGAAATCAAGTGCAAGACGTTCCTCGATATCGTTGATAGAAAGTGGCGAGATCAACTTCATTCTCCGCTTCATGCAGCAGCTGCGTTTTTGAACCCGAGTA
TTCAGTACAATTCAGAAATAAAGTTCCTTACTTCCATTAAAGAAGACTTCTTTAATGTTTTGGAGAAATTACTCCCCTTGCCGGAGATGAGACGGGATATTACCAATCAA
ATATTTACTTTCACAAAGGCGAACGGGATGTTTGGATGCAGTTTAGCTATGGAAGCACGAGATACCGTTTCGCCTTGGCTTTGGTGGGAGCAGTTTGGTGACTCGGCGCC
CGTGTTACAACGAGTAGCAATACGGATTCTCAGTCAAGTTTGCAGTACTTTCTCTTTCGAAAGGCATTGGAGCACGTTTCAGCAAATTCACTCCGAAAAACGGAATAAGA
TCGACAAAGAAACTCTCAATGACCTCGTCTACATAAACTACAATCTCAAGTTAGCTAGACAGATGAAAACAAAACCCCTGGAATCTGATCCTATCCAGTTCGACGACATT
GATATGACTTCAGAGTGGGTAGAGGAAAGCGAAAACCCGAGCCCGACCCAGTGGCTCGACCGATTTGGTTCTTTGGATGGAGGTGACTTGAATACGAGACAGTTCAATGC
TGCCTTATTTAGTGCGAGTGACCACATATTTAACCTGTGA
mRNA sequenceShow/hide mRNA sequence
CCACTTTTCCATTTCTTCTTTTCTTACTTTTTTCTGTGTTTTTCAGCTCTATTTTAGTTCATACCTCCCGATTTGCTTTATAATTCCAGTTTTCTGCTTCTTTCACTGTC
CAGAGAGAATTCAGTGGAAAAAGGGTGTTCCGATTTTTCAATTGATTCCAGTTTTCTGCTTCTTTGGCAGTCGCTTAGTTCTTTGTGGGATTGTTTAAGTTCTTGGATGA
TGATTCATATTGGAATTTTTGTTTTCGTTTTATGATCTTTTGTTACTTAAATTTCATGGATATGTGATTGCTAGTTTAATGTAAGGGGTTTCTGACTTGTTTGGCTTCTG
AGAAAATCGGTGGAGGGAAATGGGGTGTTCTAAAATGTAGAAGAGATTTTGATTTGATACTCATAGTTGTTTCTTCAGATGACTTGGTTTTGCTTTTAAAGTTTCTCTCT
CAAATACGAATGGTGTGTCTTTAAGTTCATACCAACACGTAAGTTAGTTTAACATTGCTGAGAGAGGCAGAGGCATTTTGGAAGATCCCTTTTCTCTAGGAGTTTCAGTC
CATCAAAATAAGGGACAAATTGATTTCGGGGAATACATATGTTTTTTTTCTAAATAATCATGTTCTGTTTCTCTCAAATTCACAAGCAATCAAACATGACACCAGCTGGT
TTGATAATGTTAAGGAAAGGTCTATGATGTTCCACATTTTAGATCCTTTCTGTCTTGTTTTATTCATCTATCTTTTGTTGTTTCTATGCTAATCATGAATTTTCTATCAT
GGCGTCTCAGCTGCTTGTTACTGTCCTTTTGGTTTGGTCCTCGGTTGGTTGGCTTGCCGGAACATCAGAACACATACACATCCTGATGGTCCGTGAGAAAGATATTTGTT
GGGAGTATGCTGAGAAATTAGATGGTAACAAGGTGAAGTGTAAATTTTGTCTTAGAGTTTTGAATGGTGGGATTAGTAGATTGAAGCATCATTTATCTCGATTACCGAGT
AGAGGTGTAAATCCGTGTAGTAAAGTGAGGGACGATGTTTCGGATAGAGTTAGAGCCATACTAGCAACTAGAGAGGAGATTAAGGAAACGTCTAGTGGGAAAAAGCAGAA
GATAGCTGAAGTCAAGACTATCGAAAATGCGCCATCAATGTCGACGTGTAAATCTGTTGTTTCAATGGAGGCCCCGTCTCCAATCGCCAAAGTTTTTCCAACGGTTACTC
CCATGGCTCCCCCATCATTACTCAACCATGAAAATGCTGAGAAAAGCATTGCTTTGTTCTTTTTTGAGAATAAGCTAGACTTTAGTATAGCTAGATCTTCATCGTATCAG
CTAATGATCGATGCAATAGGGAAATGTGGCCCGGGATTTACGGGTCCTTCTGCCGAAACTTTGAAGAGTACATGGTTGGAAAGGATCAAAACTGAAGTGAGCCTTCAATC
AAAGGATATTGAGAAAGAGTGGGCTACCACTGGCTGCACAATCATCGTAGACACGTGGACCGACAATAAATCAAGAGCTTTGATAAACTTTTTGGTTTCATCCCCATCCC
AGACCTTTTTTCACAAATCGGTCGATGCATCTGCATATTTCAAGAACACGAAATGCCTAGCGGATTTATTCGATTCCGTGATTCAAGATTTTGGACATGAAAACGTAGTA
CAGATAATTATGGACAGTAGTTTCAACTATACAGGCATTGCTAATCATATCCTTCAGACTTATGGAACTATATTTGTGTCTCCTTGTGCTTCTCAGTGTCTGAATTCAAT
TTTGGAGGAATTTTCAAAGGTAGATTGGGTTAACAGATGTATCTTGCAAGCACAAACCATATCAAAGTTTCTATACAATAGTTCCTCATTGCTTGACCTGATGCGAAGGT
TCACGGGCGGTCAAGAACTCATTCGAACTGGGATATCGAAACCCGTATCGAGCTTCCTGTCTCTGCAATCTATTCTGAAGCAAAGGTCAAGACTGAAGCATATGTTCAAC
AGCCCTGAATACACTACAAATCCTTATGCAAATAAACCACAGAGCATTTCTTGTATTGCCATTATAGAAGATAATGATTTCTGGAGGGCAGTGGAAGAATGTGTAGCAAT
TTCAGAGCCTTTCCTAAGAGTATTAAGAGAAGTGTCTGGGGGTAAACCTGCTGTGGGATGTATATATGAGTTAATGACTAGAGCAAAAGAATCAATAAGAACTTACTATA
TAATGGATGAAATCAAGTGCAAGACGTTCCTCGATATCGTTGATAGAAAGTGGCGAGATCAACTTCATTCTCCGCTTCATGCAGCAGCTGCGTTTTTGAACCCGAGTATT
CAGTACAATTCAGAAATAAAGTTCCTTACTTCCATTAAAGAAGACTTCTTTAATGTTTTGGAGAAATTACTCCCCTTGCCGGAGATGAGACGGGATATTACCAATCAAAT
ATTTACTTTCACAAAGGCGAACGGGATGTTTGGATGCAGTTTAGCTATGGAAGCACGAGATACCGTTTCGCCTTGGCTTTGGTGGGAGCAGTTTGGTGACTCGGCGCCCG
TGTTACAACGAGTAGCAATACGGATTCTCAGTCAAGTTTGCAGTACTTTCTCTTTCGAAAGGCATTGGAGCACGTTTCAGCAAATTCACTCCGAAAAACGGAATAAGATC
GACAAAGAAACTCTCAATGACCTCGTCTACATAAACTACAATCTCAAGTTAGCTAGACAGATGAAAACAAAACCCCTGGAATCTGATCCTATCCAGTTCGACGACATTGA
TATGACTTCAGAGTGGGTAGAGGAAAGCGAAAACCCGAGCCCGACCCAGTGGCTCGACCGATTTGGTTCTTTGGATGGAGGTGACTTGAATACGAGACAGTTCAATGCTG
CCTTATTTAGTGCGAGTGACCACATATTTAACCTGTGAGAGCGAGCGTTTGAACATGCTAGTAATCCTCTCCAGTTCTTAGGCATTGTATGTATTCTCTTGATAAGAGGC
TTCTTTTGCTGTTTTACTTCCTTTTGTTGTATTGTAGCATGAATTTGTTCCCTTGCTTTGTTGTATATAAAATATAAACTAAGCTTAGGTTATTATGCCTTCCATGGTTT
AATCATAGGCCTACTTTCACAAATTACCTTTCGAGTCCCGAGTTTTCAAGAAATGGGTTCGTTTGGTTTCGAAAGTTATACGATGTACCTTAAAATGTATTTGGGACATG
TAGAAATAAGTAAACTAGAACGATTTTTTTTTTTCTCGTGCCTCATCAGATGAA
Protein sequenceShow/hide protein sequence
MASQLLVTVLLVWSSVGWLAGTSEHIHILMVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQ
KIAEVKTIENAPSMSTCKSVVSMEAPSPIAKVFPTVTPMAPPSLLNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKSTWLERIKTEVSLQ
SKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSQTFFHKSVDASAYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSFNYTGIANHILQTYGTIFVSPCASQCLNS
ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYANKPQSISCIAIIEDNDFWRAVEECVA
ISEPFLRVLREVSGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNSEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQ
IFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKTKPLESDPIQFDDI
DMTSEWVEESENPSPTQWLDRFGSLDGGDLNTRQFNAALFSASDHIFNL