; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004277 (gene) of Snake gourd v1 genome

Gene IDTan0004277
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein SCAI-like
Genome locationLG01:13721636..13730093
RNA-Seq ExpressionTan0004277
SyntenyTan0004277
Gene Ontology termsGO:0009873 - ethylene-activated signaling pathway (biological process)
GO:0045892 - negative regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003714 - transcription corepressor activity (molecular function)
InterPro domainsIPR022709 - Protein SCAI


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031279.1 Protein SCAI [Cucurbita argyrosperma subsp. argyrosperma]1.7e-23895.84Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD++SVAKTFRALVESA+RKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWK+QQEFR+KLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA
        FYEAILNR+YFE SKNSRKDLGARFK+LRFYARFLLVSL LNRT TVQVLAERL ALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVAT+SMNVRPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA

Query:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
        SFDSHQ SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
Subjt:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSK+ITQNSRENCN+LPESCKSEK  SSDLYDEYLWFGHR N
Subjt:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKA
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKA
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKA

XP_022136689.1 protein SCAI isoform X2 [Momordica charantia]2.3e-23593.36Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD DSVAKTFRALV+SANRKFARVQDVPAYGRVDNHHYFHKVFKA+MRLWK+QQEFR+KLVESGLNR EIGEIASRIGQLYFGHY+RTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFL VSLFLNRTDTVQVLAERL ALVDDSKA F GTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA

Query:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
         FDSH SSLPFVARFHAKRVLKFRDAVLTSYHR+EVKFAEITLDTYRMLQCLEWEPGFF+QKHPVEPNENGATIDHSGASGIIDINLATD++DPSLPPNP
Subjt:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMAT+CEELLPDSIMLIYLSAAGKCCQNSVNQMAS GESRKS+++K+I QNSRENCNALPESCKSEK  SSDLYDEYLWFGHRGN
Subjt:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAGLSK
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAG ++
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAGLSK

XP_022942249.1 protein SCAI-like [Cucurbita moschata]4.2e-23795.6Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD++SVAKTFRALVESA+RKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWK+QQEFR+KLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA
        FYEAILNR+YFE SKNSRKDLGARFK+LRFYARFLLVSL LNRT TVQVLAERL ALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVAT+SMNVRPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA

Query:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
        SFDSHQ SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
Subjt:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSK+ITQNSRENCN+LPESCKSEK  SSDLYDEYLWFGHR N
Subjt:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
        GGPNVLYPGDIIPFTRRPVFLIVDSNNS+AFK
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK

XP_022979819.1 protein SCAI-like [Cucurbita maxima]8.5e-23895.83Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD++SVAKTFRALVESA+RKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWK+QQEFR+KLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA
        FYEAILNR+YFE SKNSRKDLGARFK+LRFYARFLLVSL LNRT TVQVLAERL ALVDDSK AFRGTDFKEWRLVVQEIFCFMKVAT+SMNVRPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA

Query:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
        SFDSHQ SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
Subjt:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSK+ITQNSRENCNALPESCKSEK  SSDLYDEYLWFGHR N
Subjt:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK

XP_023531718.1 protein SCAI-like [Cucurbita pepo subsp. pepo]1.5e-23795.83Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD++SVAKTFRALVESA+RKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWK+QQEFR+KLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA
        FYEAILNR+YFE  KNSRKDLGARFK+LRFYARFLLVSL LNRT TVQVLAERL ALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVAT+SMNVRPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA

Query:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
        SFDSHQ SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
Subjt:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSK+ITQNSRENCNALPESCKSEK  SSDLYDEYLWFGHR N
Subjt:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK

TrEMBL top hitse value%identityAlignment
A0A5A7TRF5 Protein SCAI1.1e-23391.89Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDS AKTFRA+VE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFR+KLVESGLNR EIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERL ALVDDSKA FR TDFKEWRLVVQEIFCFM VAT S NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA

Query:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
        +FDSH  SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQM S+GESRKS+++K+  QNSRENCNA  ESCK EK  SSDLYDEYLWFGHRGN
Subjt:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAGLSKLFKLLPG
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAGLSKL ++L G
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAGLSKLFKLLPG

A0A6J1C520 protein SCAI isoform X21.1e-23593.36Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD DSVAKTFRALV+SANRKFARVQDVPAYGRVDNHHYFHKVFKA+MRLWK+QQEFR+KLVESGLNR EIGEIASRIGQLYFGHY+RTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFL VSLFLNRTDTVQVLAERL ALVDDSKA F GTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA

Query:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
         FDSH SSLPFVARFHAKRVLKFRDAVLTSYHR+EVKFAEITLDTYRMLQCLEWEPGFF+QKHPVEPNENGATIDHSGASGIIDINLATD++DPSLPPNP
Subjt:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMAT+CEELLPDSIMLIYLSAAGKCCQNSVNQMAS GESRKS+++K+I QNSRENCNALPESCKSEK  SSDLYDEYLWFGHRGN
Subjt:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAGLSK
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAG ++
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAGLSK

A0A6J1C886 protein SCAI isoform X12.8e-23493.98Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD DSVAKTFRALV+SANRKFARVQDVPAYGRVDNHHYFHKVFKA+MRLWK+QQEFR+KLVESGLNR EIGEIASRIGQLYFGHY+RTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFL VSLFLNRTDTVQVLAERL ALVDDSKA F GTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA

Query:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
         FDSH SSLPFVARFHAKRVLKFRDAVLTSYHR+EVKFAEITLDTYRMLQCLEWEPGFF+QKHPVEPNENGATIDHSGASGIIDINLATD++DPSLPPNP
Subjt:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMAT+CEELLPDSIMLIYLSAAGKCCQNSVNQMAS GESRKS+++K+I QNSRENCNALPESCKSEK  SSDLYDEYLWFGHRGN
Subjt:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK

A0A6J1FQR5 protein SCAI-like2.0e-23795.6Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD++SVAKTFRALVESA+RKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWK+QQEFR+KLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA
        FYEAILNR+YFE SKNSRKDLGARFK+LRFYARFLLVSL LNRT TVQVLAERL ALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVAT+SMNVRPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA

Query:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
        SFDSHQ SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
Subjt:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSK+ITQNSRENCN+LPESCKSEK  SSDLYDEYLWFGHR N
Subjt:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
        GGPNVLYPGDIIPFTRRPVFLIVDSNNS+AFK
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK

A0A6J1IXN1 protein SCAI-like4.1e-23895.83Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD++SVAKTFRALVESA+RKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWK+QQEFR+KLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA
        FYEAILNR+YFE SKNSRKDLGARFK+LRFYARFLLVSL LNRT TVQVLAERL ALVDDSK AFRGTDFKEWRLVVQEIFCFMKVAT+SMNVRPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSA

Query:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
        SFDSHQ SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP
Subjt:  SFDSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSK+ITQNSRENCNALPESCKSEK  SSDLYDEYLWFGHR N
Subjt:  KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK

SwissProt top hitse value%identityAlignment
Q54YY1 Protein SCAI homolog1.8e-4432.75Show/hide
Query:  VAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVES---GLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYE
        + KTF  L+  + R F  ++D+P +GR     +F K F+ Y +LWK+QQ++RS L +    GL R EIGEIAS+IGQLY+ +Y+RTS+  +L E+Y+FYE
Subjt:  VAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVES---GLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYE

Query:  AILNRSYFEG-SKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKV---ATVS-MNVRP-LR
        AI  RSYF+  S +   D+    K+LR+YARF++V L LN+   V  L E L+  V+D    ++ +D +EW LV+QEIF F++    AT S  N  P L 
Subjt:  AILNRSYFEG-SKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKV---ATVS-MNVRP-LR

Query:  YSASFDSHQSSLPFVARFHAKRVLK---------------------FRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATI--
         S +  ++ S+       H    ++                      + A+L    +N++KF+EITLD +RM Q LE+EP    +++ ++  +    +  
Subjt:  YSASFDSHQSSLPFVARFHAKRVLK---------------------FRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATI--

Query:  -DHSGASGIIDINLATDMTDPSLPP-------------------------NPKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQ-NSVNQ
             A+   + N  TD  + ++P                          NP K +LYRP+++ ++  ++   +EL  +  ML+Y+ A G   + N VNQ
Subjt:  -DHSGASGIIDINLATDMTDPSLPP-------------------------NPKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQ-NSVNQ

Q8C8N2 Protein SCAI4.3e-5132.72Show/hide
Query:  FRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVES-GLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRS
        F  L++ + + F  ++D+P YG+     YF + F  Y +LWK+QQ+ R  L    GL R +IGEIAS+IGQLY+ +Y+RTSE  +L EA+ FY AI  RS
Subjt:  FRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVES-GLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRS

Query:  YF-EGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVS-MNVRPLRYSASFDSHQS
        Y+ + +K  R +L    K+LR+YARF++V L LN+ D V+ L + L   ++D    F   D  EW LV+QE+  F++   V  +N        S    ++
Subjt:  YF-EGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVS-MNVRPLRYSASFDSHQS

Query:  SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLP---------P
          P + +      L   DA++     N+VKF+E+T+D +RMLQ LE EP                            +NLA+ M  P +           
Subjt:  SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLP---------P

Query:  NPKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHR
        NP K +LY+P+ + L   +A   +EL  +S++LIYLSA G       +    Y          ++T ++R+  N      +++  K              
Subjt:  NPKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHR

Query:  GNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
             + L+PGD+ PFTR+P+F++VDS+NS A+K
Subjt:  GNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK

Q8N9R8 Protein SCAI1.9e-5132.95Show/hide
Query:  FRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVES-GLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRS
        F  L++ + + F  ++D+P YG+     YF + F  Y +LWK+QQ+ R  L    GL R +IGEIAS+IGQLY+ +Y+RTSE  +L EA+ FY AI  RS
Subjt:  FRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVES-GLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRS

Query:  YF-EGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVS-MNVRPLRYSASFDSHQS
        Y+ + +K  R +L    K+LR+YARF++V L LN+ D V+ L + L   ++D    F   D  EW LV+QE+  F++   V  +N        S    ++
Subjt:  YF-EGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVS-MNVRPLRYSASFDSHQS

Query:  SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLP---------P
          P + +      L   DA++     N+VKF+E+T+D +RMLQ LE EP                            +NLA+ M  P +           
Subjt:  SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLP---------P

Query:  NPKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHR
        NP K +LY+P+ + L   +A   +EL  +S++LIYLSA G       +    Y          ++T ++R+  N      +++  K              
Subjt:  NPKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHR

Query:  GNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
             + L+PGD+ PFTR+P+F+IVDS+NS A+K
Subjt:  GNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK

Arabidopsis top hitse value%identityAlignment
AT3G03570.1 Protein of unknown function (DUF3550/UPF0682)3.0e-10046.24Show/hide
Query:  DNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFY
        +N  +++ + +LV  A++KF++++D+P Y R    +YF KVFK Y +LWK+QQE R KLVE+GL R EIGEIASRI QLY+GHYMRTS+A +L E+YVFY
Subjt:  DNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFY

Query:  EAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSASF
        EAIL R YF+      +DL    K+LRF ARFL+V L L R + V  L ++   L+D+ K  F+ TDFKEW++V QEI  F+K  T  MN+RPLRYS   
Subjt:  EAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSASF

Query:  DSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNPK
        D +  +        A R L+  DA+L+SY+ NEVK++E+TLD++RMLQCLEWEP G  YQ         GA +  +   G+  IN +  M DP+LPPNP+
Subjt:  DSHQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNPK

Query:  KAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESR------KSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDE--YL
        KA+LYRPS+TH +AV+AT+CEEL    I+L+YLSA+GK  Q S + +++   +       +   S  I Q +  +    P S +S +  S D       L
Subjt:  KAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESR------KSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDE--YL

Query:  WFGHRGNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
         FG  G  G + +YP D++PFTR+P+F+I+DS++S  FK
Subjt:  WFGHRGNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK

AT4G40050.1 Protein of unknown function (DUF3550/UPF0682)2.8e-14661.63Show/hide
Query:  DSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEA
        + V+  FRALVE+A+RKFARV+D+PA+GR  + HYF KVFKAYM+LW YQQ  RSKLVESGLNR EIGEIASRIGQLYF  YMRTSEARFL+EA+VFYEA
Subjt:  DSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEA

Query:  ILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSASFDS
        IL RSYF+ ++   KDLGARFKELRFYARFLLVSL ++R   +  LA++L  LVD S + FR T+FKEWRLVVQEI  F++  T    +RPLRY A  DS
Subjt:  ILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSASFDS

Query:  HQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNPKKA
        + +S  ++ARFHAK++ KFRDA+L SYHRNEVK+AE+TLDTYRM+QCLEWEP G FYQK PVE  ENG  +DH+  SG+ID+NLA DM DPSLPPNP+KA
Subjt:  HQSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNPKKA

Query:  ILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHR-GNGG
        ILYRP+V+HL+AV+A +C+EL P+++ML+YLSA+G   + +V Q  +   S ++ +SKL+ + S+E  +   E   + K  S++ Y+ +LW G R G+ G
Subjt:  ILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHR-GNGG

Query:  PNVLYPGDIIPFTRRPVFLIVDSNNSHAFK
         N LYPGD+IPFTR+P+FLI+DS+ S AFK
Subjt:  PNVLYPGDIIPFTRRPVFLIVDSNNSHAFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGACAACGACTCTGTAGCGAAGACCTTCCGAGCTCTGGTCGAGAGCGCCAACCGGAAGTTCGCTAGGGTCCAAGACGTCCCGGCTTACGGCCGTGTGGACAACCA
CCACTATTTTCATAAGGTTTTCAAGGCTTATATGCGGCTCTGGAAGTACCAGCAGGAATTCCGCTCGAAGCTTGTTGAATCTGGTCTCAACCGCTCGGAAATTGGCGAGA
TCGCTAGCCGGATCGGTCAGCTTTACTTTGGGCATTATATGAGAACCAGTGAGGCCAGGTTCTTGATTGAAGCCTATGTTTTCTATGAAGCGATTCTTAATCGAAGCTAT
TTTGAGGGATCGAAGAATTCGAGGAAGGATCTGGGAGCAAGGTTCAAAGAACTGAGGTTTTACGCGAGGTTTTTGCTGGTTTCGTTGTTTTTGAACCGCACGGACACAGT
TCAGGTCCTTGCCGAACGATTGATGGCTCTGGTGGATGATAGCAAGGCCGCTTTTCGGGGTACTGACTTTAAAGAATGGAGGTTAGTTGTACAAGAGATTTTCTGCTTCA
TGAAAGTAGCAACAGTCTCAATGAATGTCAGACCTCTGCGTTACTCTGCTTCATTTGATTCCCATCAGTCATCCCTTCCATTTGTGGCTCGTTTCCATGCAAAGAGGGTT
CTTAAATTCCGAGATGCCGTTCTGACAAGCTACCACCGAAATGAGGTTAAATTTGCGGAAATTACTTTAGATACTTATAGAATGCTGCAATGTCTCGAATGGGAGCCTGG
TTTCTTCTACCAAAAGCATCCAGTTGAACCAAATGAAAATGGTGCTACCATTGATCATTCTGGGGCATCTGGAATAATTGATATTAACTTAGCGACCGATATGACAGATC
CATCTTTACCTCCAAATCCAAAAAAAGCTATCCTCTATCGGCCTTCAGTGACTCATTTGATAGCTGTCATGGCTACAGTTTGTGAGGAGCTCCTTCCAGACAGTATCATG
CTGATTTATCTATCAGCAGCAGGAAAATGTTGTCAAAACAGTGTCAATCAAATGGCAAGTTATGGGGAATCAAGAAAATCCGTGAGAAGTAAACTCATCACCCAGAACTC
ACGAGAAAATTGTAATGCTCTGCCTGAATCCTGTAAGAGTGAGAAGGGAAAATCAAGCGACCTTTATGATGAGTATTTGTGGTTTGGGCATAGGGGTAATGGAGGTCCAA
ACGTTCTATACCCTGGTGATATAATACCTTTTACACGGAGACCTGTTTTCTTGATAGTTGACAGTAACAACAGCCATGCATTCAAGGCAGGTTTGTCAAAACTTTTTAAA
CTGTTGCCAGGTCATCATGGTTTGCTTCACACTACTATATTGAAAATTAAATTTCCTCTTGATAGGTAG
mRNA sequenceShow/hide mRNA sequence
CCGGTTCAGTAAGCGACCGTTGGATCAGAACCTGCATGGCATGAATCAACGGCCACGATTCGAGATGACAAAATATAATGGCAGTGCAGAAAAAACACAACTCCCACTTC
TCTCATCCACTCTTCTTTCTCGGCCATAGCCGCCAAAATTGCTGCCATTTCCGGCCCCTCTCCGCCGCCTCTGCAACCCCCTTTTGCCGGACTTCAAATCCAAATTCAGT
TTCATACGACAAAGTAAAAGCGAAGGAGTTCGAGGAGGAAGAAGGACGGAAGAGAAAAAGTGGGTTTGATTTATGACGCCACAATAGAACACACGAGTCCCCAAGCAAAA
AAGTTCAACTACTTCTCTTTCTCTCTCTCTATCTGTTGCTCTGAATCGTATAAGCACGTACACATACTTGCGCGCGCGGCACTCTCTGAAGGAAGCTCACTTTCTTTCTT
TTTCTCTGATACTTCACTTTGCGGTTTCGATTGTTCTGTTTTTGTTCGATGAGAGCCGGAGATTGTGGCTGCCGGTCTGGGAATTGAGACCTCCGGAATGACCGACAACG
ACTCTGTAGCGAAGACCTTCCGAGCTCTGGTCGAGAGCGCCAACCGGAAGTTCGCTAGGGTCCAAGACGTCCCGGCTTACGGCCGTGTGGACAACCACCACTATTTTCAT
AAGGTTTTCAAGGCTTATATGCGGCTCTGGAAGTACCAGCAGGAATTCCGCTCGAAGCTTGTTGAATCTGGTCTCAACCGCTCGGAAATTGGCGAGATCGCTAGCCGGAT
CGGTCAGCTTTACTTTGGGCATTATATGAGAACCAGTGAGGCCAGGTTCTTGATTGAAGCCTATGTTTTCTATGAAGCGATTCTTAATCGAAGCTATTTTGAGGGATCGA
AGAATTCGAGGAAGGATCTGGGAGCAAGGTTCAAAGAACTGAGGTTTTACGCGAGGTTTTTGCTGGTTTCGTTGTTTTTGAACCGCACGGACACAGTTCAGGTCCTTGCC
GAACGATTGATGGCTCTGGTGGATGATAGCAAGGCCGCTTTTCGGGGTACTGACTTTAAAGAATGGAGGTTAGTTGTACAAGAGATTTTCTGCTTCATGAAAGTAGCAAC
AGTCTCAATGAATGTCAGACCTCTGCGTTACTCTGCTTCATTTGATTCCCATCAGTCATCCCTTCCATTTGTGGCTCGTTTCCATGCAAAGAGGGTTCTTAAATTCCGAG
ATGCCGTTCTGACAAGCTACCACCGAAATGAGGTTAAATTTGCGGAAATTACTTTAGATACTTATAGAATGCTGCAATGTCTCGAATGGGAGCCTGGTTTCTTCTACCAA
AAGCATCCAGTTGAACCAAATGAAAATGGTGCTACCATTGATCATTCTGGGGCATCTGGAATAATTGATATTAACTTAGCGACCGATATGACAGATCCATCTTTACCTCC
AAATCCAAAAAAAGCTATCCTCTATCGGCCTTCAGTGACTCATTTGATAGCTGTCATGGCTACAGTTTGTGAGGAGCTCCTTCCAGACAGTATCATGCTGATTTATCTAT
CAGCAGCAGGAAAATGTTGTCAAAACAGTGTCAATCAAATGGCAAGTTATGGGGAATCAAGAAAATCCGTGAGAAGTAAACTCATCACCCAGAACTCACGAGAAAATTGT
AATGCTCTGCCTGAATCCTGTAAGAGTGAGAAGGGAAAATCAAGCGACCTTTATGATGAGTATTTGTGGTTTGGGCATAGGGGTAATGGAGGTCCAAACGTTCTATACCC
TGGTGATATAATACCTTTTACACGGAGACCTGTTTTCTTGATAGTTGACAGTAACAACAGCCATGCATTCAAGGCAGGTTTGTCAAAACTTTTTAAACTGTTGCCAGGTC
ATCATGGTTTGCTTCACACTACTATATTGAAAATTAAATTTCCTCTTGATAGGTAGATATTTCTCACTTCTCTTCAGTATCATCATATCAGGATACGTGAACGTGATTAT
CATGGCTTTGAGCATGAGATGAGATAGATATGACCGTCAGGTGCATTAGATGATCTGTGTTTGTTAGGATACCACCTTACACATAATCTTACACGTAATCATACTAAATA
CAAAGATTGAACTATATTGCAATATCATAAAAACTCTACAATATCTGTGACCTTTCGAGAGGGCTAAGTCTCTTCCACAAGCAATCCACTAAGGATTGACTTACCAAAAT
ATGACCCCCTTCCTACAAACCCTACCCCTTCTATTTATAATCAAGACTCTCCAGCTAATTACCCTTAATACCCTTACTAGTAACTACTATCCCCAAGATATCCCTATGTG
TACTCTCACAGTTTTCCTGTACAAAAGGACGTGTTTGAAGCCAAATATTGCTTGCAACTTTCATGTTGCGCTTTCAGTATGAATCTTTCCATTTACCCAAAAAAAAAAAA
AAATGCTAGTTCACCAAGATTTACTGAAAGTTTCATGTTTATGGACCTGTCTGGTCTAGAGTGAATTTAATGTGGTCTTTAAAGATTCAATGCTTCAAGAAGACGTCACC
TCAATTTGACAATTTTCCTGTGATGCGATTTGCTTATGTTATCAAATTTTATTTGACCAACCATCATAGAATCACCTCATTAACTGTTCCATATCTTACATATAAAGTGC
GATTCTCAGTCATGTTGTTAGTCTCAAGTCAATCTGACTACTCACGTTGAAACAATATTTGATTTTCAGGTTCTACATGGTGCAGAGAGAGGAGAGACTGCCGCCATACT
TCTTTCACCTTTGAGACCAGCATTCAAGAATCCCTTAAATGTTGATACAATTCAATCTGGAAGTCAGTTTACCTTTTTCTTGACTGCCCCTCTCCCTGCATTTTGCGAAA
TGGTTGGCCTGTCCTCAGCCAATTTGGATATAGATGTTTACAATGATGCTGAGACCATAATCTCCTCCACATTTTCCGAGTGGGAAATAATTCTTTGTACATCAACTAGC
TTAAATATCGTCTGGGCCCAAGTTTTGTCTGATCATTTTTTACGCCGTCTCATTCTCAGATTTATATTCTGCCGAGCCGTGCTATCTTTCTTCAGTACTAAAGATGGCGA
CGACCTTCCTGTTTGCCTGCCTTGTCTTCCCGACTCTGTCGCCTCGAATTCTGGAGTTGTCTGCTCAGCAATTCGCCGTCTTGCGCAGCACCTTAACGTTGCTGACTTAT
TTAACTTCCACGAAGTATGATCACAATCTAGATATTGCGAAAACCCGAATTCGAGTTTGAAGTTGTCCTCAAAGGTCAGTTCAGAACCGAAACATAAACACCATTGGCCT
CTTGAGCGTTACAGTAGGATCACAAATTGGTGTTCTGAAGCTAAAATTTATGAAACACAAGTAGTGTGATGGTGGTTGCCTTCCATTTTAATGTTTATGAGCAACCAATT
GGTTTCTTAAGTTTCAAGGTGGGTTTACTGTTTAGTTAGATGAGAGTGTGATAATTTATTGTAGGTTTCTATTCGGCCATGAAGATTCATTTAGTTCCCTCTTTTCTTTC
TTTCTTTCTTTTTTTTTTTTGGCCAATTTCTTTTCATTTCTCTGTGAAAAAAGGGGAAAAAGAAACAGAAGTAGAGAAATGAAATGAGTTACTTGTAGATGCTGGCCAAT
GGGGAATGTATAATCGCTTTCTAGGACTCTTATACAAAAGTGTCAAATCTTGGGAAATTTGACTTCGTCTTTGTAACTTGAGTTTGTTCAATATTGGAATGTTATGAGAT
AGCATTTTCCAAGTCATGAAGTTCTTGTTTCCTATTATTAAATCATCTGTCCTATTTGTTTGCTTCAAAGGTTCAAGTGCTATGAAATCTT
Protein sequenceShow/hide protein sequence
MTDNDSVAKTFRALVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRSKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSY
FEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLMALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVATVSMNVRPLRYSASFDSHQSSLPFVARFHAKRV
LKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNPKKAILYRPSVTHLIAVMATVCEELLPDSIM
LIYLSAAGKCCQNSVNQMASYGESRKSVRSKLITQNSRENCNALPESCKSEKGKSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAGLSKLFK
LLPGHHGLLHTTILKIKFPLDR