; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018879 (gene) of Snake gourd v1 genome

Gene IDTan0018879
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein HLB1-like isoform X2
Genome locationLG01:27434973..27450506
RNA-Seq ExpressionTan0018879
SyntenyTan0018879
Gene Ontology termsGO:0006887 - exocytosis (biological process)
GO:0048768 - root hair cell tip growth (biological process)
GO:0005769 - early endosome (cellular component)
GO:0005802 - trans-Golgi network (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015807.1 Protein HLB1 [Cucurbita argyrosperma subsp. argyrosperma]2.5e-27385.32Show/hide
Query:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEPEPIADAIPKAELRQERESESVNEEADSEPESQ--------------------------------G
        MSPTPEEPNNLQNGIE EPHIS ES Q  E +SEPE  AD +P AEL+QERESESVN   D EP+S+                                 
Subjt:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEPEPIADAIPKAELRQERESESVNEEADSEPESQ--------------------------------G

Query:  KHLSESIQLQVVTDVTDPSFEEPKGTPIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAM
        K LSESIQLQV TDV DP FEEPKGT I SNGTENSQPALRKDEGSRTFTMRELLNGLK EDGND++NESEGE+PEANSGYSL+QDSPH PYSEQSRAAM
Subjt:  KHLSESIQLQVVTDVTDPSFEEPKGTPIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAM

Query:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRA
        ELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEAC+KYDEATR CPTLHDAFYNWAIAISDRA
Subjt:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRA

Query:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN
        KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG 
Subjt:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN

Query:  VKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGS
        VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGR  APH DWKRSQFFLNHDVLQKL IGGEQ  TSP++LGRSGS
Subjt:  VKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGS

Query:  TLNCDRTTKVEIPDIISVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG
        TLN DRT KVEIPDI+SVSACADLTLPPGAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGI+ G
Subjt:  TLNCDRTTKVEIPDIISVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG

XP_004146133.1 protein HLB1 isoform X1 [Cucumis sativus]4.2e-27690.36Show/hide
Query:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEP-EPIADAIPKAELRQERESESV-NEEADSEPESQGKHLSESIQLQVVTDVTDPSFEEPKGTPIPS
        MSPTPEEPNNLQNGIE +PHISSES+Q  EPRS P EP  D+IP +EL++ERESESV N   DSEPES  K LSESI L VVT VTDPS EE K T  PS
Subjt:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEP-EPIADAIPKAELRQERESESV-NEEADSEPESQGKHLSESIQLQVVTDVTDPSFEEPKGTPIPS

Query:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NG TEN QPALRKDEGSRTFTMRELLNGLKGEDG+D++NESEGERPE NSGYSL+QDSPH PYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
Subjt:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEAC+KYDEAT  CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GNVKDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGR LAPHSDWKRSQFFLNHDVLQKL IGGEQI TSPS+LGRSGSTLN DRT KVEIPDI+SVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPPG

Query:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG
        AGLCIDTIHGP+FLVADSWD LDGWLDAIRLVYTIYARGKN+VLAGI+TG
Subjt:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG

XP_008448563.1 PREDICTED: uncharacterized protein LOC103490705 isoform X1 [Cucumis melo]1.4e-27690.55Show/hide
Query:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSE-PEPIADAIPKAELRQERESESV-NEEADSEPESQGKHLSESIQLQVVTDVTDPSFEEPKGTPIPS
        MSPTPEEPNNLQNGIE +PHISSES+Q  EPRSE  EP AD+IP +EL+QERESESV N  ADSEPES  K LSESI L VVT VTDPS EE K T  P 
Subjt:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSE-PEPIADAIPKAELRQERESESV-NEEADSEPESQGKHLSESIQLQVVTDVTDPSFEEPKGTPIPS

Query:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NG TEN QPALRKDEGSRTFTMRELLNGLKGEDG+D +NESEGERPE NSG+SL+QDSPH PYSEQSRAAMELINS+TGVDEEGRSRQRILTFAARRYAS
Subjt:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEAC+KYDEAT  CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN+KDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGR LAPHSDWKRSQFFLNHDVLQKL IGGEQI TSPS LGRSGSTLN DRT KVEIPDI+SVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPPG

Query:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG
        AGLCIDTIHGP+FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGI+TG
Subjt:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG

XP_022965252.1 protein HLB1-like isoform X2 [Cucurbita maxima]6.5e-27789.84Show/hide
Query:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEPEPIADAIPKAELRQERESESVNEEADSEPESQ----GKHLSESIQLQVVTDVTDPSFEEPKGTPI
        MSP PEEPNNLQNGIE EPHIS ES Q  E +SEPE  AD IP AEL+QERESESVN  ADSEP+S+     K LSESI+LQVVTDVTDP FEEPKGT I
Subjt:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEPEPIADAIPKAELRQERESESVNEEADSEPESQ----GKHLSESIQLQVVTDVTDPSFEEPKGTPI

Query:  PSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
         SNG ENSQPALRKDEGSRTFTMRELLNGLK EDGND++NESEGE+PEANSGYSL+QDSPH PYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
Subjt:  PSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA

Query:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
        SAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEAC+KYDEATR CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQLNW
Subjt:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW

Query:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYS
        NSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG VKDVSPNELYSQSAIYIAAAHALKP+YS
Subjt:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYS

Query:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPP
        VYSSALRLVRSMLPLPYLKVGYLTAPPVGR  APH DWKRSQFFLNHDVLQKL IGGEQI TSP++LGRSGSTLN DRT KVEIPDI+SVSACADLTLPP
Subjt:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPP

Query:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG
        GAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGI+ G
Subjt:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG

XP_038876586.1 protein HLB1 [Benincasa hispida]5.3e-27991.26Show/hide
Query:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEPEPIADAIPKAELRQERESESVNE-EADSEPESQGKHLSESIQLQVVTDVTDPSFEEPKGTPIPSN
        MSPTPEEPNNLQNGIE +PHIS ES+QT EPRSEPEP ADAI  +EL QERESESVN   ADSEP S+ K L ESI LQV TDV DP FEE K T IPSN
Subjt:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEPEPIADAIPKAELRQERESESVNE-EADSEPESQGKHLSESIQLQVVTDVTDPSFEEPKGTPIPSN

Query:  G-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASA
        G TENS+PALRKDEGSRTFTMRELLNGLKGEDGND++NESEGERPE N GYSL+QDSPH PYSEQSRAAMELI+SVTGVDEEGRSRQRILTFAARRYASA
Subjt:  G-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASA

Query:  IERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNS
        IERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEAC+KYDEATR CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNS
Subjt:  IERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNS

Query:  PQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSVY
        PQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSVY
Subjt:  PQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSVY

Query:  SSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPPGA
        SSALRLVRSMLPLPYLKVGYLTAPPVGR LAPH DWKRSQFFLNHDVLQKL IGGEQI TSPS+LGRSGSTLN D T KVEIPDI+SVSACADLTLPPGA
Subjt:  SSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPPGA

Query:  GLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG
        GLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGI+TG
Subjt:  GLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG

TrEMBL top hitse value%identityAlignment
A0A0A0L688 Uncharacterized protein2.0e-27690.36Show/hide
Query:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEP-EPIADAIPKAELRQERESESV-NEEADSEPESQGKHLSESIQLQVVTDVTDPSFEEPKGTPIPS
        MSPTPEEPNNLQNGIE +PHISSES+Q  EPRS P EP  D+IP +EL++ERESESV N   DSEPES  K LSESI L VVT VTDPS EE K T  PS
Subjt:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEP-EPIADAIPKAELRQERESESV-NEEADSEPESQGKHLSESIQLQVVTDVTDPSFEEPKGTPIPS

Query:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NG TEN QPALRKDEGSRTFTMRELLNGLKGEDG+D++NESEGERPE NSGYSL+QDSPH PYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
Subjt:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEAC+KYDEAT  CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GNVKDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGR LAPHSDWKRSQFFLNHDVLQKL IGGEQI TSPS+LGRSGSTLN DRT KVEIPDI+SVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPPG

Query:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG
        AGLCIDTIHGP+FLVADSWD LDGWLDAIRLVYTIYARGKN+VLAGI+TG
Subjt:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG

A0A1S3BJC9 uncharacterized protein LOC103490705 isoform X17.0e-27790.55Show/hide
Query:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSE-PEPIADAIPKAELRQERESESV-NEEADSEPESQGKHLSESIQLQVVTDVTDPSFEEPKGTPIPS
        MSPTPEEPNNLQNGIE +PHISSES+Q  EPRSE  EP AD+IP +EL+QERESESV N  ADSEPES  K LSESI L VVT VTDPS EE K T  P 
Subjt:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSE-PEPIADAIPKAELRQERESESV-NEEADSEPESQGKHLSESIQLQVVTDVTDPSFEEPKGTPIPS

Query:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NG TEN QPALRKDEGSRTFTMRELLNGLKGEDG+D +NESEGERPE NSG+SL+QDSPH PYSEQSRAAMELINS+TGVDEEGRSRQRILTFAARRYAS
Subjt:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEAC+KYDEAT  CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN+KDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGR LAPHSDWKRSQFFLNHDVLQKL IGGEQI TSPS LGRSGSTLN DRT KVEIPDI+SVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPPG

Query:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG
        AGLCIDTIHGP+FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGI+TG
Subjt:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG

A0A6J1EA05 protein HLB1-like6.1e-27385.15Show/hide
Query:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEPEPIADAIPKAELRQERESESVNEEADSEPESQ--------------------------------G
        MSPTPEEPNNLQNGIE EPHIS ES Q  E +SEPE  AD +P AEL+QERE ESVN   D EP+S+                                 
Subjt:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEPEPIADAIPKAELRQERESESVNEEADSEPESQ--------------------------------G

Query:  KHLSESIQLQVVTDVTDPSFEEPKGTPIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAM
        K LSESIQLQV TDV DP FEEPKGT I SNGTENSQPALRKDEGSRTFTMRELLNGLK EDGND++NESEGE+PEANSGYSL+QDSPH PYSEQSRAAM
Subjt:  KHLSESIQLQVVTDVTDPSFEEPKGTPIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAM

Query:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRA
        ELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEAC+KYDEATR CPTLHDAFYNWAIAISDRA
Subjt:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRA

Query:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN
        KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG 
Subjt:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN

Query:  VKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGS
        VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGR  APH DWKRSQFFLNHDVLQKL IGGEQ  TSP++LGRSGS
Subjt:  VKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGS

Query:  TLNCDRTTKVEIPDIISVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG
        TLN DRT KVEIPDI+SVSACADLTLPPGAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGI+ G
Subjt:  TLNCDRTTKVEIPDIISVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG

A0A6J1HJU5 protein HLB1-like isoform X23.1e-27789.84Show/hide
Query:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEPEPIADAIPKAELRQERESESVNEEADSEPESQ----GKHLSESIQLQVVTDVTDPSFEEPKGTPI
        MSP PEEPNNLQNGIE EPHIS ES Q  E +SEPE  AD IP AEL+QERESESVN  ADSEP+S+     K LSESI+LQVVTDVTDP FEEPKGT I
Subjt:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEPEPIADAIPKAELRQERESESVNEEADSEPESQ----GKHLSESIQLQVVTDVTDPSFEEPKGTPI

Query:  PSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
         SNG ENSQPALRKDEGSRTFTMRELLNGLK EDGND++NESEGE+PEANSGYSL+QDSPH PYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
Subjt:  PSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA

Query:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
        SAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEAC+KYDEATR CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQLNW
Subjt:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW

Query:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYS
        NSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG VKDVSPNELYSQSAIYIAAAHALKP+YS
Subjt:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYS

Query:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPP
        VYSSALRLVRSMLPLPYLKVGYLTAPPVGR  APH DWKRSQFFLNHDVLQKL IGGEQI TSP++LGRSGSTLN DRT KVEIPDI+SVSACADLTLPP
Subjt:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPP

Query:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG
        GAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGI+ G
Subjt:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG

A0A6J1HL68 protein HLB1-like isoform X18.0e-27385.15Show/hide
Query:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEPEPIADAIPKAELRQERESESVNEEADSEP--------------------------------ESQG
        MSP PEEPNNLQNGIE EPHIS ES Q  E +SEPE  AD +P AEL+QERESESVN  AD EP                                +S  
Subjt:  MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEPEPIADAIPKAELRQERESESVNEEADSEP--------------------------------ESQG

Query:  KHLSESIQLQVVTDVTDPSFEEPKGTPIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAM
        K LSESI+LQVVTDVTDP FEEPKGT I SNG ENSQPALRKDEGSRTFTMRELLNGLK EDGND++NESEGE+PEANSGYSL+QDSPH PYSEQSRAAM
Subjt:  KHLSESIQLQVVTDVTDPSFEEPKGTPIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAM

Query:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRA
        ELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEAC+KYDEATR CPTLHDAFYNWAIAISDRA
Subjt:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRA

Query:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN
        KMRGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG 
Subjt:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN

Query:  VKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGS
        VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGR  APH DWKRSQFFLNHDVLQKL IGGEQI TSP++LGRSGS
Subjt:  VKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHDVLQKLKIGGEQIHTSPSVLGRSGS

Query:  TLNCDRTTKVEIPDIISVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG
        TLN DRT KVEIPDI+SVSACADLTLPPGAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGI+ G
Subjt:  TLNCDRTTKVEIPDIISVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG

SwissProt top hitse value%identityAlignment
Q9FHY8 Protein HLB15.8e-18061.18Show/hide
Query:  MSPTPEEPNNLQNG-----------------IETEPHISSESEQTD---EPRSEPEPIADAIPKAELRQERESESVNEEADSEPESQGKHLSESIQLQVV
        M+ T EEP  LQNG                 ++TEP ++ E  + +    P      + DA P+    + +  E      D++PE     +       VV
Subjt:  MSPTPEEPNNLQNG-----------------IETEPHISSESEQTD---EPRSEPEPIADAIPKAELRQERESESVNEEADSEPESQGKHLSESIQLQVV

Query:  T----DVTDPSFEEPKGTPIPSNGTENSQPAL-----RKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELI
        T    D+TD          IP   TE  Q +      + D+G++TFTMRELL+ LK E         EG+    +S    S++S   P   ++  AM+LI
Subjt:  T----DVTDPSFEEPKGTPIPSNGTENSQPAL-----RKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELI

Query:  NSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMR
        N +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNVSPDS SPSKD LLEEAC+KYDEATR CPTL+DA+YNWAIAISDRAK+R
Subjt:  NSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMR

Query:  GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKD
        GRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KD
Subjt:  GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKD

Query:  VSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHD-VLQKLKIGGEQIHTSPSVLGRSGSTL
        + P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG +LAPHSDWKR++F LNH+ +LQ LK    ++  + S    + ST 
Subjt:  VSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHD-VLQKLKIGGEQIHTSPSVLGRSGSTL

Query:  NCDRTTKVEIPDIISVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG
           +T KV I +I+SV+ CADLTLPPGAGLCIDTIHGPVFLVADSW++LDGWLDAIRLVYTIYARGK+DVLAGI+TG
Subjt:  NCDRTTKVEIPDIISVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG

Arabidopsis top hitse value%identityAlignment
AT5G41950.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-18161.18Show/hide
Query:  MSPTPEEPNNLQNG-----------------IETEPHISSESEQTD---EPRSEPEPIADAIPKAELRQERESESVNEEADSEPESQGKHLSESIQLQVV
        M+ T EEP  LQNG                 ++TEP ++ E  + +    P      + DA P+    + +  E      D++PE     +       VV
Subjt:  MSPTPEEPNNLQNG-----------------IETEPHISSESEQTD---EPRSEPEPIADAIPKAELRQERESESVNEEADSEPESQGKHLSESIQLQVV

Query:  T----DVTDPSFEEPKGTPIPSNGTENSQPAL-----RKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELI
        T    D+TD          IP   TE  Q +      + D+G++TFTMRELL+ LK E         EG+    +S    S++S   P   ++  AM+LI
Subjt:  T----DVTDPSFEEPKGTPIPSNGTENSQPAL-----RKDEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELI

Query:  NSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMR
        N +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNVSPDS SPSKD LLEEAC+KYDEATR CPTL+DA+YNWAIAISDRAK+R
Subjt:  NSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMR

Query:  GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKD
        GRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KD
Subjt:  GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKD

Query:  VSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHD-VLQKLKIGGEQIHTSPSVLGRSGSTL
        + P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG +LAPHSDWKR++F LNH+ +LQ LK    ++  + S    + ST 
Subjt:  VSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFFLNHD-VLQKLKIGGEQIHTSPSVLGRSGSTL

Query:  NCDRTTKVEIPDIISVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG
           +T KV I +I+SV+ CADLTLPPGAGLCIDTIHGPVFLVADSW++LDGWLDAIRLVYTIYARGK+DVLAGI+TG
Subjt:  NCDRTTKVEIPDIISVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCTACTCCCGAGGAACCTAATAATTTGCAGAACGGAATCGAAACCGAACCACACATTTCTTCAGAATCAGAGCAAACTGATGAACCCAGATCAGAGCCGGAACC
CATAGCAGATGCAATTCCCAAAGCCGAATTACGCCAAGAACGCGAATCAGAATCAGTCAATGAAGAAGCAGATTCGGAGCCGGAGTCTCAAGGGAAACATTTATCGGAGT
CAATCCAATTACAAGTAGTGACGGATGTTACAGATCCGAGTTTTGAAGAGCCGAAAGGAACCCCGATCCCGTCCAACGGCACCGAGAACTCGCAACCTGCGCTGCGTAAA
GACGAAGGAAGCCGGACGTTTACAATGAGAGAGTTGCTGAATGGATTGAAAGGTGAAGATGGTAACGACAATGTTAACGAATCTGAAGGCGAGAGGCCTGAGGCGAACTC
TGGTTACAGTCTTAGTCAAGATAGCCCACATCATCCGTATTCTGAACAGAGCAGAGCTGCCATGGAATTGATCAACAGTGTTACAGGTGTTGATGAAGAGGGCCGTTCTC
GTCAACGGATTCTTACATTCGCTGCTAGAAGATATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTTTACAATTGGGCTTTGGTCCTCCAGGAGAGTGCA
GACAATGTTAGTCCAGATTCCACCTCACCTTCTAAAGATGCATTGCTCGAGGAGGCTTGTAGAAAGTATGATGAGGCTACCCGTTTTTGCCCAACACTTCATGATGCTTT
TTATAATTGGGCAATTGCAATTTCTGATCGGGCCAAAATGCGTGGCCGTACAAAGGAAGCCGAAGAACTGTGGAAGCAGGCTACCAAGAATTATGAAAAAGCTGTCCAAC
TCAACTGGAATAGTCCCCAGGCGCTAAATAATTGGGGGCTTGCTCTTCAGGAACTCAGTGCAATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAAAACAGCTATCAGT
AAGTTTCGTGCTGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGTACTGTTCTGTATGGACTAGCTGAGGACACATTACGGACTGGTGGAACAGG
CAACGTTAAGGATGTTTCCCCCAATGAGTTGTACAGCCAATCTGCAATTTATATTGCGGCTGCTCATGCTCTAAAACCAAATTACTCCGTTTACAGCAGTGCCTTGCGGT
TGGTTCGTTCAATGCTGCCGTTACCCTATCTAAAAGTTGGATACCTGACTGCACCCCCTGTGGGGAGAGCCCTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTT
CTAAATCATGATGTATTGCAAAAGCTTAAAATAGGGGGGGAACAAATACATACATCCCCTAGTGTTTTAGGAAGATCTGGAAGTACCTTGAATTGCGATAGGACAACCAA
AGTAGAAATTCCAGATATTATCTCTGTATCAGCATGTGCAGATCTAACTTTACCGCCTGGTGCTGGACTCTGCATTGACACAATCCATGGGCCAGTTTTCTTGGTTGCTG
ACTCGTGGGATGCGCTCGATGGATGGCTCGATGCAATTAGATTAGTTTACACAATCTATGCCCGAGGCAAGAACGACGTTCTGGCAGGCATCGTAACGGGTTGA
mRNA sequenceShow/hide mRNA sequence
AAGAACTCAAAACATATATCTAGAAAAAAAAAAAAAAAACTTAATCAGTGATCAAATTGAAAATTCAGCCTTTTGATTATTATCATTATTATTTGAGGCAATATTTTGGA
TTTACAGTCTAAGCCACTTGTGTGTCACTCATCCATGGCGGAGTAATTACACGAGGGGTTTGGAATTCTGTAATTTTGTAATTGAGCATTTTCTCTTTCGTCTTTCAGTT
TCGTTCTTAGCGGATCTTCAATCTGCTCGTTTTCCGGAACCTTGCTTCTATCCCATTCTCTTCCTGTTCGCCACTTCACCATGTCGCCTACTCCCGAGGAACCTAATAAT
TTGCAGAACGGAATCGAAACCGAACCACACATTTCTTCAGAATCAGAGCAAACTGATGAACCCAGATCAGAGCCGGAACCCATAGCAGATGCAATTCCCAAAGCCGAATT
ACGCCAAGAACGCGAATCAGAATCAGTCAATGAAGAAGCAGATTCGGAGCCGGAGTCTCAAGGGAAACATTTATCGGAGTCAATCCAATTACAAGTAGTGACGGATGTTA
CAGATCCGAGTTTTGAAGAGCCGAAAGGAACCCCGATCCCGTCCAACGGCACCGAGAACTCGCAACCTGCGCTGCGTAAAGACGAAGGAAGCCGGACGTTTACAATGAGA
GAGTTGCTGAATGGATTGAAAGGTGAAGATGGTAACGACAATGTTAACGAATCTGAAGGCGAGAGGCCTGAGGCGAACTCTGGTTACAGTCTTAGTCAAGATAGCCCACA
TCATCCGTATTCTGAACAGAGCAGAGCTGCCATGGAATTGATCAACAGTGTTACAGGTGTTGATGAAGAGGGCCGTTCTCGTCAACGGATTCTTACATTCGCTGCTAGAA
GATATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTTTACAATTGGGCTTTGGTCCTCCAGGAGAGTGCAGACAATGTTAGTCCAGATTCCACCTCACCT
TCTAAAGATGCATTGCTCGAGGAGGCTTGTAGAAAGTATGATGAGGCTACCCGTTTTTGCCCAACACTTCATGATGCTTTTTATAATTGGGCAATTGCAATTTCTGATCG
GGCCAAAATGCGTGGCCGTACAAAGGAAGCCGAAGAACTGTGGAAGCAGGCTACCAAGAATTATGAAAAAGCTGTCCAACTCAACTGGAATAGTCCCCAGGCGCTAAATA
ATTGGGGGCTTGCTCTTCAGGAACTCAGTGCAATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAAAACAGCTATCAGTAAGTTTCGTGCTGCTATACAGTTGCAATTT
GATTTTCATCGAGCAATCTACAACCTTGGTACTGTTCTGTATGGACTAGCTGAGGACACATTACGGACTGGTGGAACAGGCAACGTTAAGGATGTTTCCCCCAATGAGTT
GTACAGCCAATCTGCAATTTATATTGCGGCTGCTCATGCTCTAAAACCAAATTACTCCGTTTACAGCAGTGCCTTGCGGTTGGTTCGTTCAATGCTGCCGTTACCCTATC
TAAAAGTTGGATACCTGACTGCACCCCCTGTGGGGAGAGCCCTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGATGTATTGCAAAAGCTTAAA
ATAGGGGGGGAACAAATACATACATCCCCTAGTGTTTTAGGAAGATCTGGAAGTACCTTGAATTGCGATAGGACAACCAAAGTAGAAATTCCAGATATTATCTCTGTATC
AGCATGTGCAGATCTAACTTTACCGCCTGGTGCTGGACTCTGCATTGACACAATCCATGGGCCAGTTTTCTTGGTTGCTGACTCGTGGGATGCGCTCGATGGATGGCTCG
ATGCAATTAGATTAGTTTACACAATCTATGCCCGAGGCAAGAACGACGTTCTGGCAGGCATCGTAACGGGTTGATTATTACCAAGTATGCAAATGTATTATTGATATTAC
CTTGATCCATGTTTATATTATGCTTACTCACAGTAGATTGAGTATTGATTTCTCCAAATTGAAACACAAAATTTTGGGGTGCTTTCCAATGGTGAAGAATACATGTTTTT
CTAGTCAGATTTCCTTTTCATTTGTGATAATCTATGTACATGTCAGTTTGTTAACAGCACGCACAAG
Protein sequenceShow/hide protein sequence
MSPTPEEPNNLQNGIETEPHISSESEQTDEPRSEPEPIADAIPKAELRQERESESVNEEADSEPESQGKHLSESIQLQVVTDVTDPSFEEPKGTPIPSNGTENSQPALRK
DEGSRTFTMRELLNGLKGEDGNDNVNESEGERPEANSGYSLSQDSPHHPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESA
DNVSPDSTSPSKDALLEEACRKYDEATRFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAIS
KFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRALAPHSDWKRSQFF
LNHDVLQKLKIGGEQIHTSPSVLGRSGSTLNCDRTTKVEIPDIISVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIVTG