; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g29570 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g29570
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein HLB1
Genome locationchr10:22349826..22357385
RNA-Seq ExpressionMoc10g29570
SyntenyMoc10g29570
Gene Ontology termsGO:0006887 - exocytosis (biological process)
GO:0048768 - root hair cell tip growth (biological process)
GO:0005769 - early endosome (cellular component)
GO:0005802 - trans-Golgi network (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146133.1 protein HLB1 isoform X1 [Cucumis sativus]2.6e-25485.35Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPES-RVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEAS
        MSPTPEEP NLQNG + QPHI SES Q  E  S PE   V +IP    Q++RESESV   ++  P SEPES R+Q SESI L VVT  TDP   + +E S
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPES-RVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEAS

Query:  IPSNG-ADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR
         PSNG  +N  PALRKDEGSRTFTMRELLNGLKG+DG+DS+NESEGERPE NSG+S  QDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR
Subjt:  IPSNG-ADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR

Query:  YASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQL
        YASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAK+RGRTKEAEELWKQATKNYEKAVQL
Subjt:  YASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQL

Query:  NWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPN
        NWNSPQALNNWGLALQELSAIVPAREKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KDVSPN+LYSQSAIYIAAAHALKPN
Subjt:  NWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPN

Query:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS--------LNGERTIKVEIPDIVSVSACADLTL
        YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS        LNG+RTIKVEIPDIVSVSACADLTL
Subjt:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS--------LNGERTIKVEIPDIVSVSACADLTL

Query:  PPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        PPGAGLCIDTIHGP+FLVAD+WD LDGWLDAIRLVYTIYARGKN+VLAGII G
Subjt:  PPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

XP_022145328.1 protein HLB1 [Momordica charantia]4.4e-302100Show/hide
Query:  MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGA
        MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGA
Subjt:  MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGA

Query:  DNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIER
        DNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIER
Subjt:  DNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIER

Query:  NAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQA
        NAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQA
Subjt:  NAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQA

Query:  LNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSA
        LNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSA
Subjt:  LNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSA

Query:  LRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPV
        LRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPV
Subjt:  LRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPV

Query:  FLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        FLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
Subjt:  FLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

XP_022965251.1 protein HLB1-like isoform X1 [Cucurbita maxima]8.9e-25581.35Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP--------------------------------QQQRESESVDEEADAEPRSEPESRR
        MSP PEEP NLQNG + +PHI  ES Q  ES S+PES    +P                                QQ+RESESV+  AD+EP+SE +S R
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP--------------------------------QQQRESESVDEEADAEPRSEPESRR

Query:  EQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAM
        +Q SESI+LQVVTD TDPR  +P+  SI SNGA+NS PALRKDEGSRTFTMRELLNGLK +DGNDS+NESEGE+PEANSG+S  QDSPHQPYSEQSRAAM
Subjt:  EQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAM

Query:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
        ELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
Subjt:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA

Query:  KIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN
        K+RGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG 
Subjt:  KIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN

Query:  AKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------
         KDVSPN+LYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP        
Subjt:  AKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------

Query:  SLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        +LNG+RT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVAD+WDALDGWLDAIRLVYTIYARGKN+VLAGIIAG
Subjt:  SLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

XP_022965252.1 protein HLB1-like isoform X2 [Cucurbita maxima]3.9e-25885.66Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASI
        MSP PEEP NLQNG + +PHI  ES Q  ES S+PES    IP    QQ+RESESV+  AD+EP+SE +S R+Q SESI+LQVVTD TDPR  +P+  SI
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASI

Query:  PSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
         SNGA+NS PALRKDEGSRTFTMRELLNGLK +DGNDS+NESEGE+PEANSG+S  QDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
Subjt:  PSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA

Query:  SAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNW
        SAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK+RGRTKEAEELWKQAT+NYEKAVQLNW
Subjt:  SAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNW

Query:  NSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYS
        NSPQALNNWGLALQELSAIVPAREK TIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG  KDVSPN+LYSQSAIYIAAAHALKP+YS
Subjt:  NSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYS

Query:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------SLNGERTIKVEIPDIVSVSACADLTLPP
        VYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP        +LNG+RT+KVEIPDIVSVSACADLTLPP
Subjt:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------SLNGERTIKVEIPDIVSVSACADLTLPP

Query:  GAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        GAGLCIDTIHG +FLVAD+WDALDGWLDAIRLVYTIYARGKN+VLAGIIAG
Subjt:  GAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

XP_038876586.1 protein HLB1 [Benincasa hispida]2.6e-25485.51Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATI----PQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASI
        MSPTPEEP NLQNG + QPHI  ES QT E  S+PE     I      Q+RESESV+   +    SEP SRR+Q  ESI LQV TD  DPR  + +E SI
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATI----PQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASI

Query:  PSNG-ADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRY
        PSNG  +NS PALRKDEGSRTFTMRELLNGLKG+DGNDS+NESEGERPE N G+S  QDSPHQPYSEQSRAAMELI+SVTGVDEEGRSRQRILTFAARRY
Subjt:  PSNG-ADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRY

Query:  ASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLN
        ASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK+RGRTKEAEELWKQATKNYEKAVQLN
Subjt:  ASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLN

Query:  WNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNY
        WNSPQALNNWGLALQELSAIVPAREKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN KDVSPN+LYSQSAIYIAAAHALKPNY
Subjt:  WNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNY

Query:  SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS--------LNGERTIKVEIPDIVSVSACADLTLP
        SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPH DWKRSQFFLNHDVLQKLNIGGEQIQTSPS        LNG+ TIKVEIPDIVSVSACADLTLP
Subjt:  SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS--------LNGERTIKVEIPDIVSVSACADLTLP

Query:  PGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        PGAGLCIDTIHGPVFLVAD+WDALDGWLDAIRLVYTIYARGKN+VLAGII G
Subjt:  PGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

TrEMBL top hitse value%identityAlignment
A0A0A0L688 Uncharacterized protein1.3e-25485.35Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPES-RVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEAS
        MSPTPEEP NLQNG + QPHI SES Q  E  S PE   V +IP    Q++RESESV   ++  P SEPES R+Q SESI L VVT  TDP   + +E S
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPES-RVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEAS

Query:  IPSNG-ADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR
         PSNG  +N  PALRKDEGSRTFTMRELLNGLKG+DG+DS+NESEGERPE NSG+S  QDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR
Subjt:  IPSNG-ADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR

Query:  YASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQL
        YASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAK+RGRTKEAEELWKQATKNYEKAVQL
Subjt:  YASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQL

Query:  NWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPN
        NWNSPQALNNWGLALQELSAIVPAREKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KDVSPN+LYSQSAIYIAAAHALKPN
Subjt:  NWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPN

Query:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS--------LNGERTIKVEIPDIVSVSACADLTL
        YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS        LNG+RTIKVEIPDIVSVSACADLTL
Subjt:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS--------LNGERTIKVEIPDIVSVSACADLTL

Query:  PPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        PPGAGLCIDTIHGP+FLVAD+WD LDGWLDAIRLVYTIYARGKN+VLAGII G
Subjt:  PPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

A0A6J1CUX3 protein HLB12.1e-302100Show/hide
Query:  MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGA
        MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGA
Subjt:  MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGA

Query:  DNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIER
        DNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIER
Subjt:  DNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIER

Query:  NAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQA
        NAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQA
Subjt:  NAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQA

Query:  LNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSA
        LNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSA
Subjt:  LNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSA

Query:  LRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPV
        LRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPV
Subjt:  LRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPV

Query:  FLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        FLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
Subjt:  FLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

A0A6J1EA05 protein HLB1-like6.2e-25481.17Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP--------------------------------QQQRESESVDEEADAEPRSEPESRR
        MSPTPEEP NLQNG + +PHI  ES Q  ES S+PES    +P                                QQ+RESESV+  AD+E +SE +S R
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP--------------------------------QQQRESESVDEEADAEPRSEPESRR

Query:  EQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAM
        +Q SESIQLQV TD  DPR  +P+  SI SNG +NS PALRKDEGSRTFTMRELLNGLK +DGNDS+NESEGE+PEANSG+S  QDSPHQPYSEQSRAAM
Subjt:  EQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAM

Query:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
        ELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
Subjt:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA

Query:  KIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN
        K+RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG 
Subjt:  KIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN

Query:  AKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------
         KDVSPN+LYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQ QTSP        
Subjt:  AKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------

Query:  SLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        +LNG+RT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVAD+WDALDGWLDAIRLVYTIYARGKN+VLAGIIAG
Subjt:  SLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

A0A6J1HJU5 protein HLB1-like isoform X21.9e-25885.66Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASI
        MSP PEEP NLQNG + +PHI  ES Q  ES S+PES    IP    QQ+RESESV+  AD+EP+SE +S R+Q SESI+LQVVTD TDPR  +P+  SI
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP----QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASI

Query:  PSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
         SNGA+NS PALRKDEGSRTFTMRELLNGLK +DGNDS+NESEGE+PEANSG+S  QDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
Subjt:  PSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA

Query:  SAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNW
        SAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK+RGRTKEAEELWKQAT+NYEKAVQLNW
Subjt:  SAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNW

Query:  NSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYS
        NSPQALNNWGLALQELSAIVPAREK TIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG  KDVSPN+LYSQSAIYIAAAHALKP+YS
Subjt:  NSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYS

Query:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------SLNGERTIKVEIPDIVSVSACADLTLPP
        VYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP        +LNG+RT+KVEIPDIVSVSACADLTLPP
Subjt:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------SLNGERTIKVEIPDIVSVSACADLTLPP

Query:  GAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        GAGLCIDTIHG +FLVAD+WDALDGWLDAIRLVYTIYARGKN+VLAGIIAG
Subjt:  GAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

A0A6J1HL68 protein HLB1-like isoform X14.3e-25581.35Show/hide
Query:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP--------------------------------QQQRESESVDEEADAEPRSEPESRR
        MSP PEEP NLQNG + +PHI  ES Q  ES S+PES    +P                                QQ+RESESV+  AD+EP+SE +S R
Subjt:  MSPTPEEP-NLQNGNQTQPHIPSESQQTEESGSDPESRVATIP--------------------------------QQQRESESVDEEADAEPRSEPESRR

Query:  EQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAM
        +Q SESI+LQVVTD TDPR  +P+  SI SNGA+NS PALRKDEGSRTFTMRELLNGLK +DGNDS+NESEGE+PEANSG+S  QDSPHQPYSEQSRAAM
Subjt:  EQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAM

Query:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
        ELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNV  DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
Subjt:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA

Query:  KIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN
        K+RGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG 
Subjt:  KIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN

Query:  AKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------
         KDVSPN+LYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP        
Subjt:  AKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSP--------

Query:  SLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        +LNG+RT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVAD+WDALDGWLDAIRLVYTIYARGKN+VLAGIIAG
Subjt:  SLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

SwissProt top hitse value%identityAlignment
Q9FHY8 Protein HLB12.0e-17762.28Show/hide
Query:  MSPTPEEPNLQNG------NQTQPHIPSESQQTEE--SGSDPESRVATIPQ--QQRESESVDEEADAEPRSE-----------PESRREQSSESIQLQVV
        M+ T EEP LQNG         Q  IP    QTE   +G  PE      P+  Q   +++  EE  +E + E            E++ E   E +Q  VV
Subjt:  MSPTPEEPNLQNG------NQTQPHIPSESQQTEE--SGSDPESRVATIPQ--QQRESESVDEEADAEPRSE-----------PESRREQSSESIQLQVV

Query:  TDA------TDPRSGDPEEASIPSNGA--DNSHPALRK-DEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELI
        TD        D   G  EE  I S     +++   L+K D+G++TFTMRELL+ LK ++G+ + +         +S   F+++S  QP   ++  AM+LI
Subjt:  TDA------TDPRSGDPEEASIPSNGA--DNSHPALRK-DEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELI

Query:  NSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIR
        N +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNV  DS SPSKD LLEEACKKYDEATRLCPTL+DA+YNWAIAISDRAKIR
Subjt:  NSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIR

Query:  GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKD
        GRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +VRTAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KD
Subjt:  GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKD

Query:  VSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKL---------NIGGEQIQTSPS
        + P +LYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG  LAPHSDWKR++F LNH+ +LQ L         N+ G+    S +
Subjt:  VSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKL---------NIGGEQIQTSPS

Query:  LNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        +   +T+KV I +IVSV+ CADLTLPPGAGLCIDTIHGPVFLVAD+W++LDGWLDAIRLVYTIYARGK+DVLAGII G
Subjt:  LNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG

Arabidopsis top hitse value%identityAlignment
AT5G41950.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-17862.28Show/hide
Query:  MSPTPEEPNLQNG------NQTQPHIPSESQQTEE--SGSDPESRVATIPQ--QQRESESVDEEADAEPRSE-----------PESRREQSSESIQLQVV
        M+ T EEP LQNG         Q  IP    QTE   +G  PE      P+  Q   +++  EE  +E + E            E++ E   E +Q  VV
Subjt:  MSPTPEEPNLQNG------NQTQPHIPSESQQTEE--SGSDPESRVATIPQ--QQRESESVDEEADAEPRSE-----------PESRREQSSESIQLQVV

Query:  TDA------TDPRSGDPEEASIPSNGA--DNSHPALRK-DEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELI
        TD        D   G  EE  I S     +++   L+K D+G++TFTMRELL+ LK ++G+ + +         +S   F+++S  QP   ++  AM+LI
Subjt:  TDA------TDPRSGDPEEASIPSNGA--DNSHPALRK-DEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELI

Query:  NSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIR
        N +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNV  DS SPSKD LLEEACKKYDEATRLCPTL+DA+YNWAIAISDRAKIR
Subjt:  NSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIR

Query:  GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKD
        GRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +VRTAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KD
Subjt:  GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKD

Query:  VSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKL---------NIGGEQIQTSPS
        + P +LYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG  LAPHSDWKR++F LNH+ +LQ L         N+ G+    S +
Subjt:  VSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKL---------NIGGEQIQTSPS

Query:  LNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
        +   +T+KV I +IVSV+ CADLTLPPGAGLCIDTIHGPVFLVAD+W++LDGWLDAIRLVYTIYARGK+DVLAGII G
Subjt:  LNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCTACTCCCGAGGAGCCTAATTTGCAGAACGGAAACCAAACCCAACCGCACATTCCGTCAGAATCACAGCAAACTGAAGAATCCGGATCGGACCCAGAATCCAG
AGTTGCCACAATTCCCCAACAGCAACGCGAATCAGAATCGGTTGATGAAGAAGCAGATGCGGAGCCTCGATCGGAGCCGGAGTCTCGGAGGGAACAGTCGTCGGAGTCCA
TCCAGTTGCAGGTGGTGACGGATGCCACAGATCCCAGGTCCGGTGATCCCGAGGAAGCCTCGATCCCGTCCAACGGCGCCGACAACTCGCATCCCGCCCTGCGGAAGGAC
GAAGGAAGCCGGACGTTCACCATGAGGGAGCTGCTGAATGGATTGAAAGGTGATGATGGTAACGACAGCGTTAATGAATCGGAAGGCGAGAGGCCCGAGGCTAACTCCGG
TCACAGTTTTACTCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCTATGGAGTTGATCAATAGTGTTACAGGTGTTGATGAGGAGGGTCGTTCTCGCC
AACGCATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCATTAGTCCTCCAGGAGAGTGCAGAT
AATGTTGGTGCAGATTCCTCTTCACCTTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCTACCCGTCTTTGCCCAACACTTCATGATGCTTTTTA
TAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATTCGTGGTCGTACAAAGGAGGCTGAAGAACTATGGAAGCAGGCTACCAAAAACTATGAAAAAGCTGTTCAACTCA
ATTGGAACAGTCCCCAGGCGCTAAATAATTGGGGACTCGCTCTACAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAGAACAGCTATCAGTAAG
TTTCGTGCAGCAATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAATCTTGGTACTGTTCTGTATGGACTAGCTGAGGACACATTAAGGACCGGAGGAACAGGAAA
TGCTAAGGATGTTTCCCCTAATGACTTGTACAGCCAATCTGCTATTTATATCGCAGCTGCTCATGCTCTAAAACCAAATTACTCTGTTTACAGCAGTGCCCTGCGGTTGG
TTCGTTCAATGCTGCCGTTGCCGTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTA
AATCATGATGTATTGCAAAAGCTTAACATAGGGGGGGAGCAAATACAAACATCACCTAGCTTGAATGGCGAGAGGACAATCAAAGTAGAAATTCCAGATATTGTCTCAGT
ATCAGCATGCGCAGATCTAACTCTACCACCCGGTGCTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGGTCGCTGACACGTGGGACGCGCTTGATGGATGGC
TTGATGCAATTAGATTAGTTTACACAATCTATGCTCGAGGCAAGAACGACGTTTTGGCTGGCATCATAGCAGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGCCTACTCCCGAGGAGCCTAATTTGCAGAACGGAAACCAAACCCAACCGCACATTCCGTCAGAATCACAGCAAACTGAAGAATCCGGATCGGACCCAGAATCCAG
AGTTGCCACAATTCCCCAACAGCAACGCGAATCAGAATCGGTTGATGAAGAAGCAGATGCGGAGCCTCGATCGGAGCCGGAGTCTCGGAGGGAACAGTCGTCGGAGTCCA
TCCAGTTGCAGGTGGTGACGGATGCCACAGATCCCAGGTCCGGTGATCCCGAGGAAGCCTCGATCCCGTCCAACGGCGCCGACAACTCGCATCCCGCCCTGCGGAAGGAC
GAAGGAAGCCGGACGTTCACCATGAGGGAGCTGCTGAATGGATTGAAAGGTGATGATGGTAACGACAGCGTTAATGAATCGGAAGGCGAGAGGCCCGAGGCTAACTCCGG
TCACAGTTTTACTCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCTATGGAGTTGATCAATAGTGTTACAGGTGTTGATGAGGAGGGTCGTTCTCGCC
AACGCATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCATTAGTCCTCCAGGAGAGTGCAGAT
AATGTTGGTGCAGATTCCTCTTCACCTTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCTACCCGTCTTTGCCCAACACTTCATGATGCTTTTTA
TAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATTCGTGGTCGTACAAAGGAGGCTGAAGAACTATGGAAGCAGGCTACCAAAAACTATGAAAAAGCTGTTCAACTCA
ATTGGAACAGTCCCCAGGCGCTAAATAATTGGGGACTCGCTCTACAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAGAACAGCTATCAGTAAG
TTTCGTGCAGCAATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAATCTTGGTACTGTTCTGTATGGACTAGCTGAGGACACATTAAGGACCGGAGGAACAGGAAA
TGCTAAGGATGTTTCCCCTAATGACTTGTACAGCCAATCTGCTATTTATATCGCAGCTGCTCATGCTCTAAAACCAAATTACTCTGTTTACAGCAGTGCCCTGCGGTTGG
TTCGTTCAATGCTGCCGTTGCCGTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTA
AATCATGATGTATTGCAAAAGCTTAACATAGGGGGGGAGCAAATACAAACATCACCTAGCTTGAATGGCGAGAGGACAATCAAAGTAGAAATTCCAGATATTGTCTCAGT
ATCAGCATGCGCAGATCTAACTCTACCACCCGGTGCTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGGTCGCTGACACGTGGGACGCGCTTGATGGATGGC
TTGATGCAATTAGATTAGTTTACACAATCTATGCTCGAGGCAAGAACGACGTTTTGGCTGGCATCATAGCAGGCTGA
Protein sequenceShow/hide protein sequence
MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKD
EGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESAD
NVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISK
FRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFL
NHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG