; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014698 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014698
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionNeuronal PAS domain protein
Genome locationChr02:18117244..18121628
RNA-Seq ExpressionHG10014698
SyntenyHG10014698
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030639.1 hypothetical protein SDJN02_04676, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.17Show/hide
Query:  MSSLLLNCVHDVLYYGSNQRQNSSHNFLKLDITSNSKEIFNLAFLTLIFLICIYEAPVDLRSNCLMTLKHHLANSTSRQISKVLMKLLGPNLEQQWMRSL
        MS+LLLNCVHDVLYYGSNQR+NSSH  LKLDITS+S+EIFNLAFLTLIFLICIYEAP DLRSNCLMTLKHHLANSTSRQISKVLMKLLG NLE+QWMRS+
Subjt:  MSSLLLNCVHDVLYYGSNQRQNSSHNFLKLDITSNSKEIFNLAFLTLIFLICIYEAPVDLRSNCLMTLKHHLANSTSRQISKVLMKLLGPNLEQQWMRSL

Query:  NLAITNWILELKAAGSTLKTPSPLFSYSFLTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYRAVVREKWIDLRVHVDNIRCDI
        NLAITNW+LELKA G TLKTPSPL+SYSF T+GLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNY+ VVR+KWID+RVHVDNIRCDI
Subjt:  NLAITNWILELKAAGSTLKTPSPLFSYSFLTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYRAVVREKWIDLRVHVDNIRCDI

Query:  IRLVNETLLSERGVGGSEKHFPSRISLQLTPTFHTNIMSVSVSKSSSNPQIDIGTEKTFEAGFESATPYPGLKLAVGETVTVSMKPWKFEQLVHGNAATL
        +RLVNETLLSERGVGGSEKHFPSRISLQLTPT HTNIMSVSVSKSS+NP+I++GTE+TFEAGFE +TPYPGLKL+VGET  VS+KPWKFEQ VHGNAA L
Subjt:  IRLVNETLLSERGVGGSEKHFPSRISLQLTPTFHTNIMSVSVSKSSSNPQIDIGTEKTFEAGFESATPYPGLKLAVGETVTVSMKPWKFEQLVHGNAATL

Query:  NWYLHDSSDGKEVASTKPSKLALINPKSWFRDRYSSANRPFNRQGGVIFAGDEYGDSVWWKIDGKARGKTMEW-EIRGWIWNLPPLSQWKTTSISTSICS
        NWYLHDSSDGKEVASTKPSKL LINPK+WFRDRYSSA+RPFN+QGG+IFAGDEYG++VWWKIDGKARGKTM++ ++  WI NLPPLSQWKTTSISTSICS
Subjt:  NWYLHDSSDGKEVASTKPSKLALINPKSWFRDRYSSANRPFNRQGGVIFAGDEYGDSVWWKIDGKARGKTMEW-EIRGWIWNLPPLSQWKTTSISTSICS

Query:  SSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSNQQKNFSFSLLKLDITFNSKEIFNLAF
        SSS+NSSL+VVAAKSLHS TITLSVIADFSLPISLW+SEPLK STKSS L DDQESISSLL NC+RDVLHYGS+QQKNFSF  LKL+ITFN KEIFN+ F
Subjt:  SSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSNQQKNFSFSLLKLDITFNSKEIFNLAF

Query:  LTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTLKTPSPLFSYSFSTHGLWKVQLYCPVI
        L L+FLICIYEAPT LRLDCLT LKYHL N  SRQ SKMLMKLLGSN+EEQWMRSINLAITNWI+ELKANSC LKTPSPLFSYSFSTHGLWKVQLYCPVI
Subjt:  LTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTLKTPSPLFSYSFSTHGLWKVQLYCPVI

Query:  ATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSEKHFPSRISLQLTPTLQTNIISVSVSK
        A D IENS +PSTDERLQ SLNYHQLEG+LQFNYKAEV EKWINLRVHVDNIRCNII LVND L+SKRGVG SEK+FPSRISLQLTPTLQTNI+SVSVSK
Subjt:  ATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSEKHFPSRISLQLTPTLQTNIISVSVSK

Query:  SSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKPSRFALINPRAWFRDRYSSAFRPFNKQ
        SSDNP IEVGTEKTLEAGFE  NPYPG+KLAVGET TASL+PWKFEQ VYGNTGILNWYLHDSSDGKEVASRKPS+ ALINPRAWFRDRYSSAFRPFN+Q
Subjt:  SSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKPSRFALINPRAWFRDRYSSAFRPFNKQ

Query:  GGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP
        GGVIFAGDEYGE + WKI+  AR KT+EWEIRGWIWLTYWPNKH TFYTETRRLEFKE+LH+SIP
Subjt:  GGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP

XP_008456895.1 PREDICTED: uncharacterized protein LOC103496710 [Cucumis melo]1.3e-25590.02Show/hide
Query:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN
        ++  WI NLPPLSQWKTTSIST ICSSSSTNSSL+VVAAKSLHSPTITLSVIADFSLPISLWTSEPLK +TKSS L DDQESISSLL NCVRDVLHYGSN
Subjt:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN

Query:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL
        QQ  F+ S  KL+ITF SKEIFNL FLTLIFLICIYEAPT LRLD LT +KYHLANCWSRQTSK+ MKLLGSNLEEQWMRSINLAITNWILELKAN CTL
Subjt:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL

Query:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE
        KTPSPLFSYS+STHGLWKVQLYCPVIA D IENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRC+IIQLVNDTLMSKRGVGRSE
Subjt:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE

Query:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP
        KHFPSRISLQ+TP +QTNIISVSVSKSSDNP IEVGTEK+LEAGFEGQNPYPGIKLAVGETATASL+PWKFEQ+VYGNTGILNWYLHDSSDGKEVA RKP
Subjt:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP

Query:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP
        SRFALINPRAWFRDRY+SAFRPFNKQGGVIFA DEYG+ ICWKIEREAR KTMEWEIRGWIWLTYWPNKH TFYTETRRLE KE+LH SIP
Subjt:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP

XP_011655022.1 uncharacterized protein LOC105435469 [Cucumis sativus]1.7e-25589.41Show/hide
Query:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN
        ++  WI NLPPLSQWK+TSIST ICSSSSTNSSL+VVAAKSLHSPTITLSVIADFSLPISLW SEPLK STKSS L DDQE++ SLL NCVRDVLHYGSN
Subjt:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN

Query:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL
        QQ  F+ S  KL+ITFN KEIFNLAFLTLIFLICIYEAPT LRLD LT +KYHLANCWSRQTSK+ MKLLGSNLEEQWMRSINLAITNWILELKAN CTL
Subjt:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL

Query:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE
        KTPSPLFSYS+STHGLWKVQLYCPVIA D IENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRC +IQLVNDTLMSKRGVGRSE
Subjt:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE

Query:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP
        KHFPS+ISLQ+TPT+QTNIISVSVSKSS NP IEVGTEKTLEAGFEGQNPYP IKLAVGETATASLRPWKFEQ+V+GNTGILNWYLHDSSDGKEVA RKP
Subjt:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP

Query:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP
        SRFALINPRAWFRDRY+SAFRPFNKQGGVIFAGDEYG+ ICWKIERE RGKTM+WEIRGWIWLTYWPNKH TFYTETRRLEFKEILH SIP
Subjt:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP

XP_023544488.1 uncharacterized protein LOC111804047 [Cucurbita pepo subsp. pepo]8.4e-24786.76Show/hide
Query:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN
        ++  WI NLPPLSQWKTTSISTSICSSSS+NSSL+VVAAKSLHS TITLSVIADFSLPISLW+SEPLK STKSS L DDQESISSLL NC+RDVLHYGS+
Subjt:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN

Query:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL
        QQKNFSF  LKL+ITFN KEIFN+ FL L+FLICIYEAPTGLRLDCLT LKYHL N  SRQ SKMLMKLLGSN+EEQWMRSINLAITNWI+ELKANSC L
Subjt:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL

Query:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE
        KTPSPLFSYSFSTHGLWKVQLYCPVIA D IENS +PSTDERLQ SLNYHQLEGVLQFNYKAEV EKWINLRVHVDNIRCNII LVND L+SKRGVG SE
Subjt:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE

Query:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP
        K+FPSRISLQLTPTLQTNI+SVSVSKSSDNP IEVG EKTLEAGFE  NPYPG+KLAVGET TASL+PWKFEQ VYGNTGILNWYLHDSSDGKEVASRKP
Subjt:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP

Query:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP
        S+ ALINPRAWFRDRYSSAFRPFN+QGGVIFAGDEYGE + WKI+  AR KT+EWEIRGWIWLTYWPNKH TFYTETRRLEFKE+LH+SIP
Subjt:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP

XP_038892408.1 uncharacterized protein LOC120081521 isoform X1 [Benincasa hispida]9.9e-26492.26Show/hide
Query:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN
        ++  WI NLPPLSQWK TSIST I SSS TNSSLDVVAAKSLHSP ITLSVIADFSLPISLWTSEPLK +TKSS L DDQESISSLL NCV DVLHYGSN
Subjt:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN

Query:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL
        QQ NF+FS LKL+ITFNSKEIFNL FLTLIFLICIYEAPTGLRLDCLT LKYHLANCWSRQTSK+LMKLLGSNLEEQWMRSINLAITNWILELKAN CTL
Subjt:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL

Query:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE
        KTPSPLFSYSFSTHGLWKVQLYCPVIA D+IENSSSPS DERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE
Subjt:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE

Query:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP
        KHFPSRISLQ+TPTLQTNI S+SVSKSSDNP IEVGTEKTLEAGFEGQNPYPGIKL VGETATASL+PWKFEQ+VYGNTGILNWYLHDSSDGKEVA RKP
Subjt:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP

Query:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP
        SRFALINPRAWFRDRYSSA RPFNKQGGVIFAGDEYGERICWKIEREARGK+MEWEIRGWIWLTYWPNKH TFYTETRRLEFKEILHLSIP
Subjt:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP

TrEMBL top hitse value%identityAlignment
A0A0A0KQL8 Uncharacterized protein8.2e-25689.41Show/hide
Query:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN
        ++  WI NLPPLSQWK+TSIST ICSSSSTNSSL+VVAAKSLHSPTITLSVIADFSLPISLW SEPLK STKSS L DDQE++ SLL NCVRDVLHYGSN
Subjt:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN

Query:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL
        QQ  F+ S  KL+ITFN KEIFNLAFLTLIFLICIYEAPT LRLD LT +KYHLANCWSRQTSK+ MKLLGSNLEEQWMRSINLAITNWILELKAN CTL
Subjt:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL

Query:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE
        KTPSPLFSYS+STHGLWKVQLYCPVIA D IENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRC +IQLVNDTLMSKRGVGRSE
Subjt:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE

Query:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP
        KHFPS+ISLQ+TPT+QTNIISVSVSKSS NP IEVGTEKTLEAGFEGQNPYP IKLAVGETATASLRPWKFEQ+V+GNTGILNWYLHDSSDGKEVA RKP
Subjt:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP

Query:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP
        SRFALINPRAWFRDRY+SAFRPFNKQGGVIFAGDEYG+ ICWKIERE RGKTM+WEIRGWIWLTYWPNKH TFYTETRRLEFKEILH SIP
Subjt:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP

A0A1S3C4X7 uncharacterized protein LOC1034967106.3e-25690.02Show/hide
Query:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN
        ++  WI NLPPLSQWKTTSIST ICSSSSTNSSL+VVAAKSLHSPTITLSVIADFSLPISLWTSEPLK +TKSS L DDQESISSLL NCVRDVLHYGSN
Subjt:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN

Query:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL
        QQ  F+ S  KL+ITF SKEIFNL FLTLIFLICIYEAPT LRLD LT +KYHLANCWSRQTSK+ MKLLGSNLEEQWMRSINLAITNWILELKAN CTL
Subjt:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL

Query:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE
        KTPSPLFSYS+STHGLWKVQLYCPVIA D IENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRC+IIQLVNDTLMSKRGVGRSE
Subjt:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE

Query:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP
        KHFPSRISLQ+TP +QTNIISVSVSKSSDNP IEVGTEK+LEAGFEGQNPYPGIKLAVGETATASL+PWKFEQ+VYGNTGILNWYLHDSSDGKEVA RKP
Subjt:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP

Query:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP
        SRFALINPRAWFRDRY+SAFRPFNKQGGVIFA DEYG+ ICWKIEREAR KTMEWEIRGWIWLTYWPNKH TFYTETRRLE KE+LH SIP
Subjt:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP

A0A5D3DQP4 Uncharacterized protein6.3e-25690.02Show/hide
Query:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN
        ++  WI NLPPLSQWKTTSIST ICSSSSTNSSL+VVAAKSLHSPTITLSVIADFSLPISLWTSEPLK +TKSS L DDQESISSLL NCVRDVLHYGSN
Subjt:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN

Query:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL
        QQ  F+ S  KL+ITF SKEIFNL FLTLIFLICIYEAPT LRLD LT +KYHLANCWSRQTSK+ MKLLGSNLEEQWMRSINLAITNWILELKAN CTL
Subjt:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL

Query:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE
        KTPSPLFSYS+STHGLWKVQLYCPVIA D IENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRC+IIQLVNDTLMSKRGVGRSE
Subjt:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE

Query:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP
        KHFPSRISLQ+TP +QTNIISVSVSKSSDNP IEVGTEK+LEAGFEGQNPYPGIKLAVGETATASL+PWKFEQ+VYGNTGILNWYLHDSSDGKEVA RKP
Subjt:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP

Query:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP
        SRFALINPRAWFRDRY+SAFRPFNKQGGVIFA DEYG+ ICWKIEREAR KTMEWEIRGWIWLTYWPNKH TFYTETRRLE KE+LH SIP
Subjt:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP

A0A6J1FT79 uncharacterized protein LOC1114470804.5e-24686.35Show/hide
Query:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN
        ++  WI NLPPLSQWKTTSISTSICSSSS+NSSL+VVAAKSLHS TITLSVIADFSLPISLW+SEPLK STKSS L DDQESISSLL NC+RDVLHYGS+
Subjt:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN

Query:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL
        QQKNFSF  LKL+ITFN KEIFN+ FL L+FLICIYEAPT LRLDCLT LKYHL N  SRQ SKMLMKLLGSN+EEQWMRSINLAITNWI+ELKANSC L
Subjt:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL

Query:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE
        KTPSPLFSYSFSTHGLWKVQLYCPVIA D IENS +PSTDERLQ SLNYHQLEGVLQFNYKAEV EKWINLRVHVDNIRCNII LVND L+SKRGVG SE
Subjt:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE

Query:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP
        K+FPSR+SLQLTPTLQTNI+SVSVSKSSDNP IEVGTEKTLEAGFE  NPYPG+KLAVGET TASL+PWKFEQ VYGNTGILNWYLHDSSDGKEVASRKP
Subjt:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP

Query:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP
        S+ ALINPRAWFRDRYSSA RPFN+QGGVIFAGDEYGE + WKI+  AR KT+EWEIRGWIWLTYWPNKH TFYTETRRLEFKE+LH+SIP
Subjt:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP

A0A6J1K458 uncharacterized protein LOC1114910172.7e-24385.54Show/hide
Query:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN
        ++  WI NLPPLS+WKTTSISTSICSSSS+NSSL+VVAAKSLHS TITLSVIADFSLPISLW+SEPLK STKSS L DDQESISSLL NC+RDVLHYGS+
Subjt:  EIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN

Query:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL
        QQKNFSF  LKL+ITFN KEIFN+ FL L+FLICIYEAPTGLRLDCLT LKYHL N  SRQ SKMLMKLLGSN+EEQWMRSINLAITNWI+ELKANSC L
Subjt:  QQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTL

Query:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE
        KTPSPLFS SFSTHG WKVQLYCPVIA D+IENS +PST+ERLQ SLNYHQLEGVLQFNYKAEV EKWINLRVHVDNIRCNII LVND L+SKRGVG SE
Subjt:  KTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSE

Query:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP
        K+FPSRISLQLTPTLQTNI+SVSVSKSSDNP IEVGT+KTLEAGFE  NPYPG+KLAVGET TASL+PWKFEQ VYGNTGILNWYLHDSSDGKEVASRKP
Subjt:  KHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKP

Query:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP
        S+ ALINPRAWFRDRYSSAFRPFN+QGGVIFAGDE GE + WKI+  AR KT+EWEIRGWIWLTYWPNKH TFY ETRRLEFKE+LH+SIP
Subjt:  SRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15020.1 unknown protein1.2e-4125.79Show/hide
Query:  WIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSL----HSPTITLSVIAD-FSL--PISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHY
        WI  LP   ++  +        +     S+ + A ++L     S ++T +V+A+ F+L    ++W S    +S++   L    + +  L+        H 
Subjt:  WIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSL----HSPTITLSVIAD-FSL--PISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHY

Query:  GS---NQQKNFSFSLLKLDITFNSKE----IFNLAFLTLIFLICIYEAPT---GLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAIT
        G+    +Q     S +   +  +S E    +FNL  LT +F +C+++AP+         L     +   C      +  +  LG + E   +R+ + A++
Subjt:  GS---NQQKNFSFSLLKLDITFNSKE----IFNLAFLTLIFLICIYEAPT---GLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAIT

Query:  NWI---------LELKANSCTLKTPSPL-FSYSFSTHGLWKVQLYCPVIATDQIENSSS---------PSTDER---LQFSLNYHQLEGVLQFNYKAEVH
         W+         L LK  S +L     L FSY+   HGLW ++ Y P+++ +   NSS+         P  + +   L+++L++ Q E ++QF Y  + +
Subjt:  NWI---------LELKANSCTLKTPSPL-FSYSFSTHGLWKVQLYCPVIATDQIENSSS---------PSTDER---LQFSLNYHQLEGVLQFNYKAEVH

Query:  EKWINLRVHVDNIRCNIIQLVNDTLMSKRGVG---------RSEKHFPSRISLQLTPTLQTNIIS-VSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIK
        E +I +   VDNIR ++ +L       K GVG           E++FPSR+ + L P L ++ +S +S+ +S+ N   ++   + L+  F      P +K
Subjt:  EKWINLRVHVDNIRCNIIQLVNDTLMSKRGVG---------RSEKHFPSRISLQLTPTLQTNIIS-VSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIK

Query:  LAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKPSRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEW
                  ++ W+ EQ   GN  + +  L+D   G+EV + KP            +         F K GG++F  DEYG+ + W++ RE  G  ++W
Subjt:  LAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKPSRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEW

Query:  EIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSI
         + G IWLTYWPNK NT + ETR +E+ + + L +
Subjt:  EIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSI

AT2G40390.1 unknown protein3.6e-15555.12Show/hide
Query:  WIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPT-ITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSNQQK
        W+  LPPLS WK   +S  ICS +S++ SL+    ++  SP   T S++A+F  PI+L+ S+  +  + +S    ++  IS+LL   V  VL+Y + ++ 
Subjt:  WIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPT-ITLSVIADFSLPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSNQQK

Query:  NFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTLKTP
          S  L  L  T N K++FNLAF T +FLICIYEAPT LR  CL  +K  L  C SRQ SK+LM  LGSNLEEQWMRS+NLAITNWI+E+KA    LK+P
Subjt:  NFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKANSCTLKTP

Query:  SPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSEKHF
        SPLFSY+FST GLWKV +YCPV+A  ++E+ +S   DERL FSLNYHQLEGV+Q N++  V EKW N+ V++DN+RC+II+LVN+ L+S+RG+G  EKHF
Subjt:  SPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRGVGRSEKHF

Query:  PSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKPSRF
        PSRISLQLTPT Q+NI+ VSV KSS+NP  E   EK +EA  +  N + G+K++  ET T S++PWKFE+ V+G +  L W+LHD  DG+EV+S KPS+ 
Subjt:  PSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVASRKPSRF

Query:  ALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP
        +++NPRAWF++RYSSAFRPF KQGGV+FAGD YG+ + WK+++ A GK ME+E++G +WLTYWPNKH+TFY++TR+LEFKE+L+L++P
Subjt:  ALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP

AT5G64190.1 unknown protein2.5e-14052.42Show/hide
Query:  WIWNLPPLSQWKTTSISTSICSSSS--TNSSLDVVAAKSLHSPTITLSVIADFS--LPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN
        WI N+P +++W+TTS+   IC S+S   NS+L++ A KS     +T S+I   +   P+ LWT++       +S    D+ +I SLL N V  +L Y SN
Subjt:  WIWNLPPLSQWKTTSISTSICSSSS--TNSSLDVVAAKSLHSPTITLSVIADFS--LPISLWTSEPLKISTKSSILRDDQESISSLLHNCVRDVLHYGSN

Query:  QQKNFSFSLLKLDITFNS-----KEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKA
             ++S +K+  + +S     K+I N   LTL F++C+YEAP  LR +CL  LK HL  C +R+ +  LMKLLGSNLEEQWMR++NLA TNWI+E + 
Subjt:  QQKNFSFSLLKLDITFNS-----KEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQWMRSINLAITNWILELKA

Query:  NSCTLKTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRG
        +  T  T +PLFSY+ S +GLWKVQLYCPV A  ++E SS+P+ D RL FSL ++QLEGV+QFN+K  V + WI++ V +DNIR ++I+LVN+ LMS+RG
Subjt:  NSCTLKTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVNDTLMSKRG

Query:  VGRSEKHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSS-DGKE
         G  EKHFPSRISLQLTPTLQT+ ISVSVSKSS+NP  E   E+++E  F+  N   G+++A  E +T ++ PWK EQ V G T  LNW L+DSS  G+E
Subjt:  VGRSEKHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSS-DGKE

Query:  VASRKPSRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSI
        V S KPSRF++++PR+WF+DRY+ A+R F ++GGVIFAGDEYGE + WKI + A G TMEWEI+G+IWLTYWPNK+ TFY ETRRLEF ++L+L+I
Subjt:  VASRKPSRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGTCTCTTGCTTAATTGTGTTCATGATGTTCTTTATTATGGCTCAAACCAAAGACAGAATTCTAGCCATAACTTTCTTAAACTTGACATCACTTCCAACTCAAA
AGAAATCTTCAATCTCGCTTTTCTTACCCTCATATTCCTAATTTGCATATATGAAGCTCCGGTTGATCTCCGTTCGAATTGTCTCATGACTCTCAAGCATCATTTGGCAA
ATTCTACGTCCAGGCAGATATCAAAGGTGCTTATGAAACTGCTGGGGCCTAATCTAGAACAGCAATGGATGAGGTCCTTAAACCTTGCAATCACCAACTGGATATTGGAG
CTCAAGGCCGCTGGCAGTACTTTAAAAACACCCTCACCTTTGTTCTCTTATTCATTTTTGACATATGGGCTGTGGAAAGTTCAACTTTATTGTCCTATCATTGCAATGGA
TAATATTGAGAACTCAAGTAATCCTTCAACCGACGAAAGATTGCAGTTCTCTTTAAATTATCACCAGCTTGAAGGGGTTTTGCAGTTCAATTACAGGGCTGTGGTTCGAG
AAAAGTGGATTGATCTGAGGGTGCACGTTGATAACATAAGGTGTGACATAATCCGGCTTGTGAATGAGACTCTCTTATCCGAGAGAGGAGTTGGTGGATCAGAAAAGCAT
TTTCCATCACGGATTTCACTGCAACTCACTCCAACTTTCCACACAAATATCATGAGTGTCTCAGTAAGCAAATCCTCAAGTAACCCCCAAATCGATATTGGAACTGAAAA
AACCTTTGAGGCTGGTTTTGAATCTGCAACACCATACCCAGGCCTCAAATTAGCAGTAGGGGAGACTGTAACGGTGAGCATGAAGCCATGGAAATTTGAGCAGTTAGTCC
ACGGCAATGCTGCAACCCTTAACTGGTACCTTCACGACAGTTCAGATGGGAAAGAGGTGGCCTCCACCAAGCCATCAAAACTTGCACTCATAAACCCTAAATCTTGGTTT
CGAGACCGTTACTCAAGCGCTAACAGACCTTTCAACAGACAGGGAGGGGTTATATTCGCAGGAGATGAGTATGGAGACAGCGTGTGGTGGAAGATTGATGGAAAGGCAAG
AGGAAAAACTATGGAGTGGGAAATTAGAGGTTGGATCTGGAACCTGCCACCACTTTCTCAATGGAAAACAACTTCCATCTCAACATCCATATGCTCTTCAAGCTCAACAA
ACTCCTCTCTGGATGTTGTTGCAGCCAAAAGCCTTCATTCCCCAACCATTACTTTATCGGTTATTGCAGATTTCAGCCTTCCTATCTCTCTTTGGACTTCAGAACCCTTG
AAGATCAGCACCAAATCCTCAATTTTACGAGATGACCAAGAAAGCATATCCAGTCTCTTGCATAATTGTGTTCGTGATGTTCTTCATTATGGCTCAAACCAACAAAAGAA
TTTTAGCTTTAGTCTCCTCAAACTCGACATTACTTTCAACTCTAAAGAAATTTTCAATCTTGCATTTCTTACCCTCATATTCCTCATTTGCATCTACGAAGCTCCAACAG
GTCTACGTTTGGATTGTCTTACGAATCTCAAATACCATTTAGCAAATTGTTGGTCAAGACAGACATCAAAGATGCTTATGAAACTGTTGGGGTCTAATCTTGAAGAGCAA
TGGATGAGGTCCATAAACCTTGCAATCACCAACTGGATATTGGAGCTCAAGGCCAACAGCTGCACCCTCAAAACACCCTCACCTTTGTTCTCTTACTCATTTTCAACACA
TGGGTTATGGAAAGTTCAACTGTATTGCCCTGTCATTGCAACGGATCAAATTGAGAACTCAAGCAGTCCTTCAACTGATGAAAGACTGCAATTCTCTCTAAATTATCACC
AGCTAGAAGGGGTTCTGCAGTTCAATTACAAGGCTGAGGTTCATGAAAAGTGGATTAATCTGAGAGTTCACGTTGACAACATAAGGTGCAACATCATCCAACTCGTGAAC
GATACGCTCATGTCGAAACGAGGAGTCGGAAGATCCGAAAAGCACTTCCCATCACGAATCTCACTGCAACTCACACCAACTCTACAAACAAACATAATAAGCGTCTCCGT
AAGTAAGTCATCAGACAACCCTACAATAGAAGTCGGAACTGAAAAAACCCTAGAAGCAGGATTCGAAGGCCAAAACCCTTACCCAGGCATAAAATTAGCAGTCGGAGAGA
CCGCAACTGCAAGCTTGAGGCCATGGAAGTTCGAGCAGCTGGTGTACGGCAACACCGGAATCCTAAACTGGTACCTCCACGACAGTTCCGACGGCAAAGAGGTGGCATCG
AGGAAACCATCAAGATTTGCGCTGATAAACCCTAGAGCTTGGTTCCGGGACCGATACTCGAGCGCTTTCCGGCCATTCAACAAACAGGGAGGGGTGATATTCGCGGGAGA
TGAGTATGGAGAGAGAATTTGTTGGAAGATTGAGAGAGAAGCGAGAGGGAAAACCATGGAATGGGAGATCAGAGGTTGGATTTGGTTAACGTATTGGCCAAACAAACACA
ATACGTTTTACACTGAAACTCGAAGGCTGGAGTTCAAGGAGATTCTCCATCTTTCAATTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAGTCTCTTGCTTAATTGTGTTCATGATGTTCTTTATTATGGCTCAAACCAAAGACAGAATTCTAGCCATAACTTTCTTAAACTTGACATCACTTCCAACTCAAA
AGAAATCTTCAATCTCGCTTTTCTTACCCTCATATTCCTAATTTGCATATATGAAGCTCCGGTTGATCTCCGTTCGAATTGTCTCATGACTCTCAAGCATCATTTGGCAA
ATTCTACGTCCAGGCAGATATCAAAGGTGCTTATGAAACTGCTGGGGCCTAATCTAGAACAGCAATGGATGAGGTCCTTAAACCTTGCAATCACCAACTGGATATTGGAG
CTCAAGGCCGCTGGCAGTACTTTAAAAACACCCTCACCTTTGTTCTCTTATTCATTTTTGACATATGGGCTGTGGAAAGTTCAACTTTATTGTCCTATCATTGCAATGGA
TAATATTGAGAACTCAAGTAATCCTTCAACCGACGAAAGATTGCAGTTCTCTTTAAATTATCACCAGCTTGAAGGGGTTTTGCAGTTCAATTACAGGGCTGTGGTTCGAG
AAAAGTGGATTGATCTGAGGGTGCACGTTGATAACATAAGGTGTGACATAATCCGGCTTGTGAATGAGACTCTCTTATCCGAGAGAGGAGTTGGTGGATCAGAAAAGCAT
TTTCCATCACGGATTTCACTGCAACTCACTCCAACTTTCCACACAAATATCATGAGTGTCTCAGTAAGCAAATCCTCAAGTAACCCCCAAATCGATATTGGAACTGAAAA
AACCTTTGAGGCTGGTTTTGAATCTGCAACACCATACCCAGGCCTCAAATTAGCAGTAGGGGAGACTGTAACGGTGAGCATGAAGCCATGGAAATTTGAGCAGTTAGTCC
ACGGCAATGCTGCAACCCTTAACTGGTACCTTCACGACAGTTCAGATGGGAAAGAGGTGGCCTCCACCAAGCCATCAAAACTTGCACTCATAAACCCTAAATCTTGGTTT
CGAGACCGTTACTCAAGCGCTAACAGACCTTTCAACAGACAGGGAGGGGTTATATTCGCAGGAGATGAGTATGGAGACAGCGTGTGGTGGAAGATTGATGGAAAGGCAAG
AGGAAAAACTATGGAGTGGGAAATTAGAGGTTGGATCTGGAACCTGCCACCACTTTCTCAATGGAAAACAACTTCCATCTCAACATCCATATGCTCTTCAAGCTCAACAA
ACTCCTCTCTGGATGTTGTTGCAGCCAAAAGCCTTCATTCCCCAACCATTACTTTATCGGTTATTGCAGATTTCAGCCTTCCTATCTCTCTTTGGACTTCAGAACCCTTG
AAGATCAGCACCAAATCCTCAATTTTACGAGATGACCAAGAAAGCATATCCAGTCTCTTGCATAATTGTGTTCGTGATGTTCTTCATTATGGCTCAAACCAACAAAAGAA
TTTTAGCTTTAGTCTCCTCAAACTCGACATTACTTTCAACTCTAAAGAAATTTTCAATCTTGCATTTCTTACCCTCATATTCCTCATTTGCATCTACGAAGCTCCAACAG
GTCTACGTTTGGATTGTCTTACGAATCTCAAATACCATTTAGCAAATTGTTGGTCAAGACAGACATCAAAGATGCTTATGAAACTGTTGGGGTCTAATCTTGAAGAGCAA
TGGATGAGGTCCATAAACCTTGCAATCACCAACTGGATATTGGAGCTCAAGGCCAACAGCTGCACCCTCAAAACACCCTCACCTTTGTTCTCTTACTCATTTTCAACACA
TGGGTTATGGAAAGTTCAACTGTATTGCCCTGTCATTGCAACGGATCAAATTGAGAACTCAAGCAGTCCTTCAACTGATGAAAGACTGCAATTCTCTCTAAATTATCACC
AGCTAGAAGGGGTTCTGCAGTTCAATTACAAGGCTGAGGTTCATGAAAAGTGGATTAATCTGAGAGTTCACGTTGACAACATAAGGTGCAACATCATCCAACTCGTGAAC
GATACGCTCATGTCGAAACGAGGAGTCGGAAGATCCGAAAAGCACTTCCCATCACGAATCTCACTGCAACTCACACCAACTCTACAAACAAACATAATAAGCGTCTCCGT
AAGTAAGTCATCAGACAACCCTACAATAGAAGTCGGAACTGAAAAAACCCTAGAAGCAGGATTCGAAGGCCAAAACCCTTACCCAGGCATAAAATTAGCAGTCGGAGAGA
CCGCAACTGCAAGCTTGAGGCCATGGAAGTTCGAGCAGCTGGTGTACGGCAACACCGGAATCCTAAACTGGTACCTCCACGACAGTTCCGACGGCAAAGAGGTGGCATCG
AGGAAACCATCAAGATTTGCGCTGATAAACCCTAGAGCTTGGTTCCGGGACCGATACTCGAGCGCTTTCCGGCCATTCAACAAACAGGGAGGGGTGATATTCGCGGGAGA
TGAGTATGGAGAGAGAATTTGTTGGAAGATTGAGAGAGAAGCGAGAGGGAAAACCATGGAATGGGAGATCAGAGGTTGGATTTGGTTAACGTATTGGCCAAACAAACACA
ATACGTTTTACACTGAAACTCGAAGGCTGGAGTTCAAGGAGATTCTCCATCTTTCAATTCCTTGA
Protein sequenceShow/hide protein sequence
MSSLLLNCVHDVLYYGSNQRQNSSHNFLKLDITSNSKEIFNLAFLTLIFLICIYEAPVDLRSNCLMTLKHHLANSTSRQISKVLMKLLGPNLEQQWMRSLNLAITNWILE
LKAAGSTLKTPSPLFSYSFLTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYRAVVREKWIDLRVHVDNIRCDIIRLVNETLLSERGVGGSEKH
FPSRISLQLTPTFHTNIMSVSVSKSSSNPQIDIGTEKTFEAGFESATPYPGLKLAVGETVTVSMKPWKFEQLVHGNAATLNWYLHDSSDGKEVASTKPSKLALINPKSWF
RDRYSSANRPFNRQGGVIFAGDEYGDSVWWKIDGKARGKTMEWEIRGWIWNLPPLSQWKTTSISTSICSSSSTNSSLDVVAAKSLHSPTITLSVIADFSLPISLWTSEPL
KISTKSSILRDDQESISSLLHNCVRDVLHYGSNQQKNFSFSLLKLDITFNSKEIFNLAFLTLIFLICIYEAPTGLRLDCLTNLKYHLANCWSRQTSKMLMKLLGSNLEEQ
WMRSINLAITNWILELKANSCTLKTPSPLFSYSFSTHGLWKVQLYCPVIATDQIENSSSPSTDERLQFSLNYHQLEGVLQFNYKAEVHEKWINLRVHVDNIRCNIIQLVN
DTLMSKRGVGRSEKHFPSRISLQLTPTLQTNIISVSVSKSSDNPTIEVGTEKTLEAGFEGQNPYPGIKLAVGETATASLRPWKFEQLVYGNTGILNWYLHDSSDGKEVAS
RKPSRFALINPRAWFRDRYSSAFRPFNKQGGVIFAGDEYGERICWKIEREARGKTMEWEIRGWIWLTYWPNKHNTFYTETRRLEFKEILHLSIP