; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020506 (gene) of Snake gourd v1 genome

Gene IDTan0020506
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionARM repeat superfamily protein
Genome locationLG05:84210685..84212929
RNA-Seq ExpressionTan0020506
SyntenyTan0020506
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019647.1 U-box domain-containing protein 4, partial [Cucurbita argyrosperma subsp. argyrosperma]4.3e-29592.07Show/hide
Query:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC
        MREERGP+P ET PT SPDFSLDKPTLRQVILLISSLISLSHSVKVF+SKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVI+TA EC DLA RC
Subjt:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC

Query:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE
        VDLSFSGKLLMQSDLDVICAKF+RHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDM+FYVRDIVTRMKIGCSDLKRQALVNLLAA  EDEKYVK+IIE
Subjt:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE

Query:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK
        IGEIVNLLVNFLGS ETELQEAAL+VLH ISGFDSYK VLVGNGVIAPLIRV+E GSE GKNIA RCL+KFTENS NAWSVSAHGGVTALLKICSNADSK
Subjt:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK

Query:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL
         ELI PACGVLSNLV VEEIKRFMIEEGAI TFIRLARS+DEAVQINSIVFLQNIAYGD+SVNK LVK+GG+RALVRVLDPKSSSS+KTLEVA+RAIENL
Subjt:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL

Query:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN
         FSSISYVNILM+YGF+DNLLYFLRNGEVSLQEVALKVA RL GTS+E KKAMGDGGFMPEFVKFLGAKSFEVREMAAEALS +VMIPKNRKRF+QDNRN
Subjt:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN

Query:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS
        VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRS+LNGIWHS
Subjt:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS

XP_022140205.1 uncharacterized protein LOC111010930 [Momordica charantia]3.1e-30193.79Show/hide
Query:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC
        MREERGP+PAETDPTNSPDFS++KPTLRQVI  ISSLISLSHSVKVF+SKWKLIRDKLEELNSGLIAADNCDSDENPAIS LI KVI+TATECHDLARRC
Subjt:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC

Query:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE
        VDLSFSGKLLMQSDLDVICAKF+RHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDM+FYVRDIVTRMKIGCSDLKRQALVNLLAA IEDEKYVKVIIE
Subjt:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE

Query:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK
        +GEIVNLLVNFLGS ETELQEAAL+VLHIISGFDSYKAVLVGNG+IAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICS++DSK
Subjt:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK

Query:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL
        TELIGPACGVLSNLV VEEIKRFMIEEGAI TFIR+ARS+DEAVQINSIVFLQNIAYGD+SVN LLVKEGGVRALVRVLDPKSSSSSKTLEVA+RAIENL
Subjt:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL

Query:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN
         FSSISYVNILM+YGF+D+LL+FLRNGEVSLQEVALKVAVRL GTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALS MVMIPKNRKRF QD+RN
Subjt:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN

Query:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS
        V MLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLL+GIWHS
Subjt:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS

XP_022927631.1 uncharacterized protein LOC111434400 [Cucurbita moschata]1.1e-29592.24Show/hide
Query:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC
        MREERGP+P ET PT SPDFSLDKPTLRQVILLISSLISLSHSVKVF+SKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVI+TATEC DLA RC
Subjt:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC

Query:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE
        VDLSFSGKLLMQSDLDVICAKF+RHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDM+FYVRDIVTRMKIGCSDLKRQALVNLLAA  EDEKYVK+IIE
Subjt:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE

Query:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK
        IGEIVNLLVNFLGS ETELQEAAL+VLH ISGFDSYK VLVGNGVIAPLIRV+E GSE GKNIA RCL+KFTENS NAWSVSAHGGVTALLKICSNADSK
Subjt:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK

Query:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL
         ELI PACGVLSNLV VEEIKRFMIEEGAI TFIRLARS+DEAVQINSIVFLQNIAYGD+SVNK LVK+GG+RALVRVLDPKSSSS+KTLEVA+RAIENL
Subjt:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL

Query:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN
         FSSISYVNILM+YGF+DNLLYFLRNGEVSLQEVALKVA RL GTS+E KKAMGDGGFMPEFVKFLGAKSFEVREMAAEALS +VMIPKNRKRF+QDNRN
Subjt:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN

Query:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS
        VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRS+LNGIWHS
Subjt:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS

XP_023519872.1 uncharacterized protein LOC111783203 [Cucurbita pepo subsp. pepo]7.8e-29792.59Show/hide
Query:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC
        MREERGP+P ET PT SPDFSLDKPTLRQVILLIS+LISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVI+TATEC DLA RC
Subjt:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC

Query:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE
        VDLSFSGKLLMQSDLDVICAKF+RHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDM+FYVRDIVTRMKIGCSDLKRQALVNLLAA  EDEKYVK+IIE
Subjt:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE

Query:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK
        IGEIVNLLVNFLGS ETELQEAAL+VLHIISGFDSYK VLVGNGVIAPLIRV+E GSE GKNIA RCL+KFTENS NAWSVSAHGGVTALLKICSNADSK
Subjt:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK

Query:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL
         ELI PACGVLSNLV VEEIKRFMIEEGAI TFIRLARS+DEAVQINSIVFLQNIAYGD+SVNK LVK+GG+RALVRVLDPKSSSS+KTLEVA+RAIENL
Subjt:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL

Query:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN
         FSSISYVNILM+YGF+DNLLYFLRNGEVSLQEVALKVA RL GTSEE KKAMGDGGFMPEFVKFLGAKSFEVREMAAEALS +VMIPKNRKRF+QDNRN
Subjt:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN

Query:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS
        VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRS+LNGIWHS
Subjt:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS

XP_038895092.1 uncharacterized protein LOC120083411 [Benincasa hispida]5.4e-29893.45Show/hide
Query:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC
        MREE+GP+ AETDPT SPDF L+ PTLRQVILLISSLISLSHSVKVF+SKWKLIRDKLEELNSGLIAADNCDSDENPAISDLI KVI TATEC+DLARRC
Subjt:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC

Query:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE
        VDLSFSGKLLMQSDLDVICAKF+RHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDM+FYVRDIVTRMKIGCSDLKRQALVNLLAA  EDEKYVKVIIE
Subjt:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE

Query:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK
        IGEIVNLLVNFLGS ETELQEAAL+VLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK
Subjt:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK

Query:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL
         ELIGPACGVLSNLVAVEEIKRFMIEEGAI TFIRLARSRDEAVQINSIVFLQNIAYGD+SV KLLVK+GG+RALVRV+DPKSSSSSKTLEV +RAIENL
Subjt:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL

Query:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN
         FSSISYVN L++YGF+DNLLYFLR+GEVSLQEVALKVAVRL G SEEAKKAMGDGGFMPEFV+FLGAKSFEVREMAAEALS MVMIPKNRKRFAQDNRN
Subjt:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN

Query:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS
        VE LLQMLDTEEGNSGNKRFL SILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS
Subjt:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS

TrEMBL top hitse value%identityAlignment
A0A1S3B433 uncharacterized protein LOC1034859493.5e-29592.59Show/hide
Query:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC
        MREERGP+ AETDP  SPDFSL+ PTLRQVILLISSLISLSHSVKVF+SKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVI TATEC+DLARRC
Subjt:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC

Query:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE
        VDLSFSGKLLMQSDLDVICAKF+RHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDM+FYVRDIVTRMKIGCSDLKRQALVNLLAA  EDEKYVKVIIE
Subjt:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE

Query:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK
        IGEIVNLLVNFLGS ETELQEAAL+VLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK
Subjt:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK

Query:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL
         ELI PACGVL NLV VEEIKRFMIEE AI TFI LA+SRDEAVQINSIVFLQNIAYGD+SVNKLLVKEGG+RALVRV+DPKSSSSSKTLEV +RAIENL
Subjt:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL

Query:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN
         FSSIS VN L++YGF+DNLLYFLR+GEVSLQEVALKVAVRL GTSEEAKKAMGDGGFMPEF+KFLGAKSFEVREMAAEALS MV IPKNRKRFAQDNRN
Subjt:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN

Query:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS
        +EMLLQMLD EEGNSGNKRFL SILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIW+S
Subjt:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS

A0A5A7UPA9 Vacuolar protein 83.5e-29592.59Show/hide
Query:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC
        MREERGP+ AETDP  SPDFSL+ PTLRQVILLISSLISLSHSVKVF+SKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVI TATEC+DLARRC
Subjt:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC

Query:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE
        VDLSFSGKLLMQSDLDVICAKF+RHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDM+FYVRDIVTRMKIGCSDLKRQALVNLLAA  EDEKYVKVIIE
Subjt:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE

Query:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK
        IGEIVNLLVNFLGS ETELQEAAL+VLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK
Subjt:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK

Query:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL
         ELI PACGVL NLV VEEIKRFMIEE AI TFI LA+SRDEAVQINSIVFLQNIAYGD+SVNKLLVKEGG+RALVRV+DPKSSSSSKTLEV +RAIENL
Subjt:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL

Query:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN
         FSSIS VN L++YGF+DNLLYFLR+GEVSLQEVALKVAVRL GTSEEAKKAMGDGGFMPEF+KFLGAKSFEVREMAAEALS MV IPKNRKRFAQDNRN
Subjt:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN

Query:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS
        +EMLLQMLD EEGNSGNKRFL SILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIW+S
Subjt:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS

A0A6J1CEG2 uncharacterized protein LOC1110109301.5e-30193.79Show/hide
Query:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC
        MREERGP+PAETDPTNSPDFS++KPTLRQVI  ISSLISLSHSVKVF+SKWKLIRDKLEELNSGLIAADNCDSDENPAIS LI KVI+TATECHDLARRC
Subjt:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC

Query:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE
        VDLSFSGKLLMQSDLDVICAKF+RHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDM+FYVRDIVTRMKIGCSDLKRQALVNLLAA IEDEKYVKVIIE
Subjt:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE

Query:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK
        +GEIVNLLVNFLGS ETELQEAAL+VLHIISGFDSYKAVLVGNG+IAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICS++DSK
Subjt:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK

Query:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL
        TELIGPACGVLSNLV VEEIKRFMIEEGAI TFIR+ARS+DEAVQINSIVFLQNIAYGD+SVN LLVKEGGVRALVRVLDPKSSSSSKTLEVA+RAIENL
Subjt:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL

Query:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN
         FSSISYVNILM+YGF+D+LL+FLRNGEVSLQEVALKVAVRL GTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALS MVMIPKNRKRF QD+RN
Subjt:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN

Query:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS
        V MLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLL+GIWHS
Subjt:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS

A0A6J1EI74 uncharacterized protein LOC1114344005.4e-29692.24Show/hide
Query:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC
        MREERGP+P ET PT SPDFSLDKPTLRQVILLISSLISLSHSVKVF+SKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVI+TATEC DLA RC
Subjt:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC

Query:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE
        VDLSFSGKLLMQSDLDVICAKF+RHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDM+FYVRDIVTRMKIGCSDLKRQALVNLLAA  EDEKYVK+IIE
Subjt:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE

Query:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK
        IGEIVNLLVNFLGS ETELQEAAL+VLH ISGFDSYK VLVGNGVIAPLIRV+E GSE GKNIA RCL+KFTENS NAWSVSAHGGVTALLKICSNADSK
Subjt:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK

Query:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL
         ELI PACGVLSNLV VEEIKRFMIEEGAI TFIRLARS+DEAVQINSIVFLQNIAYGD+SVNK LVK+GG+RALVRVLDPKSSSS+KTLEVA+RAIENL
Subjt:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL

Query:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN
         FSSISYVNILM+YGF+DNLLYFLRNGEVSLQEVALKVA RL GTS+E KKAMGDGGFMPEFVKFLGAKSFEVREMAAEALS +VMIPKNRKRF+QDNRN
Subjt:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN

Query:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS
        VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRS+LNGIWHS
Subjt:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS

A0A6J1KJB3 uncharacterized protein LOC1114951324.6e-29592.07Show/hide
Query:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC
        MREE+GP+PAET PT SPDFSLDKPTLRQVILLISSLISLSHSVKVF+SKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVI TATEC DLA RC
Subjt:  MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRC

Query:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE
        VDLSFSGKLLMQSDLDVICAKF+RHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDM+FYVRDIVTRMKIGCSDLKRQALVNLLAA  EDEKYVK+IIE
Subjt:  VDLSFSGKLLMQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIE

Query:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK
        IGEIVNLLVNFLGS ETELQEAAL+VLHIISGFDSYKAVLVGNGVIAPLIRV+E GSE GKNIA RCL+KFTENS NAWSVSAHGGVTALLKICS ADSK
Subjt:  IGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSK

Query:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL
         ELI PACGVLSNLV VEEIKRFMIEEGAI TFIRLARS+DEAVQ+NSIVFLQNIAYGD+SVNK LVK+GG+RALVRVLDPKSSSS+KTLEVA+RAIENL
Subjt:  TELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENL

Query:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN
         FSSISYVNILM+YGF+DNLLYFLRNGEVSLQEVALKVA RL GTSE+ KKAMGDGGFMPEFVKFLGAKSFEVREMAAEALS +VMIPKNRKRF+QDNRN
Subjt:  SFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRN

Query:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS
        VEMLLQML TEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRS+LNGIWHS
Subjt:  VEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS

SwissProt top hitse value%identityAlignment
O22193 U-box domain-containing protein 43.8e-1227.65Show/hide
Query:  VVSRPGLGACKD--DMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVG
        +VS P     +D  +++  V+ +V  +K    D +RQA   L   A  +     VI   G IV LLV  L S ++  QE A+  L  +S  D+ K  +  
Subjt:  VVSRPGLGACKD--DMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVG

Query:  NGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDE
         G I PLI V+E GS   K  +A  L   +   EN   +   G +  L+ +  N   + +    A   L NL   +E K  +++ GA+   I L      
Subjt:  NGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDE

Query:  AVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVA
         V   ++  L N+A   +  N  + +EGG+  LV V++     S++  E A  A+  LS +S  + N+++  G +  L+   ++G    +E A
Subjt:  AVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVA

P39968 Vacuolar protein 81.1e-1125Show/hide
Query:  KDDMKFY----VRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLI
        KD + FY    ++ + T +     +L+R A    LA A   EKYV+ +    E++  ++  L SQ+ ++Q AA   L  ++  +  K ++V  G + PLI
Subjt:  KDDMKFY----VRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLI

Query:  RVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIV
          M   +   +  A  C+       +N   ++  G +  L K+  +   + +    A G L N+   EE ++ ++  GA+   + L  S D  VQ     
Subjt:  RVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIV

Query:  FLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAK
         L NIA   D  N+  + +   R + +++    S SS+    A  A+ NL+ S  SY   ++  G L +L+  +++  + L  +A    +R        +
Subjt:  FLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAK

Query:  KAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAM----VMIPKNRKRF
          + D GF+   V+ L  K  E  E+   A+S +        KNRK F
Subjt:  KAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAM----VMIPKNRKRF

Q2U5T5 Vacuolar protein 86.1e-1024.92Show/hide
Query:  DLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTEN
        DL+R A +        D + V       + +  ++  L S + E+Q AA   L  ++     K ++V  G +APLIR M   +   +  A  C+     +
Subjt:  DLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTEN

Query:  SENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGG-VR
         +N   ++  G +  L+++  + D + +    A G L N+   ++ ++ ++  GAI   ++L  S D  VQ      L NIA    +  +L   E   V+
Subjt:  SENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGG-VR

Query:  ALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEV
        +LV ++D   SS+ K    A  A+ NL+      + I+ + G L  LL  L++  + L   A+   +R        +  + D GF+   V  LG+   E 
Subjt:  ALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEV

Query:  REMAAEALSAMVMIPKNRKRFAQDNRNVEMLLQ
         E+   A+S +      R   A  +RN E++LQ
Subjt:  REMAAEALSAMVMIPKNRKRFAQDNRNVEMLLQ

Q4WVW4 Vacuolar protein 82.3e-0924.62Show/hide
Query:  DLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTEN
        DL+R A +        D + V       + +  ++  L S + E+Q AA   L  ++     K ++V  G + PLIR M   +   +  A  C+     +
Subjt:  DLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTEN

Query:  SENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGG-VR
         +N   ++  G +  L+++  + D + +    A G L N+   ++ ++ ++  GAI   ++L  S D  VQ      L NIA    +  +L   E   V+
Subjt:  SENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGG-VR

Query:  ALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEV
        +LV ++D   SS+ K    A  A+ NL+      + I+ + G L  LL  L++  + L   A+   +R        +  + D GF+   V  LG+   E 
Subjt:  ALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEV

Query:  REMAAEALSAMVMIPKNRKRFAQDNRNVEMLLQ
         E+   A+S +      R   A  +RN E++LQ
Subjt:  REMAAEALSAMVMIPKNRKRFAQDNRNVEMLLQ

Q6FJV1 Vacuolar protein 84.5e-1326.36Show/hide
Query:  KDDMKFY----VRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLI
        KD + FY    ++ + T +     +L+R A    LA A   EKYV+ +    +++  ++  L SQ+ ++Q AA   L  ++  +  K ++V  G + PLI
Subjt:  KDDMKFY----VRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAALQVLHIISGFDSYKAVLVGNGVIAPLI

Query:  -RVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSI
         ++M    EV  N A  C+       +N   ++  G +  L K+  +   + +    A G L N+   EE +R ++  GA+   + L  S D  VQ    
Subjt:  -RVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRFMIEEGAILTFIRLARSRDEAVQINSI

Query:  VFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEA
          L NIA   D  N+  + +   R + +++    S SS+    A  A+ NL+ S  SY   ++  G L +L+  +++  V L  +A    +R        
Subjt:  VFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEA

Query:  KKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAM----VMIPKNRKRF
        +  + D GF+P  VK L  +  E  E+   A+S +        KNRK F
Subjt:  KKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAM----VMIPKNRKRF

Arabidopsis top hitse value%identityAlignment
AT1G01830.2 ARM repeat superfamily protein1.4e-5729.77Show/hide
Query:  LRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGL--IAADNCDSDENPAISDLIRKVIQTATECHDLARRCVDLSFSGKLLMQSDLDVICAKFNR
        L +V  LI S++S + +VK F+ +WK I  K+E++ + L  +++  C S +N   ++ ++ V +T +E  +LA +C    + GKL MQSDLD +  K + 
Subjt:  LRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGL--IAADNCDSDENPAISDLIRKVIQTATECHDLARRCVDLSFSGKLLMQSDLDVICAKFNR

Query:  HAKKLSDIYTAGILSQG-FAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAA
        + +    +   G+L +    + +S     + +      +++++ R++IG  + K  AL +LL A  EDEK V + +     V  LV  L +  T ++E A
Subjt:  HAKKLSDIYTAGILSQG-FAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAA

Query:  LQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRF
        + ++ +++        L+  GV+ PL+R++E GS   K  AA  + + +   ENA  ++ HGG+T L+ +C   DS ++    A   L N+ AV E+++ 
Subjt:  LQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRF

Query:  MIEEGAILTFIRLAR------SRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVN--ILMSYG
        + EEG I   I L        SR+   +      LQN+    D++ + +V EGGV +L+  LD          + A+ A+ NL    I  VN  I ++  
Subjt:  MIEEGAILTFIRLAR------SRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVN--ILMSYG

Query:  FLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRNV-EMLLQMLDTEEGN
         L  L + L++G +  Q+ A     R +  S E K+ +G+ G +PE VK L +KS   RE AA+A++ +V   + R+   +D ++V   L+ +LD+  GN
Subjt:  FLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRNV-EMLLQMLDTEEGN

Query:  SGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLN
        +  K++  + L  ++GS   ++ +V+ G +  ++KL+E EV  A KL+ KL   K RS  +
Subjt:  SGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLN

AT1G01830.3 ARM repeat superfamily protein1.4e-5729.77Show/hide
Query:  LRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGL--IAADNCDSDENPAISDLIRKVIQTATECHDLARRCVDLSFSGKLLMQSDLDVICAKFNR
        L +V  LI S++S + +VK F+ +WK I  K+E++ + L  +++  C S +N   ++ ++ V +T +E  +LA +C    + GKL MQSDLD +  K + 
Subjt:  LRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGL--IAADNCDSDENPAISDLIRKVIQTATECHDLARRCVDLSFSGKLLMQSDLDVICAKFNR

Query:  HAKKLSDIYTAGILSQG-FAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAA
        + +    +   G+L +    + +S     + +      +++++ R++IG  + K  AL +LL A  EDEK V + +     V  LV  L +  T ++E A
Subjt:  HAKKLSDIYTAGILSQG-FAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAA

Query:  LQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRF
        + ++ +++        L+  GV+ PL+R++E GS   K  AA  + + +   ENA  ++ HGG+T L+ +C   DS ++    A   L N+ AV E+++ 
Subjt:  LQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRF

Query:  MIEEGAILTFIRLAR------SRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVN--ILMSYG
        + EEG I   I L        SR+   +      LQN+    D++ + +V EGGV +L+  LD          + A+ A+ NL    I  VN  I ++  
Subjt:  MIEEGAILTFIRLAR------SRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVN--ILMSYG

Query:  FLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRNV-EMLLQMLDTEEGN
         L  L + L++G +  Q+ A     R +  S E K+ +G+ G +PE VK L +KS   RE AA+A++ +V   + R+   +D ++V   L+ +LD+  GN
Subjt:  FLDNLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRNV-EMLLQMLDTEEGN

Query:  SGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLN
        +  K++  + L  ++GS   ++ +V+ G +  ++KL+E EV  A KL+ KL   K RS  +
Subjt:  SGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLN

AT1G61350.1 ARM repeat superfamily protein1.5e-17357.98Show/hide
Query:  KPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRCVDLSFSGKLLMQSDLDVICAKFN
        K ++ + I  ISSLISLSHS+K F+ KW+LIR KL+EL SGL +  N +S  +P++S LI  ++ +  + +DLA RCV++SFSGKLLMQSDLDV+  KF+
Subjt:  KPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRCVDLSFSGKLLMQSDLDVICAKFN

Query:  RHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAA
         H + LS IY+AGILS GFAIVV +P   ACKDDM+FY+RD++TRMKIG  ++K+QALV L  A  ED++YVK++IEI ++VN+LV FL S E  +QE +
Subjt:  RHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQEAA

Query:  LQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRF
         + +  ISGF SY+ VL+ +GVI PL+RV+E G+ VG+  +ARCL+K TENSENAWSVSAHGGV+ALLKICS +D   ELIG +CGVL NLV VEEIKRF
Subjt:  LQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRF

Query:  MIEEG-AILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVL-DPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLDNLL
        MIEE   + TFI+L  S++E VQ+NSI  L ++   D+    +LV+EGG++ LV VL DP S SSSK+ E+A+RAI+NL F S   +N LM   FLD+LL
Subjt:  MIEEG-AILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVL-DPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLDNLL

Query:  YFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRNVEMLLQMLDTEEG-----NSG
          LRNGE+S+QE ALKV  RL    EE K+ MG+ GFMPE VKFL AKS +VREMA+ AL  ++ +P+NRK+FAQD+ N+  +LQ+LD E+G     +SG
Subjt:  YFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRNVEMLLQMLDTEEG-----NSG

Query:  NKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS
        N +FL SIL SLT  +S RRKI +SGY+K+IEKLAE E  DAKKLV+KLS N+FRS+L+GIWHS
Subjt:  NKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLSTNKFRSLLNGIWHS

AT2G05810.1 ARM repeat superfamily protein1.3e-6029.87Show/hide
Query:  KPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIA-ADNCDSDENPAISDLIRKVIQTATECHDLARRCVDLSFS-GKLLMQSDLDVICAK
        +P +  +  ++S L+  S +V+ F  +W+++R KL  LNS L + +++    +NP +  L+  ++        L+ +C   SFS GKLLMQSDLD+  + 
Subjt:  KPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIA-ADNCDSDENPAISDLIRKVIQTATECHDLARRCVDLSFS-GKLLMQSDLDVICAK

Query:  FNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQE
         + H   L  +  +G+L Q  AIV+S P   + KDD+ F++RD+ TR++IG ++ K+++L +LL    ++EK  ++I + G +  L+          ++E
Subjt:  FNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQE

Query:  AALQVLHII--SGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEE
         AL  + ++  S  DS K V    G + PL+R++E GS   K  AA  +   T +   AW++SA+GGVT L++ C +   + +      G +SN+ AVEE
Subjt:  AALQVLHII--SGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEE

Query:  IKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKE-GGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLD
        I+  + EEGAI   I+L  S   +VQ  +  F+  I+   +    L+V+E GG++ L+ ++  + SS+  T+E  + A+  +S        +  S  F+ 
Subjt:  IKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKE-GGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLD

Query:  NLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFL-GAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRNVEMLLQMLDTEEGNSGN
         L   +++G V LQ+++  +   L+  S+  K+A+ D   +   ++ +   K   ++E A EA  +++ +  NRK   +D ++V  L+QMLD       N
Subjt:  NLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFL-GAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRNVEMLLQMLDTEEGNSGN

Query:  KRFLFSILNSLT--GSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLST-NKFRSL
        K     ++ ++   GS + R K++  G  + ++ L E EV  AKK V++L+  N+ +S+
Subjt:  KRFLFSILNSLT--GSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLST-NKFRSL

AT2G05810.2 ARM repeat superfamily protein1.3e-6029.87Show/hide
Query:  KPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIA-ADNCDSDENPAISDLIRKVIQTATECHDLARRCVDLSFS-GKLLMQSDLDVICAK
        +P +  +  ++S L+  S +V+ F  +W+++R KL  LNS L + +++    +NP +  L+  ++        L+ +C   SFS GKLLMQSDLD+  + 
Subjt:  KPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIA-ADNCDSDENPAISDLIRKVIQTATECHDLARRCVDLSFS-GKLLMQSDLDVICAK

Query:  FNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQE
         + H   L  +  +G+L Q  AIV+S P   + KDD+ F++RD+ TR++IG ++ K+++L +LL    ++EK  ++I + G +  L+          ++E
Subjt:  FNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQE

Query:  AALQVLHII--SGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEE
         AL  + ++  S  DS K V    G + PL+R++E GS   K  AA  +   T +   AW++SA+GGVT L++ C +   + +      G +SN+ AVEE
Subjt:  AALQVLHII--SGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEE

Query:  IKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKE-GGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLD
        I+  + EEGAI   I+L  S   +VQ  +  F+  I+   +    L+V+E GG++ L+ ++  + SS+  T+E  + A+  +S        +  S  F+ 
Subjt:  IKRFMIEEGAILTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKE-GGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLD

Query:  NLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFL-GAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRNVEMLLQMLDTEEGNSGN
         L   +++G V LQ+++  +   L+  S+  K+A+ D   +   ++ +   K   ++E A EA  +++ +  NRK   +D ++V  L+QMLD       N
Subjt:  NLLYFLRNGEVSLQEVALKVAVRLSGTSEEAKKAMGDGGFMPEFVKFL-GAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRNVEMLLQMLDTEEGNSGN

Query:  KRFLFSILNSLT--GSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLST-NKFRSL
        K     ++ ++   GS + R K++  G  + ++ L E EV  AKK V++L+  N+ +S+
Subjt:  KRFLFSILNSLT--GSSSGRRKIVNSGYMKNIEKLAEAEVYDAKKLVRKLST-NKFRSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGAAGAACGGGGTCCGAAACCTGCAGAAACGGACCCGACGAACTCCCCGGATTTCTCACTCGACAAGCCCACTCTCCGGCAAGTAATTTTACTCATTTCTTCCTT
GATATCTCTTTCTCATTCCGTTAAAGTATTTTCCTCCAAATGGAAATTAATTCGCGACAAGCTTGAAGAGTTGAATTCCGGCTTAATCGCGGCCGATAACTGTGATTCCG
ACGAAAATCCCGCAATCTCAGACCTAATTCGGAAGGTAATTCAGACGGCTACTGAATGCCACGATCTTGCTCGCCGCTGTGTTGACCTGTCTTTCAGTGGGAAGCTTTTG
ATGCAGAGTGATTTGGATGTAATCTGTGCGAAATTTAACCGCCATGCGAAGAAGCTTTCGGACATATATACAGCAGGCATTTTGTCGCAAGGGTTCGCCATCGTCGTTTC
TAGGCCTGGCCTCGGAGCTTGTAAGGATGATATGAAGTTTTATGTGAGGGATATTGTAACGAGAATGAAGATTGGTTGTTCAGATCTGAAGAGACAAGCTCTCGTTAATC
TTCTCGCTGCTGCAATTGAAGACGAAAAGTATGTGAAAGTGATAATAGAAATCGGCGAGATAGTGAATCTCCTTGTTAATTTTCTTGGTTCTCAGGAGACGGAACTCCAA
GAAGCGGCGCTGCAAGTGCTTCATATAATTTCTGGGTTTGATTCTTATAAAGCAGTTCTAGTTGGGAATGGGGTAATTGCGCCATTGATTCGGGTTATGGAATGTGGGAG
TGAGGTAGGGAAGAATATAGCCGCAAGGTGTTTGTTGAAATTCACTGAGAATTCTGAAAACGCTTGGTCTGTATCTGCTCATGGCGGAGTAACAGCTCTGTTGAAAATCT
GCTCCAATGCTGATTCTAAAACAGAATTGATCGGCCCTGCTTGTGGGGTTCTCAGCAATCTCGTTGCTGTTGAAGAAATTAAGAGATTTATGATTGAAGAAGGCGCAATT
TTAACGTTTATTAGGCTTGCTCGATCCAGAGATGAAGCTGTGCAGATAAACTCCATTGTCTTTCTGCAAAACATAGCTTATGGGGATGATTCAGTAAACAAATTGCTGGT
TAAAGAAGGTGGAGTTCGGGCATTAGTTCGTGTTTTGGATCCAAAATCTTCATCCTCATCCAAAACCCTAGAGGTAGCGATTCGAGCAATTGAAAACCTCAGTTTCTCGT
CAATTAGTTATGTAAATATTTTGATGAGCTACGGGTTCCTGGATAATCTTCTTTATTTCTTACGAAATGGGGAGGTTTCTCTTCAAGAAGTAGCTCTGAAAGTTGCAGTT
AGGCTAAGCGGGACATCAGAGGAAGCCAAGAAAGCAATGGGGGATGGAGGTTTCATGCCAGAATTCGTCAAGTTTCTTGGTGCAAAGTCTTTTGAAGTTCGAGAAATGGC
AGCCGAGGCACTCTCAGCGATGGTCATGATCCCCAAAAACAGAAAGAGATTTGCTCAGGACAATCGAAACGTAGAGATGCTTCTGCAAATGCTCGACACAGAGGAGGGAA
ATTCAGGTAACAAAAGGTTCCTCTTCTCAATACTAAACTCATTAACAGGAAGCAGTAGTGGAAGAAGGAAGATTGTGAATTCTGGGTACATGAAAAACATTGAAAAACTT
GCAGAAGCTGAAGTTTATGATGCCAAAAAGCTCGTCAGGAAATTGTCCACAAACAAATTTCGTAGTCTGTTAAATGGAATCTGGCATTCTTGA
mRNA sequenceShow/hide mRNA sequence
AAGAAAACCAGTAACCACAGACCTGCAATCTTGATAAGTAGCTCTATATCACGGAGTCTCCGTTTTGCTTTCAGAGGACTGATTCTTCGCCGGAGCGTGCACAGCCAACT
CGCCGGAAAATGATTTTGCTCCATTTCCAGTGACAACTAAAATGAGAGAAGAACGGGGTCCGAAACCTGCAGAAACGGACCCGACGAACTCCCCGGATTTCTCACTCGAC
AAGCCCACTCTCCGGCAAGTAATTTTACTCATTTCTTCCTTGATATCTCTTTCTCATTCCGTTAAAGTATTTTCCTCCAAATGGAAATTAATTCGCGACAAGCTTGAAGA
GTTGAATTCCGGCTTAATCGCGGCCGATAACTGTGATTCCGACGAAAATCCCGCAATCTCAGACCTAATTCGGAAGGTAATTCAGACGGCTACTGAATGCCACGATCTTG
CTCGCCGCTGTGTTGACCTGTCTTTCAGTGGGAAGCTTTTGATGCAGAGTGATTTGGATGTAATCTGTGCGAAATTTAACCGCCATGCGAAGAAGCTTTCGGACATATAT
ACAGCAGGCATTTTGTCGCAAGGGTTCGCCATCGTCGTTTCTAGGCCTGGCCTCGGAGCTTGTAAGGATGATATGAAGTTTTATGTGAGGGATATTGTAACGAGAATGAA
GATTGGTTGTTCAGATCTGAAGAGACAAGCTCTCGTTAATCTTCTCGCTGCTGCAATTGAAGACGAAAAGTATGTGAAAGTGATAATAGAAATCGGCGAGATAGTGAATC
TCCTTGTTAATTTTCTTGGTTCTCAGGAGACGGAACTCCAAGAAGCGGCGCTGCAAGTGCTTCATATAATTTCTGGGTTTGATTCTTATAAAGCAGTTCTAGTTGGGAAT
GGGGTAATTGCGCCATTGATTCGGGTTATGGAATGTGGGAGTGAGGTAGGGAAGAATATAGCCGCAAGGTGTTTGTTGAAATTCACTGAGAATTCTGAAAACGCTTGGTC
TGTATCTGCTCATGGCGGAGTAACAGCTCTGTTGAAAATCTGCTCCAATGCTGATTCTAAAACAGAATTGATCGGCCCTGCTTGTGGGGTTCTCAGCAATCTCGTTGCTG
TTGAAGAAATTAAGAGATTTATGATTGAAGAAGGCGCAATTTTAACGTTTATTAGGCTTGCTCGATCCAGAGATGAAGCTGTGCAGATAAACTCCATTGTCTTTCTGCAA
AACATAGCTTATGGGGATGATTCAGTAAACAAATTGCTGGTTAAAGAAGGTGGAGTTCGGGCATTAGTTCGTGTTTTGGATCCAAAATCTTCATCCTCATCCAAAACCCT
AGAGGTAGCGATTCGAGCAATTGAAAACCTCAGTTTCTCGTCAATTAGTTATGTAAATATTTTGATGAGCTACGGGTTCCTGGATAATCTTCTTTATTTCTTACGAAATG
GGGAGGTTTCTCTTCAAGAAGTAGCTCTGAAAGTTGCAGTTAGGCTAAGCGGGACATCAGAGGAAGCCAAGAAAGCAATGGGGGATGGAGGTTTCATGCCAGAATTCGTC
AAGTTTCTTGGTGCAAAGTCTTTTGAAGTTCGAGAAATGGCAGCCGAGGCACTCTCAGCGATGGTCATGATCCCCAAAAACAGAAAGAGATTTGCTCAGGACAATCGAAA
CGTAGAGATGCTTCTGCAAATGCTCGACACAGAGGAGGGAAATTCAGGTAACAAAAGGTTCCTCTTCTCAATACTAAACTCATTAACAGGAAGCAGTAGTGGAAGAAGGA
AGATTGTGAATTCTGGGTACATGAAAAACATTGAAAAACTTGCAGAAGCTGAAGTTTATGATGCCAAAAAGCTCGTCAGGAAATTGTCCACAAACAAATTTCGTAGTCTG
TTAAATGGAATCTGGCATTCTTGAATGTAATTTCCCTGTTTGTTAATATCCTTTTCTTCTTCTTCTTCTTCTTCTTACATGATTTTGTTAATTAAATCCATACCTGATTT
GATTCACAACTACTTCAGACTCCCACATTCTCTCATTAAAAAACTTACCTTTGAAAAACAGCTTAATAAGAGTGTAATTAGAGAGGTCTAATTTAATCATTCTCCAACCA
CCCACATAAAACTTTTGTATTATAGCCCTGAGAGAGAGCAAATACATTATTAAATCAGTGTACTTGATGAGGATGGAAGGAAGAGAGCACATGGGGGAAGAGAAGACATT
GAATACAACAGAGAGGAATACGAGTAGTTTTAATCAAGCTTTCTG
Protein sequenceShow/hide protein sequence
MREERGPKPAETDPTNSPDFSLDKPTLRQVILLISSLISLSHSVKVFSSKWKLIRDKLEELNSGLIAADNCDSDENPAISDLIRKVIQTATECHDLARRCVDLSFSGKLL
MQSDLDVICAKFNRHAKKLSDIYTAGILSQGFAIVVSRPGLGACKDDMKFYVRDIVTRMKIGCSDLKRQALVNLLAAAIEDEKYVKVIIEIGEIVNLLVNFLGSQETELQ
EAALQVLHIISGFDSYKAVLVGNGVIAPLIRVMECGSEVGKNIAARCLLKFTENSENAWSVSAHGGVTALLKICSNADSKTELIGPACGVLSNLVAVEEIKRFMIEEGAI
LTFIRLARSRDEAVQINSIVFLQNIAYGDDSVNKLLVKEGGVRALVRVLDPKSSSSSKTLEVAIRAIENLSFSSISYVNILMSYGFLDNLLYFLRNGEVSLQEVALKVAV
RLSGTSEEAKKAMGDGGFMPEFVKFLGAKSFEVREMAAEALSAMVMIPKNRKRFAQDNRNVEMLLQMLDTEEGNSGNKRFLFSILNSLTGSSSGRRKIVNSGYMKNIEKL
AEAEVYDAKKLVRKLSTNKFRSLLNGIWHS