; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038055 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038055
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUPF0505 protein C16orf62 homolog isoform X1
Genome locationchr2:12009107..12025369
RNA-Seq ExpressionLag0038055
SyntenyLag0038055
Gene Ontology termsGO:0015031 - protein transport (biological process)
GO:0032456 - endocytic recycling (biological process)
GO:0042147 - retrograde transport, endosome to Golgi (biological process)
GO:0005768 - endosome (cellular component)
GO:0005829 - cytosol (cellular component)
GO:0030906 - retromer, cargo-selective complex (cellular component)
InterPro domainsIPR005378 - Vacuolar protein sorting-associated protein 35
IPR029705 - VPS35 endosomal protein sorting factor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607792.1 VPS35 endosomal protein sorting factor-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.0e-28787.16Show/hide
Query:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK
        GLLVSCVND NAQLKHFIPAKETKTGSS D KVLLVG++EPTIEYIVKCIFK VSQRQL+GTL+ALGLG NMENS C+SIVLH+ILKELPVEV+SS AM+
Subjt:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK

Query:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL
        FL LIDRSNDSSFRQFLNYRL G+RLCERRP +DIVDAVM+NVL+VIAQNESLDEYLTVIDAYLDIVLQN  D  V TIL+ ISQRTCNR IDENGLLSL
Subjt:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL

Query:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE
        QSI+GKLLSHYQHLEDVFALSHFLEILDLLVGRP+ IIIINILKMATR+S IRDPAT+ELLFEISQALND FDFANMKDD+NQPAHLLSRFVQLVDFG E
Subjt:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE

Query:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN
        RERHLAFLVECRGAFGTIDE+K+TLVHSSNGLAVKAL+DAK H NFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHS +LIDSAISCLHN
Subjt:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN

Query:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA
        +DIKEGSRAAAD +L LSSIQKLCSLLV++PGNP HGSAYFPKILVSFVNDIPWMTP+MRTRILCAILSLLATCSQNRLPYHADNG+LWGSNNVF+GD A
Subjt:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA

Query:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ
        Y  ELVSLSE IVQNLV+A+ QESS A RG+LALE C++ILSSFT++DETYAICSKL+ETA+LCMSDSNKYLQST  RLEEKSQ
Subjt:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ

KAG7037366.1 hypothetical protein SDJN02_00991 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-28787.16Show/hide
Query:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK
        GLLVSCVND NAQLKHFIPAKETKTGSS D KVLLVG++EPTIEYIVKCIFK+VSQRQL+GTL+ALGLG NMENS C+SIVLH+ILKELPVEV+SS AM+
Subjt:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK

Query:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL
        FL LIDRSNDSSFRQFLNYRL G+RLCERRP +DIVDAVM+NVL+VIAQNESLDEYLTVIDAYLDIVLQN  D  V TIL+ ISQRTCNR IDENGLLSL
Subjt:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL

Query:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE
        QSI+GKLLSHYQHLEDVFALSHFLEILDLLVGRP+ IIIINILKMATR+S IRDPAT+ELLFEISQALND FDFANMKDD+NQPAHLLSRFVQLVDFG E
Subjt:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE

Query:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN
        RERHLAFLVECRGAFGTIDE+K+TLVHSSNGLAVKAL+DAK H NFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHS +LIDSAISCLHN
Subjt:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN

Query:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA
        +DIKEGSRAAAD +L LSSIQKLCSLLV++PGNP HGSAYFPKILVSFVNDIPWMTP+MRTRILCAILSLLATCSQNRLPYHADNG+LWGSNNVF+GD A
Subjt:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA

Query:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ
        Y  ELVSLSE IVQNLV+A+ QESS A RG+LALE C++ILSSFT++DETYAICSKL+ETA+LCMSDSNKYLQST  RLEEKSQ
Subjt:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ

XP_022981376.1 UPF0505 protein C16orf62 homolog [Cucurbita maxima]1.9e-29088.01Show/hide
Query:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK
        GLLVSCVND NAQLKHFIPAKETKTGSS D KVLLVG++EPTIEYIVKCIFK+VSQRQLDGTL+ALGLG NMENS C+SIVLH+ILKELPVEV+SS AM+
Subjt:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK

Query:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL
        FL LIDRSNDSSFRQFLNYRL GLRLCERRP +DIVDAVM+NVL+VIAQNESLDEYLTVIDAYLDIVLQN  D CV TIL+AISQRTCNR IDENGLLSL
Subjt:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL

Query:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE
        QSI+GKLLSHYQHLEDVFALSHFLEILDLLVGRP+ IIII+ILKMATR+S IRDPAT+ELLFEISQALND FDFANMKDD+NQPAHLLSRFVQLVDFG E
Subjt:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE

Query:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN
        RERHLAFLVECRGAFGTIDE+K+TLVHSSNGLAVKAL+DAK H NFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHS ELIDSAISCLHN
Subjt:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN

Query:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA
        +D+KEGSRAAAD +L LSSIQKLCSLLV++PGNP HGSAYFPKILVSFVNDIPWMTP+MRTRILCAILSLLATCSQNRLPYHADNG+ WGSNNVF+GDLA
Subjt:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA

Query:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ
        Y  ELVSLSE IV+NLV+AI QESS A RGILALE C++ LSSFT++DETYAICSKL+ETAKLCMSDSNKYLQSTF RLEEKSQ
Subjt:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ

XP_023525123.1 UPF0505 protein C16orf62 homolog [Cucurbita pepo subsp. pepo]9.7e-28787.16Show/hide
Query:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK
        GLLVSCVND NAQLKHFIPAKETKTGSS D KVLLVG++EPTIEYIVKCIFK+VSQRQL+GTL+ALGLG NMENS C+SIVLH+ILKELPVEV+SS AM+
Subjt:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK

Query:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL
        FL LIDRSNDSSFRQFLNYRL G+RLCERRP +DIVDAVM+NVL+VIAQNESLDEYLTVIDAYLDIVLQN  D  V TIL+AISQRTCNR IDENGLLSL
Subjt:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL

Query:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE
        QSI+GKLLSHYQHLEDVFALSHFLEILDLLVGRP+ IIIINILKMATR+S IRDPAT+ELLFEISQALND FDFANMKDD+NQPAHLLSRFVQLVDFG E
Subjt:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE

Query:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN
        RERHLAFLVECRGAFGTIDE+K+TLVH SNGLAVKAL+DAK H NFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHS +LIDSAISCLHN
Subjt:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN

Query:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA
        +DIKEGSRAAAD +L LSSIQKLCSLLV++PGNP HGSAYFPKILVSFVNDIPWMTP+MRTRILCAILSLLATCSQNRLPYHADNG+ WGSNNVF+GD A
Subjt:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA

Query:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ
        Y  ELVSLSE IVQNLV+AI QESS A RGILALE C++ILSSFT++DETYAICS L+ TAKLCMSD NKYLQSTF RLEEKSQ
Subjt:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ

XP_038898827.1 VPS35 endosomal protein-sorting factor-like isoform X3 [Benincasa hispida]4.4e-28786.47Show/hide
Query:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK
        G+LVSCVNDMNAQLK+FIPAKE  TGSS D+KVLLVG+MEPTIEYIVKCIFK  SQRQLDGTLLALGLG NMENS C+SIVLHHILKEL VEVVSSNAM+
Subjt:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK

Query:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL
        FLQLID SNDSSFRQF+NYRLLGLRLCE+RP + IVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQ+  D C+ TILEAISQRTCN+ IDENG+LSL
Subjt:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL

Query:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE
        QSI+GKLLSHYQHLEDVFALSHFLEILD+LVGRPR+I+II+ILKMATRNSCIRDPATIELLFEISQALND FDFANMKDDDNQP HLLSRFVQLVDFGIE
Subjt:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE

Query:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN
        RERHLAFLVECRGAFG IDELKETLVHSSNGLAVKAL+D   H NFVK+CIAFSEVTLPSIS  IKQFNLYLETAEVA L GL+SHSDELIDSAISCLHN
Subjt:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN

Query:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA
        M+IKEGSRAAA+ EL LSSI+KLCS LV++PGNP HGSAYFPKILVSFVNDIPWMTPRMRT ILCA+L LLA CSQNRLPYHADNGVLWGSNN+F+GD A
Subjt:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA

Query:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ
        Y  ELVSLS+HIV+NLVDA+LQESSPA RG++ALEACN+ILSSFTI+DETYAICSKLIETAKLCM++SNKYLQSTF  LEEKS+
Subjt:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ

TrEMBL top hitse value%identityAlignment
A0A0A0K5F0 Uncharacterized protein2.8e-27983.78Show/hide
Query:  HFLP----GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVE
        H LP    G+LVSCVNDMNAQLKHFI AKE  T  S D+KVLLVG+MEPTIEYI+KC+FK+VSQR+LD TLLALGLG NME S C+S+VLHHILKEL VE
Subjt:  HFLP----GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVE

Query:  VVSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVI
        VVSSNAM+FLQLID SNDSSF QF+NYRLLGLRLCE+RP + IVD ++NNVLKVIAQNESLDEYLTVIDAYLD VLQN  D C+ TILE ISQR+CN+ I
Subjt:  VVSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVI

Query:  DENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFV
        DENG+LSLQSI+GKLLSHYQ +EDVFALSHFLEILDLLVGRPRS+III+ILKMATRNS IRDPATIELLFEISQALND FDFANMK+DDNQP HLLSRFV
Subjt:  DENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFV

Query:  QLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELID
        QLVDFGIERERHLAFLVECRGAFGTID+LKETLVHSSNGL VKAL+DAK +VNFVK+CIAFSEVTLPSIST IKQFNLYLETAEVALLGGLISH+DELID
Subjt:  QLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELID

Query:  SAISCLHNMDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSN
        SAISCLHNM+IKEGSRAAA+ EL LSSIQKLCSLLV++PGNPSHGS +FPKILVSFV ++PWMTPRM+T ILCAIL LLA CSQNRLPYHAD GVLWGSN
Subjt:  SAISCLHNMDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSN

Query:  NVFYGDLAYCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ
        NVF+GD A   ELVSLSEHIVQNLVDA+LQESSPA RG +ALEACN+ILSSFTI+DETYAICSKL+ETAKLCM++SNKYLQSTFH LE+KSQ
Subjt:  NVFYGDLAYCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ

A0A6J1CD86 UPF0505 protein C16orf62 homolog isoform X36.8e-28687.35Show/hide
Query:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK
        GLLVSC+ND NAQLKHFIPAKETKTG+S D+KVLLVG+MEP IEY VKCIFKDVSQRQLD TL   GLG NM+NS C SIVLHH+LKELPVEVVSSNA++
Subjt:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK

Query:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL
        FLQLI+RSNDSSF QFLNYRLLGLRLCERRP +DIVDAVMNN+LKVIAQNESLDEYLTVIDAYLDIVLQN  D  V TILEAISQ+T NRVIDENGLLSL
Subjt:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL

Query:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE
        QSIIGKLLS YQHLEDVFALSHFLEILDLLVGRPR+II I ILKMATRNS IRDPATIELLFEISQALND  DFAN+K DD+QPAHLLSRFVQLVDFGIE
Subjt:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE

Query:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN
        RERHLAFLVECRGAFGTI+EL+ETLVHSSNGLAVKAL+DA  HVNFVKSCIAFSEVTLPSIS  IKQFNLYLETAEVALLGGLISHSD+LIDSAISCLHN
Subjt:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN

Query:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA
        MDIK+GSRAAAD +L LSSIQKLCSLLV++PGNP H S YFPKIL+SFVNDIPWMTPRMRTRILCAIL LLATCSQNRLPYHADNGV WGSNNVF GD A
Subjt:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA

Query:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQF
        Y  ELVSLSEHIVQ LVDAI QESS A RGI+ALEACN+ILSSFTIRDETYAICSKL+ETAKL MSDSNKYLQSTFH LEEKSQF
Subjt:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQF

A0A6J1CE53 UPF0505 protein C16orf62 homolog isoform X16.8e-28687.35Show/hide
Query:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK
        GLLVSC+ND NAQLKHFIPAKETKTG+S D+KVLLVG+MEP IEY VKCIFKDVSQRQLD TL   GLG NM+NS C SIVLHH+LKELPVEVVSSNA++
Subjt:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK

Query:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL
        FLQLI+RSNDSSF QFLNYRLLGLRLCERRP +DIVDAVMNN+LKVIAQNESLDEYLTVIDAYLDIVLQN  D  V TILEAISQ+T NRVIDENGLLSL
Subjt:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL

Query:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE
        QSIIGKLLS YQHLEDVFALSHFLEILDLLVGRPR+II I ILKMATRNS IRDPATIELLFEISQALND  DFAN+K DD+QPAHLLSRFVQLVDFGIE
Subjt:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE

Query:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN
        RERHLAFLVECRGAFGTI+EL+ETLVHSSNGLAVKAL+DA  HVNFVKSCIAFSEVTLPSIS  IKQFNLYLETAEVALLGGLISHSD+LIDSAISCLHN
Subjt:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN

Query:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA
        MDIK+GSRAAAD +L LSSIQKLCSLLV++PGNP H S YFPKIL+SFVNDIPWMTPRMRTRILCAIL LLATCSQNRLPYHADNGV WGSNNVF GD A
Subjt:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA

Query:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQF
        Y  ELVSLSEHIVQ LVDAI QESS A RGI+ALEACN+ILSSFTIRDETYAICSKL+ETAKL MSDSNKYLQSTFH LEEKSQF
Subjt:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQF

A0A6J1FL44 UPF0505 protein C16orf62 homolog4.0e-28686.82Show/hide
Query:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK
        GLLVSCVND NAQLKHFIPAKETKTGSS D KVLLVG++EPTIEYIVKCIFK VSQRQL+GTL+ALGLG NMENS C+SIVLH+ILKELPVEV+SS AM+
Subjt:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK

Query:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL
        FL LIDRSNDSSFRQFLNYRL G+RLCERRP +DIVDAVM+NVL+VIAQNESLDEYLTVIDAYLDIVLQN  D  V TIL+ ISQRTCNR IDENGLLSL
Subjt:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL

Query:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE
        QSI+GKLLSHYQHLEDVFALSHFLEILDLLVGRP+ IIIINILKMATR+S IRDPAT+ELLFEISQALND FDFANMKDD+NQPAHLLSRFVQLVDFG E
Subjt:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE

Query:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN
        RERHLAFLVECRGAFGTIDE+K+TLVHSSNGLAVKAL+DAK H NFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHS +LIDSAISCLHN
Subjt:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN

Query:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA
        +DIKEGSRAAAD +L LSSIQKLCSLLV++PGNP HGSAYFPKILVSFVNDIPWMTP+MRTRILCAILSLLATCSQNRLPYHADNG+LWG NNVF+GD A
Subjt:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA

Query:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ
        Y  ELVSLSE IVQNLV+A+ QESS A RG+LALE C++ILSSFT++DETYAICS L+ETAKLCMSDSNKYLQST  RLEE SQ
Subjt:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ

A0A6J1IWD8 UPF0505 protein C16orf62 homolog9.2e-29188.01Show/hide
Query:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK
        GLLVSCVND NAQLKHFIPAKETKTGSS D KVLLVG++EPTIEYIVKCIFK+VSQRQLDGTL+ALGLG NMENS C+SIVLH+ILKELPVEV+SS AM+
Subjt:  GLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMK

Query:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL
        FL LIDRSNDSSFRQFLNYRL GLRLCERRP +DIVDAVM+NVL+VIAQNESLDEYLTVIDAYLDIVLQN  D CV TIL+AISQRTCNR IDENGLLSL
Subjt:  FLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSL

Query:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE
        QSI+GKLLSHYQHLEDVFALSHFLEILDLLVGRP+ IIII+ILKMATR+S IRDPAT+ELLFEISQALND FDFANMKDD+NQPAHLLSRFVQLVDFG E
Subjt:  QSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIE

Query:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN
        RERHLAFLVECRGAFGTIDE+K+TLVHSSNGLAVKAL+DAK H NFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHS ELIDSAISCLHN
Subjt:  RERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHN

Query:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA
        +D+KEGSRAAAD +L LSSIQKLCSLLV++PGNP HGSAYFPKILVSFVNDIPWMTP+MRTRILCAILSLLATCSQNRLPYHADNG+ WGSNNVF+GDLA
Subjt:  MDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLA

Query:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ
        Y  ELVSLSE IV+NLV+AI QESS A RGILALE C++ LSSFT++DETYAICSKL+ETAKLCMSDSNKYLQSTF RLEEKSQ
Subjt:  YCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQ

SwissProt top hitse value%identityAlignment
Q557H3 VPS35 endosomal protein sorting factor-like1.3e-4724.95Show/hide
Query:  VGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDI
        +G+  P++E++++C+    +   L+  L       N       S++L+HI+   P E + SN+  F   I  ++  S+ ++  Y   G+ L   +P  + 
Subjt:  VGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDI

Query:  VDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPR
        + +++N+V KV+   E++ +Y++V + +++ VL + S+   +  L+ I +        E     LQSI+ K+ +H      + + ++FL +LDL  G  +
Subjt:  VDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPR

Query:  SIIIINILK-MATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAV
          I  + L+ ++T      DP  I       +AL+D  +  + +D+  Q   L+   +  +DFG + E+ L F VECR  F   D +K  LV+    +  
Subjt:  SIIIINILK-MATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAV

Query:  KALRDAKNH-----VNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHNM-DIKEGSRAAADTELFLSSIQKLCSLLV
        K L   K        +F+++C+A+  +T+PSI     + NLYL ++ VAL    +S +D L+ +AI+ +  +  I E  +  +  +  +S +    SLLV
Subjt:  KALRDAKNH-----VNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHNM-DIKEGSRAAADTELFLSSIQKLCSLLV

Query:  IVPGNPSHGSAYFPKILVSFVNDIPW-MTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLAYCQELVSLSEHIVQN-LVDAILQESSP
        + PG+P  G  Y  K L   + +  W  +   ++++   +L L ++ +Q  LPYH +   +  ++ +F  D  +  EL      +++  L D  L +  P
Subjt:  IVPGNPSHGSAYFPKILVSFVNDIPW-MTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLAYCQELVSLSEHIVQN-LVDAILQESSP

Query:  ATR-----GILALEACNAILSSFTIRDETYAICSKLIETAK
                GI+ ++  NA+L+   +  +T ++   L   AK
Subjt:  ATR-----GILALEACNAILSSFTIRDETYAICSKLIETAK

Q5R8N4 VPS35 endosomal protein-sorting factor-like9.8e-4023.85Show/hide
Query:  PTIEYIVKCIFKDVSQRQLDGTL-LALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAV
        P +++I +CI     +  L   +     LG N       +++L+ ++     E +++ +M F+ +I   ++S F + L +R LGL L    P       +
Subjt:  PTIEYIVKCIFKDVSQRQLDGTL-LALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAV

Query:  MNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTIL-EAISQRTCNRVIDENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSII
        +N   KVI + ++  +Y+   + +++   ++ +   VNT+L + I   T +R   E+    LQ II K+++H+     + ++  FL  LD+       + 
Subjt:  MNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTIL-EAISQRTCNRVIDENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSII

Query:  IINILKMA---TRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAVK
        +   +  A    +    +DP  +  L  + + ++D  +   ++D+    ++L++ F+++V FG + E+ L+F VE R  F  ++ +   L+HS N LA++
Subjt:  IINILKMA---TRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAVK

Query:  ALRDAK-NH----VNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHNMD--IKEGSRAAADTELFLSSIQKLCSLLV
          +  K NH      FV++C+A+  +T+PS+     + NLYL + +VAL    +S +D    +AIS +  +   I    +        L  +    S L+
Subjt:  ALRDAK-NH----VNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHNMD--IKEGSRAAADTELFLSSIQKLCSLLV

Query:  IVPGNPSHGSAYFPKILVSFVNDIPWM-TPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLAYCQELVSLSEHIVQNLVD---AILQES
        IVP +P HG  +  + L++ + D  W      + RI   +L LL+  SQ    YH D   +  +++++ GD  +  E   L E ++  +++    + ++ 
Subjt:  IVPGNPSHGSAYFPKILVSFVNDIPWM-TPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLAYCQELVSLSEHIVQNLVD---AILQES

Query:  SPATRGILALEACNAILSSFTIRD
        +   +  L L   N+IL+   +R+
Subjt:  SPATRGILALEACNAILSSFTIRD

Q5XI83 VPS35 endosomal protein-sorting factor-like1.6e-3726.39Show/hide
Query:  SIVLHHILKELPVEVVSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNT
        +++L+ ++     E +++ +M F+ +I   ++S F + L +R LG+ L    P  +    ++N   KVI + +S  +Y+   + +++   ++ +   VNT
Subjt:  SIVLHHILKELPVEVVSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNT

Query:  IL-EAISQRTCNRVIDENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANM
        +L + I   T +R   E+    LQSII K+++H+     +F++  FL  LD+     +  + + + K      CI + A I  L               +
Subjt:  IL-EAISQRTCNRVIDENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANM

Query:  KDDDNQPAHLLSRFVQLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAK-NH----VNFVKSCIAFSEVTLPSISTHIKQFNLYL
        +D+    AHL++ F+++V FG + E+ L+F VE R  F  ++ +   L+HS N LA++  +  K NH      FV++C+A+  +T+PS+     + NLYL
Subjt:  KDDDNQPAHLLSRFVQLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAK-NH----VNFVKSCIAFSEVTLPSISTHIKQFNLYL

Query:  ETAEVALLGGLISHSDELIDSAISCLHNM--DIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWM-TPRMRTRILCAILS
         + +VAL    +S +D    +AIS +  +   I    +        L  +    S L+IVP +P HG  +  + L++ + D  W  +   + RI  ++L 
Subjt:  ETAEVALLGGLISHSDELIDSAISCLHNM--DIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWM-TPRMRTRILCAILS

Query:  LLATCSQNRLPYHAD----NGVLWGSNNVFYGDLAYCQE--LVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRD
        LL+  SQ+   YH D    N  L+G ++ F  + +   E  +V + EH+     D  L+  S     +L L   N+IL+   +R+
Subjt:  LLATCSQNRLPYHAD----NGVLWGSNNVFYGDLAYCQE--LVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRD

Q7Z3J2 VPS35 endosomal protein-sorting factor-like8.9e-4124.05Show/hide
Query:  PTIEYIVKCIFKDVSQRQLDGTL-LALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAV
        P +++I +CI     +  L   +     LG N       +++L+ ++     E +++ +M F+ +I   ++S F + L +R LGL L    P       +
Subjt:  PTIEYIVKCIFKDVSQRQLDGTL-LALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAV

Query:  MNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTIL-EAISQRTCNRVIDENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSII
        +N   KVI + ++  +Y+   + +++   ++ +   VNT+L + I   T +R   E+    LQ II K+++H+     +F++  FL  LD+       + 
Subjt:  MNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTIL-EAISQRTCNRVIDENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSII

Query:  IINILKMA---TRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAVK
        +   +  A    +    +DP  +  L  + + ++D  +   ++D+    ++L++ F+++V FG + E+ L+F VE R  F  ++ +   L+HS N LA++
Subjt:  IINILKMA---TRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAVK

Query:  ALRDAK-NH----VNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHNMD--IKEGSRAAADTELFLSSIQKLCSLLV
          +  K NH      FV++C+A+  +T+PS++    + NLYL + +VAL    +S +D    +AIS +  +   I    +        L  +    S L+
Subjt:  ALRDAK-NH----VNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHNMD--IKEGSRAAADTELFLSSIQKLCSLLV

Query:  IVPGNPSHGSAYFPKILVSFVNDIPWM-TPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLAYCQELVSLSEHIVQNLVD---AILQES
        IVP +P HG  +  + L++ + D  W      + RI   +L LL+  SQ    YH D   +  +++++ GD  +  E   L E ++  +++    + ++ 
Subjt:  IVPGNPSHGSAYFPKILVSFVNDIPWM-TPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLAYCQELVSLSEHIVQNLVD---AILQES

Query:  SPATRGILALEACNAILSSFTIRD
        +   +  L L   N+IL+   +R+
Subjt:  SPATRGILALEACNAILSSFTIRD

Q8BWQ6 VPS35 endosomal protein-sorting factor-like2.1e-4225.36Show/hide
Query:  SIVLHHILKELPVEVVSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNT
        +++L+ ++     E V++ +M F+ +I   ++S F + L +R LGL L    P  +    ++N   KVI + +S  +Y+   + +++   ++ +   VNT
Subjt:  SIVLHHILKELPVEVVSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNT

Query:  IL-EAISQRTCNRVIDENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMA---TRNSCIRDPATIELLFEISQALNDGFDF
        +L + I   T +R   E+    LQSII K+++H+     +F++  FL  LD+       + +   +  A    +    +DP  +  L  I + ++D  + 
Subjt:  IL-EAISQRTCNRVIDENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMA---TRNSCIRDPATIELLFEISQALNDGFDF

Query:  ANMKDDDNQPAHLLSRFVQLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAK-NH----VNFVKSCIAFSEVTLPSISTHIKQFN
          ++D+    AHL++ F+++V FG + E+ L+F VE R  F  ++ +   L+HS N LA++  +  K NH      FV++C+A+  +T+PS+     + N
Subjt:  ANMKDDDNQPAHLLSRFVQLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAK-NH----VNFVKSCIAFSEVTLPSISTHIKQFN

Query:  LYLETAEVALLGGLISHSDELIDSAISCLHNM--DIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWM-TPRMRTRILCA
        LYL + +VAL    +S +D    +AI  +  +   I    +        L  +    S L+IVP +P HG  +  + L++ + D  W  +   + RI  +
Subjt:  LYLETAEVALLGGLISHSDELIDSAISCLHNM--DIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWM-TPRMRTRILCA

Query:  ILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLAYCQELVSLSEHIVQNLVD---AILQESSPATRGILALEACNAILSSFTIRD
        +L LL+  SQ+   YH D   +  +++++ GD  +  E   L E ++  +++    + ++ +   + +L L   N+IL+   +R+
Subjt:  ILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLAYCQELVSLSEHIVQNLVD---AILQESSPATRGILALEACNAILSSFTIRD

Arabidopsis top hitse value%identityAlignment
AT1G50730.1 unknown protein1.0e-14849.15Show/hide
Query:  GLLVSCVNDMNAQLKHFIPAKETKTGSS--MDDKVLLVGMMEPTIEYIVKCIFKDVSQ-RQLDGTLLALGLGTN----MENSLCISIVLHHILKELPVEV
        G L+ C+ D+   L    P    K G S   DDK LL  ++EP IEYI+KC+F    Q   + G L  LG G N      NS  +SI+LH++LKELP E+
Subjt:  GLLVSCVNDMNAQLKHFIPAKETKTGSS--MDDKVLLVGMMEPTIEYIVKCIFKDVSQ-RQLDGTLLALGLGTN----MENSLCISIVLHHILKELPVEV

Query:  VSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVID
        VSS AM+ L +I  SND SF Q LNYRLLG RL E +     + ++++ V++  +Q +SL +YL ++DAY+D++LQN+ +  ++ +L+ I     ++ + 
Subjt:  VSSNAMKFLQLIDRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVID

Query:  ENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDN-QPAHLLSRFV
        E    SLQSII KLLSH+++L++V  L+HF+EILDL+ G  +S + +++L M TRN CI D  T++LLFE+SQAL D  DF N+KDDDN Q +HL+SRFV
Subjt:  ENGLLSLQSIIGKLLSHYQHLEDVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDN-QPAHLLSRFV

Query:  QLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELID
        ++VD+G E ERHL FL ECR AF  I ELKETLV SSN LAVKAL+  K H+NFVKSC+AFSEVT+PSIS+  K  NLYLETAEVALLGGLISHSDEL+ 
Subjt:  QLVDFGIERERHLAFLVECRGAFGTIDELKETLVHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELID

Query:  SAISCLHNMDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSN
        SA+  L N+ + +G + + D +   S I KLCSLLV++PGNP  G     K + S      W T R++ +I CAI+SLL+T SQ+ LPYH+ N  + G+ 
Subjt:  SAISCLHNMDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPSHGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSN

Query:  NVFYGDLAYCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLE
         +F+GD +Y QELVS ++ ++  L+DAI QESS  +RG +ALEACN I S+  + ++   +C +L+ETAK C+  +++Y++ST   L+
Subjt:  NVFYGDLAYCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFTIRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACTTTTTACCAGGATTGCTTGTCTCATGTGTCAATGACATGAATGCTCAATTGAAACATTTCATACCAGCAAAAGAAACCAAAACTGGCAGTTCTATGGATGATAA
AGTCTTGCTTGTTGGTATGATGGAACCAACAATCGAATACATTGTTAAATGCATATTTAAGGATGTCTCTCAGAGACAATTAGACGGAACACTTTTAGCGCTTGGACTTG
GAACAAATATGGAGAATTCGCTGTGTATCTCAATCGTTCTTCATCACATACTAAAGGAACTTCCAGTTGAAGTAGTAAGCTCGAATGCTATGAAATTTCTCCAGCTCATT
GATCGCAGCAATGATTCATCCTTCCGTCAGTTCTTGAATTACAGGTTACTCGGGCTCAGGCTTTGTGAAAGGAGACCCTCTATGGATATTGTGGATGCTGTAATGAATAA
TGTACTTAAGGTTATTGCGCAAAATGAGAGCCTTGATGAGTATCTGACAGTCATTGATGCCTATTTGGATATTGTTCTTCAAAATCAGTCGGATTGCTGTGTAAATACGA
TTTTAGAAGCTATTTCACAGCGAACATGCAATAGAGTGATAGATGAGAATGGACTCCTCAGTCTGCAGTCAATTATAGGGAAGCTTCTTTCTCATTACCAGCATTTGGAA
GATGTATTTGCTCTGAGCCATTTTCTGGAGATTTTGGACTTGCTTGTTGGGAGACCAAGGAGCATTATCATCATTAATATTCTTAAAATGGCTACTAGGAACTCTTGTAT
ACGTGATCCAGCAACCATAGAATTGCTTTTCGAAATTTCTCAGGCTCTTAATGATGGCTTTGATTTTGCCAACATGAAAGATGATGATAACCAACCAGCACATTTGCTTT
CTCGTTTTGTCCAACTGGTGGACTTTGGGATAGAGAGGGAGCGCCATCTAGCATTCCTAGTTGAGTGTCGTGGAGCATTTGGTACCATAGATGAGCTTAAGGAAACTCTC
GTGCATTCTAGCAATGGTTTAGCTGTAAAGGCTTTAAGAGATGCGAAGAACCATGTCAATTTTGTCAAATCCTGCATAGCATTTTCTGAAGTCACATTACCGTCAATATC
AACTCATATTAAGCAGTTCAATCTTTATCTTGAGACTGCAGAGGTCGCCCTGTTAGGTGGTTTAATTTCTCATTCAGATGAATTAATAGATTCAGCAATCAGCTGCTTGC
ACAATATGGACATTAAGGAGGGCTCCCGTGCAGCAGCCGACACTGAACTTTTTCTCTCCTCAATTCAAAAATTATGCAGCCTCTTGGTTATTGTCCCTGGTAATCCTAGT
CATGGAAGTGCTTACTTTCCAAAGATTTTAGTATCATTTGTAAATGATATACCATGGATGACTCCTAGAATGAGGACAAGGATCTTATGTGCGATACTTTCATTATTGGC
AACATGTTCCCAAAATAGACTCCCATATCATGCAGATAATGGAGTGTTGTGGGGTTCAAACAATGTCTTCTACGGTGACTTGGCCTATTGTCAAGAACTTGTCTCTTTGT
CTGAGCATATTGTACAGAATCTAGTCGATGCTATTCTCCAAGAGTCTTCTCCGGCTACACGTGGAATACTGGCGCTTGAAGCTTGTAATGCCATCCTATCGTCTTTCACT
ATAAGAGATGAAACATATGCAATTTGCTCGAAGTTGATTGAGACTGCTAAATTGTGTATGAGTGACAGCAACAAATATTTGCAGTCAACCTTTCATCGCTTAGAGGAAAA
GTCACAATTCAATGAAACAGCCTTATTTTGTCAGTGCTTTCCAAAGGCAAAGGCAATTTTGGACCACCCCGATATACAAGGAGCTGACGAGGACAACCGGGGAGGAATCG
GGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAGTGAGACGGGCCAAGACCGAAGGGGTCGGGTTTTGGGCCCGACCCCCTGCTCGGCCTCGGCCATGGGCCGAGGCC
GAGCCTGTTCGGTCTCGTCTGGTCCCCACCGCCTCTGGATGCCCCGATTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCTAGGAGGATGAACAGGTA
TTTATATCCCTCTTCGCCACTGAAGAGGGGATCCCGAATTCTATCCCTAAACTATATTCTCTATTCTCTGCTTTCTCCTCTTGCTTTTACTTTTCCACGCCCTACCGTTC
TGCTTGCTGACTTAAGCATCGGAGCCGGTGTGGCAAGTACCACACCGGTGTGCAGGTTTACTGTCTTGCAGGCCACGTCTTTCCCCCTCAACTACAAATTTACCGTTGGT
GGCACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACTTTTTACCAGGATTGCTTGTCTCATGTGTCAATGACATGAATGCTCAATTGAAACATTTCATACCAGCAAAAGAAACCAAAACTGGCAGTTCTATGGATGATAA
AGTCTTGCTTGTTGGTATGATGGAACCAACAATCGAATACATTGTTAAATGCATATTTAAGGATGTCTCTCAGAGACAATTAGACGGAACACTTTTAGCGCTTGGACTTG
GAACAAATATGGAGAATTCGCTGTGTATCTCAATCGTTCTTCATCACATACTAAAGGAACTTCCAGTTGAAGTAGTAAGCTCGAATGCTATGAAATTTCTCCAGCTCATT
GATCGCAGCAATGATTCATCCTTCCGTCAGTTCTTGAATTACAGGTTACTCGGGCTCAGGCTTTGTGAAAGGAGACCCTCTATGGATATTGTGGATGCTGTAATGAATAA
TGTACTTAAGGTTATTGCGCAAAATGAGAGCCTTGATGAGTATCTGACAGTCATTGATGCCTATTTGGATATTGTTCTTCAAAATCAGTCGGATTGCTGTGTAAATACGA
TTTTAGAAGCTATTTCACAGCGAACATGCAATAGAGTGATAGATGAGAATGGACTCCTCAGTCTGCAGTCAATTATAGGGAAGCTTCTTTCTCATTACCAGCATTTGGAA
GATGTATTTGCTCTGAGCCATTTTCTGGAGATTTTGGACTTGCTTGTTGGGAGACCAAGGAGCATTATCATCATTAATATTCTTAAAATGGCTACTAGGAACTCTTGTAT
ACGTGATCCAGCAACCATAGAATTGCTTTTCGAAATTTCTCAGGCTCTTAATGATGGCTTTGATTTTGCCAACATGAAAGATGATGATAACCAACCAGCACATTTGCTTT
CTCGTTTTGTCCAACTGGTGGACTTTGGGATAGAGAGGGAGCGCCATCTAGCATTCCTAGTTGAGTGTCGTGGAGCATTTGGTACCATAGATGAGCTTAAGGAAACTCTC
GTGCATTCTAGCAATGGTTTAGCTGTAAAGGCTTTAAGAGATGCGAAGAACCATGTCAATTTTGTCAAATCCTGCATAGCATTTTCTGAAGTCACATTACCGTCAATATC
AACTCATATTAAGCAGTTCAATCTTTATCTTGAGACTGCAGAGGTCGCCCTGTTAGGTGGTTTAATTTCTCATTCAGATGAATTAATAGATTCAGCAATCAGCTGCTTGC
ACAATATGGACATTAAGGAGGGCTCCCGTGCAGCAGCCGACACTGAACTTTTTCTCTCCTCAATTCAAAAATTATGCAGCCTCTTGGTTATTGTCCCTGGTAATCCTAGT
CATGGAAGTGCTTACTTTCCAAAGATTTTAGTATCATTTGTAAATGATATACCATGGATGACTCCTAGAATGAGGACAAGGATCTTATGTGCGATACTTTCATTATTGGC
AACATGTTCCCAAAATAGACTCCCATATCATGCAGATAATGGAGTGTTGTGGGGTTCAAACAATGTCTTCTACGGTGACTTGGCCTATTGTCAAGAACTTGTCTCTTTGT
CTGAGCATATTGTACAGAATCTAGTCGATGCTATTCTCCAAGAGTCTTCTCCGGCTACACGTGGAATACTGGCGCTTGAAGCTTGTAATGCCATCCTATCGTCTTTCACT
ATAAGAGATGAAACATATGCAATTTGCTCGAAGTTGATTGAGACTGCTAAATTGTGTATGAGTGACAGCAACAAATATTTGCAGTCAACCTTTCATCGCTTAGAGGAAAA
GTCACAATTCAATGAAACAGCCTTATTTTGTCAGTGCTTTCCAAAGGCAAAGGCAATTTTGGACCACCCCGATATACAAGGAGCTGACGAGGACAACCGGGGAGGAATCG
GGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAGTGAGACGGGCCAAGACCGAAGGGGTCGGGTTTTGGGCCCGACCCCCTGCTCGGCCTCGGCCATGGGCCGAGGCC
GAGCCTGTTCGGTCTCGTCTGGTCCCCACCGCCTCTGGATGCCCCGATTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCTAGGAGGATGAACAGGTA
TTTATATCCCTCTTCGCCACTGAAGAGGGGATCCCGAATTCTATCCCTAAACTATATTCTCTATTCTCTGCTTTCTCCTCTTGCTTTTACTTTTCCACGCCCTACCGTTC
TGCTTGCTGACTTAAGCATCGGAGCCGGTGTGGCAAGTACCACACCGGTGTGCAGGTTTACTGTCTTGCAGGCCACGTCTTTCCCCCTCAACTACAAATTTACCGTTGGT
GGCACGTGA
Protein sequenceShow/hide protein sequence
MHFLPGLLVSCVNDMNAQLKHFIPAKETKTGSSMDDKVLLVGMMEPTIEYIVKCIFKDVSQRQLDGTLLALGLGTNMENSLCISIVLHHILKELPVEVVSSNAMKFLQLI
DRSNDSSFRQFLNYRLLGLRLCERRPSMDIVDAVMNNVLKVIAQNESLDEYLTVIDAYLDIVLQNQSDCCVNTILEAISQRTCNRVIDENGLLSLQSIIGKLLSHYQHLE
DVFALSHFLEILDLLVGRPRSIIIINILKMATRNSCIRDPATIELLFEISQALNDGFDFANMKDDDNQPAHLLSRFVQLVDFGIERERHLAFLVECRGAFGTIDELKETL
VHSSNGLAVKALRDAKNHVNFVKSCIAFSEVTLPSISTHIKQFNLYLETAEVALLGGLISHSDELIDSAISCLHNMDIKEGSRAAADTELFLSSIQKLCSLLVIVPGNPS
HGSAYFPKILVSFVNDIPWMTPRMRTRILCAILSLLATCSQNRLPYHADNGVLWGSNNVFYGDLAYCQELVSLSEHIVQNLVDAILQESSPATRGILALEACNAILSSFT
IRDETYAICSKLIETAKLCMSDSNKYLQSTFHRLEEKSQFNETALFCQCFPKAKAILDHPDIQGADEDNRGGIGLKDGPRRQNRQVRRAKTEGVGFWARPPARPRPWAEA
EPVRSRLVPTASGCPDFAWFDLKRLRNPKKARRMNRYLYPSSPLKRGSRILSLNYILYSLLSPLAFTFPRPTVLLADLSIGAGVASTTPVCRFTVLQATSFPLNYKFTVG
GT