; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001483 (gene) of Snake gourd v1 genome

Gene IDTan0001483
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationLG11:24394956..24397476
RNA-Seq ExpressionTan0001483
SyntenyTan0001483
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601794.1 IST1-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.0e-12673.03Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP
        MGR LDALLGRNFRASKFRPLLNLA+SRLAILTNQR++RRSQAQSDVLQLLQL H QRALLRVEQVIK+QN LDAYVLIEGYLNLLIER SLLEQ RECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP

Query:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE
        EELKEA+AGLLF+ASRCGDFPELHEIKSV TS FGKEFTARAVELRNNCGVNHLLMQKLSTR PNLESRMD+LK IASENGI  Q+DE+   P SN    
Subjt:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE

Query:  KLAGNTRQNHSEA----------DAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDP-----------------CSGKKPEVESKPK
          A N+RQN SE             E PSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSE QDP                    KK EVE+KPK
Subjt:  KLAGNTRQNHSEA----------DAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDP-----------------CSGKKPEVESKPK

Query:  PEIKYGNGREDEEEGSKIR--AEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY
         EI+Y  G+  EEEGSKIR  AEVKNSMCG  S+RE   +EN EMDEQR +N+  G +ME  LEK       SFRLNLEKKP+SVRTRR RGY
Subjt:  PEIKYGNGREDEEEGSKIR--AEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY

XP_022139373.1 uncharacterized protein LOC111010325 [Momordica charantia]2.7e-12270.81Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP
        MGRKLDALLGRNFRASKFRPLLNLALSRLA+LTNQR VRRSQA+SD LQLLQL HH RALLRVEQVIKEQN LDAYVLIEGYLNLLIERT LLEQ RECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP

Query:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE
        EELKEAV+GL+F+ASRCGDFPEL EIKSV T+ FGKEFTARAVELRNNCGV+H +MQKLSTRQPNLESRM+VL+ IASEN I  Q+DE    P S  N+E
Subjt:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE

Query:  KLAGNTRQNHSEA-----------DAETPSGS---KQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQD--------PCSG--------KKPEVES
        K A N RQN+ +             A+ PSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS+ QD        P SG        KK EVES
Subjt:  KLAGNTRQNHSEA-----------DAETPSGS---KQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQD--------PCSG--------KKPEVES

Query:  KPKPEIKYGNGREDEEEGSKIRAEVKNSMCGSSRESSDEENSEMDEQRGSNLGNGLDMET--------KLEKASFRLNLEKKPMSVRTRRTRGY
        KPK EI+Y N RE+EE GSKI AEVKNSM  S   S   E+SEM+EQR  N+  GLDMET        + +K SF LNLEKKPMSVRTRR RG+
Subjt:  KPKPEIKYGNGREDEEEGSKIRAEVKNSMCGSSRESSDEENSEMDEQRGSNLGNGLDMET--------KLEKASFRLNLEKKPMSVRTRRTRGY

XP_022921602.1 uncharacterized protein LOC111429814 [Cucurbita moschata]4.7e-12773.28Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP
        MGR LDALLGRNFRASKFRPLLNLALSRLAILTNQR++RRSQAQSDVLQLLQL H QRALLRVEQVIK+QN LDAYVLIEGYLNLLIERTSLLEQ R+CP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP

Query:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE
        EELKEA+AGLLF+ASRCGDFPELHEIKSV TS FGKEFTARAVELRNNCGVNHLLMQKLSTR PNLESRMD+LK IASENGI  Q+DE+   P SN    
Subjt:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE

Query:  KLAGNTRQNHSEA----------DAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDP-----------------CSGKKPEVESKPK
          A N+RQN SE             E PSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSE QDP                    KK EVE+KPK
Subjt:  KLAGNTRQNHSEA----------DAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDP-----------------CSGKKPEVESKPK

Query:  PEIKYGNGREDEEEGSKIR--AEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY
         EI+Y  G+  EEEGSKIR  AEVKNSMCG  S+RE   +EN EMDEQR +N+  G +ME  LEK       SFRLNLEKKP+SVRTRR RGY
Subjt:  PEIKYGNGREDEEEGSKIR--AEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY

XP_022971762.1 uncharacterized protein LOC111470447 [Cucurbita maxima]6.8e-12672.77Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP
        MGR LDALLGRNFRASKFRPLLNLA+SRLAILTNQR++RR+QAQSDVLQLLQL H QRALLRVEQVIK+QN LDAYVL+EGYLNLLIERTSLLEQ RECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP

Query:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE
        EELKEA+AGLLF+ASRCGDFPELHEIKSV T+ FGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMD+LK IASENGI  Q+DE+   P SN    
Subjt:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE

Query:  KLAGNTRQNHSE----------ADAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDP-----------------CSGKKPEVESKPK
          A N+RQN SE             E PSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSE QDP                    KK  VE+KPK
Subjt:  KLAGNTRQNHSE----------ADAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDP-----------------CSGKKPEVESKPK

Query:  PEIKYGNGREDEEEGSKIR--AEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY
         EI+Y  GR  EEEGSKIR   EVKNSMCG  S+RE   +EN EMDEQR SN+  G +ME  LEK       SFRLNLEKKP+SVRTRR RGY
Subjt:  PEIKYGNGREDEEEGSKIR--AEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY

XP_023538782.1 uncharacterized protein LOC111799606 [Cucurbita pepo subsp. pepo]1.4e-12674.61Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP
        MGR LDALLGRNFRASKFRPLLNLALSRLAILTNQR++RRSQAQSDVLQLLQLRH QRALLRVEQVIK+QN LDAYVLIEGYLNLLIERTSLLEQ RECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP

Query:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDE---SLSPPPSNY
        EELKEA+AGLLF+ASRCGDFPELHEIKSV TS FGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMD+LK IASENGI  Q+DE   S     S+ 
Subjt:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDE---SLSPPPSNY

Query:  NQEKLAGNTRQNHSEADAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQD--------PCSG---------KKPEVESKPKPEIKYGN
        NQ +  G   +N  +   E PSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSR E QD        P SG         KK +VE+KPK EI+Y  
Subjt:  NQEKLAGNTRQNHSEADAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQD--------PCSG---------KKPEVESKPKPEIKYGN

Query:  GREDEEEGSKIR--AEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY
        GR  EEEGSKIR  AEVKNSMCG  S+RE   +EN EMDEQR SN+  G +ME  LEK       SFRLNLEKKP+SVRTRR RG+
Subjt:  GREDEEEGSKIR--AEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY

TrEMBL top hitse value%identityAlignment
A0A0A0KAU0 Uncharacterized protein5.3e-10867.54Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP
        MGRKLDALLGRNFRASKFRPLLNL+LSRL+ILT QR+V  SQA SDVLQLLQL HH RALLRVE+VIK+QN LDAYVLIEGYLNLL+ERT+LLEQ  ECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP

Query:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE
        EELKEAVAGLLF+ASRCGDFPELHEIKSV T+ FGKEFTARAVELRNNCGVN  LMQKLSTRQP LE+RMD LK IASENGI  QID+     PS+  QE
Subjt:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE

Query:  KLAGNTRQNHSEADA-----ETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSR-----SEPQDPCSG---------KKPEVESKPKPEI-KYGN
        K+  N RQ+ +E  +     E  SGSK  YKDVADAAQAAFESAAQAAAAARAAMELSR     S P  P SG         +K EVESK K E+ +YGN
Subjt:  KLAGNTRQNHSEADA-----ETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSR-----SEPQDPCSG---------KKPEVESKPKPEI-KYGN

Query:  GREDEEEGSKIRAEVKNSMCGSSRESSDEENSEMDEQRGSNLGNGLDMETKL------EKASFRLNLEKKPMSVRTRRTRGY
        GR+ E EG                   +EE   MDE+R S   NGL METK+      EK SFRLNLEKKP+SVRTRR  GY
Subjt:  GREDEEEGSKIRAEVKNSMCGSSRESSDEENSEMDEQRGSNLGNGLDMETKL------EKASFRLNLEKKPMSVRTRRTRGY

A0A6J1CFG6 uncharacterized protein LOC1110103251.3e-12270.81Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP
        MGRKLDALLGRNFRASKFRPLLNLALSRLA+LTNQR VRRSQA+SD LQLLQL HH RALLRVEQVIKEQN LDAYVLIEGYLNLLIERT LLEQ RECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP

Query:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE
        EELKEAV+GL+F+ASRCGDFPEL EIKSV T+ FGKEFTARAVELRNNCGV+H +MQKLSTRQPNLESRM+VL+ IASEN I  Q+DE    P S  N+E
Subjt:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE

Query:  KLAGNTRQNHSEA-----------DAETPSGS---KQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQD--------PCSG--------KKPEVES
        K A N RQN+ +             A+ PSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS+ QD        P SG        KK EVES
Subjt:  KLAGNTRQNHSEA-----------DAETPSGS---KQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQD--------PCSG--------KKPEVES

Query:  KPKPEIKYGNGREDEEEGSKIRAEVKNSMCGSSRESSDEENSEMDEQRGSNLGNGLDMET--------KLEKASFRLNLEKKPMSVRTRRTRGY
        KPK EI+Y N RE+EE GSKI AEVKNSM  S   S   E+SEM+EQR  N+  GLDMET        + +K SF LNLEKKPMSVRTRR RG+
Subjt:  KPKPEIKYGNGREDEEEGSKIRAEVKNSMCGSSRESSDEENSEMDEQRGSNLGNGLDMET--------KLEKASFRLNLEKKPMSVRTRRTRGY

A0A6J1E0Z0 uncharacterized protein LOC1114298142.3e-12773.28Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP
        MGR LDALLGRNFRASKFRPLLNLALSRLAILTNQR++RRSQAQSDVLQLLQL H QRALLRVEQVIK+QN LDAYVLIEGYLNLLIERTSLLEQ R+CP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP

Query:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE
        EELKEA+AGLLF+ASRCGDFPELHEIKSV TS FGKEFTARAVELRNNCGVNHLLMQKLSTR PNLESRMD+LK IASENGI  Q+DE+   P SN    
Subjt:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE

Query:  KLAGNTRQNHSEA----------DAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDP-----------------CSGKKPEVESKPK
          A N+RQN SE             E PSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSE QDP                    KK EVE+KPK
Subjt:  KLAGNTRQNHSEA----------DAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDP-----------------CSGKKPEVESKPK

Query:  PEIKYGNGREDEEEGSKIR--AEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY
         EI+Y  G+  EEEGSKIR  AEVKNSMCG  S+RE   +EN EMDEQR +N+  G +ME  LEK       SFRLNLEKKP+SVRTRR RGY
Subjt:  PEIKYGNGREDEEEGSKIR--AEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY

A0A6J1E4E5 uncharacterized protein LOC1114298232.9e-12270.99Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP
        MGR LDALLGRNFRASKFRPLLNL LSRLAILTNQR++RRSQAQSDVLQLLQL H QRALLRVEQVIK+QN LDAYVLIEGYLNLLIER SLLEQ RECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP

Query:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE
        EELKEA+AGLLF+ASRCGDFPELHEIKS  TS FGKEFTARAVELRNNCGVNHLLMQKLSTR PNLESRMD+LK IASENGI  Q+DE+   P SN    
Subjt:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE

Query:  KLAGNTRQNHSEAD----------AETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDPCSGKKP-----------------EVESKPK
          A N+RQN  E D           E PSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSR E QDP     P                 +VE+KPK
Subjt:  KLAGNTRQNHSEAD----------AETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDPCSGKKP-----------------EVESKPK

Query:  PEIKYGNGREDEEEGSKI--RAEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY
         EI+Y  GR  EEEGSK+  + EVKNSMCG  S+RE   +EN EMDEQR +N+  G +ME  LEKA      SF LNLEKKP+ VRT R RGY
Subjt:  PEIKYGNGREDEEEGSKI--RAEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY

A0A6J1I2V3 uncharacterized protein LOC1114704473.3e-12672.77Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP
        MGR LDALLGRNFRASKFRPLLNLA+SRLAILTNQR++RR+QAQSDVLQLLQL H QRALLRVEQVIK+QN LDAYVL+EGYLNLLIERTSLLEQ RECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP

Query:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE
        EELKEA+AGLLF+ASRCGDFPELHEIKSV T+ FGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMD+LK IASENGI  Q+DE+   P SN    
Subjt:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE

Query:  KLAGNTRQNHSE----------ADAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDP-----------------CSGKKPEVESKPK
          A N+RQN SE             E PSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSE QDP                    KK  VE+KPK
Subjt:  KLAGNTRQNHSE----------ADAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDP-----------------CSGKKPEVESKPK

Query:  PEIKYGNGREDEEEGSKIR--AEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY
         EI+Y  GR  EEEGSKIR   EVKNSMCG  S+RE   +EN EMDEQR SN+  G +ME  LEK       SFRLNLEKKP+SVRTRR RGY
Subjt:  PEIKYGNGREDEEEGSKIR--AEVKNSMCG--SSRESSDEENSEMDEQRGSNLGNGLDMETKLEKA------SFRLNLEKKPMSVRTRRTRGY

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog6.4e-1028.48Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +A+ ++   L     +RA +RVE +I+E  +++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEAV

Query:  AGLLFSASRC-GDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
        + L+++A R   +  EL  +     + + KE+  +         VN  LM KLS   P
Subjt:  AGLLFSASRC-GDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Q54I39 IST1-like protein4.4e-1128.57Show/hide
Query:  GRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEAVAG
        G ++ + K +  L LA+SR+ IL N++       + +V +LL+ ++ + A +RVE +I+++ +++ + +IE    LL  R +L+    E P E+KE++  
Subjt:  GRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEAVAG

Query:  LLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNC----GVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPP
        L++S+ R    PEL +IK+   + +GK      +E   NC     VN  ++ KLS   P+       L  IA +  + +       PPP
Subjt:  LLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNC----GVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPP

Q568Z6 IST1 homolog6.4e-1028.48Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +A+ ++   L     +RA +RVE +I+E  +++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEAV

Query:  AGLLFSASRC-GDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
        + L+++A R   +  EL  +     + + KE+  +         VN  LM KLS   P
Subjt:  AGLLFSASRC-GDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Q5R6G8 IST1 homolog1.4e-0930.19Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +A+ ++   L     +RA +RVE +I+E  +++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEAV

Query:  AGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNN--CGVNHLLMQKLSTRQP
        + L+++A R     E+ E+K V      K         R N    VN  LM KLS   P
Subjt:  AGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNN--CGVNHLLMQKLSTRQP

Q9CX00 IST1 homolog6.4e-1028.48Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +A+ ++   L     +RA +RVE +I+E  +++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEAV

Query:  AGLLFSASRC-GDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
        + L+++A R   +  EL  +     + + KE+  +         VN  LM KLS   P
Subjt:  AGLLFSASRC-GDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Arabidopsis top hitse value%identityAlignment
AT1G13340.1 Regulator of Vps4 activity in the MVB pathway protein7.4e-6243.03Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP
        MG+KLDALLGR+F+ +KF+ L+ LAL+RL+IL NQRQ R SQA SDV +LL+L  H+ A  RV+QV+K+QN LD    I GY  L ++R  L E +R+CP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECP

Query:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE
        EEL EAV+GLLF+ASR G+FPEL EI++V  S FGK+  AR++ELR+NCGV+  ++QKLSTR P  E RM  LK IA+EN I  ++D++ +      N +
Subjt:  EELKEAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQE

Query:  --------KLAGNTRQNHSEADAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDPCS-----GKKPEVESKPKPEIKYGNGREDEEE
                KL     +      +++    K+KYKDVADAAQAAFESAA AA AA+AA+ELS+  P+   S     G+     S+ K   +   G +D  E
Subjt:  --------KLAGNTRQNHSEADAETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDPCS-----GKKPEVESKPKPEIKYGNGREDEEE

Query:  G-SKIRAEVKNSMCGSS----------RE-------------SSDEE----------NSEMDEQR---GSNLGNGL---DMETKLEKASFRLNLEKKPMS
        G   + +E K SM  S           RE              S+EE          +   DEQ    GSN  +      M   +E    R    K P+S
Subjt:  G-SKIRAEVKNSMCGSS----------RE-------------SSDEE----------NSEMDEQR---GSNLGNGL---DMETKLEKASFRLNLEKKPMS

Query:  VRTRRTRGY
        VRTR+ RGY
Subjt:  VRTRRTRGY

AT1G25420.1 Regulator of Vps4 activity in the MVB pathway protein4.1e-2832.55Show/hide
Query:  LDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELK
        L+ L  R    +K +  LNLA++R+ +L N+R ++    + ++   LQ      A +RVE VI+E N+  AY ++E +   ++ R  +LE  +ECP EL+
Subjt:  LDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELK

Query:  EAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQEKLAG
        EA+A ++F+A RC + P+L +IK++F + +GKEF   A ELR + GVN  +++KLS   P+  +R+ +LK IA E  +    D S +      + E L G
Subjt:  EAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQEKLAG

Query:  NTRQNHSEADAETPSGSKQKY--KDVADAAQAAFESAAQAAAAARAAMELSRSEP
          +Q H +        S+Q Y    V+   ++    A Q     +A   +S+S P
Subjt:  NTRQNHSEADAETPSGSKQKY--KDVADAAQAAFESAAQAAAAARAAMELSRSEP

AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein7.4e-3036.36Show/hide
Query:  LDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELK
        LD+   + F+A+K + LL L + R+ ++ N+R+ +  Q + ++ +LL+      A +RVE +I+E+ ++ A  ++E +  L+  R  ++E  RECP +LK
Subjt:  LDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELK

Query:  EAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASEN
        EA++ + F+A RC D  EL +++ +F S +GKEF A A EL+ + GVN  L++ LS R P+ E+++ +LK IA E+
Subjt:  EAVAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASEN

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein1.3e-2937.04Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEAV
        +L R F+ +K +  L +A SRL IL N+++++  Q + ++ QLL+      A +RVE V++E+  + AY LI  Y  LL+ R  ++E  + CP +LKEAV
Subjt:  LLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEAV

Query:  AGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSN
          +LF++ R  D PEL EI   FT+ +GK+F+  AVELR + GV+ LL++KLS + P+  +++ +L  IA E+ +  +    +   P +
Subjt:  AGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSN

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein7.7e-2729.43Show/hide
Query:  ALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEA
        +L  R F +SK +    +A++R+ ++ N+R V   Q + D+  LLQ      A +RVE VI+EQN+  A  +IE +  L++ R +++ + ++CP +LKE 
Subjt:  ALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEA

Query:  VAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIA-----------SENGIAEQIDESLSPPPS
        +A L+F+A RC + PEL +++ +F   +GK+F + A +LR +CGVN +L+ KLS R P  E ++ ++K IA           +E  + +  +ES+  P  
Subjt:  VAGLLFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIA-----------SENGIAEQIDESLSPPPS

Query:  NYNQEKLAGNTRQNHSEAD-------AETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDPCSGKKPEVESKPKPEIKYGNGREDEEEG
          +   L  N    +   D       + +       Y D   AA+AA E A QA AAA+ A  L+        S K+  V S      K     +     
Subjt:  NYNQEKLAGNTRQNHSEAD-------AETPSGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDPCSGKKPEVESKPKPEIKYGNGREDEEEG

Query:  SKIRAEVKNSMCGSSRESSDEENSEMDEQRGSN
           R + ++S   S       EN  M  +   N
Subjt:  SKIRAEVKNSMCGSSRESSDEENSEMDEQRGSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAAAACTAGACGCTCTTCTCGGAAGGAATTTCAGAGCCTCCAAATTCCGTCCCCTTCTCAATCTCGCCCTCTCTCGCCTCGCCATCTTAACCAACCAACGCCA
AGTCCGGCGCTCTCAAGCTCAATCCGACGTCCTCCAGCTCCTCCAGCTCCGCCACCACCAACGCGCTCTTCTTCGAGTTGAGCAAGTGATTAAGGAGCAGAATGTGTTAG
ATGCGTACGTTTTGATTGAAGGCTATCTCAATCTCTTGATTGAAAGGACTTCTCTCCTCGAACAGCACAGAGAGTGCCCTGAGGAATTGAAAGAGGCGGTTGCAGGGTTG
CTATTCTCTGCTTCCAGATGTGGGGATTTCCCTGAACTTCATGAGATTAAATCCGTTTTCACCTCTTGTTTTGGCAAGGAATTCACTGCTCGAGCTGTTGAATTACGCAA
CAACTGTGGAGTCAATCATTTGTTAATGCAAAAACTGTCCACAAGGCAGCCCAATTTGGAGAGTAGAATGGACGTGCTTAAATTCATTGCTTCTGAGAATGGAATCGCTG
AGCAAATTGACGAATCTTTATCTCCTCCTCCTTCCAATTACAATCAGGAAAAACTTGCTGGAAACACAAGACAAAACCATTCTGAGGCAGATGCTGAAACGCCGTCTGGT
TCTAAACAAAAGTACAAGGATGTGGCGGATGCAGCGCAAGCGGCTTTCGAATCAGCGGCTCAAGCGGCAGCTGCTGCCAGAGCTGCCATGGAGCTCTCTCGATCTGAACC
ACAAGACCCTTGCTCTGGAAAGAAACCTGAGGTCGAATCTAAACCGAAACCGGAAATCAAATATGGAAATGGAAGAGAAGATGAAGAAGAAGGCAGCAAAATCAGGGCGG
AAGTGAAGAATTCGATGTGTGGTTCAAGTAGAGAGTCATCTGATGAAGAAAACAGTGAAATGGATGAGCAGAGAGGTTCAAATCTTGGGAATGGGCTGGACATGGAAACG
AAACTAGAGAAGGCAAGCTTTCGTTTAAATCTGGAGAAGAAGCCAATGTCAGTGAGAACAAGAAGAACGCGTGGATACTGA
mRNA sequenceShow/hide mRNA sequence
GGAGACTTCGTCTATAAGCAACACACAGAAACACACAAACCCACCGCCACTGCCTGTGCCTTCCACTCTGAAACAAACATAAATATAATATATAAACCCATTTCGCCATG
AAAGCTTCAAAATCCTTCACTTTTGGCAGCCATGGGAAGAAAACTAGACGCTCTTCTCGGAAGGAATTTCAGAGCCTCCAAATTCCGTCCCCTTCTCAATCTCGCCCTCT
CTCGCCTCGCCATCTTAACCAACCAACGCCAAGTCCGGCGCTCTCAAGCTCAATCCGACGTCCTCCAGCTCCTCCAGCTCCGCCACCACCAACGCGCTCTTCTTCGAGTT
GAGCAAGTGATTAAGGAGCAGAATGTGTTAGATGCGTACGTTTTGATTGAAGGCTATCTCAATCTCTTGATTGAAAGGACTTCTCTCCTCGAACAGCACAGAGAGTGCCC
TGAGGAATTGAAAGAGGCGGTTGCAGGGTTGCTATTCTCTGCTTCCAGATGTGGGGATTTCCCTGAACTTCATGAGATTAAATCCGTTTTCACCTCTTGTTTTGGCAAGG
AATTCACTGCTCGAGCTGTTGAATTACGCAACAACTGTGGAGTCAATCATTTGTTAATGCAAAAACTGTCCACAAGGCAGCCCAATTTGGAGAGTAGAATGGACGTGCTT
AAATTCATTGCTTCTGAGAATGGAATCGCTGAGCAAATTGACGAATCTTTATCTCCTCCTCCTTCCAATTACAATCAGGAAAAACTTGCTGGAAACACAAGACAAAACCA
TTCTGAGGCAGATGCTGAAACGCCGTCTGGTTCTAAACAAAAGTACAAGGATGTGGCGGATGCAGCGCAAGCGGCTTTCGAATCAGCGGCTCAAGCGGCAGCTGCTGCCA
GAGCTGCCATGGAGCTCTCTCGATCTGAACCACAAGACCCTTGCTCTGGAAAGAAACCTGAGGTCGAATCTAAACCGAAACCGGAAATCAAATATGGAAATGGAAGAGAA
GATGAAGAAGAAGGCAGCAAAATCAGGGCGGAAGTGAAGAATTCGATGTGTGGTTCAAGTAGAGAGTCATCTGATGAAGAAAACAGTGAAATGGATGAGCAGAGAGGTTC
AAATCTTGGGAATGGGCTGGACATGGAAACGAAACTAGAGAAGGCAAGCTTTCGTTTAAATCTGGAGAAGAAGCCAATGTCAGTGAGAACAAGAAGAACGCGTGGATACT
GAAGAAGAAGATCACATATGTATGTAATTATGTGGTTTAATTTGGAAATTTTGCTAGTGTAGTGTATGGAAAATTCAACATTTTTGTTTGTTTTACGTGGTATATTCAAA
GTTCATGTTCAGTTATTTAATCAACCTCCCAGTCTACACCATAAACAAACAAACATGGGTTCGACTTGAAACCGGGTTCTCCTTTCTATCACACTCTTATATCATCCAAT
TGCGTTCTTCTTTACAAACTCTTTCCAAGAAATCTAATGAAAGCATTGATTTGTATATTCAACGAGCAAAAGATATTAATGATTGTCTGGCTAATGTTAGTATCTATATT
GAGGATGAAGAGATGTTCATGAATGGTCTGCCTATTGAGTTTAGTGTTTTTCAAACTTTTATTAGAATGAGATCTGAACCTATTACCTTCGAGCATCTTTATGCTCTACT
TAAAATGCGAAAACAGTCAATTGAGATGCAAATAAAGAATGTTGAAGCCTTGGTTTCAACCACTGCGTTGTTAACTCTCAACCGATCGCAAAATAACTCTAGAGGAGGGC
ATACTACGTGGTAATAGAGGAAACTCCAATCCGATTAGAGGACGAGGAAATTTTTCTTCTACTAGAGGTTGTTTTATCTCAAACTCTTTGAATTCTGATCCTTCTAATCA
TCTTGTGTGCAAAATTTGTAAGAAACCAGGATACAATGCTTTGGATTGCTTCCACCAGATGAATGATGCCTATCAAGGTTGCCATCCTCCTGAACAACTAGCTACCATGT
TCCTTCTTATTCTTCTACGAATCAACCATCTAACTCTGTTTGGCTAGCAGACTCGGGATGCAATTCACATGTTACTCAGGGTGCTTCCCTTCTTGCCTCGACAGTTCCTT
GCCAGAATGATGAACAAATTTCAGTTGGTGATGGTCAAGGACTTTGTTGGGGTATATGCCATAAACTCATGGTTTTTGTAAATTCTTGATGAA
Protein sequenceShow/hide protein sequence
MGRKLDALLGRNFRASKFRPLLNLALSRLAILTNQRQVRRSQAQSDVLQLLQLRHHQRALLRVEQVIKEQNVLDAYVLIEGYLNLLIERTSLLEQHRECPEELKEAVAGL
LFSASRCGDFPELHEIKSVFTSCFGKEFTARAVELRNNCGVNHLLMQKLSTRQPNLESRMDVLKFIASENGIAEQIDESLSPPPSNYNQEKLAGNTRQNHSEADAETPSG
SKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSEPQDPCSGKKPEVESKPKPEIKYGNGREDEEEGSKIRAEVKNSMCGSSRESSDEENSEMDEQRGSNLGNGLDMET
KLEKASFRLNLEKKPMSVRTRRTRGY