; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS028327 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS028327
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationscaffold47:3393515..3395360
RNA-Seq ExpressionMS028327
SyntenyMS028327
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031582.1 Ist1 domain-containing protein [Cucumis melo var. makuwa]7.7e-14459.26Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
        +IK AL RLK+LKKKRYSI++ LREDL ELI+NGYQQIAF RVEQL++DE+LME Y+LIEN CEFIL  FSHVRKHKTCPDDVIEAISSLIFASAR GD 
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-------
        PELK VR LFEERFG+SFA  AVEL PGNLVN QIKEKL+++ VS+HEKQ LI+E+ARDCF PA+LALEY PD  QKQVL N DQT  E K E       
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-------

Query:  ----PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIR-ENVQYGNGQSLIE
             + +DSN    +      S S  VC S     DASS E  PFCEE +VY DDVVEL +PS E G L DQR FKFK  +T  R ENV+  + QSLIE
Subjt:  ----PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIR-ENVQYGNGQSLIE

Query:  EHHDNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHFIL
        + HD S K+SVS RS+Q +N  PK   R+     QE+ SL+H KKK  KCCCLSCHS  L   ++NY LEQ  YVYSE       +  SE  ++D     
Subjt:  EHHDNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHFIL

Query:  DTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIA
            + +    +EF T  R ++ RN  +GT+VYDVFVYSHCQP ENKETN K EE      HE         S FTKC K A KYPSHVHPKLPDYDEIA
Subjt:  DTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIA

Query:  AKFIALKREYSQR
         +FI+LKREY QR
Subjt:  AKFIALKREYSQR

KAE8645768.1 hypothetical protein Csa_020499 [Cucumis sativus]2.2e-15159.46Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
        +IK AL RLK+L+KKRYSI++ LREDL ELI+NGYQQIAF RVEQLI+DE+LMEAY+LIENFCE IL  FSH+RKHKTCPDD+ EAISSLIFASAR GD 
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN
        PELK VRKLFEERFG+SF   AVEL PGNLVN QIKEKL+++PV +HEKQ LI+E+ARDCF PA+LALE CPDWH+KQVL+NGDQT  + K E       
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN

Query:  AEEFERKVM----------------CVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIR-ENVQYGNG
        +EE ER V+                C S S     S +   DAS+ E  PFCEE +VY DDVVEL +PS E G L DQR FKFKS +T  R ENV  G+ 
Subjt:  AEEFERKVM----------------CVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIR-ENVQYGNG

Query:  QSLIEEHHDNSKKRSVSERSSQSMNKSPKRSMRRS--THQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVD
        QSLIE H+  S K++VS RS+Q +N SPK   RRS    QE+ SL+H KKKLMKC CLSCHS  L S ++NYC+EQ  YV+SE       +  SE    D
Subjt:  QSLIEEHHDNSKKRSVSERSSQSMNKSPKRSMRRS--THQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVD

Query:  PHFILDTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPD
                 + +    +EF T  R +  RN  +GT+VYDVFVYS CQP ENKETN K +E+ST  KHE         S FTKC K ADKYPSHVHPKLP+
Subjt:  PHFILDTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPD

Query:  YDEIAAKFIALKREYSQR
        Y+EIAAKFI LKREY +R
Subjt:  YDEIAAKFIALKREYSQR

XP_016901704.1 PREDICTED: uncharacterized protein LOC103495691 [Cucumis melo]3.5e-14459.26Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
        +IK AL RLK+LKKKRYSI++ LREDL ELI+NGYQQIAF RVEQL++DE+LME Y+LIEN CEFIL  FSHVRKHKTCPDDVIEAISSLIFASAR GD 
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-------
        PELK VR LFEERFG+SFA  AVEL PGNLVN QIKEKL+++ VS+HEKQ LI+E+ARDCF PA+LALEY PD  QKQVL+N DQT  E K E       
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-------

Query:  ----PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIR-ENVQYGNGQSLIE
             + +DSN    +      S S  VC S     DASS E  PFCEE +VY DDVVEL +PS E G L DQR FKFK  +T  R ENV+  + QSLIE
Subjt:  ----PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIR-ENVQYGNGQSLIE

Query:  EHHDNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHFIL
        + HD S K+SVS RS+Q +N  PK   R+     QE+ SL+H KKK  KCCCLSCHS  L   ++NY LEQ  YVYSE       +  SE  ++D     
Subjt:  EHHDNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHFIL

Query:  DTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIA
            + +    +EF T  R ++ RN  +GT+VYDVFVYSHCQP ENKETN K EE      HE         S FTKC K A KYPSHVHPKLPDYDEIA
Subjt:  DTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIA

Query:  AKFIALKREYSQR
         +FI+LKREY QR
Subjt:  AKFIALKREYSQR

XP_022147829.1 uncharacterized protein LOC111016673 [Momordica charantia]2.1e-28298.19Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
        MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAY+LIENFCEFIL NFSHVRKHKTCPDDVIEAISSLIFASARCGDL
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN
        PELKSVRKLFEERFGRSFA TAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISE+ARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN

Query:  AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIRENVQYGNGQSLIEEHHDNSKKRSVS
        AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTG+RENVQYGNGQ LIEEHH+NSKKRSVS
Subjt:  AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIRENVQYGNGQSLIEEHHDNSKKRSVS

Query:  ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDTSLSGKDRHRVEFDTLPRTE
        ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKK+LMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILD SLSGKDRHRVEFDTLPRTE
Subjt:  ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDTSLSGKDRHRVEFDTLPRTE

Query:  KIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ
        KIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ
Subjt:  KIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ

XP_038888606.1 uncharacterized protein LOC120078409 [Benincasa hispida]8.4e-16762.24Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHK------TCPDDVIEAISSLIFAS
        +I+ AL RL+MLKKKRYSI++QLREDL EL++NGYQQIAF RVEQLI+DE LMEAY+LIENFCEFIL  FSH++KHK      TCPDD+IEAISSLIFAS
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHK------TCPDDVIEAISSLIFAS

Query:  ARCGDLPELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEP
        ARCGD PELKSVRKLFE+RFGRSFA  AVELCPGNLVN QIKEKLL+KPVSDHEKQ  I+++ARDCF P+ILALEY PDWHQKQV +N D+T+ E KEEP
Subjt:  ARCGDLPELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEP

Query:  -----------NIQDSNAEEFERKVMCVSPSQEVCSSSAT---------LEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGI
                   + +D+N +  + +    S S  VC SS             DASSPE  PFC+E++VYLD++VEL + SME G   DQR FKFKS VT +
Subjt:  -----------NIQDSNAEEFERKVMCVSPSQEVCSSSAT---------LEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGI

Query:  RENVQYGNGQSLIEEHHDNSKKRSVSERSSQSMNKSPK--RSMRRSTHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------V
         ENV+ GN QSLIE+ HD+S  + VS RS+Q +  SPK  R +     QE+ SL++ KKK MKCCCLSCHS  L S ++NYCLEQP YVYSE       V
Subjt:  RENVQYGNGQSLIEEHHDNSKKRSVSERSSQSMNKSPK--RSMRRSTHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------V

Query:  LISEKKKVDPHFILDTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYP
          S  K  D       S +      +EFDT  RT+K  N   GT+VYDVFVYSHCQPVENKETNAKPEE+ T  K+E+S+GFNG+++ FTKC K ADKYP
Subjt:  LISEKKKVDPHFILDTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYP

Query:  SHVHPKLPDYDEIAAKFIALKREYSQR
        SHVHPKLPDYDEIAAKFIALKREY Q+
Subjt:  SHVHPKLPDYDEIAAKFIALKREYSQR

TrEMBL top hitse value%identityAlignment
A0A0A0K3R2 Uncharacterized protein1.1e-15159.46Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
        +IK AL RLK+L+KKRYSI++ LREDL ELI+NGYQQIAF RVEQLI+DE+LMEAY+LIENFCE IL  FSH+RKHKTCPDD+ EAISSLIFASAR GD 
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN
        PELK VRKLFEERFG+SF   AVEL PGNLVN QIKEKL+++PV +HEKQ LI+E+ARDCF PA+LALE CPDWH+KQVL+NGDQT  + K E       
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN

Query:  AEEFERKVM----------------CVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIR-ENVQYGNG
        +EE ER V+                C S S     S +   DAS+ E  PFCEE +VY DDVVEL +PS E G L DQR FKFKS +T  R ENV  G+ 
Subjt:  AEEFERKVM----------------CVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIR-ENVQYGNG

Query:  QSLIEEHHDNSKKRSVSERSSQSMNKSPKRSMRRS--THQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVD
        QSLIE H+  S K++VS RS+Q +N SPK   RRS    QE+ SL+H KKKLMKC CLSCHS  L S ++NYC+EQ  YV+SE       +  SE    D
Subjt:  QSLIEEHHDNSKKRSVSERSSQSMNKSPKRSMRRS--THQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVD

Query:  PHFILDTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPD
                 + +    +EF T  R +  RN  +GT+VYDVFVYS CQP ENKETN K +E+ST  KHE         S FTKC K ADKYPSHVHPKLP+
Subjt:  PHFILDTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPD

Query:  YDEIAAKFIALKREYSQR
        Y+EIAAKFI LKREY +R
Subjt:  YDEIAAKFIALKREYSQR

A0A1S4E0G2 uncharacterized protein LOC1034956911.7e-14459.26Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
        +IK AL RLK+LKKKRYSI++ LREDL ELI+NGYQQIAF RVEQL++DE+LME Y+LIEN CEFIL  FSHVRKHKTCPDDVIEAISSLIFASAR GD 
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-------
        PELK VR LFEERFG+SFA  AVEL PGNLVN QIKEKL+++ VS+HEKQ LI+E+ARDCF PA+LALEY PD  QKQVL+N DQT  E K E       
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-------

Query:  ----PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIR-ENVQYGNGQSLIE
             + +DSN    +      S S  VC S     DASS E  PFCEE +VY DDVVEL +PS E G L DQR FKFK  +T  R ENV+  + QSLIE
Subjt:  ----PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIR-ENVQYGNGQSLIE

Query:  EHHDNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHFIL
        + HD S K+SVS RS+Q +N  PK   R+     QE+ SL+H KKK  KCCCLSCHS  L   ++NY LEQ  YVYSE       +  SE  ++D     
Subjt:  EHHDNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHFIL

Query:  DTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIA
            + +    +EF T  R ++ RN  +GT+VYDVFVYSHCQP ENKETN K EE      HE         S FTKC K A KYPSHVHPKLPDYDEIA
Subjt:  DTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIA

Query:  AKFIALKREYSQR
         +FI+LKREY QR
Subjt:  AKFIALKREYSQR

A0A5D3C6Z3 Ist1 domain-containing protein3.7e-14459.26Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
        +IK AL RLK+LKKKRYSI++ LREDL ELI+NGYQQIAF RVEQL++DE+LME Y+LIEN CEFIL  FSHVRKHKTCPDDVIEAISSLIFASAR GD 
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-------
        PELK VR LFEERFG+SFA  AVEL PGNLVN QIKEKL+++ VS+HEKQ LI+E+ARDCF PA+LALEY PD  QKQVL N DQT  E K E       
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-------

Query:  ----PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIR-ENVQYGNGQSLIE
             + +DSN    +      S S  VC S     DASS E  PFCEE +VY DDVVEL +PS E G L DQR FKFK  +T  R ENV+  + QSLIE
Subjt:  ----PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIR-ENVQYGNGQSLIE

Query:  EHHDNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHFIL
        + HD S K+SVS RS+Q +N  PK   R+     QE+ SL+H KKK  KCCCLSCHS  L   ++NY LEQ  YVYSE       +  SE  ++D     
Subjt:  EHHDNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHFIL

Query:  DTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIA
            + +    +EF T  R ++ RN  +GT+VYDVFVYSHCQP ENKETN K EE      HE         S FTKC K A KYPSHVHPKLPDYDEIA
Subjt:  DTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIA

Query:  AKFIALKREYSQR
         +FI+LKREY QR
Subjt:  AKFIALKREYSQR

A0A6J1CEV2 IST1-like protein isoform X32.5e-7940.5Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
        +IK+   RLK+LK K+  I +QLRED+ ELI NGY+Q AFNRVEQ+++DES M AY ++ NFCEFILQN S++RKHK CP+DV EA+SSL+FASARCGDL
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN
        PEL+ +RKLF ER+GR F T+AVEL PGNLVN QIKEKL    VS+ +K  +I+E+ARDCF+P +LALEY  DWHQKQV            E P  Q  +
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN

Query:  AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIRENVQYGNGQSLIEEHHDNSKKRSVS
        ++E +R+   +   + +     +    S      F EE +V++DDVVEL + +  EG   DQ LFKFK+      E+  Y    S  + H D S   S S
Subjt:  AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIRENVQYGNGQSLIEEHHDNSKKRSVS

Query:  ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDTSLSGKDRHRVEFDTLPRTE
        E  + S   S K S R S ++      H  +K                   N C E          ISE+K+      +    + K+     F   PR  
Subjt:  ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDTSLSGKDRHRVEFDTLPRTE

Query:  KIRN--------CEVGTIVYDVFVY----SHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSS-----------------FTKCTKAADKYPSHVHPK
        + R          ++ +  YDVF Y    S+ +  + KET       +   K+ SS      K +                 FT+      K PSHVHPK
Subjt:  KIRN--------CEVGTIVYDVFVY----SHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSS-----------------FTKCTKAADKYPSHVHPK

Query:  LPDYDEIAAKFIALKREYSQR
        LPDYD+IAAKF+ALKRE+ Q+
Subjt:  LPDYDEIAAKFIALKREYSQR

A0A6J1D262 uncharacterized protein LOC1110166731.0e-28298.19Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
        MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAY+LIENFCEFIL NFSHVRKHKTCPDDVIEAISSLIFASARCGDL
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN
        PELKSVRKLFEERFGRSFA TAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISE+ARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN

Query:  AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIRENVQYGNGQSLIEEHHDNSKKRSVS
        AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTG+RENVQYGNGQ LIEEHH+NSKKRSVS
Subjt:  AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIRENVQYGNGQSLIEEHHDNSKKRSVS

Query:  ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDTSLSGKDRHRVEFDTLPRTE
        ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKK+LMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILD SLSGKDRHRVEFDTLPRTE
Subjt:  ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDTSLSGKDRHRVEFDTLPRTE

Query:  KIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ
        KIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ
Subjt:  KIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog1.8e-1028.28Show/hide
Query:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL
        ++  ++RLK+L+KK+  + ++ R+++A+ ++ G  + A  RVE +IR++ L+EA  ++E +C+ +L  F  ++  K     + E++S+LI+A+ R   ++
Subjt:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGN---LVNPQIKEKLLVK
         ELK V      ++ + +     +LC  N    VN ++  KL V+
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGN---LVNPQIKEKLLVK

Q3ZBV1 IST1 homolog1.8e-1028.28Show/hide
Query:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL
        ++  ++RLK+L+KK+  + ++ R+++A+ ++ G  + A  RVE +IR++ L+EA  ++E +C+ +L  F  ++  K     + E++S+LI+A+ R   ++
Subjt:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGN---LVNPQIKEKLLVK
         ELK V      ++ + +     +LC  N    VN ++  KL V+
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGN---LVNPQIKEKLLVK

Q54I39 IST1-like protein1.9e-1231.88Show/hide
Query:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDLP
        +K A+SR+++LK K+ +I+R  + ++AEL+    ++ A  RVE +IRDE L+E + +IE  CE +    + +      P ++ E+I +L+++S R   +P
Subjt:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDLP

Query:  ELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKL
        EL+ ++   + ++G+     A   C  + VNP+I  KL
Subjt:  ELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKL

Q5R6G8 IST1 homolog1.8e-1028.28Show/hide
Query:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL
        ++  ++RLK+L+KK+  + ++ R+++A+ ++ G  + A  RVE +IR++ L+EA  ++E +C+ +L  F  ++  K     + E++S+LI+A+ R   ++
Subjt:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGN---LVNPQIKEKLLVK
         ELK V      ++ + +     +LC  N    VN ++  KL V+
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGN---LVNPQIKEKLLVK

Q9CX00 IST1 homolog1.8e-1028.28Show/hide
Query:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL
        ++  ++RLK+L+KK+  + ++ R+++A+ ++ G  + A  RVE +IR++ L+EA  ++E +C+ +L  F  ++  K     + E++S+LI+A+ R   ++
Subjt:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGN---LVNPQIKEKLLVK
         ELK V      ++ + +     +LC  N    VN ++  KL V+
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGN---LVNPQIKEKLLVK

Arabidopsis top hitse value%identityAlignment
AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein2.6e-2536.48Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
        ++K  + R+K+++ +R + ++Q+R ++A+L+  G +  A  RVE +IR+E +M A  ++E FCE I      +   + CP D+ EAISS+ FA+ RC DL
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARD
         EL+ V+ LF  ++G+ F   A EL P + VN ++ E L V+  S   K  L+ E+A +
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARD

AT1G51900.1 Regulator of Vps4 activity in the MVB pathway protein8.2e-2734.35Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHK--TCPDDVIEAISSLIFASARCG
        ++K+  SRL +LK ++Y+  R LR D+ + I +   + A  R EQL+  E+ +  Y  +  F +FIL  FS  +KH      DD  EA+SSLIFAS +C 
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHK--TCPDDVIEAISSLIFASARCG

Query:  DLPELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKL-LVKPVSDHEKQGLISEMARDC-FRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNI
        ++PEL  + +L  +R+G+ + TTA+++ PGNLVN +IKEKL     VS+ +K  ++ E+A++  +R  IL L Y      K  ++N         EE N+
Subjt:  DLPELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKL-LVKPVSDHEKQGLISEMARDC-FRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNI

Query:  QDSNAEEFERKVMCVSPSQEVCSSSATLED
         D +  E  +   C++   E      +++D
Subjt:  QDSNAEEFERKVMCVSPSQEVCSSSATLED

AT1G51900.1 Regulator of Vps4 activity in the MVB pathway protein6.3e-0360.71Show/hide
Query:  HVHPKLPDYDEIAAKFIALKREYSQRKQ
        HVHPKLPDYD+IA KF  LK    +R++
Subjt:  HVHPKLPDYDEIAAKFIALKREYSQRKQ

AT2G14830.1 Regulator of Vps4 activity in the MVB pathway protein1.3e-4029.84Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
        ++K+   RL +LK K+Y+I   LR D+A+L+  G +  A +R +QL  DE+LM  Y+L+ +F + IL N S++R+ +  PD + EA+S+L+FASARCGDL
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDL

Query:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDC-FRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDS
        PEL+++R LF +R+G  F  TA+ L PGN VNPQ+ EKL +  VSD  K  L+ E+  +   R  +LA+EY P++H KQVL+                 S
Subjt:  PELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDC-FRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDS

Query:  NAEEFERKVMCVSPSQEVCSSSA----------TLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIRENVQYGNGQSLIEE
           E E++VM  + +Q  CSS            TL DA   E      +     DD +E      EE    DQ +F+F+                   E 
Subjt:  NAEEFERKVMCVSPSQEVCSSSA----------TLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIRENVQYGNGQSLIEE

Query:  HHDNSKKRSVSERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDTSLSGKDRH
          D  K+R    R  +S + S                S P  K + C        W              Y Y      ++K+            GK  +
Subjt:  HHDNSKKRSVSERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDTSLSGKDRH

Query:  RVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREY
                            IVY+VF                P++       ES  G    K +             HVHPKLPDYD+I A F AL+++ 
Subjt:  RVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREY

Query:  SQRKQH
         Q+++H
Subjt:  SQRKQH

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein1.0e-2437.34Show/hide
Query:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDLP
        ++ A SRLK+LK K+   ++QLR +LA+L+ +G    A  RVE ++R+E  + AY LI  +CE ++     +   K CP D+ EA++S++FAS R  D+P
Subjt:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDLP

Query:  ELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARD
        EL  + K F  ++G+ F+T+AVEL P + V+  + EKL  K      K  ++  +A +
Subjt:  ELKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARD

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein1.1e-2636.31Show/hide
Query:  KRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDLPE
        K A++R+K+++ KR  +++Q+R D+A L+ +G    A  RVE +IR++++  A  +IE FCE I+   + + K K CP D+ E I+SLIFA+ RC ++PE
Subjt:  KRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDLPE

Query:  LKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARD
        L  +R +F +++G+ F + A +L P   VN  + +KL V+      K  ++ E+A++
Subjt:  LKSVRKLFEERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAAACGAGCTCTGTCCCGGCTTAAGATGCTCAAGAAGAAAAGATATTCAATCCTCAGGCAATTGCGTGAAGATTTGGCTGAGCTGATTAGTAATGGCTATCAACA
AATTGCCTTCAACAGGGTGGAGCAGCTTATTAGAGATGAAAGCCTTATGGAGGCGTACAATTTGATTGAGAATTTCTGTGAATTCATCCTCCAGAACTTCTCCCATGTCA
GAAAACACAAGACCTGCCCAGATGATGTTATTGAAGCAATTTCAAGTCTCATATTTGCCTCTGCAAGATGTGGGGACTTGCCTGAACTTAAATCGGTTCGGAAGCTCTTC
GAGGAGCGATTTGGTCGGAGTTTTGCCACGACTGCTGTCGAATTGTGCCCTGGAAATCTTGTGAATCCACAGATTAAAGAGAAGCTTCTTGTGAAGCCTGTTTCAGATCA
TGAGAAGCAAGGATTGATCAGTGAAATGGCTAGAGATTGCTTTCGTCCAGCAATTTTGGCTCTTGAATACTGTCCTGACTGGCATCAGAAACAGGTACTGGAGAATGGAG
ATCAAACTCATTGTGAGAGCAAAGAAGAACCAAACATTCAGGATTCAAATGCTGAAGAGTTCGAGAGAAAAGTAATGTGTGTCAGTCCATCACAGGAAGTGTGTAGCAGC
TCTGCCACTTTGGAAGATGCATCCTCTCCTGAATGCTCTCCATTTTGTGAAGAAACTCTTGTTTATCTCGATGATGTAGTTGAGCTTCCGAACCCCTCGATGGAAGAAGG
ATACTTGTGGGATCAGAGATTATTCAAGTTCAAATCACCTGTTACGGGAATAAGAGAAAACGTTCAATATGGCAATGGTCAAAGTCTCATAGAAGAACATCACGACAACT
CGAAGAAGAGATCGGTCTCGGAAAGATCTAGCCAGAGTATGAATAAATCTCCGAAAAGATCGATGAGAAGATCGACGCATCAAGAGGATGATAGTTTAAGTCATCCAAAG
AAGAAATTAATGAAGTGTTGCTGCCTATCATGCCATAGCTCCTGGTTGCCATCTGGAATGGAAAACTACTGCTTGGAGCAGCCAAGTTATGTGTATTCTGAAGTTTTGAT
ATCAGAGAAGAAGAAAGTTGATCCACATTTTATTTTAGATACTAGTTTGTCTGGCAAAGACCGCCATCGAGTCGAATTCGATACGCTCCCCAGGACAGAAAAGATAAGAA
ATTGTGAGGTTGGAACTATAGTTTATGATGTTTTTGTCTATTCTCATTGCCAGCCAGTTGAGAACAAGGAAACTAATGCAAAACCAGAAGAAATCAGTACAAAGTGTAAG
CATGAATCTTCTCTCGGTTTCAATGGAATGAAAAGTAGTTTCACAAAATGTACGAAAGCTGCCGATAAGTATCCGAGCCATGTTCATCCGAAGTTGCCCGACTACGACGA
AATTGCAGCCAAATTCATAGCTCTGAAGAGAGAATATTCACAGAGAAAGCAGCACCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGATCAAACGAGCTCTGTCCCGGCTTAAGATGCTCAAGAAGAAAAGATATTCAATCCTCAGGCAATTGCGTGAAGATTTGGCTGAGCTGATTAGTAATGGCTATCAACA
AATTGCCTTCAACAGGGTGGAGCAGCTTATTAGAGATGAAAGCCTTATGGAGGCGTACAATTTGATTGAGAATTTCTGTGAATTCATCCTCCAGAACTTCTCCCATGTCA
GAAAACACAAGACCTGCCCAGATGATGTTATTGAAGCAATTTCAAGTCTCATATTTGCCTCTGCAAGATGTGGGGACTTGCCTGAACTTAAATCGGTTCGGAAGCTCTTC
GAGGAGCGATTTGGTCGGAGTTTTGCCACGACTGCTGTCGAATTGTGCCCTGGAAATCTTGTGAATCCACAGATTAAAGAGAAGCTTCTTGTGAAGCCTGTTTCAGATCA
TGAGAAGCAAGGATTGATCAGTGAAATGGCTAGAGATTGCTTTCGTCCAGCAATTTTGGCTCTTGAATACTGTCCTGACTGGCATCAGAAACAGGTACTGGAGAATGGAG
ATCAAACTCATTGTGAGAGCAAAGAAGAACCAAACATTCAGGATTCAAATGCTGAAGAGTTCGAGAGAAAAGTAATGTGTGTCAGTCCATCACAGGAAGTGTGTAGCAGC
TCTGCCACTTTGGAAGATGCATCCTCTCCTGAATGCTCTCCATTTTGTGAAGAAACTCTTGTTTATCTCGATGATGTAGTTGAGCTTCCGAACCCCTCGATGGAAGAAGG
ATACTTGTGGGATCAGAGATTATTCAAGTTCAAATCACCTGTTACGGGAATAAGAGAAAACGTTCAATATGGCAATGGTCAAAGTCTCATAGAAGAACATCACGACAACT
CGAAGAAGAGATCGGTCTCGGAAAGATCTAGCCAGAGTATGAATAAATCTCCGAAAAGATCGATGAGAAGATCGACGCATCAAGAGGATGATAGTTTAAGTCATCCAAAG
AAGAAATTAATGAAGTGTTGCTGCCTATCATGCCATAGCTCCTGGTTGCCATCTGGAATGGAAAACTACTGCTTGGAGCAGCCAAGTTATGTGTATTCTGAAGTTTTGAT
ATCAGAGAAGAAGAAAGTTGATCCACATTTTATTTTAGATACTAGTTTGTCTGGCAAAGACCGCCATCGAGTCGAATTCGATACGCTCCCCAGGACAGAAAAGATAAGAA
ATTGTGAGGTTGGAACTATAGTTTATGATGTTTTTGTCTATTCTCATTGCCAGCCAGTTGAGAACAAGGAAACTAATGCAAAACCAGAAGAAATCAGTACAAAGTGTAAG
CATGAATCTTCTCTCGGTTTCAATGGAATGAAAAGTAGTTTCACAAAATGTACGAAAGCTGCCGATAAGTATCCGAGCCATGTTCATCCGAAGTTGCCCGACTACGACGA
AATTGCAGCCAAATTCATAGCTCTGAAGAGAGAATATTCACAGAGAAAGCAGCACCAATGA
Protein sequenceShow/hide protein sequence
MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYNLIENFCEFILQNFSHVRKHKTCPDDVIEAISSLIFASARCGDLPELKSVRKLF
EERFGRSFATTAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEMARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSNAEEFERKVMCVSPSQEVCSS
SATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGIRENVQYGNGQSLIEEHHDNSKKRSVSERSSQSMNKSPKRSMRRSTHQEDDSLSHPK
KKLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDTSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCK
HESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ