; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g01730 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g01730
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationchr4:1111982..1115413
RNA-Seq ExpressionMoc04g01730
SyntenyMoc04g01730
Gene Ontology termsGO:0015031 - protein transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup
IPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645768.1 hypothetical protein Csa_020499 [Cucumis sativus]8.2e-25664.41Show/hide
Query:  PKPKYVMLSDS--QASLRPPPYRRNIPRYHSKSH---GGGCGCCFKCICCCYCFLFFLIFALIGFGYFLWSYYSPQIPSYKVSDFSVHAFDVKPDFSLHT
        PKPKYVMLSD   QASLRPPPYRRN+PRYHSK++   GGG GCC KCICCCYCF+FFLIFAL G GYFL+ YY+PQ+PSYKVSDFSVHAF+VK DFSL+T
Subjt:  PKPKYVMLSDS--QASLRPPPYRRNIPRYHSKSH---GGGCGCCFKCICCCYCFLFFLIFALIGFGYFLWSYYSPQIPSYKVSDFSVHAFDVKPDFSLHT

Query:  EFIVIVRADNPNNNIEFVYGKDSSVSVLYSKSQLCAGKIPNFRQPSKNVTDINILLSGNSEFGSGLQEALMQNRHSGKIPLFVQVKVPVTVVVGGFSFRK
        EFIVIV+ADNPN NI FVYGKDSSVSV+YSKS+LC+G+IPNFRQPSKNVTDI+ILLSGNSEFGSGLQEALMQNRHSGKIPL V+VKVPVTVV+G  S +K
Subjt:  EFIVIVRADNPNNNIEFVYGKDSSVSVLYSKSQLCAGKIPNFRQPSKNVTDINILLSGNSEFGSGLQEALMQNRHSGKIPLFVQVKVPVTVVVGGFSFRK

Query:  ITVLVNCSLVVD-------------------NFKKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEF
        + V VNCSLVVD                   +FKK+IK AL RLK+L+KKRYSI++ LREDL ELI+NGYQQIAF RVEQLI+DE+LMEAYDLIENFCE 
Subjt:  ITVLVNCSLVVD-------------------NFKKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEF

Query:  ILLNFSHVRKHKTCPDDVIEAISSLIFASARCGDLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAI
        IL+ FSH+RKHKTCPDD+ EAISSLIFASAR GD PELK VRKLFEERFG+SF   AVEL PGNLVN QIKEKL+++PV +HEKQ LI+EIARDCF PA+
Subjt:  ILLNFSHVRKHKTCPDDVIEAISSLIFASARCGDLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAI

Query:  LALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSNAEEFERKVM----------------CVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVEL
        LALE CPDWH+KQVL+NGDQT  + K E       +EE ER V+                C S S     S +   DAS+ E  PFCEE +VY DDVVEL
Subjt:  LALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSNAEEFERKVM----------------CVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVEL

Query:  PNPSMEEGYLWDQRLFKFKSPVTGMR-ENVQYGNGQGLIEEHHNNSKKRSVSERSSQSMNKSPKRSMRRS--THQEDDSLSHPKKRLMKCCCLSCHSSWL
         +PS E G L DQR FKFKS +T  R ENV  G+ Q LIE H+ ++KK +VS RS+Q +N SPK   RRS    QE+ SL+H KK+LMKC CLSCHS  L
Subjt:  PNPSMEEGYLWDQRLFKFKSPVTGMR-ENVQYGNGQGLIEEHHNNSKKRSVSERSSQSMNKSPKRSMRRS--THQEDDSLSHPKKRLMKCCCLSCHSSWL

Query:  PSGMENYCLEQPSYVYSE-------VLISEKKKVDPHFILDNSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKC
         S ++NYC+EQ  YV+SE       +  SE    D      N  + +    +EF T  R +  RN  +GT+VYDVFVYS CQP ENKETN K +E+ST  
Subjt:  PSGMENYCLEQPSYVYSE-------VLISEKKKVDPHFILDNSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKC

Query:  KHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQR
        KHE         S FTKC K ADKYPSHVHPKLP+Y+EIAAKFI LKREY +R
Subjt:  KHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQR

KAG6572356.1 NDR1/HIN1-like protein 6, partial [Cucurbita argyrosperma subsp. sororia]8.1e-17174.46Show/hide
Query:  PKPKYVMLSDS-QASLRPPPYRRNIPRYHSKSHGGG---CGCCFKCICCCYCFLFFLIFALIGFGYFLWSYYSPQIPSYKVSDFSVHAFDVKPDFSLHTE
        PKPKYVMLSD+ Q SLRPPPYRR++PRYHSK+HGGG    GCC KCICCCYCFL FLIF+L G  Y L+SYY+PQIPSYKVS F+V  FDVKPDFSL+TE
Subjt:  PKPKYVMLSDS-QASLRPPPYRRNIPRYHSKSHGGG---CGCCFKCICCCYCFLFFLIFALIGFGYFLWSYYSPQIPSYKVSDFSVHAFDVKPDFSLHTE

Query:  FIVIVRADNPNNNIEFVYGKDSSVSVLYSKSQLCAGKIPNFRQPSKNVTDINILLSGNSEFGSGLQEALMQNRHSGKIPLFVQVKVPVTVVVGGFSFRKI
        FIVIV ADNPN NI+FVYGK S VSVLY++S LC G IPNFRQPSKNVTDI+I L+G SEFGSG QEALMQNRHSG+IPL V VKVPVT+V+G  S +K+
Subjt:  FIVIVRADNPNNNIEFVYGKDSSVSVLYSKSQLCAGKIPNFRQPSKNVTDINILLSGNSEFGSGLQEALMQNRHSGKIPLFVQVKVPVTVVVGGFSFRKI

Query:  TVLVNCSLVVDN-------------------FKKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFI
         V VNCSL+VDN                   FKK+I++AL RLKMLKKKRYSI RQLREDL ELISNGYQQIAFNRVEQL+RDESLMEAYDLIENFCEFI
Subjt:  TVLVNCSLVVDN-------------------FKKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFI

Query:  LLNFSHVRKHKTCPDDVIEAISSLIFASARCGDLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAIL
        L+ FSHVRK+KTCPDD++EAISSLIFASARCGD PELK VRK FEE +G+ FA  AVEL PGNLVNPQIKEKL++K VSD EKQ L++EIARDCF PAIL
Subjt:  LLNFSHVRKHKTCPDDVIEAISSLIFASARCGDLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAIL

Query:  ALEYCPDWHQKQVLE
        ALEYCPDW+Q+  L+
Subjt:  ALEYCPDWHQKQVLE

XP_016901704.1 PREDICTED: uncharacterized protein LOC103495691 [Cucumis melo]1.5e-14559.22Show/hide
Query:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG
        KK+IK AL RLK+LKKKRYSI++ LREDL ELI+NGYQQIAF RVEQL++DE+LME YDLIEN CEFIL+ FSHVRKHKTCPDDVIEAISSLIFASAR G
Subjt:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG

Query:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-----
        D PELK VR LFEERFG+SFA  AVEL PGNLVN QIKEKL+++ VS+HEKQ LI+EIARDCF PA+LALEY PD  QKQVL+N DQT  E K E     
Subjt:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-----

Query:  ------PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMR-ENVQYGNGQGL
               + +DSN    +      S S  VC S     DASS E  PFCEE +VY DDVVEL +PS E G L DQR FKFK  +T  R ENV+  + Q L
Subjt:  ------PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMR-ENVQYGNGQGL

Query:  IEEHHNNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHF
        IE+ H+ S K+SVS RS+Q +N  PK   R+     QE+ SL+H KK+  KCCCLSCHS  L   ++NY LEQ  YVYSE       +  SE  ++D   
Subjt:  IEEHHNNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHF

Query:  ILDNSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDE
           +  + +    +EF T  R ++ RN  +GT+VYDVFVYSHCQP ENKETN K EE      HE         S FTKC K A KYPSHVHPKLPDYDE
Subjt:  ILDNSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDE

Query:  IAAKFIALKREYSQR
        IA +FI+LKREY QR
Subjt:  IAAKFIALKREYSQR

XP_022147829.1 uncharacterized protein LOC111016673 [Momordica charantia]2.4e-287100Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
        MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCGDL

Query:  PELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN
        PELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN
Subjt:  PELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN

Query:  AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMRENVQYGNGQGLIEEHHNNSKKRSVS
        AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMRENVQYGNGQGLIEEHHNNSKKRSVS
Subjt:  AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMRENVQYGNGQGLIEEHHNNSKKRSVS

Query:  ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDNSLSGKDRHRVEFDTLPRTE
        ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDNSLSGKDRHRVEFDTLPRTE
Subjt:  ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDNSLSGKDRHRVEFDTLPRTE

Query:  KIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ
        KIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ
Subjt:  KIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ

XP_038888606.1 uncharacterized protein LOC120078409 [Benincasa hispida]1.7e-16862.38Show/hide
Query:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHK------TCPDDVIEAISSLIF
        KK+I+ AL RL+MLKKKRYSI++QLREDL EL++NGYQQIAF RVEQLI+DE LMEAYDLIENFCEFIL+ FSH++KHK      TCPDD+IEAISSLIF
Subjt:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHK------TCPDDVIEAISSLIF

Query:  ASARCGDLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKE
        ASARCGD PELKSVRKLFE+RFGRSFA  AVELCPGNLVN QIKEKLL+KPVSDHEKQ  I++IARDCF P+ILALEY PDWHQKQV +N D+T+ E KE
Subjt:  ASARCGDLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKE

Query:  EP-----------NIQDSNAEEFERKVMCVSPSQEVCSSSAT---------LEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVT
        EP           + +D+N +  + +    S S  VC SS             DASSPE  PFC+E++VYLD++VEL + SME G   DQR FKFKS VT
Subjt:  EP-----------NIQDSNAEEFERKVMCVSPSQEVCSSSAT---------LEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVT

Query:  GMRENVQYGNGQGLIEEHHNNSKKRSVSERSSQSMNKSPK--RSMRRSTHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE------
         M ENV+ GN Q LIE+ H++S  + VS RS+Q +  SPK  R +     QE+ SL++ KK+ MKCCCLSCHS  L S ++NYCLEQP YVYSE      
Subjt:  GMRENVQYGNGQGLIEEHHNNSKKRSVSERSSQSMNKSPK--RSMRRSTHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE------

Query:  -VLISEKKKVDPHFILDNSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADK
         V  S  K  D      +S +      +EFDT  RT+K  N   GT+VYDVFVYSHCQPVENKETNAKPEE+ T  K+E+S+GFNG+++ FTKC K ADK
Subjt:  -VLISEKKKVDPHFILDNSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADK

Query:  YPSHVHPKLPDYDEIAAKFIALKREYSQR
        YPSHVHPKLPDYDEIAAKFIALKREY Q+
Subjt:  YPSHVHPKLPDYDEIAAKFIALKREYSQR

TrEMBL top hitse value%identityAlignment
A0A0A0K3R2 Uncharacterized protein4.4e-15459.81Show/hide
Query:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG
        KK+IK AL RLK+L+KKRYSI++ LREDL ELI+NGYQQIAF RVEQLI+DE+LMEAYDLIENFCE IL+ FSH+RKHKTCPDD+ EAISSLIFASAR G
Subjt:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG

Query:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQD
        D PELK VRKLFEERFG+SF   AVEL PGNLVN QIKEKL+++PV +HEKQ LI+EIARDCF PA+LALE CPDWH+KQVL+NGDQT  + K E     
Subjt:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQD

Query:  SNAEEFERKVM----------------CVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMR-ENVQYG
          +EE ER V+                C S S     S +   DAS+ E  PFCEE +VY DDVVEL +PS E G L DQR FKFKS +T  R ENV  G
Subjt:  SNAEEFERKVM----------------CVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMR-ENVQYG

Query:  NGQGLIEEHHNNSKKRSVSERSSQSMNKSPKRSMRRS--THQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKK
        + Q LIE H+ ++KK +VS RS+Q +N SPK   RRS    QE+ SL+H KK+LMKC CLSCHS  L S ++NYC+EQ  YV+SE       +  SE   
Subjt:  NGQGLIEEHHNNSKKRSVSERSSQSMNKSPKRSMRRS--THQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKK

Query:  VDPHFILDNSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKL
         D      N  + +    +EF T  R +  RN  +GT+VYDVFVYS CQP ENKETN K +E+ST  KHE         S FTKC K ADKYPSHVHPKL
Subjt:  VDPHFILDNSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKL

Query:  PDYDEIAAKFIALKREYSQR
        P+Y+EIAAKFI LKREY +R
Subjt:  PDYDEIAAKFIALKREYSQR

A0A1S4E0G2 uncharacterized protein LOC1034956917.5e-14659.22Show/hide
Query:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG
        KK+IK AL RLK+LKKKRYSI++ LREDL ELI+NGYQQIAF RVEQL++DE+LME YDLIEN CEFIL+ FSHVRKHKTCPDDVIEAISSLIFASAR G
Subjt:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG

Query:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-----
        D PELK VR LFEERFG+SFA  AVEL PGNLVN QIKEKL+++ VS+HEKQ LI+EIARDCF PA+LALEY PD  QKQVL+N DQT  E K E     
Subjt:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-----

Query:  ------PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMR-ENVQYGNGQGL
               + +DSN    +      S S  VC S     DASS E  PFCEE +VY DDVVEL +PS E G L DQR FKFK  +T  R ENV+  + Q L
Subjt:  ------PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMR-ENVQYGNGQGL

Query:  IEEHHNNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHF
        IE+ H+ S K+SVS RS+Q +N  PK   R+     QE+ SL+H KK+  KCCCLSCHS  L   ++NY LEQ  YVYSE       +  SE  ++D   
Subjt:  IEEHHNNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHF

Query:  ILDNSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDE
           +  + +    +EF T  R ++ RN  +GT+VYDVFVYSHCQP ENKETN K EE      HE         S FTKC K A KYPSHVHPKLPDYDE
Subjt:  ILDNSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDE

Query:  IAAKFIALKREYSQR
        IA +FI+LKREY QR
Subjt:  IAAKFIALKREYSQR

A0A5D3C6Z3 Ist1 domain-containing protein1.7e-14559.22Show/hide
Query:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG
        KK+IK AL RLK+LKKKRYSI++ LREDL ELI+NGYQQIAF RVEQL++DE+LME YDLIEN CEFIL+ FSHVRKHKTCPDDVIEAISSLIFASAR G
Subjt:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG

Query:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-----
        D PELK VR LFEERFG+SFA  AVEL PGNLVN QIKEKL+++ VS+HEKQ LI+EIARDCF PA+LALEY PD  QKQVL N DQT  E K E     
Subjt:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEE-----

Query:  ------PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMR-ENVQYGNGQGL
               + +DSN    +      S S  VC S     DASS E  PFCEE +VY DDVVEL +PS E G L DQR FKFK  +T  R ENV+  + Q L
Subjt:  ------PNIQDSNAEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMR-ENVQYGNGQGL

Query:  IEEHHNNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHF
        IE+ H+ S K+SVS RS+Q +N  PK   R+     QE+ SL+H KK+  KCCCLSCHS  L   ++NY LEQ  YVYSE       +  SE  ++D   
Subjt:  IEEHHNNSKKRSVSERSSQSMNKSPKRSMRR--STHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSE-------VLISEKKKVDPHF

Query:  ILDNSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDE
           +  + +    +EF T  R ++ RN  +GT+VYDVFVYSHCQP ENKETN K EE      HE         S FTKC K A KYPSHVHPKLPDYDE
Subjt:  ILDNSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDE

Query:  IAAKFIALKREYSQR
        IA +FI+LKREY QR
Subjt:  IAAKFIALKREYSQR

A0A6J1D1Q8 NDR1/HIN1-like protein 61.0e-118100Show/hide
Query:  MDSGHPKPKYVMLSDSQASLRPPPYRRNIPRYHSKSHGGGCGCCFKCICCCYCFLFFLIFALIGFGYFLWSYYSPQIPSYKVSDFSVHAFDVKPDFSLHT
        MDSGHPKPKYVMLSDSQASLRPPPYRRNIPRYHSKSHGGGCGCCFKCICCCYCFLFFLIFALIGFGYFLWSYYSPQIPSYKVSDFSVHAFDVKPDFSLHT
Subjt:  MDSGHPKPKYVMLSDSQASLRPPPYRRNIPRYHSKSHGGGCGCCFKCICCCYCFLFFLIFALIGFGYFLWSYYSPQIPSYKVSDFSVHAFDVKPDFSLHT

Query:  EFIVIVRADNPNNNIEFVYGKDSSVSVLYSKSQLCAGKIPNFRQPSKNVTDINILLSGNSEFGSGLQEALMQNRHSGKIPLFVQVKVPVTVVVGGFSFRK
        EFIVIVRADNPNNNIEFVYGKDSSVSVLYSKSQLCAGKIPNFRQPSKNVTDINILLSGNSEFGSGLQEALMQNRHSGKIPLFVQVKVPVTVVVGGFSFRK
Subjt:  EFIVIVRADNPNNNIEFVYGKDSSVSVLYSKSQLCAGKIPNFRQPSKNVTDINILLSGNSEFGSGLQEALMQNRHSGKIPLFVQVKVPVTVVVGGFSFRK

Query:  ITVLVNCSLVVDN
        ITVLVNCSLVVDN
Subjt:  ITVLVNCSLVVDN

A0A6J1D262 uncharacterized protein LOC1110166731.1e-287100Show/hide
Query:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
        MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCGDL
Subjt:  MIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCGDL

Query:  PELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN
        PELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN
Subjt:  PELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSN

Query:  AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMRENVQYGNGQGLIEEHHNNSKKRSVS
        AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMRENVQYGNGQGLIEEHHNNSKKRSVS
Subjt:  AEEFERKVMCVSPSQEVCSSSATLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMRENVQYGNGQGLIEEHHNNSKKRSVS

Query:  ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDNSLSGKDRHRVEFDTLPRTE
        ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDNSLSGKDRHRVEFDTLPRTE
Subjt:  ERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKRLMKCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDNSLSGKDRHRVEFDTLPRTE

Query:  KIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ
        KIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ
Subjt:  KIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESSLGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog1.8e-1127.16Show/hide
Query:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL
        ++  ++RLK+L+KK+  + ++ R+++A+ ++ G  + A  RVE +IR++ L+EA +++E +C+ +L  F  ++  K     + E++S+LI+A+ R   ++
Subjt:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL

Query:  PELKSVRKLFEERFGRSFAATAVELCPGN---LVNPQIKEKLLVKPVSDHEKQGLISEIARD
         ELK V      ++ + +     +LC  N    VN ++  KL V+       +  + EIA++
Subjt:  PELKSVRKLFEERFGRSFAATAVELCPGN---LVNPQIKEKLLVKPVSDHEKQGLISEIARD

Q54I39 IST1-like protein2.1e-1231.91Show/hide
Query:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG
        K  +K A+SR+++LK K+ +I+R  + ++AEL+    ++ A  RVE +IRDE L+E + +IE  CE +    + +      P ++ E+I +L+++S R  
Subjt:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG

Query:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKL
         +PEL+ ++   + ++G+     A   C  + VNP+I  KL
Subjt:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKL

Q5R6G8 IST1 homolog1.8e-1127.16Show/hide
Query:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL
        ++  ++RLK+L+KK+  + ++ R+++A+ ++ G  + A  RVE +IR++ L+EA +++E +C+ +L  F  ++  K     + E++S+LI+A+ R   ++
Subjt:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL

Query:  PELKSVRKLFEERFGRSFAATAVELCPGN---LVNPQIKEKLLVKPVSDHEKQGLISEIARD
         ELK V      ++ + +     +LC  N    VN ++  KL V+       +  + EIA++
Subjt:  PELKSVRKLFEERFGRSFAATAVELCPGN---LVNPQIKEKLLVKPVSDHEKQGLISEIARD

Q9CX00 IST1 homolog1.8e-1127.16Show/hide
Query:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL
        ++  ++RLK+L+KK+  + ++ R+++A+ ++ G  + A  RVE +IR++ L+EA +++E +C+ +L  F  ++  K     + E++S+LI+A+ R   ++
Subjt:  IKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARC-GDL

Query:  PELKSVRKLFEERFGRSFAATAVELCPGN---LVNPQIKEKLLVKPVSDHEKQGLISEIARD
         ELK V      ++ + +     +LC  N    VN ++  KL V+       +  + EIA++
Subjt:  PELKSVRKLFEERFGRSFAATAVELCPGN---LVNPQIKEKLLVKPVSDHEKQGLISEIARD

Q9ZVD2 NDR1/HIN1-like protein 132.8e-1729.11Show/hide
Query:  PKPKYVMLSDSQASLRPPPYRRNIPRY----HSKSHGGGCGCCFKCICCCYCFLFFLIFALIGFGY-FLWSYYSPQIPSYKVSDFSVHAFDVKPDFSLHT
        P    + +   Q    PPP   N  R+      K++   C CCF    C +    F++  L G  +  L+  Y P+ P Y +  FSV   ++     +  
Subjt:  PKPKYVMLSDSQASLRPPPYRRNIPRY----HSKSHGGGCGCCFKCICCCYCFLFFLIFALIGFGY-FLWSYYSPQIPSYKVSDFSVHAFDVKPDFSLHT

Query:  EFIVIVRADNPNNNIEFVYGKDSSVSVLYSKSQLCAGKIPNFRQPSKNVTDINILLSGNS-EFGSGLQEALMQNRHSGKIPLFVQVKVPVTVVVGGFSFR
         F V VR+ N N  I   Y K+SSV V Y+   +  G +P F QP+KNVT + ++LSG+  +  SG+++ +        +P  +++K PV +  G     
Subjt:  EFIVIVRADNPNNNIEFVYGKDSSVSVLYSKSQLCAGKIPNFRQPSKNVTDINILLSGNS-EFGSGLQEALMQNRHSGKIPLFVQVKVPVTVVVGGFSFR

Query:  KITVLVNCSLVVD
         + V V+C + VD
Subjt:  KITVLVNCSLVVD

Arabidopsis top hitse value%identityAlignment
AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein3.1e-2737.89Show/hide
Query:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG
        K ++K  + R+K+++ +R + ++Q+R ++A+L+  G +  A  RVE +IR+E +M A +++E FCE I +    +   + CP D+ EAISS+ FA+ RC 
Subjt:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG

Query:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARD
        DL EL+ V+ LF  ++G+ F A A EL P + VN ++ E L V+  S   K  L+ EIA +
Subjt:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARD

AT1G51900.1 Regulator of Vps4 activity in the MVB pathway protein8.2e-2835.06Show/hide
Query:  KMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHK--TCPDDVIEAISSLIFASARC
        K++K+  SRL +LK ++Y+  R LR D+ + I +   + A  R EQL+  E+ +  Y  +  F +FILL FS  +KH      DD  EA+SSLIFAS +C
Subjt:  KMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHK--TCPDDVIEAISSLIFASARC

Query:  GDLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKL-LVKPVSDHEKQGLISEIARDC-FRPAILALEYCPDWHQKQVLENGDQTHCESKEEPN
         ++PEL  + +L  +R+G+ +  TA+++ PGNLVN +IKEKL     VS+ +K  ++ EIA++  +R  IL L Y      K  ++N         EE N
Subjt:  GDLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKL-LVKPVSDHEKQGLISEIARDC-FRPAILALEYCPDWHQKQVLENGDQTHCESKEEPN

Query:  IQDSNAEEFERKVMCVSPSQEVCSSSATLED
        + D +  E  +   C++   E      +++D
Subjt:  IQDSNAEEFERKVMCVSPSQEVCSSSATLED

AT1G51900.1 Regulator of Vps4 activity in the MVB pathway protein9.1e-0360.71Show/hide
Query:  HVHPKLPDYDEIAAKFIALKREYSQRKQ
        HVHPKLPDYD+IA KF  LK    +R++
Subjt:  HVHPKLPDYDEIAAKFIALKREYSQRKQ

AT2G14830.1 Regulator of Vps4 activity in the MVB pathway protein6.4e-4134.29Show/hide
Query:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG
        K ++K+   RL +LK K+Y+I   LR D+A+L+  G +  A +R +QL  DE+LM  Y L+ +F + ILLN S++R+ +  PD + EA+S+L+FASARCG
Subjt:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG

Query:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDC-FRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQ
        DLPEL+++R LF +R+G  F  TA+ L PGN VNPQ+ EKL +  VSD  K  L+ EI  +   R  +LA+EY P++H KQVL+                
Subjt:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDC-FRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQ

Query:  DSNAEEFERKVMCVSPSQEVCSSSA----------TLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMRENVQYGNGQGLI
         S   E E++VM  + +Q  CSS            TL DA   E      +     DD +E      EE    DQ +F+F+                   
Subjt:  DSNAEEFERKVMCVSPSQEVCSSSA----------TLEDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMRENVQYGNGQGLI

Query:  EEHHNNSKKRSVSERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKRLMKCC
        E+     K+     RS+ S +    + +    +      S  +K   KCC
Subjt:  EEHHNNSKKRSVSERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKRLMKCC

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein2.2e-2537.27Show/hide
Query:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG
        K  ++ A SRLK+LK K+   ++QLR +LA+L+ +G    A  RVE ++R+E  + AY+LI  +CE +++    +   K CP D+ EA++S++FAS R  
Subjt:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG

Query:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARD
        D+PEL  + K F  ++G+ F+ +AVEL P + V+  + EKL  K      K  ++  IA +
Subjt:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARD

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein3.1e-2736.65Show/hide
Query:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG
        K   K A++R+K+++ KR  +++Q+R D+A L+ +G    A  RVE +IR++++  A ++IE FCE I+   + + K K CP D+ E I+SLIFA+ RC 
Subjt:  KKMIKRALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCG

Query:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARD
        ++PEL  +R +F +++G+ F + A +L P   VN  + +KL V+      K  ++ EIA++
Subjt:  DLPELKSVRKLFEERFGRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCGGTCATCCTAAACCCAAATATGTTATGCTCTCCGACAGCCAGGCCAGCCTCCGCCCGCCGCCGTACCGCCGCAACATCCCCCGCTACCACTCCAAGTCCCA
CGGCGGCGGCTGCGGCTGCTGCTTCAAATGCATCTGCTGCTGCTACTGCTTCCTCTTCTTCCTAATCTTCGCCCTCATCGGATTTGGGTATTTTCTCTGGTCTTATTACT
CGCCCCAAATCCCTTCCTACAAGGTCAGCGATTTCAGCGTCCACGCCTTCGATGTGAAGCCCGATTTCAGCCTCCACACTGAATTCATCGTCATCGTCAGAGCCGACAAC
CCCAACAACAACATCGAATTCGTCTACGGCAAGGACAGCTCTGTTTCCGTCCTCTACTCCAAATCCCAGCTCTGCGCCGGCAAGATTCCAAATTTCCGGCAGCCCTCCAA
GAATGTCACCGATATCAACATTTTGCTGAGCGGGAACAGCGAATTCGGATCTGGGCTGCAGGAGGCGCTCATGCAGAACCGGCACAGCGGCAAAATTCCTCTGTTCGTCC
AGGTCAAGGTGCCGGTCACGGTGGTCGTCGGAGGATTCTCGTTCAGGAAGATCACGGTGTTGGTGAATTGTTCGTTGGTGGTGGATAATTTCAAGAAGATGATCAAACGA
GCTCTGTCCCGGCTTAAGATGCTCAAGAAGAAAAGATATTCAATCCTCAGGCAATTGCGTGAAGATTTGGCTGAGCTGATTAGTAATGGCTATCAACAAATTGCCTTCAA
CAGGGTGGAGCAGCTTATTAGAGATGAAAGCCTTATGGAGGCGTACGATTTGATTGAGAATTTCTGTGAATTCATCCTCCTGAACTTCTCCCATGTCAGAAAACACAAGA
CCTGCCCAGATGATGTTATTGAAGCAATTTCAAGTCTCATATTTGCCTCTGCAAGATGTGGGGACTTGCCTGAACTTAAATCGGTTCGGAAGCTCTTCGAGGAGCGATTT
GGTCGGAGTTTCGCCGCGACTGCTGTCGAATTGTGCCCTGGAAATCTTGTGAATCCACAGATTAAAGAGAAGCTTCTTGTGAAGCCTGTTTCAGATCATGAGAAGCAAGG
ATTGATCAGTGAAATAGCTAGAGATTGCTTTCGTCCAGCAATTTTGGCTCTTGAATACTGTCCTGACTGGCATCAGAAACAGGTACTGGAGAATGGAGATCAAACTCATT
GTGAGAGCAAAGAAGAACCAAACATTCAGGATTCAAATGCTGAAGAGTTCGAGAGAAAAGTAATGTGTGTCAGTCCATCACAGGAAGTGTGTAGCAGCTCTGCCACTTTG
GAAGATGCATCCTCTCCTGAATGCTCTCCATTTTGTGAAGAAACTCTTGTTTATCTCGATGATGTAGTTGAGCTTCCGAACCCCTCGATGGAAGAAGGATACTTGTGGGA
TCAGAGATTATTCAAGTTCAAATCACCTGTTACGGGAATGAGAGAAAACGTTCAATATGGCAATGGTCAAGGTCTCATAGAAGAACATCACAACAACTCGAAGAAGAGAT
CAGTCTCGGAAAGATCTAGCCAGAGTATGAATAAATCTCCGAAAAGATCGATGAGAAGATCGACGCATCAAGAGGATGATAGTTTAAGTCATCCAAAGAAGAGATTAATG
AAGTGTTGCTGCCTATCATGCCATAGCTCCTGGTTGCCATCTGGAATGGAAAACTACTGCTTGGAGCAGCCAAGTTATGTGTATTCTGAAGTTTTGATATCAGAGAAGAA
GAAAGTTGATCCACATTTTATTTTAGATAATAGTTTGTCTGGCAAAGACCGCCATCGAGTCGAATTCGATACGCTCCCCAGGACAGAAAAGATAAGAAATTGTGAGGTTG
GAACTATAGTTTATGATGTTTTTGTCTATTCTCATTGCCAGCCAGTTGAGAACAAGGAAACTAATGCAAAACCAGAAGAAATCAGTACAAAGTGTAAGCATGAATCTTCT
CTCGGTTTCAATGGAATGAAAAGTAGTTTCACAAAATGTACGAAAGCTGCCGATAAGTATCCGAGCCATGTTCATCCGAAGTTGCCCGACTACGACGAAATCGCAGCCAA
ATTCATAGCTCTGAAGAGAGAATATTCACAGAGAAAGCAGCACCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCGGTCATCCTAAACCCAAATATGTTATGCTCTCCGACAGCCAGGCCAGCCTCCGCCCGCCGCCGTACCGCCGCAACATCCCCCGCTACCACTCCAAGTCCCA
CGGCGGCGGCTGCGGCTGCTGCTTCAAATGCATCTGCTGCTGCTACTGCTTCCTCTTCTTCCTAATCTTCGCCCTCATCGGATTTGGGTATTTTCTCTGGTCTTATTACT
CGCCCCAAATCCCTTCCTACAAGGTCAGCGATTTCAGCGTCCACGCCTTCGATGTGAAGCCCGATTTCAGCCTCCACACTGAATTCATCGTCATCGTCAGAGCCGACAAC
CCCAACAACAACATCGAATTCGTCTACGGCAAGGACAGCTCTGTTTCCGTCCTCTACTCCAAATCCCAGCTCTGCGCCGGCAAGATTCCAAATTTCCGGCAGCCCTCCAA
GAATGTCACCGATATCAACATTTTGCTGAGCGGGAACAGCGAATTCGGATCTGGGCTGCAGGAGGCGCTCATGCAGAACCGGCACAGCGGCAAAATTCCTCTGTTCGTCC
AGGTCAAGGTGCCGGTCACGGTGGTCGTCGGAGGATTCTCGTTCAGGAAGATCACGGTGTTGGTGAATTGTTCGTTGGTGGTGGATAATTTCAAGAAGATGATCAAACGA
GCTCTGTCCCGGCTTAAGATGCTCAAGAAGAAAAGATATTCAATCCTCAGGCAATTGCGTGAAGATTTGGCTGAGCTGATTAGTAATGGCTATCAACAAATTGCCTTCAA
CAGGGTGGAGCAGCTTATTAGAGATGAAAGCCTTATGGAGGCGTACGATTTGATTGAGAATTTCTGTGAATTCATCCTCCTGAACTTCTCCCATGTCAGAAAACACAAGA
CCTGCCCAGATGATGTTATTGAAGCAATTTCAAGTCTCATATTTGCCTCTGCAAGATGTGGGGACTTGCCTGAACTTAAATCGGTTCGGAAGCTCTTCGAGGAGCGATTT
GGTCGGAGTTTCGCCGCGACTGCTGTCGAATTGTGCCCTGGAAATCTTGTGAATCCACAGATTAAAGAGAAGCTTCTTGTGAAGCCTGTTTCAGATCATGAGAAGCAAGG
ATTGATCAGTGAAATAGCTAGAGATTGCTTTCGTCCAGCAATTTTGGCTCTTGAATACTGTCCTGACTGGCATCAGAAACAGGTACTGGAGAATGGAGATCAAACTCATT
GTGAGAGCAAAGAAGAACCAAACATTCAGGATTCAAATGCTGAAGAGTTCGAGAGAAAAGTAATGTGTGTCAGTCCATCACAGGAAGTGTGTAGCAGCTCTGCCACTTTG
GAAGATGCATCCTCTCCTGAATGCTCTCCATTTTGTGAAGAAACTCTTGTTTATCTCGATGATGTAGTTGAGCTTCCGAACCCCTCGATGGAAGAAGGATACTTGTGGGA
TCAGAGATTATTCAAGTTCAAATCACCTGTTACGGGAATGAGAGAAAACGTTCAATATGGCAATGGTCAAGGTCTCATAGAAGAACATCACAACAACTCGAAGAAGAGAT
CAGTCTCGGAAAGATCTAGCCAGAGTATGAATAAATCTCCGAAAAGATCGATGAGAAGATCGACGCATCAAGAGGATGATAGTTTAAGTCATCCAAAGAAGAGATTAATG
AAGTGTTGCTGCCTATCATGCCATAGCTCCTGGTTGCCATCTGGAATGGAAAACTACTGCTTGGAGCAGCCAAGTTATGTGTATTCTGAAGTTTTGATATCAGAGAAGAA
GAAAGTTGATCCACATTTTATTTTAGATAATAGTTTGTCTGGCAAAGACCGCCATCGAGTCGAATTCGATACGCTCCCCAGGACAGAAAAGATAAGAAATTGTGAGGTTG
GAACTATAGTTTATGATGTTTTTGTCTATTCTCATTGCCAGCCAGTTGAGAACAAGGAAACTAATGCAAAACCAGAAGAAATCAGTACAAAGTGTAAGCATGAATCTTCT
CTCGGTTTCAATGGAATGAAAAGTAGTTTCACAAAATGTACGAAAGCTGCCGATAAGTATCCGAGCCATGTTCATCCGAAGTTGCCCGACTACGACGAAATCGCAGCCAA
ATTCATAGCTCTGAAGAGAGAATATTCACAGAGAAAGCAGCACCAATGA
Protein sequenceShow/hide protein sequence
MDSGHPKPKYVMLSDSQASLRPPPYRRNIPRYHSKSHGGGCGCCFKCICCCYCFLFFLIFALIGFGYFLWSYYSPQIPSYKVSDFSVHAFDVKPDFSLHTEFIVIVRADN
PNNNIEFVYGKDSSVSVLYSKSQLCAGKIPNFRQPSKNVTDINILLSGNSEFGSGLQEALMQNRHSGKIPLFVQVKVPVTVVVGGFSFRKITVLVNCSLVVDNFKKMIKR
ALSRLKMLKKKRYSILRQLREDLAELISNGYQQIAFNRVEQLIRDESLMEAYDLIENFCEFILLNFSHVRKHKTCPDDVIEAISSLIFASARCGDLPELKSVRKLFEERF
GRSFAATAVELCPGNLVNPQIKEKLLVKPVSDHEKQGLISEIARDCFRPAILALEYCPDWHQKQVLENGDQTHCESKEEPNIQDSNAEEFERKVMCVSPSQEVCSSSATL
EDASSPECSPFCEETLVYLDDVVELPNPSMEEGYLWDQRLFKFKSPVTGMRENVQYGNGQGLIEEHHNNSKKRSVSERSSQSMNKSPKRSMRRSTHQEDDSLSHPKKRLM
KCCCLSCHSSWLPSGMENYCLEQPSYVYSEVLISEKKKVDPHFILDNSLSGKDRHRVEFDTLPRTEKIRNCEVGTIVYDVFVYSHCQPVENKETNAKPEEISTKCKHESS
LGFNGMKSSFTKCTKAADKYPSHVHPKLPDYDEIAAKFIALKREYSQRKQHQ