; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010652 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010652
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationscaffold35:1197375..1199480
RNA-Seq ExpressionMS010652
SyntenyMS010652
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134834.1 IST1-like protein isoform X2 [Cucumis sativus]6.1e-18284.51Show/hide
Query:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA
        +AATEAAARSLKIVK FI +LR  FN SKCKTAAKMAVARIKLLRNKREAVV+QMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA
Subjt:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA

Query:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE
        RLSIIAKQR+CP DLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAA DLRPNCGVNRLLIDKLS+RTP+G+VKLKIMKEIAKEH+IEWDTTE
Subjt:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE

Query:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDT-HFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLE
        SEKELLKP EELIEGPRTFVSAASLPVKP+ + S  D AQI RTTNSRE+++ HFQD+ASAAEAAAKAAKQAIAAA+AAAYLANKD NR   G  GF L 
Subjt:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDT-HFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLE

Query:  F-EGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPV
        F  G P NS+PT S+NM NHQFKAGE+ T P QS GRCSSLK NEETRNVNT+Y+ AYRR+SYNPTDIKFDESDCEEET+M+DE+    ++PPDRNPPP 
Subjt:  F-EGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPV

Query:  PSSRVHPKLPDYDTLAARFEALKYRK
        PSSRVHPKLPDYDTLAARFEALKYRK
Subjt:  PSSRVHPKLPDYDTLAARFEALKYRK

XP_008440890.1 PREDICTED: IST1-like protein [Cucumis melo]1.0e-18183.73Show/hide
Query:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA
        +AATEAAARSLKIVK FI +LR  FN SKCKTAAKMAVARIKLLRNKREAVV+QMRRDIALLLQSGQDATARIRVEHVIREQNVLAANE+IELFCELVVA
Subjt:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA

Query:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE
        RLSIIAKQR+CP DLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAA DLRPNCGVNRLLIDKLS+RTP+G+VKLKIMKEIAKEH+IEWDTTE
Subjt:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE

Query:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLEF
        SEKELLKPPEELIEGPRTFVSAASLPVKP+ + S  D AQI RTTN      HFQDTASAAEAAAKAAKQAIAAA+AAAYLANKD N+  R   GF L F
Subjt:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLEF

Query:  EGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVPS
         G P NS+   S+NMD HQFKAGE+ T P QS GRC SLK NEETRNVNT+Y+ AYRR+SYNPTDIKFDESDCEEET+MDDE+    ++PPDRNPPP PS
Subjt:  EGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVPS

Query:  SRVHPKLPDYDTLAARFEALKYRK
        SRVHPKLPDYDTLAARFEALKYRK
Subjt:  SRVHPKLPDYDTLAARFEALKYRK

XP_022132738.1 IST1-like protein [Momordica charantia]9.6e-22899.3Show/hide
Query:  TAAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVV
        TAAATEAAARSLKIVKLFIALLRRDFN SKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLA NEIIELFCELVV
Subjt:  TAAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVV

Query:  ARLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTT
        ARLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTT
Subjt:  ARLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTT

Query:  ESEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLE
        ESEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLE
Subjt:  ESEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLE

Query:  FEGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVP
        FEGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVP
Subjt:  FEGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVP

Query:  SSRVHPKLPDYDTLAARFEALKYRKI
        SSRVHPKLPDYDTLAARFEALKY+KI
Subjt:  SSRVHPKLPDYDTLAARFEALKYRKI

XP_031743525.1 IST1-like protein isoform X1 [Cucumis sativus]8.8e-18184.31Show/hide
Query:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA
        +AATEAAARSLKIVK FI +LR  FN SKCKTAAKMAVARIKLLRNKREAVV+QMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA
Subjt:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA

Query:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE
        RLSIIAKQR+CP DLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAA DLRPNCGVNRLLIDKLS+RTP+G+VKLKIMKEIAKEH+IEWDTTE
Subjt:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE

Query:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQI-GRTTNSREDDT-HFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGL
        SEKELLKP EELIEGPRTFVSAASLPVKP+ + S  D AQI  RTTNSRE+++ HFQD+ASAAEAAAKAAKQAIAAA+AAAYLANKD NR   G  GF L
Subjt:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQI-GRTTNSREDDT-HFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGL

Query:  EF-EGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPP
         F  G P NS+PT S+NM NHQFKAGE+ T P QS GRCSSLK NEETRNVNT+Y+ AYRR+SYNPTDIKFDESDCEEET+M+DE+    ++PPDRNPPP
Subjt:  EF-EGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPP

Query:  VPSSRVHPKLPDYDTLAARFEALKYRK
         PSSRVHPKLPDYDTLAARFEALKYRK
Subjt:  VPSSRVHPKLPDYDTLAARFEALKYRK

XP_038882989.1 IST1-like protein [Benincasa hispida]3.1e-18685.41Show/hide
Query:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA
        +AATEAAARS+KIVK FI LLRR FN SKCKTAAKMAVARIKLLRNKREAVV+QMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA
Subjt:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA

Query:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE
        RLSIIAKQR+CP DLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAA DLRPNCGVNRLLIDKLS+RTP+GQVKLKIMKEIAKEH+IEWDTTE
Subjt:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE

Query:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDT-HFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLE
        SEKELLKPPEELIEGPRTFVSAASLPVKP+ + SA D AQ  RT NSRED++ HFQD+ASAAEAAAKAAKQAIAAA+AAAYLANKD NR  +   GFGL 
Subjt:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDT-HFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLE

Query:  FEGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVP
        F G P NS+PTNSHNMD H+FK GEE T P QS GRCSSLK NE+T NVNT+Y+ AYRR+SYNPTDIKFDESDCEEETQMD+ +    ++PPDRNPPPVP
Subjt:  FEGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVP

Query:  SSRVHPKLPDYDTLAARFEALKYRK
        SSRVHPKLPDYDTLAARFEALKYRK
Subjt:  SSRVHPKLPDYDTLAARFEALKYRK

TrEMBL top hitse value%identityAlignment
A0A0A0KKW3 Uncharacterized protein3.0e-18284.51Show/hide
Query:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA
        +AATEAAARSLKIVK FI +LR  FN SKCKTAAKMAVARIKLLRNKREAVV+QMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA
Subjt:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA

Query:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE
        RLSIIAKQR+CP DLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAA DLRPNCGVNRLLIDKLS+RTP+G+VKLKIMKEIAKEH+IEWDTTE
Subjt:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE

Query:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDT-HFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLE
        SEKELLKP EELIEGPRTFVSAASLPVKP+ + S  D AQI RTTNSRE+++ HFQD+ASAAEAAAKAAKQAIAAA+AAAYLANKD NR   G  GF L 
Subjt:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDT-HFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLE

Query:  F-EGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPV
        F  G P NS+PT S+NM NHQFKAGE+ T P QS GRCSSLK NEETRNVNT+Y+ AYRR+SYNPTDIKFDESDCEEET+M+DE+    ++PPDRNPPP 
Subjt:  F-EGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPV

Query:  PSSRVHPKLPDYDTLAARFEALKYRK
        PSSRVHPKLPDYDTLAARFEALKYRK
Subjt:  PSSRVHPKLPDYDTLAARFEALKYRK

A0A1S3B2X8 IST1-like protein5.0e-18283.73Show/hide
Query:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA
        +AATEAAARSLKIVK FI +LR  FN SKCKTAAKMAVARIKLLRNKREAVV+QMRRDIALLLQSGQDATARIRVEHVIREQNVLAANE+IELFCELVVA
Subjt:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA

Query:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE
        RLSIIAKQR+CP DLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAA DLRPNCGVNRLLIDKLS+RTP+G+VKLKIMKEIAKEH+IEWDTTE
Subjt:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE

Query:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLEF
        SEKELLKPPEELIEGPRTFVSAASLPVKP+ + S  D AQI RTTN      HFQDTASAAEAAAKAAKQAIAAA+AAAYLANKD N+  R   GF L F
Subjt:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLEF

Query:  EGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVPS
         G P NS+   S+NMD HQFKAGE+ T P QS GRC SLK NEETRNVNT+Y+ AYRR+SYNPTDIKFDESDCEEET+MDDE+    ++PPDRNPPP PS
Subjt:  EGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVPS

Query:  SRVHPKLPDYDTLAARFEALKYRK
        SRVHPKLPDYDTLAARFEALKYRK
Subjt:  SRVHPKLPDYDTLAARFEALKYRK

A0A5A7SMF5 IST1-like protein5.0e-18283.73Show/hide
Query:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA
        +AATEAAARSLKIVK FI +LR  FN SKCKTAAKMAVARIKLLRNKREAVV+QMRRDIALLLQSGQDATARIRVEHVIREQNVLAANE+IELFCELVVA
Subjt:  AAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVA

Query:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE
        RLSIIAKQR+CP DLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAA DLRPNCGVNRLLIDKLS+RTP+G+VKLKIMKEIAKEH+IEWDTTE
Subjt:  RLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTE

Query:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLEF
        SEKELLKPPEELIEGPRTFVSAASLPVKP+ + S  D AQI RTTN      HFQDTASAAEAAAKAAKQAIAAA+AAAYLANKD N+  R   GF L F
Subjt:  SEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLEF

Query:  EGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVPS
         G P NS+   S+NMD HQFKAGE+ T P QS GRC SLK NEETRNVNT+Y+ AYRR+SYNPTDIKFDESDCEEET+MDDE+    ++PPDRNPPP PS
Subjt:  EGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVPS

Query:  SRVHPKLPDYDTLAARFEALKYRK
        SRVHPKLPDYDTLAARFEALKYRK
Subjt:  SRVHPKLPDYDTLAARFEALKYRK

A0A6J1BX61 IST1-like protein4.7e-22899.3Show/hide
Query:  TAAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVV
        TAAATEAAARSLKIVKLFIALLRRDFN SKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLA NEIIELFCELVV
Subjt:  TAAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVV

Query:  ARLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTT
        ARLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTT
Subjt:  ARLSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTT

Query:  ESEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLE
        ESEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLE
Subjt:  ESEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLE

Query:  FEGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVP
        FEGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVP
Subjt:  FEGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVP

Query:  SSRVHPKLPDYDTLAARFEALKYRKI
        SSRVHPKLPDYDTLAARFEALKY+KI
Subjt:  SSRVHPKLPDYDTLAARFEALKYRKI

A0A6J1EML3 IST1-like protein1.1e-16578.65Show/hide
Query:  ATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARL
        A  AAARS++IVK FI+LLRR FN SKCKTAAKMAVARIKLLRNKREAVV+QMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCEL+V+RL
Subjt:  ATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARL

Query:  SIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTESE
        SIIAKQR+CP DLKEGVASLIFA PRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLS+RTP+GQVKL IMKEIAKEHQIEWDTTESE
Subjt:  SIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTESE

Query:  KELLKPPEELIEGPRTFVSAASLPVKPM-ANQSARDIAQIGR-------TTNSREDDT-HFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGG
        KELLKPPEELI+GPRTFVSAAS+PVKP+ A+QSA D A I R        TNSRED++ HFQDTASAAEAAAKAAKQAIAAAQAAAYLANK+SNR  R  
Subjt:  KELLKPPEELIEGPRTFVSAASLPVKPM-ANQSARDIAQIGR-------TTNSREDDT-HFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGG

Query:  LGFGLEFEGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDR
         G  L F G                           PQSLGRC SLK NEE RN NT+++ AYRRYSYNPTDIKFDESDCEEETQMD+E  G  ++PPDR
Subjt:  LGFGLEFEGLPTNSSPTNSHNMDNHQFKAGEETTIPPQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDR

Query:  NPPPVPSSRVHPKLPDYDTLAARFEALKYRK
        NPPPVPSSRVHPKLPDYDTLAARFEALK+ +
Subjt:  NPPPVPSSRVHPKLPDYDTLAARFEALKYRK

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog2.1e-2034.08Show/hide
Query:  LLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEGV
        +L   F   + +   ++ + R+KLL  K+  + ++ R++IA  L +G+D  ARIRVEH+IRE  ++ A EI+EL+C+L++AR  +I   +E    L E V
Subjt:  LLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEGV

Query:  ASLIFAAPRC-SEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWD
        ++LI+AAPR  SE+ EL  + +    KY K++       +    VN  L+ KLS+  P   +  + + EIAK + + ++
Subjt:  ASLIFAAPRC-SEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWD

Q54I39 IST1-like protein1.6e-2025.5Show/hide
Query:  FNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEGVASLIF
        ++  K K   K+AV+RI++L+NK+  +VR  +R++A LL+   + +ARIRVE +IR++ ++   +IIE+ CEL+ AR+++I    E P ++KE + +L++
Subjt:  FNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEGVASLIF

Query:  AAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTESEKELLKPPEELIEGPRTFVSAAS
        ++ R  +IPEL  ++N  + KYGK   + A +   +  VN  ++ KLS  TP   +  + + EIA++  ++W  ++       PP +LI           
Subjt:  AAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTESEKELLKPPEELIEGPRTFVSAAS

Query:  LPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLEFEGLPTNSSPTNSHNMDNHQFKAGE
        +P +P+  Q    I Q            H Q      +   +  +Q       +  + +                    PT S   +   +     +  +
Subjt:  LPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLEFEGLPTNSSPTNSHNMDNHQFKAGE

Query:  ETTIP--PQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPV-PSSRVHPKLPDYDTLAARFEALK
            P  P S    +S     +   ++TN  + Y    +N  +  ++ ++       ++ +    N   +  PPP  PSS      PDYD L ARFEALK
Subjt:  ETTIP--PQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPV-PSSRVHPKLPDYDTLAARFEALK

Q568Z6 IST1 homolog2.1e-2034.08Show/hide
Query:  LLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEGV
        +L   F   + +   ++ + R+KLL  K+  + ++ R++IA  L +G+D  ARIRVEH+IRE  ++ A EI+EL+C+L++AR  +I   +E    L E V
Subjt:  LLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEGV

Query:  ASLIFAAPRC-SEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWD
        ++LI+AAPR  SE+ EL  + +    KY K++       +    VN  L+ KLS+  P   +  + + EIAK + + ++
Subjt:  ASLIFAAPRC-SEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWD

Q5R6G8 IST1 homolog4.8e-2034.08Show/hide
Query:  LLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEGV
        +L   F   + +   ++ + R+KLL  K+  + ++ R++IA  L +G+D  ARIRVEH+IRE  ++ A EI+EL+C+L++AR  +I   +E    L E V
Subjt:  LLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEGV

Query:  ASLIFAAPRC-SEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWD
        ++LI+AAPR  SE+ EL  + +    KY K +       +    VN  L+ KLS+  P   +  + + EIAK + + ++
Subjt:  ASLIFAAPRC-SEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWD

Q9CX00 IST1 homolog2.1e-2034.08Show/hide
Query:  LLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEGV
        +L   F   + +   ++ + R+KLL  K+  + ++ R++IA  L +G+D  ARIRVEH+IRE  ++ A EI+EL+C+L++AR  +I   +E    L E V
Subjt:  LLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEGV

Query:  ASLIFAAPRC-SEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWD
        ++LI+AAPR  SE+ EL  + +    KY K++       +    VN  L+ KLS+  P   +  + + EIAK + + ++
Subjt:  ASLIFAAPRC-SEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWD

Arabidopsis top hitse value%identityAlignment
AT1G25420.1 Regulator of Vps4 activity in the MVB pathway protein2.0e-5340.26Show/hide
Query:  VKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPP
        + L   L  R    +KCKT+  +A+AR+KLL+NKR+  ++ M+++IA  LQ+GQ+  ARIRVEHVIRE N+ AA EI+ELFCE ++AR+ I+  ++ECP 
Subjt:  VKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPP

Query:  DLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTESEKELLKPPEELI
        +L+E +AS+IFAAPRCSE+P+L  ++N+F  KYGK+F+  A++LRP+ GVNR +I+KLS  +PSG  +LK++KEIA+E+ + WD++ +E E +K  E+L+
Subjt:  DLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTESEKELLKPPEELI

Query:  EGPRTF-----------------VSAASLPVKPMANQSARDIAQ------IGRTTNSREDDTHFQ----------DTASAAEAAAKAAKQAIAAAQAAAY
         G +                    S+ S  V+ +  ++ +   +      + ++  S +  + FQ          D    A AA  +A +A AAA+AAA 
Subjt:  EGPRTF-----------------VSAASLPVKPMANQSARDIAQ------IGRTTNSREDDTHFQ----------DTASAAEAAAKAAKQAIAAAQAAAY

Query:  LAN
        L N
Subjt:  LAN

AT1G34220.1 Regulator of Vps4 activity in the MVB pathway protein2.0e-5842.86Show/hide
Query:  ALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEG
        +   + F  +KCKT  K+ + RIKL+RN+REA ++QMRR+IA LL++GQ+ATARIRVEH+IRE+ ++AA EI+ELFCEL+  RL II  QRECP DLKE 
Subjt:  ALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEG

Query:  VASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNR------------------------------LLIDKLSIRTPSGQVKLKIMKEI
        ++S+ FAAPRCS++ EL  ++ +F  KYGK+FV+AA++L+P+ GVNR                               L++ LS+R PS + KLK++KEI
Subjt:  VASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNR------------------------------LLIDKLSIRTPSGQVKLKIMKEI

Query:  AKEHQIEWDTTESEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHF
        A+EH+++WD   +E +L K  E+L++GP+ F   + LP+    N+   ++  +       + D+ +
Subjt:  AKEHQIEWDTTESEKELLKPPEELIEGPRTFVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHF

AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein3.6e-6348.31Show/hide
Query:  ALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEG
        +   + F  +KCKT  K+ + RIKL+RN+REA ++QMRR+IA LL++GQ+ATARIRVEH+IRE+ ++AA EI+ELFCEL+  RL II  QRECP DLKE 
Subjt:  ALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEG

Query:  VASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTESEKELLKPPEELIEGPRT
        ++S+ FAAPRCS++ EL  ++ +F  KYGK+FV+AA++L+P+ GVNR L++ LS+R PS + KLK++KEIA+EH+++WD   +E +L K  E+L++GP+ 
Subjt:  VASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTESEKELLKPPEELIEGPRT

Query:  FVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHF
        F   + LP+    N+   ++  +       + D+ +
Subjt:  FVSAASLPVKPMANQSARDIAQIGRTTNSREDDTHF

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein4.1e-5147.44Show/hide
Query:  LLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEGV
        +L+R F  +KCKTA +MA +R+K+L+NK+E  ++Q+RR++A LL+SGQ  TARIRVEHV+RE+  +AA E+I ++CEL+V RL +I  Q+ CP DLKE V
Subjt:  LLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQRECPPDLKEGV

Query:  ASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTESEKELLKPPEELIEGPRTF
         S++FA+ R S++PELS +   F  KYGKDF ++A +LRP+ GV+RLL++KLS + P G  K+KI+  IA+EH + W+  +S  E      EL+ G  +F
Subjt:  ASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTESEKELLKPPEELIEGPRTF

Query:  VSAASLPVKPMANQS
          A+S+ +    N +
Subjt:  VSAASLPVKPMANQS

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein6.6e-11855.74Show/hide
Query:  AATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVAR
        AA  A+A++ K++K  ++L RR FN SKCKTAAKMAVARIKL+RNKR  VV+QMRRDIA+LLQSGQDATARIRVEHVIREQN+ AANEIIELFCEL+V+R
Subjt:  AATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVAR

Query:  LSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTES
        L+II KQ++CP DLKEG+ASLIFAAPRCSEIPEL  LR++F KKYGKDFVSAATDLRP+CGVNR+LIDKLS+R P G+ KLKIMKEIAKE Q++WDTTE+
Subjt:  LSIIAKQRECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTES

Query:  EKELLKPPEELIEGPRTFVSAASLPVKPMA-NQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANK--DSNRGGRGGLGFGL
        E+ELLKP EE I+GPR FVSA+SLPV   A N+       + R+T+S   +TH+ DT SAAEAA + AKQA+AAAQ A+ LA +   SN+          
Subjt:  EKELLKPPEELIEGPRTFVSAASLPVKPMA-NQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANK--DSNRGGRGGLGFGL

Query:  EFEGLPTNSS-PTNSHNMDNHQFKAG-------EETT---IPPQSLGRCSSLKNNEETRNVN-TNYDEAY------------RRYSYNP--------TDI
        EF     +S+   +S  MD+H    G        ET+     P +  R    +++     +N ++Y+E Y            RR+SYNP        ++I
Subjt:  EFEGLPTNSS-PTNSHNMDNHQFKAG-------EETT---IPPQSLGRCSSLKNNEETRNVN-TNYDEAY------------RRYSYNP--------TDI

Query:  KFDESD-CEEETQMDDESRG-ATNQPPDRNPPPVPSS----------RVHPKLPDYDTLAARFEALKYRK
        KFDESD  EEET+ D+ S+G  ++ PP+R PP  P S          +VHPKLPDYD LAARFEA+++ K
Subjt:  KFDESD-CEEETQMDDESRG-ATNQPPDRNPPPVPSS----------RVHPKLPDYDTLAARFEALKYRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACCGCCGCCGCCACGGAGGCCGCCGCTCGTTCCCTCAAGATCGTCAAGCTCTTCATCGCCCTCCTCCGCCGCGACTTCAACTGCTCCAAATGCAAGACAGCGGCAAAAAT
GGCGGTGGCCAGGATAAAACTGCTGAGGAACAAGCGGGAGGCAGTGGTGAGACAGATGAGGCGGGACATTGCTCTCCTTCTCCAGTCTGGGCAGGATGCCACTGCTCGTA
TTCGGGTTGAACATGTCATAAGGGAACAGAATGTCTTGGCTGCAAATGAGATTATTGAGCTCTTCTGCGAGTTAGTAGTGGCTAGACTATCAATAATAGCAAAACAAAGA
GAATGTCCACCAGATCTAAAAGAAGGGGTTGCTAGTTTGATTTTTGCCGCTCCGAGGTGCTCGGAAATACCAGAACTTTCTGCACTTAGGAACGTTTTTGAGAAGAAATA
TGGCAAAGATTTTGTCTCTGCTGCCACTGATTTAAGGCCCAACTGTGGAGTGAATCGGCTGCTCATTGACAAGCTCTCAATTCGAACTCCGTCGGGTCAAGTGAAGCTCA
AAATAATGAAGGAGATTGCCAAGGAGCACCAGATTGAATGGGACACAACGGAATCTGAGAAAGAGCTGCTAAAGCCTCCTGAAGAACTTATTGAAGGGCCTCGGACTTTT
GTCAGTGCTGCCAGCTTACCTGTGAAGCCTATGGCAAACCAATCTGCTCGAGATATTGCCCAGATTGGAAGAACAACGAACAGTCGAGAGGATGATACGCACTTTCAAGA
TACAGCTTCTGCTGCAGAAGCTGCTGCGAAAGCGGCGAAGCAAGCGATTGCTGCTGCACAGGCTGCTGCCTATTTGGCAAACAAAGACTCAAACAGAGGTGGTCGAGGTG
GTTTGGGTTTCGGTCTCGAATTTGAAGGTCTTCCGACTAATTCTAGCCCAACCAACTCTCATAATATGGATAATCATCAGTTCAAGGCAGGAGAAGAGACGACAATCCCA
CCTCAGAGCTTAGGCAGATGCTCCTCTCTGAAGAATAATGAAGAGACCAGGAATGTAAATACAAATTATGATGAGGCTTACAGAAGATATAGCTACAATCCTACAGACAT
AAAGTTCGACGAATCGGATTGTGAAGAAGAAACTCAAATGGACGACGAATCTAGAGGGGCTACTAATCAACCTCCTGACCGGAATCCTCCTCCTGTACCCTCGTCTCGGG
TTCACCCGAAGCTACCAGATTACGATACACTTGCTGCTCGCTTTGAAGCTCTCAAGTACAGAAAAATT
mRNA sequenceShow/hide mRNA sequence
ACCGCCGCCGCCACGGAGGCCGCCGCTCGTTCCCTCAAGATCGTCAAGCTCTTCATCGCCCTCCTCCGCCGCGACTTCAACTGCTCCAAATGCAAGACAGCGGCAAAAAT
GGCGGTGGCCAGGATAAAACTGCTGAGGAACAAGCGGGAGGCAGTGGTGAGACAGATGAGGCGGGACATTGCTCTCCTTCTCCAGTCTGGGCAGGATGCCACTGCTCGTA
TTCGGGTTGAACATGTCATAAGGGAACAGAATGTCTTGGCTGCAAATGAGATTATTGAGCTCTTCTGCGAGTTAGTAGTGGCTAGACTATCAATAATAGCAAAACAAAGA
GAATGTCCACCAGATCTAAAAGAAGGGGTTGCTAGTTTGATTTTTGCCGCTCCGAGGTGCTCGGAAATACCAGAACTTTCTGCACTTAGGAACGTTTTTGAGAAGAAATA
TGGCAAAGATTTTGTCTCTGCTGCCACTGATTTAAGGCCCAACTGTGGAGTGAATCGGCTGCTCATTGACAAGCTCTCAATTCGAACTCCGTCGGGTCAAGTGAAGCTCA
AAATAATGAAGGAGATTGCCAAGGAGCACCAGATTGAATGGGACACAACGGAATCTGAGAAAGAGCTGCTAAAGCCTCCTGAAGAACTTATTGAAGGGCCTCGGACTTTT
GTCAGTGCTGCCAGCTTACCTGTGAAGCCTATGGCAAACCAATCTGCTCGAGATATTGCCCAGATTGGAAGAACAACGAACAGTCGAGAGGATGATACGCACTTTCAAGA
TACAGCTTCTGCTGCAGAAGCTGCTGCGAAAGCGGCGAAGCAAGCGATTGCTGCTGCACAGGCTGCTGCCTATTTGGCAAACAAAGACTCAAACAGAGGTGGTCGAGGTG
GTTTGGGTTTCGGTCTCGAATTTGAAGGTCTTCCGACTAATTCTAGCCCAACCAACTCTCATAATATGGATAATCATCAGTTCAAGGCAGGAGAAGAGACGACAATCCCA
CCTCAGAGCTTAGGCAGATGCTCCTCTCTGAAGAATAATGAAGAGACCAGGAATGTAAATACAAATTATGATGAGGCTTACAGAAGATATAGCTACAATCCTACAGACAT
AAAGTTCGACGAATCGGATTGTGAAGAAGAAACTCAAATGGACGACGAATCTAGAGGGGCTACTAATCAACCTCCTGACCGGAATCCTCCTCCTGTACCCTCGTCTCGGG
TTCACCCGAAGCTACCAGATTACGATACACTTGCTGCTCGCTTTGAAGCTCTCAAGTACAGAAAAATT
Protein sequenceShow/hide protein sequence
TAAATEAAARSLKIVKLFIALLRRDFNCSKCKTAAKMAVARIKLLRNKREAVVRQMRRDIALLLQSGQDATARIRVEHVIREQNVLAANEIIELFCELVVARLSIIAKQR
ECPPDLKEGVASLIFAAPRCSEIPELSALRNVFEKKYGKDFVSAATDLRPNCGVNRLLIDKLSIRTPSGQVKLKIMKEIAKEHQIEWDTTESEKELLKPPEELIEGPRTF
VSAASLPVKPMANQSARDIAQIGRTTNSREDDTHFQDTASAAEAAAKAAKQAIAAAQAAAYLANKDSNRGGRGGLGFGLEFEGLPTNSSPTNSHNMDNHQFKAGEETTIP
PQSLGRCSSLKNNEETRNVNTNYDEAYRRYSYNPTDIKFDESDCEEETQMDDESRGATNQPPDRNPPPVPSSRVHPKLPDYDTLAARFEALKYRKI