; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021294 (gene) of Snake gourd v1 genome

Gene IDTan0021294
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein GLE1 isoform X1
Genome locationLG04:899612..908728
RNA-Seq ExpressionTan0021294
SyntenyTan0021294
Gene Ontology termsGO:0006446 - regulation of translational initiation (biological process)
GO:0006449 - regulation of translational termination (biological process)
GO:0015031 - protein transport (biological process)
GO:0016973 - poly(A)+ mRNA export from nucleus (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0044614 - nuclear pore cytoplasmic filaments (cellular component)
GO:0000822 - inositol hexakisphosphate binding (molecular function)
GO:0005543 - phospholipid binding (molecular function)
GO:0031369 - translation initiation factor binding (molecular function)
InterPro domainsIPR012476 - GLE1-like
IPR038506 - GLE1-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601557.1 Protein GLE1, partial [Cucurbita argyrosperma subsp. sororia]5.8e-29685.89Show/hide
Query:  SPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSS--------RDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNS
        SPVKLTLRCPS+VG VT DPQPDFSFDDLRVEL SLEEKLK STTPFTKT S        RDFP  KT KRSSKPFVMGVYEDELQ++F+DGEVVCDQ S
Subjt:  SPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSS--------RDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNS

Query:  NAKRFNCDGIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQ
        NAKRFNCDG FLSDSEDSDNESTLETRA+LMEDVDLVESSLAQLT DHLLN KEEIRNQLGRLETNLT LNEKS++A SQI+KYYEARREADRRLDTQYQ
Subjt:  NAKRFNCDGIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQ

Query:  REIAEGLDKYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQA
        REIAEGLDKYLTT+QRHHEQISQREERKIRSDAAFEEAKRKEKA+LEEKKR EKAKAEAEAKA+AEEA KAAIEAE RA++E AEREAAEN+KKV  VQA
Subjt:  REIAEGLDKYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQA

Query:  QETTEGPLTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSEL
        QET  G  TT+P+NSV Q KGTALDGTNVSRS GSMVRAS++ALTLEQERLQK KEVEEGNQ LRLSSNKDFSTHERH+ARLIRQI GTKENV TKTS+L
Subjt:  QETTEGPLTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSEL

Query:  LKIFMDPLCPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMESV
        LKIFMDPLCPQTISIAAFAKKVVSQCES +NA FACSHV+VLVTSQVPSAMVLLLAEFHRACIYTVPKH+GYSEAAFESKE +YK +GFRE+ DGKMESV
Subjt:  LKIFMDPLCPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMESV

Query:  QDYLKRLEAYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQI
        +DYLKRLEAYM LYGAL+QTEVPG RN+HGLEEGWAWLARFLN+VPPNIYTATALYAFL+ AG+ALFRKYKSQFRKLLNII+DNFLSAL+ KENPGLKQI
Subjt:  QDYLKRLEAYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQI

Query:  IVQIESYLEDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRS
        IVQIESYLEDRKFLQEPEGR L+GSLLSS +VPE EY+++SY RS
Subjt:  IVQIESYLEDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRS

KAG7032320.1 Protein GLE1, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-29885.58Show/hide
Query:  RSPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSS--------RDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQN
        +SPVKLTLRCPS+VG VT DPQPDFSFDDLRVEL SLEEKLK STTPFTKT S        RDFP  KT KR SKPFVMGVYEDELQ++F+DGEVVCDQ 
Subjt:  RSPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSS--------RDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQN

Query:  SNAKRFNCDGIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQY
        SNAKRFNCDG FLSDSEDSDNESTLETRA+LMEDVDLVESSLAQLT DHLLN KEEIRNQLGRLETNLT LNEKS++A SQI+KYYEARREADRRLDTQY
Subjt:  SNAKRFNCDGIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQY

Query:  QREIAEGLDKYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQ
        QREIAEGLDKYLTT+QRHHEQISQREERKIRSDAAFEEAKRKEKA+LEEKKR EKAKAEAEAKA+AEEA KAAIEAE RA++E AEREAAEN+KKV  VQ
Subjt:  QREIAEGLDKYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQ

Query:  AQETTEGPLTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSE
        AQET  G  TT+P+NSV Q KGTALDGTNVSRS GSMVRAS++ALTLEQERLQK KEVEEGNQ LRLSSNKDFSTHERH+ARLIRQI GTKENV TKTS+
Subjt:  AQETTEGPLTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSE

Query:  LLKIFMDPLCPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMES
        LLKIFMDPLCPQTISIAAFAKKVVSQCES +NA FACSHV+VLVTSQVPSAMVLLLAEFHRACIYTVPKH+GYSEAAFESKE +YK +GFRE+ DGKMES
Subjt:  LLKIFMDPLCPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMES

Query:  VQDYLKRLEAYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQ
        V+DYLKRLEAYM LYGAL+QTEVPG RN+HGLEEGWAWLARFLN+VPPNIYTATALYAFL+ AG+ALFRKYKSQFRKLLNII+DNFLSAL+ KENPGLKQ
Subjt:  VQDYLKRLEAYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQ

Query:  IIVQIESYLEDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY
        IIVQIESYLEDRKFLQEPEGR L+GSLLSS +VPE EY+++SY RS NSYFY
Subjt:  IIVQIESYLEDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY

XP_022957181.1 protein GLE1 isoform X1 [Cucurbita moschata]1.7e-30086.78Show/hide
Query:  SPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCD
        SPVKLTLRCPS+VG VT DPQPDFSFDDLRVEL SLEEKLK STTPFTKT SRDFP  KT KRSSKPFVMGVYEDELQ++F+DGEVVCDQ SNAKRFNCD
Subjt:  SPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCD

Query:  GIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLD
        G FLSDSEDSDNESTL TRA+LMEDVDLVESSLAQLT DHLLN KEEIRNQLGRLETNLT LNEKS++A SQI+KYYEARREADRRLDTQYQREIAEGLD
Subjt:  GIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLD

Query:  KYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGPL
        KYLTT+QRHHEQISQREERKIRSDAAFEEAKRKEKA+LEEKKR EKAKAEAEAKA+AEEA KAAIEAE RA++E AEREAAEN+KKV  VQAQET  G  
Subjt:  KYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGPL

Query:  TTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDPL
        TT+P+NSV Q KGTALDGTNVSRS GSMVRAS++ALTLEQERLQK KEVEEGNQ LRLSSNKDFSTHERH+ARLIRQI GTKENV TKTS+LLKIFMDPL
Subjt:  TTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDPL

Query:  CPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMESVQDYLKRLE
        CPQTISIAAFAKKVVSQCES +NA FACSHV+VLVTSQVPSAMVLLLAEFHRACIYTVPKH+GYSEAAFESKE +YK +GFRE+ DGKMESV+DYLKRLE
Subjt:  CPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMESVQDYLKRLE

Query:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYL
        AYM LYGAL+QTEVPG RN+HGLEEGWAWLARFLN+VPPNIYTATALYAFL+ AG+ALFRKYKSQFRKLLNII+DNFLSAL+ KENPGLKQIIVQIESYL
Subjt:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYL

Query:  EDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY
        EDRKFLQEPEGR L+GSLLSS +VPE EY+++SY RS NSYFY
Subjt:  EDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY

XP_022997611.1 protein GLE1 isoform X1 [Cucurbita maxima]1.9e-29986.47Show/hide
Query:  SPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCD
        SPVKLTLRCPS+VG VT DPQPDFSFDDLR EL SLEEKLK STTPFTKT SRDFP  KT KRSSKPFVMGVYEDELQ++FSDGEVVCDQ SNAKRFNCD
Subjt:  SPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCD

Query:  GIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLD
        G FLSDSEDSDNE+TLETRA+LMEDVDLVESSLAQLTYDHLLN KEEIRNQLGRLETNLT LNEKS++A SQI+KYYEARREADRRLDTQYQREIAEGLD
Subjt:  GIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLD

Query:  KYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGPL
        KYLTT+QRHHEQISQREERKIRSDAAFEEAKRKEKA+LEEKKR EKAKAEAEAKA+AEEA KAAIEAE RA++E AEREA EN+KKV  VQAQET  G  
Subjt:  KYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGPL

Query:  TTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDPL
        TT+P+NSV Q KGTALDGTNVSRS GSMVRAS++ALTLEQERLQK KE+EEGNQ LRLSSNKDFSTHERH+ARLIRQI GTKENV TKTS+LLKIFMDPL
Subjt:  TTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDPL

Query:  CPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMESVQDYLKRLE
        CPQTISIAAFAKKVVSQCES +NA FACSHV+VLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKE +YK +GFRE+ DGKMESV+DYLKRLE
Subjt:  CPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMESVQDYLKRLE

Query:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYL
        AYM LYGAL+QTEVPG RN+HGLEEGWAWLARFLN+VPPNIYTATALYAFL+ AG+AL RKYKSQFRKLLNII+DNFL AL+ KENPGLKQIIVQIESYL
Subjt:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYL

Query:  EDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY
        EDRKFLQEPEGR L+GSLLSS +VPE EY+++SY RS NSYFY
Subjt:  EDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY

XP_023544608.1 protein GLE1 isoform X1 [Cucurbita pepo subsp. pepo]5.4e-30286.94Show/hide
Query:  SPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCD
        SPVKLTLRCPS+VG VT DPQPDFSFDDLRVEL SLEEKLK STTPFTKT SRDFP  KT KRSSKPFVMGVYEDELQ++F+DGEVVCDQ SNAKRFNCD
Subjt:  SPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCD

Query:  GIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLD
        G FLSDSEDSDNESTLETRA+LMEDVDLVESSLAQLT DHLLN KEEIRNQLGRLETNLT LNEKS++A SQI+KYYEARREADRRLDTQYQREIAEGLD
Subjt:  GIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLD

Query:  KYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGPL
        KYLTT+QRHHEQISQREERKIRSDAAFEEAKRKEKA+LEEKKR EKAKAEAEAKA+AEEA KAAIEAE RA++E AEREAAEN+KKV  VQAQET  G  
Subjt:  KYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGPL

Query:  TTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDPL
        TT+P+NSV QPKGTALDGTNVSRS G+MVRAS++ALTLEQERLQK KEVEEGNQ LRLSSNKDFSTHERH+ARLIRQI GTKENV TKTS+LLKIFMDPL
Subjt:  TTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDPL

Query:  CPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMESVQDYLKRLE
        CPQTISIAAFAKKVVSQCES +NA FACSHV+VLVTSQVPSAMVLLLAEFHRACIYTVPKH+GYSEAAFESKE +YK +GFRE+ DGKMESV+DYLKRLE
Subjt:  CPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMESVQDYLKRLE

Query:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYL
        AYM LYGAL+QTEVPG RN+HGLEEGWAWLARFLN+VPPNIYTATALYAFL+ AG+ALFRKYKSQFRKLLNII+DNFLSAL+ KENPGLKQIIVQIESYL
Subjt:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYL

Query:  EDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY
        EDRKFLQEPEGR L+GSLLSS +VPE EY+++SY RS NSYFY
Subjt:  EDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY

TrEMBL top hitse value%identityAlignment
A0A6J1DD13 protein GLE1 isoform X13.6e-28383.1Show/hide
Query:  PVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCDG
        PVKLTLRCPS+VG VTVDPQPDFSFDDLRVELNSLEEKLK S+TPFTKT SRDF  TKT++RSS+PFVMGVYEDELQD+FSDGE V DQ+     FNCD 
Subjt:  PVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCDG

Query:  IFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLDK
        IFLSDSEDSDNES++  RAYLMEDVDLVES +AQLT+DH LNMKEEIRNQL R+ T LT+LNEK+N AFSQI+KYYEARREADRRLDTQYQR+IAEGLDK
Subjt:  IFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLDK

Query:  YLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAE--AKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGP
        YLT IQR HE ISQ EERKIRSDAAFEEAKRKEKA LEEKKRQEKAKAEAE  AKARAEEAKKAAIEAERRA+QEAA REAAEN+KKV V Q QETTE  
Subjt:  YLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAE--AKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGP

Query:  LTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDP
        LTT+P+NSV QPKG A  GTN+S+S GSM+RAS++ALTLE+ERLQK KEVEEGNQ LRLSSNKDFS+HERH+ARLIRQIRGTKENVRTK SELLKIFMDP
Subjt:  LTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDP

Query:  LCPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLE
         CPQTISIAAFAKKVVSQCES DNA FAC HV+VLVTSQVPSAMVLLLAEFHRACIYTVPKH GYSEAAFESK  +YK IGFRE+DGKME+++DYLKRLE
Subjt:  LCPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLE

Query:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSAL--KRKENPGLKQIIVQIES
         YM LYGALIQTEV GVRNLHGLEEGWAWLARFLN+VPPNIYTATAL AFL+ AG+ALFRKYKSQFRKLLNII+DNFLSAL  K KEN GLKQ+++ IES
Subjt:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSAL--KRKENPGLKQIIVQIES

Query:  YLEDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY
        YLEDRKFLQEPEGR+LQGSLLS+V+VP+ ++ QESYGRSSNSYFY
Subjt:  YLEDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY

A0A6J1DDZ4 protein GLE1 isoform X21.1e-28483.36Show/hide
Query:  PVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCDG
        PVKLTLRCPS+VG VTVDPQPDFSFDDLRVELNSLEEKLK S+TPFTKT SRDF  TKT++RSS+PFVMGVYEDELQD+FSDGE V DQ+     FNCD 
Subjt:  PVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCDG

Query:  IFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLDK
        IFLSDSEDSDNES++  RAYLMEDVDLVES +AQLT+DH LNMKEEIRNQL R+ T LT+LNEK+N AFSQI+KYYEARREADRRLDTQYQR+IAEGLDK
Subjt:  IFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLDK

Query:  YLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGPLT
        YLT IQR HE ISQ EERKIRSDAAFEEAKRKEKA LEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRA+QEAA REAAEN+KKV V Q QETTE  LT
Subjt:  YLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGPLT

Query:  TEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDPLC
        T+P+NSV QPKG A  GTN+S+S GSM+RAS++ALTLE+ERLQK KEVEEGNQ LRLSSNKDFS+HERH+ARLIRQIRGTKENVRTK SELLKIFMDP C
Subjt:  TEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDPLC

Query:  PQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLEAY
        PQTISIAAFAKKVVSQCES DNA FAC HV+VLVTSQVPSAMVLLLAEFHRACIYTVPKH GYSEAAFESK  +YK IGFRE+DGKME+++DYLKRLE Y
Subjt:  PQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLEAY

Query:  MNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSAL--KRKENPGLKQIIVQIESYL
        M LYGALIQTEV GVRNLHGLEEGWAWLARFLN+VPPNIYTATAL AFL+ AG+ALFRKYKSQFRKLLNII+DNFLSAL  K KEN GLKQ+++ IESYL
Subjt:  MNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSAL--KRKENPGLKQIIVQIESYL

Query:  EDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY
        EDRKFLQEPEGR+LQGSLLS+V+VP+ ++ QESYGRSSNSYFY
Subjt:  EDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY

A0A6J1DFL6 protein GLE1 isoform X43.6e-28383.1Show/hide
Query:  PVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCDG
        PVKLTLRCPS+VG VTVDPQPDFSFDDLRVELNSLEEKLK S+TPFTKT SRDF  TKT++RSS+PFVMGVYEDELQD+FSDGE V DQ+     FNCD 
Subjt:  PVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCDG

Query:  IFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLDK
        IFLSDSEDSDNES++  RAYLMEDVDLVES +AQLT+DH LNMKEEIRNQL R+ T LT+LNEK+N AFSQI+KYYEARREADRRLDTQYQR+IAEGLDK
Subjt:  IFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLDK

Query:  YLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAE--AKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGP
        YLT IQR HE ISQ EERKIRSDAAFEEAKRKEKA LEEKKRQEKAKAEAE  AKARAEEAKKAAIEAERRA+QEAA REAAEN+KKV V Q QETTE  
Subjt:  YLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAE--AKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGP

Query:  LTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDP
        LTT+P+NSV QPKG A  GTN+S+S GSM+RAS++ALTLE+ERLQK KEVEEGNQ LRLSSNKDFS+HERH+ARLIRQIRGTKENVRTK SELLKIFMDP
Subjt:  LTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDP

Query:  LCPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLE
         CPQTISIAAFAKKVVSQCES DNA FAC HV+VLVTSQVPSAMVLLLAEFHRACIYTVPKH GYSEAAFESK  +YK IGFRE+DGKME+++DYLKRLE
Subjt:  LCPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLE

Query:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSAL--KRKENPGLKQIIVQIES
         YM LYGALIQTEV GVRNLHGLEEGWAWLARFLN+VPPNIYTATAL AFL+ AG+ALFRKYKSQFRKLLNII+DNFLSAL  K KEN GLKQ+++ IES
Subjt:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSAL--KRKENPGLKQIIVQIES

Query:  YLEDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY
        YLEDRKFLQEPEGR+LQGSLLS+V+VP+ ++ QESYGRSSNSYFY
Subjt:  YLEDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY

A0A6J1GYI2 protein GLE1 isoform X18.5e-30186.78Show/hide
Query:  SPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCD
        SPVKLTLRCPS+VG VT DPQPDFSFDDLRVEL SLEEKLK STTPFTKT SRDFP  KT KRSSKPFVMGVYEDELQ++F+DGEVVCDQ SNAKRFNCD
Subjt:  SPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCD

Query:  GIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLD
        G FLSDSEDSDNESTL TRA+LMEDVDLVESSLAQLT DHLLN KEEIRNQLGRLETNLT LNEKS++A SQI+KYYEARREADRRLDTQYQREIAEGLD
Subjt:  GIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLD

Query:  KYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGPL
        KYLTT+QRHHEQISQREERKIRSDAAFEEAKRKEKA+LEEKKR EKAKAEAEAKA+AEEA KAAIEAE RA++E AEREAAEN+KKV  VQAQET  G  
Subjt:  KYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGPL

Query:  TTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDPL
        TT+P+NSV Q KGTALDGTNVSRS GSMVRAS++ALTLEQERLQK KEVEEGNQ LRLSSNKDFSTHERH+ARLIRQI GTKENV TKTS+LLKIFMDPL
Subjt:  TTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDPL

Query:  CPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMESVQDYLKRLE
        CPQTISIAAFAKKVVSQCES +NA FACSHV+VLVTSQVPSAMVLLLAEFHRACIYTVPKH+GYSEAAFESKE +YK +GFRE+ DGKMESV+DYLKRLE
Subjt:  CPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMESVQDYLKRLE

Query:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYL
        AYM LYGAL+QTEVPG RN+HGLEEGWAWLARFLN+VPPNIYTATALYAFL+ AG+ALFRKYKSQFRKLLNII+DNFLSAL+ KENPGLKQIIVQIESYL
Subjt:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYL

Query:  EDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY
        EDRKFLQEPEGR L+GSLLSS +VPE EY+++SY RS NSYFY
Subjt:  EDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY

A0A6J1K7Y4 protein GLE1 isoform X19.3e-30086.47Show/hide
Query:  SPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCD
        SPVKLTLRCPS+VG VT DPQPDFSFDDLR EL SLEEKLK STTPFTKT SRDFP  KT KRSSKPFVMGVYEDELQ++FSDGEVVCDQ SNAKRFNCD
Subjt:  SPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRFNCD

Query:  GIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLD
        G FLSDSEDSDNE+TLETRA+LMEDVDLVESSLAQLTYDHLLN KEEIRNQLGRLETNLT LNEKS++A SQI+KYYEARREADRRLDTQYQREIAEGLD
Subjt:  GIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLD

Query:  KYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGPL
        KYLTT+QRHHEQISQREERKIRSDAAFEEAKRKEKA+LEEKKR EKAKAEAEAKA+AEEA KAAIEAE RA++E AEREA EN+KKV  VQAQET  G  
Subjt:  KYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKV-VVQAQETTEGPL

Query:  TTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDPL
        TT+P+NSV Q KGTALDGTNVSRS GSMVRAS++ALTLEQERLQK KE+EEGNQ LRLSSNKDFSTHERH+ARLIRQI GTKENV TKTS+LLKIFMDPL
Subjt:  TTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDPL

Query:  CPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMESVQDYLKRLE
        CPQTISIAAFAKKVVSQCES +NA FACSHV+VLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKE +YK +GFRE+ DGKMESV+DYLKRLE
Subjt:  CPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFREN-DGKMESVQDYLKRLE

Query:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYL
        AYM LYGAL+QTEVPG RN+HGLEEGWAWLARFLN+VPPNIYTATALYAFL+ AG+AL RKYKSQFRKLLNII+DNFL AL+ KENPGLKQIIVQIESYL
Subjt:  AYMNLYGALIQTEVPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYL

Query:  EDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY
        EDRKFLQEPEGR L+GSLLSS +VPE EY+++SY RS NSYFY
Subjt:  EDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYGRSSNSYFY

SwissProt top hitse value%identityAlignment
Q0WPZ7 Protein GLE12.3e-11742.84Show/hide
Query:  CPSKVGLVTVDPQPDFSFDDLRVELNSLEEKL---KMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSN---------AKR
        CP  V  +++DP+P+++F+ L  E+ S+E+KL    M   P T T+ R       + R    FVM V EDE++    +GE   D+             KR
Subjt:  CPSKVGLVTVDPQPDFSFDDLRVELNSLEEKL---KMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSN---------AKR

Query:  FNCDGIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIA
        F CD ++LSD  D + +   E   Y+M  + L ES+L ++  DH   +K++IRNQ+  +ET +    E S SA ++++KY E R+E +R+LD QYQR++A
Subjt:  FNCDGIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIA

Query:  EGLDKYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAE--AKARAEEAKKAAIEAERRAVQEAAEREAAENVKKVVVQAQET
        E LD +LT +QR H+  SQ EERKIRS+ A EEA+RKE+A  EEK RQEKA+AEA+  AK RAEE KK   E ER+A +E AE+E A+         Q+ 
Subjt:  EGLDKYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAE--AKARAEEAKKAAIEAERRAVQEAAEREAAENVKKVVVQAQET

Query:  TEGPLTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKI
         E     E +        T    T+ +++ G+ +RA++SAL LE  RL+K +E+E  NQ+L+  SN++FS+ E+H+ R+IRQI GTK++V  K ++++KI
Subjt:  TEGPLTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKI

Query:  FMDPLCPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYL
        F DP CP +ISIAAFAKK+V+  E  +   FACS+V+V + SQ P  M +LLAEFH+ACIYTVPKHI  S++A++S                     D  
Subjt:  FMDPLCPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYL

Query:  KRLEAYMNLYGALIQTE--VPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALK-RKENPGLKQII
        +RL++ M LYGAL+QT+  V    N+HG+E GWAWLARFLN +P N  TATAL +FL+ AG+ L ++YKSQF K++N++ ++FL  L+ +K+   L  II
Subjt:  KRLEAYMNLYGALIQTE--VPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALK-RKENPGLKQII

Query:  VQIESYLEDRKFLQEPEGRNLQ-GSLLSSVVVPEIEYSQESYGRSSNSY
         +I +YL+DR +L+EPEGR ++  S LSS +  E+     +     N Y
Subjt:  VQIESYLEDRKFLQEPEGRNLQ-GSLLSSVVVPEIEYSQESYGRSSNSY

Q3ZBK7 Nucleoporin GLE15.2e-2131.63Show/hide
Query:  FAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLEAYMNLYGALI
        F K+   +  S   AAF  + V   +    P    L+LA  H+ C Y+VP +  + E    + E + + +G++  D K+E   ++LKR+   + LY A+I
Subjt:  FAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLEAYMNLYGALI

Query:  QTEVP-GVR---NLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYLEDRKFL
        Q   P G R   + HGL  GW WLA+ LN  P +  TAT L+ FL   G AL ++Y+ QF K++ +I +++   ++   + G     ++++ +LE  K L
Subjt:  QTEVP-GVR---NLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYLEDRKFL

Query:  QEPEGRNLQGSLLSS
        Q  E    +G L +S
Subjt:  QEPEGRNLQGSLLSS

Q53GS7 Nucleoporin GLE11.2e-2032.09Show/hide
Query:  FAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLEAYMNLYGALI
        F K+   +  S   AAF  + V   +    P    L+LA  H+ C Y+VP +  + E    + E + + +G++  D K+E   ++LKR+   + LY A+I
Subjt:  FAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLEAYMNLYGALI

Query:  QTEVP-GVR---NLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYLEDRKFL
        Q   P G R   + HGL  GW WLA+ LN  P +  TAT L+ FL   G AL ++Y+ QF K+L +I +++   ++   + G     ++++ +LE  K L
Subjt:  QTEVP-GVR---NLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYLEDRKFL

Query:  QEPEGRNLQGSLLSS
        Q  +    +G L SS
Subjt:  QEPEGRNLQGSLLSS

Q5RAS2 Nucleoporin GLE11.2e-2032.09Show/hide
Query:  FAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLEAYMNLYGALI
        F K+   +  S   AAF  + V   +    P    L+LA  H+ C Y+VP +  + E    + E + + +G++  D K+E   ++LKR+   + LY A+I
Subjt:  FAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLEAYMNLYGALI

Query:  QTEVP-GVR---NLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYLEDRKFL
        Q   P G R   + HGL  GW WLA+ LN  P +  TAT L+ FL   G AL ++Y+ QF K+L +I +++   ++   + G     ++++ +LE  K L
Subjt:  QTEVP-GVR---NLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYLEDRKFL

Query:  QEPEGRNLQGSLLSS
        Q  +    +G L SS
Subjt:  QEPEGRNLQGSLLSS

Q6DRB1 Nucleoporin GLE15.0e-1627.9Show/hide
Query:  KSNSAFSQIDKYYEARREADRRLDTQYQREIAE-GLDKYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKA
        + N   SQ+ +   A   AD +  +     +AE  L +  + ++   E +SQ  ERK          K+KE+   EEK+RQE+ KA+ E     E+ K A
Subjt:  KSNSAFSQIDKYYEARREADRRLDTQYQREIAE-GLDKYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKA

Query:  AIEAERRAVQEAAEREAAENVKKVVVQAQETTEGPLTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDF
        A+ A+ +A +E  +  A ++  K     Q+           N  +Q    A D  N ++      +  K  + L++       ++         SS    
Subjt:  AIEAERRAVQEAAEREAAENVKKVVVQAQETTEGPLTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDF

Query:  STHERHVARLIRQIRGTKENVRTKTSELLKIFMDPLCPQTISIAAF--AKKVVSQCE----SLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTV
              + +L+ +   T       TS+          PQ +  A++  A+K V Q E    S  +AAF    V   +    P    L+LA  H+ C Y V
Subjt:  STHERHVARLIRQIRGTKENVRTKTSELLKIFMDPLCPQTISIAAF--AKKVVSQCE----SLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTV

Query:  PKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLEAYMNLYGALIQTEVP-----GVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAA
        P H    E+   S E + K +G+R ++GK+E    +LKR+   + LY A+IQ   P     G+ +LHG+  GW W+A+ LN  P    TAT L+ FL   
Subjt:  PKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLEAYMNLYGALIQTEVP-----GVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAA

Query:  GYALFRKYKSQFRKLLNIITDNF
        G AL ++Y+ QF KL+ II + +
Subjt:  GYALFRKYKSQFRKLLNIITDNF

Arabidopsis top hitse value%identityAlignment
AT1G13120.1 null1.6e-11842.84Show/hide
Query:  CPSKVGLVTVDPQPDFSFDDLRVELNSLEEKL---KMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSN---------AKR
        CP  V  +++DP+P+++F+ L  E+ S+E+KL    M   P T T+ R       + R    FVM V EDE++    +GE   D+             KR
Subjt:  CPSKVGLVTVDPQPDFSFDDLRVELNSLEEKL---KMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSN---------AKR

Query:  FNCDGIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIA
        F CD ++LSD  D + +   E   Y+M  + L ES+L ++  DH   +K++IRNQ+  +ET +    E S SA ++++KY E R+E +R+LD QYQR++A
Subjt:  FNCDGIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIA

Query:  EGLDKYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAE--AKARAEEAKKAAIEAERRAVQEAAEREAAENVKKVVVQAQET
        E LD +LT +QR H+  SQ EERKIRS+ A EEA+RKE+A  EEK RQEKA+AEA+  AK RAEE KK   E ER+A +E AE+E A+         Q+ 
Subjt:  EGLDKYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAE--AKARAEEAKKAAIEAERRAVQEAAEREAAENVKKVVVQAQET

Query:  TEGPLTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKI
         E     E +        T    T+ +++ G+ +RA++SAL LE  RL+K +E+E  NQ+L+  SN++FS+ E+H+ R+IRQI GTK++V  K ++++KI
Subjt:  TEGPLTTEPINSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKI

Query:  FMDPLCPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYL
        F DP CP +ISIAAFAKK+V+  E  +   FACS+V+V + SQ P  M +LLAEFH+ACIYTVPKHI  S++A++S                     D  
Subjt:  FMDPLCPQTISIAAFAKKVVSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYL

Query:  KRLEAYMNLYGALIQTE--VPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALK-RKENPGLKQII
        +RL++ M LYGAL+QT+  V    N+HG+E GWAWLARFLN +P N  TATAL +FL+ AG+ L ++YKSQF K++N++ ++FL  L+ +K+   L  II
Subjt:  KRLEAYMNLYGALIQTE--VPGVRNLHGLEEGWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALK-RKENPGLKQII

Query:  VQIESYLEDRKFLQEPEGRNLQ-GSLLSSVVVPEIEYSQESYGRSSNSY
         +I +YL+DR +L+EPEGR ++  S LSS +  E+     +     N Y
Subjt:  VQIESYLEDRKFLQEPEGRNLQ-GSLLSSVVVPEIEYSQESYGRSSNSY

AT4G05523.1 BEST Arabidopsis thaliana protein match is: embryo defective 1745 (TAIR:AT1G13120.1)8.5e-1934.12Show/hide
Query:  CPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRF-NCDGIFLSDS
        CP  V   + DP+P+ SFD L  E  S E+KL      FTK      P TKT  R+   FVM V ED++ D   D E    Q+    +   CD ++LSD 
Subjt:  CPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSDGEVVCDQNSNAKRF-NCDGIFLSDS

Query:  EDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLDKYLTTIQ
        ++ D+E         +E   + +   A    DH     E+I+NQ+  +E  +    E++ SA + ++KY E RRE +R LD QY+RE++E L+ ++  ++
Subjt:  EDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQREIAEGLDKYLTTIQ

Query:  RHHEQISQREE
        R H  IS+  E
Subjt:  RHHEQISQREE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGATAATAGAGGAGGAGGTGTCTTGTTCGTTCATGATTGCAAAAGAGGGGAAAAAAACTTCAGTTCCTATGTTAGGAGTCCTGTGAAGTTGACCCTTCGCTGCCC
TTCTAAAGTTGGTCTGGTTACGGTTGACCCGCAACCTGATTTTAGCTTTGATGATTTGCGGGTTGAACTTAATTCCTTGGAAGAGAAGCTAAAGATGTCTACAACGCCTT
TCACCAAAACATCTTCACGAGATTTTCCCACCACAAAAACTCTGAAGAGAAGCTCCAAGCCTTTTGTAATGGGCGTGTATGAGGATGAATTACAGGACGTATTTAGTGAT
GGTGAAGTGGTTTGCGACCAAAATTCAAATGCTAAGCGCTTTAATTGTGATGGGATTTTTCTTAGTGATAGTGAAGATTCTGACAATGAATCAACCCTTGAAACACGGGC
CTATCTGATGGAGGATGTGGATTTAGTGGAAAGTTCTTTAGCTCAGCTAACATATGATCATCTTCTTAATATGAAGGAAGAAATTAGGAACCAACTTGGAAGGTTGGAGA
CAAACCTTACTGTTTTAAATGAAAAGTCAAATTCTGCATTCTCTCAAATTGATAAATACTATGAGGCAAGACGAGAAGCTGACCGGAGACTTGATACTCAATATCAACGC
GAGATTGCAGAAGGTCTGGATAAGTACTTGACTACTATTCAACGGCACCATGAACAAATATCACAAAGAGAAGAAAGAAAAATAAGAAGTGATGCGGCTTTTGAAGAAGC
CAAAAGAAAGGAGAAGGCCCTTCTAGAAGAAAAAAAGCGTCAAGAAAAGGCTAAAGCAGAAGCCGAGGCAAAAGCCAGAGCTGAGGAAGCAAAGAAAGCTGCCATAGAAG
CTGAGAGGAGAGCAGTGCAAGAAGCAGCTGAAAGGGAAGCTGCTGAAAACGTAAAAAAGGTTGTTGTACAAGCACAAGAAACTACTGAGGGCCCACTAACTACCGAACCA
ATAAACTCAGTCAGCCAACCTAAAGGAACGGCTTTGGATGGAACAAATGTATCAAGATCACAAGGTAGCATGGTTAGAGCCTCAAAAAGTGCTTTGACTTTAGAGCAAGA
GAGACTACAAAAATTTAAGGAAGTAGAGGAGGGAAACCAAACATTGAGATTGAGCTCAAATAAGGATTTTAGCACCCACGAAAGACATGTTGCTAGGCTTATTCGACAAA
TACGGGGAACGAAAGAGAATGTCAGAACAAAAACTAGTGAACTTCTCAAGATTTTCATGGATCCTCTTTGTCCACAAACCATTAGCATTGCGGCATTTGCAAAGAAGGTC
GTTTCTCAATGTGAAAGTCTAGATAATGCAGCTTTTGCATGCAGTCACGTTGTTGTTCTTGTTACCTCTCAGGTCCCAAGTGCTATGGTTCTTCTTCTTGCAGAATTCCA
TAGAGCTTGCATCTACACAGTCCCGAAGCACATAGGATATTCAGAGGCTGCATTTGAATCCAAAGAGTTACATTACAAAGCTATTGGATTCAGAGAGAATGATGGAAAGA
TGGAGAGTGTTCAGGATTATTTGAAGCGGTTAGAAGCATACATGAATCTATATGGGGCCTTGATTCAGACAGAAGTACCTGGTGTTCGTAACCTACATGGTCTTGAAGAG
GGTTGGGCATGGCTGGCAAGATTTCTAAATTCGGTTCCCCCGAATATATATACTGCAACCGCACTCTATGCATTCTTACGAGCAGCAGGCTATGCACTGTTCAGAAAATA
TAAATCTCAGTTCAGGAAGCTTTTGAACATCATTACTGACAACTTCTTGAGTGCACTCAAAAGAAAGGAGAACCCAGGGTTGAAACAGATAATCGTGCAGATTGAATCCT
ATTTGGAAGATAGGAAGTTCCTTCAAGAACCAGAAGGAAGAAACTTGCAGGGTTCCTTGTTGTCCAGTGTTGTTGTGCCGGAGATAGAGTACTCCCAAGAATCATACGGC
CGTTCAAGTAATTCATATTTCTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGATAATAGAGGAGGAGGTGTCTTGTTCGTTCATGATTGCAAAAGAGGGGAAAAAAACTTCAGTTCCTATGTTAGGAGTCCTGTGAAGTTGACCCTTCGCTGCCC
TTCTAAAGTTGGTCTGGTTACGGTTGACCCGCAACCTGATTTTAGCTTTGATGATTTGCGGGTTGAACTTAATTCCTTGGAAGAGAAGCTAAAGATGTCTACAACGCCTT
TCACCAAAACATCTTCACGAGATTTTCCCACCACAAAAACTCTGAAGAGAAGCTCCAAGCCTTTTGTAATGGGCGTGTATGAGGATGAATTACAGGACGTATTTAGTGAT
GGTGAAGTGGTTTGCGACCAAAATTCAAATGCTAAGCGCTTTAATTGTGATGGGATTTTTCTTAGTGATAGTGAAGATTCTGACAATGAATCAACCCTTGAAACACGGGC
CTATCTGATGGAGGATGTGGATTTAGTGGAAAGTTCTTTAGCTCAGCTAACATATGATCATCTTCTTAATATGAAGGAAGAAATTAGGAACCAACTTGGAAGGTTGGAGA
CAAACCTTACTGTTTTAAATGAAAAGTCAAATTCTGCATTCTCTCAAATTGATAAATACTATGAGGCAAGACGAGAAGCTGACCGGAGACTTGATACTCAATATCAACGC
GAGATTGCAGAAGGTCTGGATAAGTACTTGACTACTATTCAACGGCACCATGAACAAATATCACAAAGAGAAGAAAGAAAAATAAGAAGTGATGCGGCTTTTGAAGAAGC
CAAAAGAAAGGAGAAGGCCCTTCTAGAAGAAAAAAAGCGTCAAGAAAAGGCTAAAGCAGAAGCCGAGGCAAAAGCCAGAGCTGAGGAAGCAAAGAAAGCTGCCATAGAAG
CTGAGAGGAGAGCAGTGCAAGAAGCAGCTGAAAGGGAAGCTGCTGAAAACGTAAAAAAGGTTGTTGTACAAGCACAAGAAACTACTGAGGGCCCACTAACTACCGAACCA
ATAAACTCAGTCAGCCAACCTAAAGGAACGGCTTTGGATGGAACAAATGTATCAAGATCACAAGGTAGCATGGTTAGAGCCTCAAAAAGTGCTTTGACTTTAGAGCAAGA
GAGACTACAAAAATTTAAGGAAGTAGAGGAGGGAAACCAAACATTGAGATTGAGCTCAAATAAGGATTTTAGCACCCACGAAAGACATGTTGCTAGGCTTATTCGACAAA
TACGGGGAACGAAAGAGAATGTCAGAACAAAAACTAGTGAACTTCTCAAGATTTTCATGGATCCTCTTTGTCCACAAACCATTAGCATTGCGGCATTTGCAAAGAAGGTC
GTTTCTCAATGTGAAAGTCTAGATAATGCAGCTTTTGCATGCAGTCACGTTGTTGTTCTTGTTACCTCTCAGGTCCCAAGTGCTATGGTTCTTCTTCTTGCAGAATTCCA
TAGAGCTTGCATCTACACAGTCCCGAAGCACATAGGATATTCAGAGGCTGCATTTGAATCCAAAGAGTTACATTACAAAGCTATTGGATTCAGAGAGAATGATGGAAAGA
TGGAGAGTGTTCAGGATTATTTGAAGCGGTTAGAAGCATACATGAATCTATATGGGGCCTTGATTCAGACAGAAGTACCTGGTGTTCGTAACCTACATGGTCTTGAAGAG
GGTTGGGCATGGCTGGCAAGATTTCTAAATTCGGTTCCCCCGAATATATATACTGCAACCGCACTCTATGCATTCTTACGAGCAGCAGGCTATGCACTGTTCAGAAAATA
TAAATCTCAGTTCAGGAAGCTTTTGAACATCATTACTGACAACTTCTTGAGTGCACTCAAAAGAAAGGAGAACCCAGGGTTGAAACAGATAATCGTGCAGATTGAATCCT
ATTTGGAAGATAGGAAGTTCCTTCAAGAACCAGAAGGAAGAAACTTGCAGGGTTCCTTGTTGTCCAGTGTTGTTGTGCCGGAGATAGAGTACTCCCAAGAATCATACGGC
CGTTCAAGTAATTCATATTTCTACTGA
Protein sequenceShow/hide protein sequence
MRDNRGGGVLFVHDCKRGEKNFSSYVRSPVKLTLRCPSKVGLVTVDPQPDFSFDDLRVELNSLEEKLKMSTTPFTKTSSRDFPTTKTLKRSSKPFVMGVYEDELQDVFSD
GEVVCDQNSNAKRFNCDGIFLSDSEDSDNESTLETRAYLMEDVDLVESSLAQLTYDHLLNMKEEIRNQLGRLETNLTVLNEKSNSAFSQIDKYYEARREADRRLDTQYQR
EIAEGLDKYLTTIQRHHEQISQREERKIRSDAAFEEAKRKEKALLEEKKRQEKAKAEAEAKARAEEAKKAAIEAERRAVQEAAEREAAENVKKVVVQAQETTEGPLTTEP
INSVSQPKGTALDGTNVSRSQGSMVRASKSALTLEQERLQKFKEVEEGNQTLRLSSNKDFSTHERHVARLIRQIRGTKENVRTKTSELLKIFMDPLCPQTISIAAFAKKV
VSQCESLDNAAFACSHVVVLVTSQVPSAMVLLLAEFHRACIYTVPKHIGYSEAAFESKELHYKAIGFRENDGKMESVQDYLKRLEAYMNLYGALIQTEVPGVRNLHGLEE
GWAWLARFLNSVPPNIYTATALYAFLRAAGYALFRKYKSQFRKLLNIITDNFLSALKRKENPGLKQIIVQIESYLEDRKFLQEPEGRNLQGSLLSSVVVPEIEYSQESYG
RSSNSYFY