; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011625 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011625
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSWIM-type domain-containing protein
Genome locationchr1:29307291..29311233
RNA-Seq ExpressionLag0011625
SyntenyLag0011625
Gene Ontology termsNA
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI10421.1 hypothetical protein PRUPE_4G046500 [Prunus persica]2.1e-9241.9Show/hide
Query:  DEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRITVRCQEGCSWRLHASNR
        DEE  NS+      L+S   SD++    ++ Y E +     +  +F +GMKF +   L+ AI+ Y     YE    KN+  R++ +C+EGC WRL+AS  
Subjt:  DEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRITVRCQEGCSWRLHASNR

Query:  QDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSN
        Q ++TY IKS    HSC++   NK++T+++L+  Y+ R +  P +    +  +V  +    +S  Q Y+A++ A   +EGT   +Y KLW+YCEE+R++N
Subjt:  QDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSN

Query:  PGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPV
         GS+ L+ +D      PP F+ +YVCL ACK GF+ GCRP+IG+D CHLKG   GQ+L +VGIDAND ++P+A AV ++ +SK SW WFL+ L++D+G V
Subjt:  PGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPV

Query:  EQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFKAKATTRSM-----------FDNEMKKLVQLDKSAFDWVNAVPTKYWAKYAFRSHSK
         + GW F++DQQKGL  A E V P A HR C+RHLY N K+ FK +A    +           F+ EM+KL  LD+ A+ W+       WA++ F   +K
Subjt:  EQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFKAKATTRSM-----------FDNEMKKLVQLDKSAFDWVNAVPTKYWAKYAFRSHSK

Query:  CDMLLNNISESFNSYILEAR
        CDMLLNN+ E FNS+IL AR
Subjt:  CDMLLNNISESFNSYILEAR

XP_020080379.1 uncharacterized protein LOC109704055 [Ananas comosus]4.7e-9241.52Show/hide
Query:  DYD-GDDNE----NQDAHEGGGASLNFGNDGVSLNLED---EELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRD
        DYD  DD+E    N D H   G          S  ++D   +   +++ E    L+S   S +D +   + Y EF      +  +F +GM F ++ E R 
Subjt:  DYD-GDDNE----NQDAHEGGGASLNFGNDGVSLNLED---EELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRD

Query:  AIREYSVRCGYELIYQKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNI
        AIREYS++  Y +   KNE  ++   C+EGC W ++AS   D  T  +K   S H C +   NK VT+SWLA  Y+DRFR  P     A  + V +D ++
Subjt:  AIREYSVRCGYELIYQKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNI

Query:  CISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTA
         +SR + YRA++ A+  +EG+   +YA LWDY  E+++SNPGST L+  D  Q      F+ +YVCL ACK GFL GCRPII LDGC LK    GQ+L A
Subjt:  CISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTA

Query:  VGIDANDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFK-----------AKATT
        VG+DAND IFP+A AVVDV +S  SW WFL +LL D+       W FM+D+QKGL+ A+E + P AEHR C+RHLY+N +  +K           A+A+ 
Subjt:  VGIDANDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFK-----------AKATT

Query:  RSMFDNEMKKLVQLDKSAFDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEAR
         + F++ M  + + D  A++W+   P + W++  FR   KCDMLLNN+ E FN YILEAR
Subjt:  RSMFDNEMKKLVQLDKSAFDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEAR

XP_022156713.1 uncharacterized protein LOC111023558 [Momordica charantia]1.3e-14564.99Show/hide
Query:  KHSENVLNNAENIVDNEVIEETPLATEGTHKSLKLVDICDISDNNVHIEELLNIDYDGDDNENQDAHEGGGASLNFGNDGVSLNLEDEELCNSECESFSH
        +H  N+ +  E+ +D EVI ETP+A + TH SL+L DIC  SD  +HIEELLN+DYDG+D++     +  G  LN  + GV L+LE+EELCNSEC SFSH
Subjt:  KHSENVLNNAENIVDNEVIEETPLATEGTHKSLKLVDICDISDNNVHIEELLNIDYDGDDNENQDAHEGGGASLNFGNDGVSLNLEDEELCNSECESFSH

Query:  LNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYST
        LNSPIQSDDD VE SQ YVEFHP MLK+ +EFTLGMKFGTLTELRDAIREYS+R G+ELIYQKNE  R+TVRCQ+GC+WRLHASNRQD+           
Subjt:  LNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYST

Query:  HSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQI
                            YL                 VR DFNI ISR+QAYRARKIAM KLEGT +M+YAKL DYCEEL RSNP S+ LLS+DR QI
Subjt:  HSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQI

Query:  HMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKG
        HM PIF+SMY+CL ACK+GFLQ CRPIIGLDGCHLK  + GQ+LTAVGIDAN+GIFPLALAVVDV +SK+SWTWFLENLLD IGPVE+  WVFMTDQQKG
Subjt:  HMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKG

Query:  LVPAIEKVAPTAEHRIC
        L+PAIEKVAP AEHRIC
Subjt:  LVPAIEKVAPTAEHRIC

XP_030922963.1 uncharacterized protein LOC115949829 [Quercus lobata]4.5e-9541.35Show/hide
Query:  ENQDAHEGGGASLNFGNDGVSLNLEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIY
        +    +    A + FG D       DE+   SE  +     S +   DD  E    Y ++HP     +V F L M+F T  ++ DAI+ YS+  G  + +
Subjt:  ENQDAHEGGGASLNFGNDGVSLNLEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIY

Query:  QKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAM
        +KN+  R+ V+C + C + L+    +D+ T+++K L   H+C R+  N+  +++WL  + +D  R+ P++    I D+V + F + ISRYQA RA+  A 
Subjt:  QKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAM

Query:  DKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALA
        + + GT   +YA+L DYCEEL R NPGST L+S +RPQ H+ P F  +YVCL ACK GFL  CRP IG+DGCHLKG   GQ+LTAVG D N+G++P+A A
Subjt:  DKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALA

Query:  VVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFK-----------AKATTRSMFDNEMKKLVQLD
        VV+  ++K SWTWF++ LLDDIG +  +GW FM+DQQKGL    +++ P  +HR C+RHLY N  K +K           A+AT  S F  EM K+ +L+
Subjt:  VVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFK-----------AKATTRSMFDNEMKKLVQLD

Query:  KSAFDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEAR
          A++W+     ++W + AFR + KCD L NN  E+FNS ILE R
Subjt:  KSAFDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEAR

XP_034229423.1 uncharacterized protein LOC117638399 [Prunus dulcis]2.3e-9139.37Show/hide
Query:  DICDISDNNVHIEELLNIDYDGDDNENQDAHEG-----GGASLNFGNDGVSLNLEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVE
        D  D+ DN+  ++         DDN   DA +      G       N   ++ +ED  +   +  S   L+S   S ++    ++ Y EF+     +  +
Subjt:  DICDISDNNVHIEELLNIDYDGDDNENQDAHEG-----GGASLNFGNDGVSLNLEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVE

Query:  FTLGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDM
        F +GMKF +   L+ AI+ Y     YE    KN+  R++ +C+EGC WRL+AS  Q ++TY IKS    HSC++   NK++T+++L+  Y+ R +  P +
Subjt:  FTLGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDM

Query:  DWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLD
            +  +V  +    +S  Q Y+A++ A   +EGT   +Y KLW+YCEE+R++N GS+ L+ +D      PP F+ +YVCL ACK GF+ GCRP+IG+D
Subjt:  DWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLD

Query:  GCHLKGASHGQVLTAVGIDANDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFKA
         CHLKG   GQ+L AVGIDAND I+P+A AV ++ +SK SW WFL+ L++D+G V + GW F++DQQKGL  A E V P A HR C+RHLY N K+ FK 
Subjt:  GCHLKGASHGQVLTAVGIDANDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFKA

Query:  KATTRSM-----------FDNEMKKLVQLDKSAFDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEAR
        +A    +           F+ EM+KL  LD+ A+ W+       WA++ F   +KCDMLLNN+ E FNS+IL AR
Subjt:  KATTRSM-----------FDNEMKKLVQLDKSAFDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEAR

TrEMBL top hitse value%identityAlignment
A0A2N9EJP6 SWIM-type domain-containing protein8.3e-9540.35Show/hide
Query:  YDGDDNENQDAHEGGGASLNFGNDGVSLNLEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRC
        +D D+  ++        +L  G +   L+  ++     + +S + +     S DD  E +  Y ++ P     +V+F + M+F T  ++ DAI+ YS+  
Subjt:  YDGDDNENQDAHEGGGASLNFGNDGVSLNLEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRC

Query:  GYELIYQKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYR
           L ++KN+  R+ V+C++GC + L+    +++ T+ +K+L   H+C ++  N+  +++WL  + +D+ R+ P M    I  +V S F + ISR+QA R
Subjt:  GYELIYQKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYR

Query:  ARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGI
        A+  A + ++GT K +YA+L DYCEEL RSNP ST +LS +RPQ H+ P+F   YVCL ACK GFL GCRP IG+DGCHLKG   GQ+LTAVG D N+G+
Subjt:  ARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGI

Query:  FPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFK-----------AKATTRSMFDNEMK
        +P+A AVV+  ++K SWTWF+ NLL D+G   +K W FM+DQQKGLV   + + P  +HR C+RHLY N  K  K           A+AT  + F  EM+
Subjt:  FPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFK-----------AKATTRSMFDNEMK

Query:  KLVQLDKSAFDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEAR
        K+ +L+  A++W+ AV  K+W +  FR++SKCD L NN  E+FNS IL+ R
Subjt:  KLVQLDKSAFDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEAR

A0A2N9FLJ6 Uncharacterized protein9.2e-9441.18Show/hide
Query:  HEGGGASLNFGNDGVSLN--LEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIYQKN
        + GG  ++  G+D V+ +    + +   +  ES +       SDD+S  +   Y E   F+   +++F LGM+F T  ++ DA+R YSV  G +L ++KN
Subjt:  HEGGGASLNFGNDGVSLN--LEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIYQKN

Query:  ESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKL
        +  R+ V+C  GC + L      ++ T+ +K +   H+C  +  NKH +  WL  + ++  +  P +   +I  QV S FN+ ISR+ AY+A+++A   +
Subjt:  ESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKL

Query:  EGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAVVD
        +G+ K +YA+L DY EEL RSNPGS  ++  +RPQ H+ P F   YVCL ACK GFL GCRP IGLDGCHLKG   GQ+LTA+G DAN+G++P+A AV +
Subjt:  EGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAVVD

Query:  VDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFK-----------AKATTRSMFDNEMKKLVQLDKSA
          + K+SW WFLE L  DIG +  + W FM+DQQKGLVP  EK+ P  +HR C+RHLY N  K FK           A+AT  + F+ EM+K+  ++  A
Subjt:  VDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFK-----------AKATTRSMFDNEMKKLVQLDKSA

Query:  FDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEAR
        + W++  P ++W++ +F ++SKCD L NN  ESFN+ ILE R
Subjt:  FDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEAR

A0A2N9FZE3 SWIM-type domain-containing protein6.0e-9343.33Show/hide
Query:  VSLNLEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRITVRCQEGCSWR
        +S++  D+    S+  S   LNSP  SDD   E    Y +F    L + V FTLGM F    EL++A+  Y ++ G+ L + KNE  ++ VRC EGC W+
Subjt:  VSLNLEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRITVRCQEGCSWR

Query:  LHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCE
        L A    D  ++ I S  S H+C+R   ++ V++ W+A  Y+D FR  PDM+   ++ +V  D  + +SR +AYRA++ A   +EG+   +Y ++ DYCE
Subjt:  LHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCE

Query:  ELRRSNPGSTCLLSIDRPQIHMP-PIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAVVDVDQSKQSWTWFLENL
        EL R+NPG+T      +  +H+P   F+ +YVCL  CK GFL GCRPIIGLD CHLKGA  GQ+L AVG+D N+G++P+A AV +  +S ++WTWFLENL
Subjt:  ELRRSNPGSTCLLSIDRPQIHMP-PIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAVVDVDQSKQSWTWFLENL

Query:  LDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFK-----------AKATTRSMFDNEMKKLVQLDKSAFDWVNAVPTKYWAKY
        + DIG     GW F++DQQKGLVPA++KV   + HR C+RHL+AN KK+ K           A+A+T   FD  M+++  + K A D +       WA++
Subjt:  LDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFK-----------AKATTRSMFDNEMKKLVQLDKSAFDWVNAVPTKYWAKY

Query:  AFRSHSKCDMLLNNISESFNSYILEAR
        AF  + KCDMLLNN+ E+FNS I+ AR
Subjt:  AFRSHSKCDMLLNNISESFNSYILEAR

A0A2N9H0N6 SWIM-type domain-containing protein8.3e-9540.35Show/hide
Query:  YDGDDNENQDAHEGGGASLNFGNDGVSLNLEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRC
        +D D+  ++        +L  G +   L+  ++     + +S + +     S DD  E +  Y ++ P     +V+F + M+F T  ++ DAI+ YS+  
Subjt:  YDGDDNENQDAHEGGGASLNFGNDGVSLNLEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRC

Query:  GYELIYQKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYR
           L ++KN+  R+ V+C++GC + L+    +++ T+ +K+L   H+C ++  N+  +++WL  + +D+ R+ P M    I  +V S F + ISR+QA R
Subjt:  GYELIYQKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYR

Query:  ARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGI
        A+  A + ++GT K +YA+L DYCEEL RSNP ST +LS +RPQ H+ P+F   YVCL ACK GFL GCRP IG+DGCHLKG   GQ+LTAVG D N+G+
Subjt:  ARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGI

Query:  FPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFK-----------AKATTRSMFDNEMK
        +P+A AVV+  ++K SWTWF+ NLL D+G   +K W FM+DQQKGLV   + + P  +HR C+RHLY N  K  K           A+AT  + F  EM+
Subjt:  FPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFK-----------AKATTRSMFDNEMK

Query:  KLVQLDKSAFDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEAR
        K+ +L+  A++W+ AV  K+W +  FR++SKCD L NN  E+FNS IL+ R
Subjt:  KLVQLDKSAFDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEAR

A0A6J1DRD2 uncharacterized protein LOC1110235586.1e-14664.99Show/hide
Query:  KHSENVLNNAENIVDNEVIEETPLATEGTHKSLKLVDICDISDNNVHIEELLNIDYDGDDNENQDAHEGGGASLNFGNDGVSLNLEDEELCNSECESFSH
        +H  N+ +  E+ +D EVI ETP+A + TH SL+L DIC  SD  +HIEELLN+DYDG+D++     +  G  LN  + GV L+LE+EELCNSEC SFSH
Subjt:  KHSENVLNNAENIVDNEVIEETPLATEGTHKSLKLVDICDISDNNVHIEELLNIDYDGDDNENQDAHEGGGASLNFGNDGVSLNLEDEELCNSECESFSH

Query:  LNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYST
        LNSPIQSDDD VE SQ YVEFHP MLK+ +EFTLGMKFGTLTELRDAIREYS+R G+ELIYQKNE  R+TVRCQ+GC+WRLHASNRQD+           
Subjt:  LNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRITVRCQEGCSWRLHASNRQDDSTYLIKSLYST

Query:  HSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQI
                            YL                 VR DFNI ISR+QAYRARKIAM KLEGT +M+YAKL DYCEEL RSNP S+ LLS+DR QI
Subjt:  HSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQI

Query:  HMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKG
        HM PIF+SMY+CL ACK+GFLQ CRPIIGLDGCHLK  + GQ+LTAVGIDAN+GIFPLALAVVDV +SK+SWTWFLENLLD IGPVE+  WVFMTDQQKG
Subjt:  HMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKG

Query:  LVPAIEKVAPTAEHRIC
        L+PAIEKVAP AEHRIC
Subjt:  LVPAIEKVAPTAEHRIC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64255.1 MuDR family transposase4.8e-1020.96Show/hide
Query:  GGASLNFGNDGVSLNLEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRI
        GG+S     D + + LE E   NSE +   HL      + DS+             L +  +  +G+ F    EL+ A+   S++   + + ++      
Subjt:  GGASLNFGNDGVSLNLEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRI

Query:  TVRC-QEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTT
           C +  C W L A+  +      I      H+C       H           + F+++ + D   I   VR      IS  + +  +KI  +      
Subjt:  TVRC-QEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTT

Query:  KMKYAKL-------WDYCEE----LRRSNPGSTCLLSIDRPQIHMPPIFRS---MYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDG
        ++   K        WD   E    L  +   S  LL   +  +   P F S   ++       +GF Q CRP+I +D  +L      +++ A G+DA + 
Subjt:  KMKYAKL-------WDYCEE----LRRSNPGSTCLLSIDRPQIHMPPIFRS---MYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDG

Query:  IFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVA-----PTAEHRICLRHLYANLKKHFKA-----------KATTRSM
         FPLA AV   + S   W WFL  + + +   ++KG   ++     ++  + +       P A HR  L H Y+   + F +             + +  
Subjt:  IFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVA-----PTAEHRICLRHLYANLKKHFKA-----------KATTRSM

Query:  FDNEMKKLVQLDKSAFDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEARRLCNAWKDAVQCLEGCSFLI
        F + M  + + +  A  W++  P   WA  A  +  +  ++  N    F         +CNA++ A   + G   L+
Subjt:  FDNEMKKLVQLDKSAFDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEARRLCNAWKDAVQCLEGCSFLI

AT1G64260.1 MuDR family transposase5.8e-1621.97Show/hide
Query:  LGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRITVRC-QEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDR-FRAQPDM
        LG+ F    EL+ A+  + +R     I ++ E    T  C +  C W L A+  ++     I      H+C+ +  N     S  A + ++R  R QP +
Subjt:  LGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRITVRC-QEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDR-FRAQPDM

Query:  DWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMP----PIFRSMYVCLKACKDGFLQGCRPI
            +    +      +   +    +   + ++ G     +  +         SN      L +D      P      FR ++       +GF Q CRP+
Subjt:  DWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMP----PIFRSMYVCLKACKDGFLQGCRPI

Query:  IGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVA-----PTAEHRICLRHLY
        I +D   L G    +++ A G+DA +  FPLA AV   + S  SW WF   + + +   ++K    ++   + +V  + +       P A H+ CL HL 
Subjt:  IGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVA-----PTAEHRICLRHLY

Query:  A---------NLKKHFKAKATT--RSMFDNEMKKLVQLDKSAFDWVNAVPTKYWA
        +         NL+   +   +T  +  FD+ M  + + +  A+ W++ +P   WA
Subjt:  A---------NLKKHFKAKATT--RSMFDNEMKKLVQLDKSAFDWVNAVPTKYWA

AT1G76320.1 FAR1-related sequence 44.6e-0520.53Show/hide
Query:  GCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASW-------LANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTT
        GC   +H   R D   Y + S    H+        H   S          ++ L R +  P  D K +      DF     R Q  + R++ +D   G  
Subjt:  GCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASW-------LANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTT

Query:  KMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAV-VDVDQ
        ++    L ++   ++  NP      ++D  + H+  +    +V  K  +D   +    ++  +  +        ++  VG+  N  + P+ L   +  D 
Subjt:  KMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAV-VDVDQ

Query:  SKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANL---------------KKHFKA--KATTRSMFDNEMKKLV-QL
        +  ++ W +++ L  +G   QK  V +TDQ   +  AI  V P   H  CL H+   L               KK FK   ++ +   FD    KL+ + 
Subjt:  SKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANL---------------KKHFKA--KATTRSMFDNEMKKLV-QL

Query:  DKSAFDWVNAV--PTKYWAKYAFRSHSKCDMLLNNISESFNSYILEARRLCNAWKDAVQCLEGCSFLIDAVAHTNARSHF
              W+ ++    K+WA    R  +   + +   SES NS      R  +      + LEG   +++      A++ F
Subjt:  DKSAFDWVNAV--PTKYWAKYAFRSHSKCDMLLNNISESFNSYILEARRLCNAWKDAVQCLEGCSFLIDAVAHTNARSHF

AT1G76320.2 FAR1-related sequence 44.6e-0520.53Show/hide
Query:  GCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASW-------LANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTT
        GC   +H   R D   Y + S    H+        H   S          ++ L R +  P  D K +      DF     R Q  + R++ +D   G  
Subjt:  GCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASW-------LANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTT

Query:  KMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAV-VDVDQ
        ++    L ++   ++  NP      ++D  + H+  +    +V  K  +D   +    ++  +  +        ++  VG+  N  + P+ L   +  D 
Subjt:  KMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAV-VDVDQ

Query:  SKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANL---------------KKHFKA--KATTRSMFDNEMKKLV-QL
        +  ++ W +++ L  +G   QK  V +TDQ   +  AI  V P   H  CL H+   L               KK FK   ++ +   FD    KL+ + 
Subjt:  SKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANL---------------KKHFKA--KATTRSMFDNEMKKLV-QL

Query:  DKSAFDWVNAV--PTKYWAKYAFRSHSKCDMLLNNISESFNSYILEARRLCNAWKDAVQCLEGCSFLIDAVAHTNARSHF
              W+ ++    K+WA    R  +   + +   SES NS      R  +      + LEG   +++      A++ F
Subjt:  DKSAFDWVNAV--PTKYWAKYAFRSHSKCDMLLNNISESFNSYILEARRLCNAWKDAVQCLEGCSFLIDAVAHTNARSHF

AT4G15090.1 FRS (FAR1 Related Sequences) transcription factor family1.6e-0520.79Show/hide
Query:  QAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDA
        Q  + R +A++  EG +++    L +Y + +++ NP     + ++  Q       R+++      +D +L     ++  D  ++K      +   +G++ 
Subjt:  QAYRARKIAMDKLEGTTKMKYAKLWDYCEELRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDA

Query:  NDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHF
        +     L  A+V  D+S +++ W ++  L  +G    K  V +TDQ K L+ A+ ++ P   H   L H+   + ++F
Subjt:  NDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGWVFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTGGCTAGAAAGAAGGGAAGATGAAAGGAACGATCCTTTGAAAAATCGATTGGAGGGAGTAGAAGGGTTTTTCGGCGTTTTCTCTTTGGCTCTTTTTTGGCGTCC
TCTCCGATCTTCTCCAACGTTCGTCTTCTTCGAATACAAGCACTCTGAGAATGTATTAAATAATGCAGAAAATATTGTGGATAATGAAGTAATTGAGGAGACACCTCTAG
CCACAGAAGGGACACATAAGAGCTTAAAATTAGTTGACATTTGTGATATTAGTGACAACAATGTGCACATTGAAGAATTATTAAATATTGATTACGATGGAGATGATAAT
GAGAACCAAGATGCACATGAGGGTGGTGGTGCAAGTTTGAACTTCGGGAATGATGGTGTGAGTTTGAACTTAGAAGATGAAGAATTATGTAATTCTGAATGTGAGTCTTT
CAGTCATCTAAACTCACCAATTCAATCTGACGATGATAGTGTTGAGAAATCGCAACATTATGTTGAGTTTCACCCTTTTATGTTAAAGGAAAAGGTAGAGTTTACCTTGG
GGATGAAATTTGGAACATTGACCGAATTGAGGGATGCAATAAGAGAATACTCAGTGCGATGTGGATATGAGTTGATTTACCAGAAAAATGAATCAAGAAGGATTACTGTG
AGATGTCAAGAAGGGTGTAGTTGGAGGTTACATGCTTCAAATAGGCAAGATGATAGTACGTATCTAATTAAGTCACTATATAGTACACACTCTTGTGCTAGAAAGACAAC
AAATAAGCATGTAACTGCTAGTTGGCTTGCTAATAATTACTTAGACAGATTCAGGGCTCAACCAGACATGGATTGGAAAGCCATTGTAGATCAAGTGAGGAGTGATTTTA
ATATCTGCATCTCAAGGTATCAAGCATATAGAGCAAGGAAGATAGCAATGGATAAACTTGAAGGCACAACTAAGATGAAATATGCCAAGCTATGGGATTATTGCGAGGAA
TTGAGGAGATCTAATCCAGGGAGTACTTGCTTACTAAGTATTGATAGACCTCAAATTCATATGCCACCTATTTTTAGAAGTATGTATGTTTGTCTTAAAGCATGCAAGGA
TGGCTTTTTGCAAGGTTGTAGACCAATAATTGGGCTTGATGGGTGTCATTTAAAGGGTGCATCTCATGGACAAGTGTTGACTGCTGTGGGAATAGATGCAAATGATGGAA
TTTTTCCATTAGCTTTAGCGGTAGTGGATGTGGATCAGTCCAAGCAATCTTGGACTTGGTTCTTAGAGAATCTACTTGATGACATAGGACCGGTGGAACAAAAAGGATGG
GTATTTATGACTGATCAACAAAAGGGGTTAGTTCCTGCTATTGAAAAAGTGGCGCCTACTGCGGAACATAGGATTTGTTTACGCCATTTGTATGCTAATTTAAAAAAACA
TTTCAAAGCAAAGGCAACAACAAGGTCAATGTTTGACAATGAAATGAAAAAGCTTGTCCAGTTGGATAAGTCGGCTTTTGATTGGGTTAATGCAGTGCCTACTAAATATT
GGGCAAAATATGCATTTAGATCACATTCAAAGTGTGACATGTTGCTTAATAACATAAGTGAATCATTTAATTCCTACATATTGGAGGCTAGGAGATTATGCAATGCTTGG
AAGGATGCAGTGCAATGCTTGGAAGGATGTAGTTTTTTGATAGATGCGGTTGCTCATACTAATGCTCGTTCACATTTTGTGCTGGAATTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGTGGCTAGAAAGAAGGGAAGATGAAAGGAACGATCCTTTGAAAAATCGATTGGAGGGAGTAGAAGGGTTTTTCGGCGTTTTCTCTTTGGCTCTTTTTTGGCGTCC
TCTCCGATCTTCTCCAACGTTCGTCTTCTTCGAATACAAGCACTCTGAGAATGTATTAAATAATGCAGAAAATATTGTGGATAATGAAGTAATTGAGGAGACACCTCTAG
CCACAGAAGGGACACATAAGAGCTTAAAATTAGTTGACATTTGTGATATTAGTGACAACAATGTGCACATTGAAGAATTATTAAATATTGATTACGATGGAGATGATAAT
GAGAACCAAGATGCACATGAGGGTGGTGGTGCAAGTTTGAACTTCGGGAATGATGGTGTGAGTTTGAACTTAGAAGATGAAGAATTATGTAATTCTGAATGTGAGTCTTT
CAGTCATCTAAACTCACCAATTCAATCTGACGATGATAGTGTTGAGAAATCGCAACATTATGTTGAGTTTCACCCTTTTATGTTAAAGGAAAAGGTAGAGTTTACCTTGG
GGATGAAATTTGGAACATTGACCGAATTGAGGGATGCAATAAGAGAATACTCAGTGCGATGTGGATATGAGTTGATTTACCAGAAAAATGAATCAAGAAGGATTACTGTG
AGATGTCAAGAAGGGTGTAGTTGGAGGTTACATGCTTCAAATAGGCAAGATGATAGTACGTATCTAATTAAGTCACTATATAGTACACACTCTTGTGCTAGAAAGACAAC
AAATAAGCATGTAACTGCTAGTTGGCTTGCTAATAATTACTTAGACAGATTCAGGGCTCAACCAGACATGGATTGGAAAGCCATTGTAGATCAAGTGAGGAGTGATTTTA
ATATCTGCATCTCAAGGTATCAAGCATATAGAGCAAGGAAGATAGCAATGGATAAACTTGAAGGCACAACTAAGATGAAATATGCCAAGCTATGGGATTATTGCGAGGAA
TTGAGGAGATCTAATCCAGGGAGTACTTGCTTACTAAGTATTGATAGACCTCAAATTCATATGCCACCTATTTTTAGAAGTATGTATGTTTGTCTTAAAGCATGCAAGGA
TGGCTTTTTGCAAGGTTGTAGACCAATAATTGGGCTTGATGGGTGTCATTTAAAGGGTGCATCTCATGGACAAGTGTTGACTGCTGTGGGAATAGATGCAAATGATGGAA
TTTTTCCATTAGCTTTAGCGGTAGTGGATGTGGATCAGTCCAAGCAATCTTGGACTTGGTTCTTAGAGAATCTACTTGATGACATAGGACCGGTGGAACAAAAAGGATGG
GTATTTATGACTGATCAACAAAAGGGGTTAGTTCCTGCTATTGAAAAAGTGGCGCCTACTGCGGAACATAGGATTTGTTTACGCCATTTGTATGCTAATTTAAAAAAACA
TTTCAAAGCAAAGGCAACAACAAGGTCAATGTTTGACAATGAAATGAAAAAGCTTGTCCAGTTGGATAAGTCGGCTTTTGATTGGGTTAATGCAGTGCCTACTAAATATT
GGGCAAAATATGCATTTAGATCACATTCAAAGTGTGACATGTTGCTTAATAACATAAGTGAATCATTTAATTCCTACATATTGGAGGCTAGGAGATTATGCAATGCTTGG
AAGGATGCAGTGCAATGCTTGGAAGGATGTAGTTTTTTGATAGATGCGGTTGCTCATACTAATGCTCGTTCACATTTTGTGCTGGAATTCTAA
Protein sequenceShow/hide protein sequence
MLWLERREDERNDPLKNRLEGVEGFFGVFSLALFWRPLRSSPTFVFFEYKHSENVLNNAENIVDNEVIEETPLATEGTHKSLKLVDICDISDNNVHIEELLNIDYDGDDN
ENQDAHEGGGASLNFGNDGVSLNLEDEELCNSECESFSHLNSPIQSDDDSVEKSQHYVEFHPFMLKEKVEFTLGMKFGTLTELRDAIREYSVRCGYELIYQKNESRRITV
RCQEGCSWRLHASNRQDDSTYLIKSLYSTHSCARKTTNKHVTASWLANNYLDRFRAQPDMDWKAIVDQVRSDFNICISRYQAYRARKIAMDKLEGTTKMKYAKLWDYCEE
LRRSNPGSTCLLSIDRPQIHMPPIFRSMYVCLKACKDGFLQGCRPIIGLDGCHLKGASHGQVLTAVGIDANDGIFPLALAVVDVDQSKQSWTWFLENLLDDIGPVEQKGW
VFMTDQQKGLVPAIEKVAPTAEHRICLRHLYANLKKHFKAKATTRSMFDNEMKKLVQLDKSAFDWVNAVPTKYWAKYAFRSHSKCDMLLNNISESFNSYILEARRLCNAW
KDAVQCLEGCSFLIDAVAHTNARSHFVLEF