; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0016998 (gene) of Chayote v1 genome

Gene IDSed0016998
OrganismSechium edule (Chayote v1)
DescriptionExonuclease domain-containing protein
Genome locationLG08:1335962..1339961
RNA-Seq ExpressionSed0016998
SyntenySed0016998
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004386 - helicase activity (molecular function)
GO:0008408 - 3'-5' exonuclease activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013520 - Exonuclease, RNase T/DNA polymerase III
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022921575.1 protein NEN1-like [Cucurbita moschata]1.7e-24786.21Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        M P   RSEIAFFDVETTVPTR+GQ FSILEFGAILVCPRKL ELESYSTLVRPS+LSLI+SLSVRCNGITRDAV+S+PTF+QIADRV+D+LHGRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN
        NILRFDCARIREAFAEI MPAPQPKGTIDSLALLTQKFGRRAGDMKMATLA+YFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVF ENSWVSPN
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN

Query:  AVTRCRAAKSSPQGVNSNNS--------------PHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNL-AESDATESDAFNIHILSDQLTGESL
        AVTR RAAKSSPQG N+NNS                SS+SR GG+PISL D+QGEAHPILSLVT SSEDG SNL A+SDATESD+FN+  LSDQ+ GESL
Subjt:  AVTRCRAAKSSPQGVNSNNS--------------PHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNL-AESDATESDAFNIHILSDQLTGESL

Query:  ETDTSM-EEHVSVSTEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVP
        ETDTSM EEH SVSTEAS TE+TSTS+S+ TEFLEPDQVSV SITASF+PFFRGSQRIQLSHKDDCLQLLCNNL+VRFGISTKFTDYAGRPRL+FVVDVP
Subjt:  ETDTSM-EEHVSVSTEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVP

Query:  PSLCRVLEASDGVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISL
        PSLCRVLEASDGVAQRLFSESGSGSEWRPVV+RKNGYFNYPTMR+HIPTAVSGD+AIYATEMYQKE+S  AQRLIFS+FDA EL+SLINPG IIDAFISL
Subjt:  PSLCRVLEASDGVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISL

Query:  DTYDYQQSAGIRLVAKKLIIHS
        DTYDYQQSAGIRLVAKKLIIHS
Subjt:  DTYDYQQSAGIRLVAKKLIIHS

XP_022987423.1 protein NEN1-like [Cucurbita maxima]3.8e-24786.21Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        M P   RSEIAFFDVETTVPTR+GQ FSILEFGAILVCPRKL ELESYSTLVRPS+LSLI+SLSVRCNGITRDAV+S+PTF+QIADRV+D+LHGRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN
        NILRFDCARIREAFAEI MPAPQPKGTIDSLALLTQKFGRRAGDMKMATLA+YFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVF ENSWVSPN
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN

Query:  AVTRCRAAKSSPQGVNSNNS--------------PHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNL-AESDATESDAFNIHILSDQLTGESL
        AVTR RAAKSSPQG+N NNS                SS+SR GG+PISL D+QGEAHPILSLVT SSEDG SNL A+SDATESD+FN+  LSDQ+ GESL
Subjt:  AVTRCRAAKSSPQGVNSNNS--------------PHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNL-AESDATESDAFNIHILSDQLTGESL

Query:  ETDTSM-EEHVSVSTEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVP
        ETDTSM EEH  VSTEAS TE+TSTSLSA TEFLEPDQVSV SITASF+PFFRGSQRIQLSHKDDCLQLLCNNL+VRFGISTKFTDYAGRPRL+FVVDVP
Subjt:  ETDTSM-EEHVSVSTEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVP

Query:  PSLCRVLEASDGVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISL
        PSLCRVLEASDGVAQRLFSESGSGSEWRPVV+RKNGYFNYPTMR+HIPTAV GD+AIYATEMYQKE+S  AQRLIFS+FDA EL+SLINPG IIDAFISL
Subjt:  PSLCRVLEASDGVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISL

Query:  DTYDYQQSAGIRLVAKKLIIHS
        DTYDYQQSAGIRLVAKKLIIHS
Subjt:  DTYDYQQSAGIRLVAKKLIIHS

XP_023515369.1 protein NEN1-like isoform X1 [Cucurbita pepo subsp. pepo]3.0e-24483.4Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        M P   RSEIAFFDVETTVPTR+GQ FSILEFGAILVCPRKL ELESYSTLVRPS+LSLI+SLSVRCNGITRDAV+S+PTF+QIADRV+D+LHGRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN
        NILRFDCARIREAFAEI MPAP+PKGTIDSLALLTQKFGRRAGDMKMATLA+YFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVF ENSW+SPN
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN

Query:  AVTRCRAAKSSPQGVNSNNS----------------------------PHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNL-AESDATESDAF
        AVTR RAAKSSPQG+N+NNS                              SS+SR GG+PISL D+QGEAHPILSLVT SSEDG SNL A+SDATESD+F
Subjt:  AVTRCRAAKSSPQGVNSNNS----------------------------PHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNL-AESDATESDAF

Query:  NIHILSDQLTGESLETDTSM-EEHVSVSTEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTD
        N+  LSDQ+ GESLET +SM EEH  VSTEAS TE+TSTSLSA TEFLEPDQVSV SITASF+PFFRGSQRIQLSHKDDCLQLLCNNL+VRFGISTKFTD
Subjt:  NIHILSDQLTGESLETDTSM-EEHVSVSTEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTD

Query:  YAGRPRLSFVVDVPPSLCRVLEASDGVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELES
        YAGRPRL+FVVDVPPSLCRVLEASDGVAQRLFSESGSGSEWRPVV+RKNGYFNYPTMR+HIPTAVSGD+AIYATEMYQKE+S  AQRLIFS+FDA EL+S
Subjt:  YAGRPRLSFVVDVPPSLCRVLEASDGVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELES

Query:  LINPGVIIDAFISLDTYDYQQSAGIRLVAKKLIIHS
        LINPG IIDAFISLDTYDYQQSAGIRLVAKKLIIHS
Subjt:  LINPGVIIDAFISLDTYDYQQSAGIRLVAKKLIIHS

XP_023515370.1 protein NEN1-like isoform X2 [Cucurbita pepo subsp. pepo]7.1e-24685.63Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        M P   RSEIAFFDVETTVPTR+GQ FSILEFGAILVCPRKL ELESYSTLVRPS+LSLI+SLSVRCNGITRDAV+S+PTF+QIADRV+D+LHGRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN
        NILRFDCARIREAFAEI MPAP+PKGTIDSLALLTQKFGRRAGDMKMATLA+YFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVF ENSW+SPN
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN

Query:  AVTRCRAAKSSPQGVNSNNS--------------PHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNL-AESDATESDAFNIHILSDQLTGESL
        AVTR RAAKSSPQG+N+NNS                SS+SR GG+PISL D+QGEAHPILSLVT SSEDG SNL A+SDATESD+FN+  LSDQ+ GESL
Subjt:  AVTRCRAAKSSPQGVNSNNS--------------PHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNL-AESDATESDAFNIHILSDQLTGESL

Query:  ETDTSM-EEHVSVSTEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVP
        ET +SM EEH  VSTEAS TE+TSTSLSA TEFLEPDQVSV SITASF+PFFRGSQRIQLSHKDDCLQLLCNNL+VRFGISTKFTDYAGRPRL+FVVDVP
Subjt:  ETDTSM-EEHVSVSTEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVP

Query:  PSLCRVLEASDGVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISL
        PSLCRVLEASDGVAQRLFSESGSGSEWRPVV+RKNGYFNYPTMR+HIPTAVSGD+AIYATEMYQKE+S  AQRLIFS+FDA EL+SLINPG IIDAFISL
Subjt:  PSLCRVLEASDGVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISL

Query:  DTYDYQQSAGIRLVAKKLIIHS
        DTYDYQQSAGIRLVAKKLIIHS
Subjt:  DTYDYQQSAGIRLVAKKLIIHS

XP_038880150.1 protein NEN1-like [Benincasa hispida]6.5e-24787.57Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        M   E RSEIAFFDVETTVPTR+GQ FSILEFGAILVCPRKLVELESYSTLVRPS+LSLI+SLSVRCNGITRDAV+SSPTF+QIADRV+D+LHGRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN
        NILRFDCARIREAFAEI +PAP+PKGTIDSLALLTQ+FGRRAGDMKMATLA+YFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVF ENSWVSPN
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN

Query:  AVTRCRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSM-EEHVSVST
        AVTR R AKSSPQG +SNN  HSS+SR GGNPISL D QGE HPILSLVT SSEDG SNLAES ATESD+FN+  LSDQ+TGESLETD SM EEH +VST
Subjt:  AVTRCRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSM-EEHVSVST

Query:  EASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQ
        EAS +EDTSTSLSA TEFLEPDQVSV  ITASF+PFFRGSQRIQLSHKDDCLQLLCNNL+VRFGISTKFTDYAGRP+LSFVVDVPPSLC VLEASDGVAQ
Subjt:  EASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQ

Query:  RLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVA
        RLFS+SGSGSEWRPVV+RKN YFNYPTMR+HIPTAVSGD+AIYATEMYQKESS  AQRLIFS+FDAAEL+SLI PG I+DAFISLDTYDYQQSAGIRLVA
Subjt:  RLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVA

Query:  KKLIIHS
        K+LIIHS
Subjt:  KKLIIHS

TrEMBL top hitse value%identityAlignment
A0A0A0LSC3 Exonuclease domain-containing protein7.0e-23984.19Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        M P E RSEIAFFDVETTVPTR+GQ FSILEFGAILVCP+KLVELESYSTLV+PS+LSLI+SLSVRCNGITRDAV+SSPTF+QIADRV+D+LHGRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN
        NILRFDCARIREAFAEI +PAP+PKGTIDSLALLTQ+FGRRAGDMKMATLA+YFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVF ENSWVSPN
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN

Query:  AVTRCRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSVSTE
        AVTR RAAKSSPQG +SNN+ H+S+S  GGNPISL  +QGE HPILSLVT  SED  S LAE  ATESD+FN+H  SDQ+TG +LETD +MEEH +VSTE
Subjt:  AVTRCRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSVSTE

Query:  ASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQR
        AS +EDTSTSL   T+FLEPDQVSV  ITASF+PFFRGSQRIQL HKDDCLQLLCNNL+VRFGISTKFTDYAGRPRLSFVVDVPP+LC VLEASDGVAQR
Subjt:  ASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQR

Query:  LFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAK
        LFS+SGSGSEWRP V RKNGYFNYPTMR+HIPTAVSGD+A YATEM+QKE+S   QRLIFS+FDAAEL+SLI PG I+DAFISLDTYDYQQSAGIRLVAK
Subjt:  LFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAK

Query:  KLIIHS
        KLII S
Subjt:  KLIIHS

A0A1S3BWV4 protein NEN14.0e-24285.01Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        M P E RSEIAFFDVETTVPTR+GQ FSILEFGAILVCP+KLVELESYSTLV+PS+LSLI+SLSVRCNGITRDAV+SSPTF+QIADRV+D+LHGRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN
        NILRFDCARIREAFA+I +PAP+PKGTIDSLALLTQ+FGRRAGDMKMATLA+YFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVF ENSWVSPN
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN

Query:  AVTRCRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSME-EHVSVST
        AVTR RAAKSSPQG +SNN+ H+S+S  GGNPISL D+QGEAHPILSLVT SSEDG  NLAE  ATESD+FN+H +SDQ+TG +LETD +ME EH +VST
Subjt:  AVTRCRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSME-EHVSVST

Query:  EASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQ
        EAS +EDTSTSL   T+FLEPDQVSV  ITASF+PFFRGSQRIQL HKDDCLQLLCNNL+VRFGISTKFTDYAGRPRLSFVVDVPPSLC VLEASDGVAQ
Subjt:  EASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQ

Query:  RLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVA
        RLFS+SGSGSEWRP V RKNGYFNYPTMR+HIPTAVSGD+A YATEMYQK++S   QRLIFS+FDAAEL+SLI PG I+DAFISLDTYDYQQSAGIRLVA
Subjt:  RLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVA

Query:  KKLIIHS
        KKLIIHS
Subjt:  KKLIIHS

A0A6J1C0W8 protein NEN1-like5.0e-23784.58Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        M P E RSEIAFFDVETTVPTR+GQ FSILEFGAILVCPRKLVELESYSTLVRPS+LSLI+SLSVRCNGITRDAV+S+P F+QIADRV+D+LHGRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN
        NILRFDCARIRE+FAEI MPAP+PKGTIDSLALLTQKFGRRAGDMKMA+LATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVF ENSWVSPN
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN

Query:  AVTRCRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSVSTE
         VTR RAAKSSP G NSNN+  SS+SR GGNP SL D+QGE HPILSLVT SSEDG S+L ESDATESD+FN+H LSD+ T ES + D SMEEH SVSTE
Subjt:  AVTRCRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSVSTE

Query:  ASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQR
        AS  ED S SLS+ TEFLEPDQVSV SITASF+PFFRGSQRIQLSH D+CLQLLCNNLKVRFGISTKFTD AGRPRLSFVVDVPPSLC VLEASDGVAQR
Subjt:  ASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQR

Query:  LFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAK
        L S+SGS S+WRP+V+RKNGYF+YPTMRIHIPTAV GD AIY TEMYQKE S  AQRLIFS+FDAAEL+SLINP   +DAFISLDTYDYQQSAGIRLVAK
Subjt:  LFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAK

Query:  KLIIHS
        KLI HS
Subjt:  KLIIHS

A0A6J1E665 protein NEN1-like8.2e-24886.21Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        M P   RSEIAFFDVETTVPTR+GQ FSILEFGAILVCPRKL ELESYSTLVRPS+LSLI+SLSVRCNGITRDAV+S+PTF+QIADRV+D+LHGRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN
        NILRFDCARIREAFAEI MPAPQPKGTIDSLALLTQKFGRRAGDMKMATLA+YFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVF ENSWVSPN
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN

Query:  AVTRCRAAKSSPQGVNSNNS--------------PHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNL-AESDATESDAFNIHILSDQLTGESL
        AVTR RAAKSSPQG N+NNS                SS+SR GG+PISL D+QGEAHPILSLVT SSEDG SNL A+SDATESD+FN+  LSDQ+ GESL
Subjt:  AVTRCRAAKSSPQGVNSNNS--------------PHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNL-AESDATESDAFNIHILSDQLTGESL

Query:  ETDTSM-EEHVSVSTEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVP
        ETDTSM EEH SVSTEAS TE+TSTS+S+ TEFLEPDQVSV SITASF+PFFRGSQRIQLSHKDDCLQLLCNNL+VRFGISTKFTDYAGRPRL+FVVDVP
Subjt:  ETDTSM-EEHVSVSTEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVP

Query:  PSLCRVLEASDGVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISL
        PSLCRVLEASDGVAQRLFSESGSGSEWRPVV+RKNGYFNYPTMR+HIPTAVSGD+AIYATEMYQKE+S  AQRLIFS+FDA EL+SLINPG IIDAFISL
Subjt:  PSLCRVLEASDGVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISL

Query:  DTYDYQQSAGIRLVAKKLIIHS
        DTYDYQQSAGIRLVAKKLIIHS
Subjt:  DTYDYQQSAGIRLVAKKLIIHS

A0A6J1JJE4 protein NEN1-like1.8e-24786.21Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        M P   RSEIAFFDVETTVPTR+GQ FSILEFGAILVCPRKL ELESYSTLVRPS+LSLI+SLSVRCNGITRDAV+S+PTF+QIADRV+D+LHGRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN
        NILRFDCARIREAFAEI MPAPQPKGTIDSLALLTQKFGRRAGDMKMATLA+YFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVF ENSWVSPN
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN

Query:  AVTRCRAAKSSPQGVNSNNS--------------PHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNL-AESDATESDAFNIHILSDQLTGESL
        AVTR RAAKSSPQG+N NNS                SS+SR GG+PISL D+QGEAHPILSLVT SSEDG SNL A+SDATESD+FN+  LSDQ+ GESL
Subjt:  AVTRCRAAKSSPQGVNSNNS--------------PHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNL-AESDATESDAFNIHILSDQLTGESL

Query:  ETDTSM-EEHVSVSTEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVP
        ETDTSM EEH  VSTEAS TE+TSTSLSA TEFLEPDQVSV SITASF+PFFRGSQRIQLSHKDDCLQLLCNNL+VRFGISTKFTDYAGRPRL+FVVDVP
Subjt:  ETDTSM-EEHVSVSTEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVP

Query:  PSLCRVLEASDGVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISL
        PSLCRVLEASDGVAQRLFSESGSGSEWRPVV+RKNGYFNYPTMR+HIPTAV GD+AIYATEMYQKE+S  AQRLIFS+FDA EL+SLINPG IIDAFISL
Subjt:  PSLCRVLEASDGVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISL

Query:  DTYDYQQSAGIRLVAKKLIIHS
        DTYDYQQSAGIRLVAKKLIIHS
Subjt:  DTYDYQQSAGIRLVAKKLIIHS

SwissProt top hitse value%identityAlignment
F4JJ23 Protein NEN43.3e-6861.84Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        MA     +EI FFD+ET VP + GQ F ILEFGAI+VCP+KL ELES++TL++P +LS+++  S R +GITR  V ++P+F  +A+++  LL+GRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSW-VSP
        NI RFDC RI+EAFAEI   AP+P G IDSL LL+ KFG+RAG+MKMA+LA YFG+G Q HRSLDDVRMNLEVLK+CATVLFLES+LP   LE  W  S 
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSW-VSP

Query:  NAVTRCR
          +TR R
Subjt:  NAVTRCR

Q0V842 Protein NEN22.5e-14555.86Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        + PAE RSEIAFFDVETT+P R GQG++ILEFG+ILVCP+KLVEL++YS LVRP+NL+LI   SV+CNGI R+ V S+ TF+ IAD V+D+LHGRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN
        NIL+FDC RIREAFAEI    P+PKGTIDSLALLTQ+FGRRAGDMKMATLATYFG+G QTHRSLDDVRMN EVLKYCATVLFLESSLP+  +ENS  +  
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN

Query:  AVT---RCRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSV
          T   R R  K SP       SP  ++ +TG N  ++        PILS V+S+              ++D F++  L +++  E L++D  MEE  + 
Subjt:  AVT---RCRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSV

Query:  STEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQ--RIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASD
         +E   +E T         F+E D++SV SI A+ +P + GSQ  ++QL   D  LQL C  LKVRFGI+ KF D AGR RL+FV+D+ PSLC VL+  D
Subjt:  STEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQ--RIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASD

Query:  GVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESS--SDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSA
          AQ +  +SGSGS+W P+VI   G+ N PT RIHIPT ++GDI  YA E++QKE S  +  Q+LI S   A E+ESL+NP  ++DAF+SL+ YDYQQ A
Subjt:  GVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESS--SDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSA

Query:  GIRLVAKKLIIH
        GIRLVA+KL+IH
Subjt:  GIRLVAKKLIIH

Q9CA74 Protein NEN35.5e-14053.2Show/hide
Query:  RSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGHNILRFD
        RSEIAFFD+ET VPT+ G+ F+ILEFGAILVCPR+L EL SYSTLVRP++LSLI++L+ R +GITRD V+S+ TFS+IAD+V+D+LHGRIWAGHNI+RFD
Subjt:  RSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGHNILRFD

Query:  CARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPNAVTRCR
        C RIREAFAEI +  P+PK TIDSL+LL+QKFG+RAGDMKMA+LATYFG+G Q HRSLDDVRMNLEV+KYCATVLFLESS+P++  + SW SP    R R
Subjt:  CARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPNAVTRCR

Query:  A-AKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSVSTEASHTE
        +  K    GV  +++  SS+ +T  +  S+     + HPI+SL+T  SE   S+         D  +I  L  +L   +L+TD +  E      +   + 
Subjt:  A-AKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSVSTEASHTE

Query:  DTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQRLFSES
         +  S +    FL  ++VSV SI AS +PF+RGS R++L H D  L L  ++LKVRFGIS KF D+AGRP+L+ +VD P  LC++L+A D  A  L ++S
Subjt:  DTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQRLFSES

Query:  GSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESS-SDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAKKLII
         + S+WRP VIRK G+ NYPT R+HI +  +GD  +  T++YQKE      Q+L  S  +  +LES + PG ++DAF SL+ Y YQQ AGIRL  KKL+I
Subjt:  GSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESS-SDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAKKLII

Q9FLR0 Protein NEN11.2e-14255.45Show/hide
Query:  EVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGHNILR
        E RSEIAFFDVETTVP ++GQ F+ILEFG+ILVCP+KL EL SY+TLV+P++LSLI+SLSVRCNGI RD VV +P F+ IAD V+D+LHGRIWAGHNILR
Subjt:  EVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGHNILR

Query:  FDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPNAVTR
        FDCARIREAFAEI    P+PKG IDSL LLTQKFGRRAGDMKMATLA YFG+G QTHRSLDDVRMNLEVLKYCATVLFLESSLP   ++NS VSP  ++ 
Subjt:  FDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPNAVTR

Query:  CRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGES-LETDTSMEEHVSVSTEASH
         R   +S +G                                + VT+S      +++E+ A + D FN+ +L +++  ++ +++D  MEE     ++   
Subjt:  CRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGES-LETDTSMEEHVSVSTEASH

Query:  TEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQ--RIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQRL
        +E+TS        FL PD +S+ +I A   PF+ GSQ  +++L H D  LQL C+ LK+RFG++ KF D  GR RL+FVVD+ PSL  +LEA D  AQ+L
Subjt:  TEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQ--RIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQRL

Query:  FSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAKK
          +SGS SEW PVV    G+ NYP  RIHI T ++GD A YATE++Q+ESS   Q+LIFS  +  ELESL+  G ++DAF+SL+ YDYQQ AGIRLVAKK
Subjt:  FSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAKK

Query:  LIIHS
        L+I S
Subjt:  LIIHS

Arabidopsis top hitse value%identityAlignment
AT1G74390.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.9e-14153.2Show/hide
Query:  RSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGHNILRFD
        RSEIAFFD+ET VPT+ G+ F+ILEFGAILVCPR+L EL SYSTLVRP++LSLI++L+ R +GITRD V+S+ TFS+IAD+V+D+LHGRIWAGHNI+RFD
Subjt:  RSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGHNILRFD

Query:  CARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPNAVTRCR
        C RIREAFAEI +  P+PK TIDSL+LL+QKFG+RAGDMKMA+LATYFG+G Q HRSLDDVRMNLEV+KYCATVLFLESS+P++  + SW SP    R R
Subjt:  CARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPNAVTRCR

Query:  A-AKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSVSTEASHTE
        +  K    GV  +++  SS+ +T  +  S+     + HPI+SL+T  SE   S+         D  +I  L  +L   +L+TD +  E      +   + 
Subjt:  A-AKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSVSTEASHTE

Query:  DTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQRLFSES
         +  S +    FL  ++VSV SI AS +PF+RGS R++L H D  L L  ++LKVRFGIS KF D+AGRP+L+ +VD P  LC++L+A D  A  L ++S
Subjt:  DTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQRLFSES

Query:  GSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESS-SDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAKKLII
         + S+WRP VIRK G+ NYPT R+HI +  +GD  +  T++YQKE      Q+L  S  +  +LES + PG ++DAF SL+ Y YQQ AGIRL  KKL+I
Subjt:  GSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESS-SDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAKKLII

AT1G74390.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.5e-13251.2Show/hide
Query:  RSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGHNILRFD
        RSEIAFFD+ET VPT+ G+ F+ILEFGAILVCPR+L EL SYSTLVRP++LSLI++L+ R +GITRD V+S+ TFS+IAD+V+D+LHGRIWAGHNI+RFD
Subjt:  RSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGHNILRFD

Query:  CARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPNAVTRCR
        C RIREAFAEI +  P+PK TIDSL+LL+QKFG+RAGDMKMA+LATYFG+G Q HRSLDDVRMNLE           ESS+P++  + SW SP    R R
Subjt:  CARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPNAVTRCR

Query:  A-AKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSVSTEASHTE
        +  K    GV  +++  SS+ +T  +  S+     + HPI+SL+T  SE   S+         D  +I  L  +L   +L+TD +  E      +   + 
Subjt:  A-AKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSVSTEASHTE

Query:  DTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQRLFSES
         +  S +    FL  ++VSV SI AS +PF+RGS R++L H D  L L  ++LKVRFGIS KF D+AGRP+L+ +VD P  LC++L+A D  A  L ++S
Subjt:  DTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQRLFSES

Query:  GSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESS-SDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAKKLII
         + S+WRP VIRK G+ NYPT R+HI +  +GD  +  T++YQKE      Q+L  S  +  +LES + PG ++DAF SL+ Y YQQ AGIRL  KKL+I
Subjt:  GSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESS-SDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAKKLII

AT4G39810.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.3e-6961.84Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        MA     +EI FFD+ET VP + GQ F ILEFGAI+VCP+KL ELES++TL++P +LS+++  S R +GITR  V ++P+F  +A+++  LL+GRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSW-VSP
        NI RFDC RI+EAFAEI   AP+P G IDSL LL+ KFG+RAG+MKMA+LA YFG+G Q HRSLDDVRMNLEVLK+CATVLFLES+LP   LE  W  S 
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSW-VSP

Query:  NAVTRCR
          +TR R
Subjt:  NAVTRCR

AT5G07710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.4e-14455.45Show/hide
Query:  EVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGHNILR
        E RSEIAFFDVETTVP ++GQ F+ILEFG+ILVCP+KL EL SY+TLV+P++LSLI+SLSVRCNGI RD VV +P F+ IAD V+D+LHGRIWAGHNILR
Subjt:  EVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGHNILR

Query:  FDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPNAVTR
        FDCARIREAFAEI    P+PKG IDSL LLTQKFGRRAGDMKMATLA YFG+G QTHRSLDDVRMNLEVLKYCATVLFLESSLP   ++NS VSP  ++ 
Subjt:  FDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPNAVTR

Query:  CRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGES-LETDTSMEEHVSVSTEASH
         R   +S +G                                + VT+S      +++E+ A + D FN+ +L +++  ++ +++D  MEE     ++   
Subjt:  CRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGES-LETDTSMEEHVSVSTEASH

Query:  TEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQ--RIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQRL
        +E+TS        FL PD +S+ +I A   PF+ GSQ  +++L H D  LQL C+ LK+RFG++ KF D  GR RL+FVVD+ PSL  +LEA D  AQ+L
Subjt:  TEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQ--RIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQRL

Query:  FSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAKK
          +SGS SEW PVV    G+ NYP  RIHI T ++GD A YATE++Q+ESS   Q+LIFS  +  ELESL+  G ++DAF+SL+ YDYQQ AGIRLVAKK
Subjt:  FSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAKK

Query:  LIIHS
        L+I S
Subjt:  LIIHS

AT5G61390.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.8e-14655.86Show/hide
Query:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH
        + PAE RSEIAFFDVETT+P R GQG++ILEFG+ILVCP+KLVEL++YS LVRP+NL+LI   SV+CNGI R+ V S+ TF+ IAD V+D+LHGRIWAGH
Subjt:  MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGH

Query:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN
        NIL+FDC RIREAFAEI    P+PKGTIDSLALLTQ+FGRRAGDMKMATLATYFG+G QTHRSLDDVRMN EVLKYCATVLFLESSLP+  +ENS  +  
Subjt:  NILRFDCARIREAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPN

Query:  AVT---RCRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSV
          T   R R  K SP       SP  ++ +TG N  ++        PILS V+S+              ++D F++  L +++  E L++D  MEE  + 
Subjt:  AVT---RCRAAKSSPQGVNSNNSPHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSV

Query:  STEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQ--RIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASD
         +E   +E T         F+E D++SV SI A+ +P + GSQ  ++QL   D  LQL C  LKVRFGI+ KF D AGR RL+FV+D+ PSLC VL+  D
Subjt:  STEASHTEDTSTSLSAPTEFLEPDQVSVPSITASFIPFFRGSQ--RIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASD

Query:  GVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESS--SDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSA
          AQ +  +SGSGS+W P+VI   G+ N PT RIHIPT ++GDI  YA E++QKE S  +  Q+LI S   A E+ESL+NP  ++DAF+SL+ YDYQQ A
Subjt:  GVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIAIYATEMYQKESS--SDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSA

Query:  GIRLVAKKLIIH
        GIRLVA+KL+IH
Subjt:  GIRLVAKKLIIH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCGGCGGAGGTCCGATCGGAGATTGCCTTCTTCGACGTGGAGACCACAGTCCCGACCCGCAAAGGCCAAGGGTTTTCAATTCTGGAGTTCGGAGCCATTCTGGT
GTGCCCTAGAAAGCTGGTCGAACTGGAAAGCTATTCCACTCTCGTTCGCCCCTCCAATCTCTCGCTCATTAACTCCTTGTCGGTGCGATGCAATGGCATTACCAGAGACG
CCGTCGTTTCTTCCCCTACTTTCTCCCAAATTGCCGATAGGGTTTTCGATCTCCTTCACGGACGGATATGGGCAGGGCATAACATATTGAGATTTGATTGTGCTCGAATA
AGGGAGGCTTTTGCGGAAATTGATATGCCAGCCCCGCAACCGAAAGGTACTATTGATTCTTTGGCCTTGTTGACTCAGAAGTTTGGAAGAAGAGCTGGTGACATGAAGAT
GGCGACTCTTGCTACTTATTTCGGGATAGGACAACAAACTCACAGGAGTTTGGATGATGTCAGGATGAATCTCGAAGTTCTTAAGTATTGTGCAACTGTCTTGTTTCTGG
AATCAAGTCTACCCGAAGTTTTTCTGGAGAACAGTTGGGTTTCACCAAATGCCGTTACAAGGTGTCGTGCTGCAAAATCATCTCCACAGGGAGTTAATTCGAATAATAGC
CCTCATTCATCAAATTCAAGGACAGGTGGCAATCCAATATCTCTGAAGGATGAACAAGGGGAAGCTCATCCGATATTATCTCTTGTAACTAGCAGCTCCGAGGACGGAAC
CTCAAATCTGGCCGAATCTGATGCAACTGAATCAGATGCTTTTAACATCCACATTCTCAGTGATCAACTAACAGGAGAATCACTCGAAACAGATACCAGTATGGAAGAAC
ATGTCTCAGTATCTACTGAAGCATCTCACACAGAGGATACTTCCACAAGTCTCAGCGCCCCTACAGAATTCTTGGAGCCAGATCAGGTCTCTGTCCCATCCATCACAGCA
TCTTTCATTCCATTCTTTCGTGGTAGCCAAAGAATACAATTATCGCACAAAGATGACTGTTTACAGCTTCTTTGTAATAATCTGAAAGTTCGATTTGGCATAAGCACGAA
ATTCACTGATTACGCTGGGCGTCCAAGGCTGAGTTTTGTGGTCGACGTACCACCGAGTTTATGCAGGGTTCTTGAAGCATCCGATGGCGTGGCTCAGAGATTATTTTCAG
AATCTGGCAGCGGCTCCGAGTGGAGGCCTGTCGTGATAAGAAAGAATGGCTATTTCAACTACCCGACAATGAGAATACACATTCCAACTGCAGTAAGTGGAGATATAGCT
ATTTATGCTACAGAGATGTACCAAAAGGAATCATCAAGTGATGCACAAAGGCTGATATTCAGTCAATTTGATGCTGCAGAGCTTGAAAGCTTGATCAACCCTGGAGTTAT
CATAGATGCATTCATTTCATTGGATACATATGATTATCAACAGAGTGCAGGCATTAGATTGGTAGCAAAAAAGTTGATCATCCATTCCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ACATTTTGTAAGAGTACTTTTCTAGGTTATTTGAATAAAAATAAAGAAATTGGTTTCTTCTTCATCTTCACCTTCAGACTCATCTGTACGGCGGTGATTATCGTTATAAA
ACCTTCATTTCTTCGCGCCTTCCTTCTTCTTCTTCTTCTTCAACTTCGAGAGAAATCGAATCAGCTTCTCATCTCTTCTCCGATCGAAAATTTGATCGGTTCAATCATGG
CTCCGGCGGAGGTCCGATCGGAGATTGCCTTCTTCGACGTGGAGACCACAGTCCCGACCCGCAAAGGCCAAGGGTTTTCAATTCTGGAGTTCGGAGCCATTCTGGTGTGC
CCTAGAAAGCTGGTCGAACTGGAAAGCTATTCCACTCTCGTTCGCCCCTCCAATCTCTCGCTCATTAACTCCTTGTCGGTGCGATGCAATGGCATTACCAGAGACGCCGT
CGTTTCTTCCCCTACTTTCTCCCAAATTGCCGATAGGGTTTTCGATCTCCTTCACGGACGGATATGGGCAGGGCATAACATATTGAGATTTGATTGTGCTCGAATAAGGG
AGGCTTTTGCGGAAATTGATATGCCAGCCCCGCAACCGAAAGGTACTATTGATTCTTTGGCCTTGTTGACTCAGAAGTTTGGAAGAAGAGCTGGTGACATGAAGATGGCG
ACTCTTGCTACTTATTTCGGGATAGGACAACAAACTCACAGGAGTTTGGATGATGTCAGGATGAATCTCGAAGTTCTTAAGTATTGTGCAACTGTCTTGTTTCTGGAATC
AAGTCTACCCGAAGTTTTTCTGGAGAACAGTTGGGTTTCACCAAATGCCGTTACAAGGTGTCGTGCTGCAAAATCATCTCCACAGGGAGTTAATTCGAATAATAGCCCTC
ATTCATCAAATTCAAGGACAGGTGGCAATCCAATATCTCTGAAGGATGAACAAGGGGAAGCTCATCCGATATTATCTCTTGTAACTAGCAGCTCCGAGGACGGAACCTCA
AATCTGGCCGAATCTGATGCAACTGAATCAGATGCTTTTAACATCCACATTCTCAGTGATCAACTAACAGGAGAATCACTCGAAACAGATACCAGTATGGAAGAACATGT
CTCAGTATCTACTGAAGCATCTCACACAGAGGATACTTCCACAAGTCTCAGCGCCCCTACAGAATTCTTGGAGCCAGATCAGGTCTCTGTCCCATCCATCACAGCATCTT
TCATTCCATTCTTTCGTGGTAGCCAAAGAATACAATTATCGCACAAAGATGACTGTTTACAGCTTCTTTGTAATAATCTGAAAGTTCGATTTGGCATAAGCACGAAATTC
ACTGATTACGCTGGGCGTCCAAGGCTGAGTTTTGTGGTCGACGTACCACCGAGTTTATGCAGGGTTCTTGAAGCATCCGATGGCGTGGCTCAGAGATTATTTTCAGAATC
TGGCAGCGGCTCCGAGTGGAGGCCTGTCGTGATAAGAAAGAATGGCTATTTCAACTACCCGACAATGAGAATACACATTCCAACTGCAGTAAGTGGAGATATAGCTATTT
ATGCTACAGAGATGTACCAAAAGGAATCATCAAGTGATGCACAAAGGCTGATATTCAGTCAATTTGATGCTGCAGAGCTTGAAAGCTTGATCAACCCTGGAGTTATCATA
GATGCATTCATTTCATTGGATACATATGATTATCAACAGAGTGCAGGCATTAGATTGGTAGCAAAAAAGTTGATCATCCATTCCTCTTAATGGTAAATTAAGTTGTCCTA
TGATTGTAGTAGCTCTGTTGTTATATTATGCTTAGTTCATTGATGACTAACATTCAAAACTCCAATGTATACCCTTTCATTATTTCAGGGAAAAAACACATCCCTTTTCC
CTATGTCTGCATTGTTTGATTGGATAAAGTACTTATTTCTTTAAGTATTTGAGTATTGTATTGCATGCATATAGTTTGGCTTATGGTTTGAAGATTTGGGTGAATGTGGA
TAAGAGAGAGAGAGAAACAAATGAAGCTGCAACTGACAAAACTAAAGCAAAGGGAAAGTGAAATAAGAACCTGTCTTTAAAATAACCTTTTGAATTCTTGCTGTCGCAAC
TTAATCAAATTATATCATATCATAGGCCTCCTTATTCAAGATATAGAAACTTAGTGATGGATTTATATGAGTAAAACTTGGGTGCTTCCACCAACGGTGAATGACGCAAT
TGGTTATCTTATCTGGTGTTAATTCATTTCAACTTCATGGTCATGAGTTCGAG
Protein sequenceShow/hide protein sequence
MAPAEVRSEIAFFDVETTVPTRKGQGFSILEFGAILVCPRKLVELESYSTLVRPSNLSLINSLSVRCNGITRDAVVSSPTFSQIADRVFDLLHGRIWAGHNILRFDCARI
REAFAEIDMPAPQPKGTIDSLALLTQKFGRRAGDMKMATLATYFGIGQQTHRSLDDVRMNLEVLKYCATVLFLESSLPEVFLENSWVSPNAVTRCRAAKSSPQGVNSNNS
PHSSNSRTGGNPISLKDEQGEAHPILSLVTSSSEDGTSNLAESDATESDAFNIHILSDQLTGESLETDTSMEEHVSVSTEASHTEDTSTSLSAPTEFLEPDQVSVPSITA
SFIPFFRGSQRIQLSHKDDCLQLLCNNLKVRFGISTKFTDYAGRPRLSFVVDVPPSLCRVLEASDGVAQRLFSESGSGSEWRPVVIRKNGYFNYPTMRIHIPTAVSGDIA
IYATEMYQKESSSDAQRLIFSQFDAAELESLINPGVIIDAFISLDTYDYQQSAGIRLVAKKLIIHSS