; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy4G083890 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy4G083890
OrganismCucumis hystrix (Cucumber (hystrix) v1)
Descriptionmediator of RNA polymerase II transcription subunit 15a-like
Genome locationchrH04:23100046..23108047
RNA-Seq ExpressionChy4G083890
SyntenyChy4G083890
Gene Ontology termsGO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
GO:0008408 - 3'-5' exonuclease activity (molecular function)
GO:0031490 - chromatin DNA binding (molecular function)
InterPro domainsIPR002562 - 3'-5' exonuclease domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036529 - Coactivator CBP, KIX domain superfamily
IPR036546 - Mediator complex subunit 15, KIX domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443239.1 PREDICTED: uncharacterized protein LOC103486878 [Cucumis melo]0.072.16Show/hide
Query:  LDKKASMATPT-DWRTEITQETRQQIVRSIYMMLKEQPS-ELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQP
        +DKKASMAT T DWRTEIT+ETRQ+   SI+M+L+ Q S + N+ +ISD ARKHEM LFS A S +EYL+ G GK+ KRENH+GSSS +A V+YPQYHQP
Subjt:  LDKKASMATPT-DWRTEITQETRQQIVRSIYMMLKEQPS-ELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQP

Query:  AEAKSLLLQHIQRTPQLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHN
                     TPQL RQHP VRQ HQQF MQNQ+ AS QNTSNSQ RPQGF RQD GIHLSSEMFTQHPNFVNLTTQV+KE +SEGF ASKS  QH 
Subjt:  AEAKSLLLQHIQRTPQLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHN

Query:  QHSMSGAF-ADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGN
        QH    +  A  ERIP+SEV HDAAFAEMEQLKKTFLP  IKAYEP+RKV H +   +  L+KT+E+IL FF S KEKIIASYTKE+F RCL+YIEQ GN
Subjt:  QHSMSGAF-ADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGN

Query:  TIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSS--VAPQEKGSIRS---------------------KLHPQWIHGS
        TIK N NV NK SSLH GQPGLSGSRINHP+QQ  DNVKL CQSVIR TTGS  SS  +APQEKGS+RS                     K+HPQWIH S
Subjt:  TIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSS--VAPQEKGSIRS---------------------KLHPQWIHGS

Query:  GNTPSIYRSGMSLNPHLNSNFSH----------VAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEM----
        GNTP+ YRSGMSLN HLNSNFSH          VAERPRPT PCT PL+G ASP PSS IVGLEK SPNVTYHS  NF    HCNPYQLLHSK E     
Subjt:  GNTPSIYRSGMSLNPHLNSNFSH----------VAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEM----

Query:  ------------IAEPTSLGINGQLSTYQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIP
                    +AEPTS GINGQ STYQA++RLLKAVGSSS+ ALRAAVSGITSVGYMEDA+IDPRC A VTNLRL++G GSSNNMKRKINAMALNNIP
Subjt:  ------------IAEPTSLGINGQLSTYQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIP

Query:  SPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----VKLPVLTMKLLVPLDYPEDY
        SP S+IPGSEETVTSRTKKLKK +DSSLLEE+RNINKQFIETVLELD+DENLN+RLANAGTVLRCSYSAV DGTN+    VKLPVLTMKLLVPLDYPEDY
Subjt:  SPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----VKLPVLTMKLLVPLDYPEDY

Query:  PVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA
        PVFLSKFD  SSNVDEEC NLSN AMSMLRAFLR APECVSLEEYAR WDECARSV+S+YV+RAGGGSFSARYG+WEDSVA A
Subjt:  PVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA

XP_011652173.1 uncharacterized protein LOC105434992 [Cucumis sativus]0.096.12Show/hide
Query:  MATPTDWRTEITQETRQQIVRSIYMMLKEQPSELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQPAEAKSLLL
        MATPTDWRTEITQETRQQIVRSIYMMLKEQPSELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVV+YPQYHQPAEAKSLLL
Subjt:  MATPTDWRTEITQETRQQIVRSIYMMLKEQPSELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQPAEAKSLLL

Query:  QHIQRTPQLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHNQHSMSGAF
        QHIQRTPQLHRQHPNVRQ HQQF MQNQ+GASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQ  QHSMSGA 
Subjt:  QHIQRTPQLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHNQHSMSGAF

Query:  ADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGNTIKSNINVV
        ADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGL RRGL KTIE+ILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGNTIKSNINVV
Subjt:  ADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGNTIKSNINVV

Query:  NKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSSVAPQEKGSIRSKLHPQWIHGSGNTPSIYRSGMSLNPHLNSNFSHVAERPRPTN
        NKPSSLHDGQPGLSGSRINHPVQQ GDNVKLHCQSVIRTTTGSGSSSVAPQE GSIRSKLHPQWIHGSGNTP  YRSG+SLNPHLNSNFSHVAERPRPTN
Subjt:  NKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSSVAPQEKGSIRSKLHPQWIHGSGNTPSIYRSGMSLNPHLNSNFSHVAERPRPTN

Query:  PCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEMIAEPTSLGINGQLSTYQAHNRLLKAVGSSSKEALRAAVSGITSVGY
        PCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEMIAEPTSLGINGQLSTYQAHNRLLKAVGSSS+EALRAAVSGITSVGY
Subjt:  PCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEMIAEPTSLGINGQLSTYQAHNRLLKAVGSSSKEALRAAVSGITSVGY

Query:  MEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIPSPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLAN
        MEDA+IDP+CRAKVTNLRLIDGFGSSNNMKRKINAMALNNIPSPSSEI GSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLN+RLAN
Subjt:  MEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIPSPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLAN

Query:  AGTVLRCSYSAVSDGTNTVKLPVLTMKLLVPLDYPEDYPVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVK
        AGTVLR SYSAVSDGTN+VKLPVLTMKLLVPLDYPEDYPVFLSKFDLSSSNVDEE RNLSNGA+SMLRAFLR APECVSLE+YARAWDECARSVLSEYV+
Subjt:  AGTVLRCSYSAVSDGTNTVKLPVLTMKLLVPLDYPEDYPVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVK

Query:  RAGGGSFSARYGSWEDSVAAA
        RAGGGSFSARYGSWEDSV AA
Subjt:  RAGGGSFSARYGSWEDSVAAA

XP_023528462.1 mediator of RNA polymerase II transcription subunit 15a-like isoform X1 [Cucurbita pepo subsp. pepo]7.92e-17746.12Show/hide
Query:  LDKKASMATPTDWRTEITQETRQQIVRSIYMMLKE-----QPSELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSG---------
        + KK S+AT TDW+ EI   TRQ+ VRSIY MLKE      P+ ++ + I++ A ++E   F  AK++E YL+  TGKM     H+ +SS          
Subjt:  LDKKASMATPTDWRTEITQETRQQIVRSIYMMLKE-----QPSELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSG---------

Query:  ---------QAVVIYPQYHQPAEAKSLLLQHIQRTP-----QLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNF
                 Q  V  P Y QPA   SL  QH Q+       QL RQ  NVRQ HQQ GM NQ+  SPQ   NS C PQG         L S +FTQ+PN 
Subjt:  ---------QAVVIYPQYHQPAEAKSLLLQHIQRTP-----QLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNF

Query:  VNL------TTQVKKEVDSEGFMASKSS-------EQHNQHSMSGAFADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLV
        +NL      TTQVK+ V  E F ASK S       EQ  QH   GA A P  +PNSE W D AFAEMEQLKK  +P   K  + + +      +Q    +
Subjt:  VNL------TTQVKKEVDSEGFMASKSS-------EQHNQHSMSGAFADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLV

Query:  KTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGNTIKS-NINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSV---IRTTTGSGSSSVA
          ++K++RF Q  +++II ++TKE+F  CL  I ++    ++ ++N+ NK  SL  GQPGL GSRIN PVQQ  DNVKLH Q V      + GS SS   
Subjt:  KTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGNTIKS-NINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSV---IRTTTGSGSSSVA

Query:  PQEKGSIRSKLHPQWIHGSGNTPSIYRSGMSLNPHLNSNFSHVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLH
        P++KGS+RS+    W+    N     +  +++ P +     ++      T+P                                                
Subjt:  PQEKGSIRSKLHPQWIHGSGNTPSIYRSGMSLNPHLNSNFSHVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLH

Query:  SKAEMIAEPTSLGINGQLST-YQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIPSPSSEI
             + EP +LG N QLST  +  +RLLKAV S S EALR AV  I+SV  M D++ +P C +K T+L   D  GSSN++KRKINA  LN++PSP S+ 
Subjt:  SKAEMIAEPTSLGINGQLST-YQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIPSPSSEI

Query:  PGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----VKLPVLTMKLLVPLDYPEDYPVFLSK
          S  TV S+  KLKKLS  +LLEEMRNINKQFIETVLELDLDENLN RLANAGTVLRCSY AV+D  N+    VKLPVL++KLLVPLDYPEDYPVFLSK
Subjt:  PGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----VKLPVLTMKLLVPLDYPEDYPVFLSK

Query:  FDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSV
         +    NVDE+ R LSN A S LRAFLR  PEC+SLEEYARAW+ECARSV+SEY +RAGGG FSARYG+WEDS+
Subjt:  FDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSV

XP_038903080.1 probable mediator of RNA polymerase II transcription subunit 15c isoform X1 [Benincasa hispida]3.62e-28560.47Show/hide
Query:  LDKKASMATPTDWRTEITQETRQQIVRSIYMMLKEQPSE-----LNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQY
        +DKKA+ AT TDWRTEIT ETR Q  R I MML EQ S      +N+K IS+ AR+HEM LFS AKSK++YL+ GT KM +RENH GSSS Q  V  PQY
Subjt:  LDKKASMATPTDWRTEITQETRQQIVRSIYMMLKEQPSE-----LNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQY

Query:  HQPAEAKSLLLQHIQRTPQLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNL------TTQVKKEVDSEGFM
        HQ AE  SLL QHIQ T QLHRQ+ NV Q HQQFGM NQ   SPQNT NSQ     F R+D GIH S EMFTQHPN VNL      TTQVK+EV+ EGF 
Subjt:  HQPAEAKSLLLQHIQRTPQLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNL------TTQVKKEVDSEGFM

Query:  ASKSSEQHN-------QHSMSGAFADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTK
        ASKSS QH+       Q    GA A PE IP SE WHD AFAEME+LKKT+LP   KA E   +VV  E +Q++     +  +++F Q  ++KII +Y K
Subjt:  ASKSSEQHN-------QHSMSGAFADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTK

Query:  EKFIRCLQYIEQSGNTIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTG--SGSSSVAPQEKGSIRSK---------------
        EKF RCLQ IE+ G  IKS  N+ NK   LH GQPG  GSR+N PVQQ  ++VKLH Q VIR TTG   G+S +A  EKGS+RS+               
Subjt:  EKFIRCLQYIEQSGNTIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTG--SGSSSVAPQEKGSIRSK---------------

Query:  ------LHPQWI----HGSGNTPSIYRSGMSLNP---------HLNSNFSHVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRP
                 QW+    + +GN P+IYRSGMSLN          H  S  S  AERP PTNPC   LHGRASP PSSSIV L+K SPNV+Y SSSNF F  
Subjt:  ------LHPQWI----HGSGNTPSIYRSGMSLNP---------HLNSNFSHVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRP

Query:  HCNPYQLLHSKAEM----------------IAEPTSLGINGQLSTY-QAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGF
        +CNP + LH KAE+                 A PTSLG NGQL T  QAHNRLLKAV S S EAL  AVSGI+SVGY +DAMIDP C AKVT++RL DG 
Subjt:  HCNPYQLLHSKAEM----------------IAEPTSLGINGQLSTY-QAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGF

Query:  GSSNNMKRKINAMALNNIPSPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----V
        GSSNNMKRKINA ALNNIPSP S+I GSE TVTSR KKLKKLSD SLLEE+RNINKQF+ETVLELDLDE+LN++LANAGTVLRCSYSA ++  N+    V
Subjt:  GSSNNMKRKINAMALNNIPSPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----V

Query:  KLPVLTMKLLVPLDYPEDYPVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVA
        KLPVL++KLLVPLDYPEDYPVFLSKF+ +S NVD+E R+LSN A  MLRAFLR AP+C+SL EYAR WDECARSV+SEY +RAGGG FS +YG+WED+VA
Subjt:  KLPVLTMKLLVPLDYPEDYPVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVA

Query:  AA
         A
Subjt:  AA

XP_038903081.1 mediator of RNA polymerase II transcription subunit 15a-like isoform X2 [Benincasa hispida]1.03e-27160.23Show/hide
Query:  MMLKEQPSE-----LNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQPAEAKSLLLQHIQRTPQLHRQHPNVRQ-
        MML EQ S      +N+K IS+ AR+HEM LFS AKSK++YL+ GT KM +RENH GSSS Q  V  PQYHQ AE  SLL QHIQ T QLHRQ+ NV Q 
Subjt:  MMLKEQPSE-----LNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQPAEAKSLLLQHIQRTPQLHRQHPNVRQ-

Query:  HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNL------TTQVKKEVDSEGFMASKSSEQHN-------QHSMSGAFADPERI
        HQQFGM NQ   SPQNT NSQ     F R+D GIH S EMFTQHPN VNL      TTQVK+EV+ EGF ASKSS QH+       Q    GA A PE I
Subjt:  HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNL------TTQVKKEVDSEGFMASKSSEQHN-------QHSMSGAFADPERI

Query:  PNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGNTIKSNINVVNKPSSL
        P SE WHD AFAEME+LKKT+LP   KA E   +VV  E +Q++     +  +++F Q  ++KII +Y KEKF RCLQ IE+ G  IKS  N+ NK   L
Subjt:  PNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGNTIKSNINVVNKPSSL

Query:  HDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTG--SGSSSVAPQEKGSIRSK---------------------LHPQWI----HGSGNTPSIYRSGM
        H GQPG  GSR+N PVQQ  ++VKLH Q VIR TTG   G+S +A  EKGS+RS+                        QW+    + +GN P+IYRSGM
Subjt:  HDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTG--SGSSSVAPQEKGSIRSK---------------------LHPQWI----HGSGNTPSIYRSGM

Query:  SLNP---------HLNSNFSHVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEM----------------
        SLN          H  S  S  AERP PTNPC   LHGRASP PSSSIV L+K SPNV+Y SSSNF F  +CNP + LH KAE+                
Subjt:  SLNP---------HLNSNFSHVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEM----------------

Query:  IAEPTSLGINGQLSTY-QAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIPSPSSEIPGSEE
         A PTSLG NGQL T  QAHNRLLKAV S S EAL  AVSGI+SVGY +DAMIDP C AKVT++RL DG GSSNNMKRKINA ALNNIPSP S+I GSE 
Subjt:  IAEPTSLGINGQLSTY-QAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIPSPSSEIPGSEE

Query:  TVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----VKLPVLTMKLLVPLDYPEDYPVFLSKFDLSS
        TVTSR KKLKKLSD SLLEE+RNINKQF+ETVLELDLDE+LN++LANAGTVLRCSYSA ++  N+    VKLPVL++KLLVPLDYPEDYPVFLSKF+ +S
Subjt:  TVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----VKLPVLTMKLLVPLDYPEDYPVFLSKFDLSS

Query:  SNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA
         NVD+E R+LSN A  MLRAFLR AP+C+SL EYAR WDECARSV+SEY +RAGGG FS +YG+WED+VA A
Subjt:  SNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA

TrEMBL top hitse value%identityAlignment
A0A0A0LET7 Uncharacterized protein1.9e-27195.82Show/hide
Query:  MEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGNTIKSNINVVNKPSSLHDGQPGLSGSRIN
        MEQLKKTFLPYFIKAYEPFRKVVHQEGL RRGL KTIE+ILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGNTIKSNINVVNKPSSLHDGQPGLSGSRIN
Subjt:  MEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGNTIKSNINVVNKPSSLHDGQPGLSGSRIN

Query:  HPVQQCGDNVKLHCQSVIRTTTGSGSSSVAPQEKGSIRSKLHPQWIHGSGNTPSIYRSGMSLNPHLNSNFSHVAERPRPTNPCTYPLHGRASPPPSSSIV
        HPVQQ GDNVKLHCQSVIRTTTGSGSSSVAPQE GSIRSKLHPQWIHGSGNTP  YRSG+SLNPHLNSNFSHVAERPRPTNPCTYPLHGRASPPPSSSIV
Subjt:  HPVQQCGDNVKLHCQSVIRTTTGSGSSSVAPQEKGSIRSKLHPQWIHGSGNTPSIYRSGMSLNPHLNSNFSHVAERPRPTNPCTYPLHGRASPPPSSSIV

Query:  GLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEMIAEPTSLGINGQLSTYQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRL
        GLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEMIAEPTSLGINGQLSTYQAHNRLLKAVGSSS+EALRAAVSGITSVGYMEDA+IDP+CRAKVTNLRL
Subjt:  GLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEMIAEPTSLGINGQLSTYQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRL

Query:  IDGFGSSNNMKRKINAMALNNIPSPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNTV
        IDGFGSSNNMKRKINAMALNNIPSPSSEI GSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLN+RLANAGTVLR SYSAVSDGTN+V
Subjt:  IDGFGSSNNMKRKINAMALNNIPSPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNTV

Query:  KLPVLTMKLLVPLDYPEDYPVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVA
        KLPVLTMKLLVPLDYPEDYPVFLSKFDLSSSNVDEE RNLSNGA+SMLRAFLR APECVSLE+YARAWDECARSVLSEYV+RAGGGSFSARYGSWEDSV 
Subjt:  KLPVLTMKLLVPLDYPEDYPVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVA

Query:  AA
        AA
Subjt:  AA

A0A1S4DUH4 uncharacterized protein LOC1034868781.3e-29672.16Show/hide
Query:  LDKKASMAT-PTDWRTEITQETRQQIVRSIYMMLKEQPS-ELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQP
        +DKKASMAT  TDWRTEIT+ETRQ+   SI+M+L+ Q S + N+ +ISD ARKHEM LFS A S +EYL+ G GK+ KRENH+GSSS +A V+YPQYHQP
Subjt:  LDKKASMAT-PTDWRTEITQETRQQIVRSIYMMLKEQPS-ELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQP

Query:  AEAKSLLLQHIQRTPQLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHN
                     TPQL RQHP VRQ HQQF MQNQ+ AS QNTSNSQ RPQGF RQD GIHLSSEMFTQHPNFVNLTTQV+KE +SEGF ASKS  QH 
Subjt:  AEAKSLLLQHIQRTPQLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHN

Query:  QHSMSGAF-ADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGN
        QH    +  A  ERIP+SEV HDAAFAEMEQLKKTFLP  IKAYEP+RK VH +   +  L+KT+E+IL FF S KEKIIASYTKE+F RCL+YIEQ GN
Subjt:  QHSMSGAF-ADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGN

Query:  TIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSS--VAPQEKGSIR---------------------SKLHPQWIHGS
        TIK N NV NK SSLH GQPGLSGSRINHP+QQ  DNVKL CQSVIR TTGS  SS  +APQEKGS+R                     SK+HPQWIH S
Subjt:  TIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSS--VAPQEKGSIR---------------------SKLHPQWIHGS

Query:  GNTPSIYRSGMSLNPHLNSNFS----------HVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEM----
        GNTP+ YRSGMSLN HLNSNFS          HVAERPRPT PCT PL+G ASP PSS IVGLEK SPNVTYHS  NF    HCNPYQLLHSK E     
Subjt:  GNTPSIYRSGMSLNPHLNSNFS----------HVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEM----

Query:  ------------IAEPTSLGINGQLSTYQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIP
                    +AEPTS GINGQ STYQA++RLLKAVGSSS+ ALRAAVSGITSVGYMEDA+IDPRC A VTNLRL++G GSSNNMKRKINAMALNNIP
Subjt:  ------------IAEPTSLGINGQLSTYQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIP

Query:  SPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----VKLPVLTMKLLVPLDYPEDY
        SP S+IPGSEETVTSRTKKLKK +DSSLLEE+RNINKQFIETVLELD+DENLN+RLANAGTVLRCSYSAV DGTN+    VKLPVLTMKLLVPLDYPEDY
Subjt:  SPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----VKLPVLTMKLLVPLDYPEDY

Query:  PVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA
        PVFLSKFD  SSNVDEEC NLSN AMSMLRAFLR APECVSLEEYAR WDECARSV+S+YV+RAGGGSFSARYG+WEDSVA A
Subjt:  PVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA

A0A5A7UH89 Putative tartrate dehydrogenase/decarboxylase ttuC1.3e-29672.16Show/hide
Query:  LDKKASMAT-PTDWRTEITQETRQQIVRSIYMMLKEQPS-ELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQP
        +DKKASMAT  TDWRTEIT+ETRQ+   SI+M+L+ Q S + N+ +ISD ARKHEM LFS A S +EYL+ G GK+ KRENH+GSSS +A V+YPQYHQP
Subjt:  LDKKASMAT-PTDWRTEITQETRQQIVRSIYMMLKEQPS-ELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQP

Query:  AEAKSLLLQHIQRTPQLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHN
                     TPQL RQHP VRQ HQQF MQNQ+ AS QNTSNSQ RPQGF RQD GIHLSSEMFTQHPNFVNLTTQV+KE +SEGF ASKS  QH 
Subjt:  AEAKSLLLQHIQRTPQLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHN

Query:  QHSMSGAF-ADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGN
        QH    +  A  ERIP+SEV HDAAFAEMEQLKKTFLP  IKAYEP+RK VH +   +  L+KT+E+IL FF S KEKIIASYTKE+F RCL+YIEQ GN
Subjt:  QHSMSGAF-ADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGN

Query:  TIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSS--VAPQEKGSIR---------------------SKLHPQWIHGS
        TIK N NV NK SSLH GQPGLSGSRINHP+QQ  DNVKL CQSVIR TTGS  SS  +APQEKGS+R                     SK+HPQWIH S
Subjt:  TIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSS--VAPQEKGSIR---------------------SKLHPQWIHGS

Query:  GNTPSIYRSGMSLNPHLNSNFS----------HVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEM----
        GNTP+ YRSGMSLN HLNSNFS          HVAERPRPT PCT PL+G ASP PSS IVGLEK SPNVTYHS  NF    HCNPYQLLHSK E     
Subjt:  GNTPSIYRSGMSLNPHLNSNFS----------HVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEM----

Query:  ------------IAEPTSLGINGQLSTYQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIP
                    +AEPTS GINGQ STYQA++RLLKAVGSSS+ ALRAAVSGITSVGYMEDA+IDPRC A VTNLRL++G GSSNNMKRKINAMALNNIP
Subjt:  ------------IAEPTSLGINGQLSTYQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIP

Query:  SPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----VKLPVLTMKLLVPLDYPEDY
        SP S+IPGSEETVTSRTKKLKK +DSSLLEE+RNINKQFIETVLELD+DENLN+RLANAGTVLRCSYSAV DGTN+    VKLPVLTMKLLVPLDYPEDY
Subjt:  SPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----VKLPVLTMKLLVPLDYPEDY

Query:  PVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA
        PVFLSKFD  SSNVDEEC NLSN AMSMLRAFLR APECVSLEEYAR WDECARSV+S+YV+RAGGGSFSARYG+WEDSVA A
Subjt:  PVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA

A0A6J1D158 probable mediator of RNA polymerase II transcription subunit 15c isoform X11.6e-14544.23Show/hide
Query:  VLDKKASMATPTDWRTEITQETRQQIVRSIYMMLKEQ-PSELNVKLISD----RARKHEMNLFSTAKSKEEYLSTGTGKMIKREN-HQGSSSGQ------
        ++++  ++    DWR EI  E R++IV SI   LKEQ P+  +  +IS+     A K E  +F+ A SK+ Y+   + KM + EN H+GSSS Q      
Subjt:  VLDKKASMATPTDWRTEITQETRQQIVRSIYMMLKEQ-PSELNVKLISD----RARKHEMNLFSTAKSKEEYLSTGTGKMIKREN-HQGSSSGQ------

Query:  -----------AVVI-----------YP------QYHQPAEAKSLLLQHI-QRTPQLH----RQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPR
                   AV +           YP      Q  Q A    LL Q+I Q TPQ H    RQ+ N RQ HQQFGM +Q+   PQNT  S CRP G   
Subjt:  -----------AVVI-----------YP------QYHQPAEAKSLLLQHI-QRTPQLH----RQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPR

Query:  QDTGIHLSSEMFTQHPNFV------NLTTQVKKEVDSEGFMASK-------------SSEQHNQHSMSGAFADPERIPNSEVWHDAAFAEMEQLKKTFLP
        +D+G+++  +MF  H   +      NL  Q+K+EV  E   ASK               EQH QH   G  A     P  E WHD A+ EM+ LK T LP
Subjt:  QDTGIHLSSEMFTQHPNFV------NLTTQVKKEVDSEGFMASK-------------SSEQHNQHSMSGAFADPERIPNSEVWHDAAFAEMEQLKKTFLP

Query:  YFIKAYEPFRK----VVHQEGLQR-RGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGNTIKS-NINVVNKPSSLHDGQPGLSGSRINHPVQ
           ++YE   K    V   E +Q+ +  +  ++K+  F +  ++K I  +TKEKF + ++ IE+     ++ N  +VNK   LH GQPG+S S IN PVQ
Subjt:  YFIKAYEPFRK----VVHQEGLQR-RGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGNTIKS-NINVVNKPSSLHDGQPGLSGSRINHPVQ

Query:  QCGDNVKLHCQSVIRTTTGSGSSSV--APQEKGS---------------------IRSKLHPQWIH----GSGNTPSIYRSGMSLNPHLN----------
        +  DN   H Q +    TGS  SS    P E GS                     I+ +    WI      + +  +I RSG+SL  HLN          
Subjt:  QCGDNVKLHCQSVIRTTTGSGSSSV--APQEKGS---------------------IRSKLHPQWIH----GSGNTPSIYRSGMSLNPHLN----------

Query:  -SNFSHVAERPRPTNPCT--YPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEM----------------IAEPTSLGINGQL
         S  S +AER    +PC+  Y   GRASP PSSS VGL K S NV+  SS NF +    N   LL+SK ++                 AEPTSLG +  L
Subjt:  -SNFSHVAERPRPTNPCT--YPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEM----------------IAEPTSLGINGQL

Query:  ST-YQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDP-----RCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIPSPSSEIPGSEETVTSRTKK
        ST  Q  NRLLKAV S S +ALR A+SGI+SVG M D + +P     RC  K  +L L DGFGSSNNMKRKI A+ LN++PSP S+  GSE TVTSR+KK
Subjt:  ST-YQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDP-----RCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIPSPSSEIPGSEETVTSRTKK

Query:  LKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDG-------TNTVKLPVLTMKLLVPLDYPEDYPVFLSKFDLSSSNVDE
        LKKL+D++LLEEMRNIN++ +ETVLELD  +N+N+R ANAGTV+RC+YSAVSD         NT+KLPVL++KLLVPLDYPEDYPVFLSKF+    N DE
Subjt:  LKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDG-------TNTVKLPVLTMKLLVPLDYPEDYPVFLSKFDLSSSNVDE

Query:  ECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAA
        ECR+LS  A SMLRAFLR APE +SL EYARAWD+CAR V+SEY +R GGG FS+RYG+WED VAA
Subjt:  ECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAA

E5GBP1 KIX_2 domain-containing protein1.3e-29672.16Show/hide
Query:  LDKKASMAT-PTDWRTEITQETRQQIVRSIYMMLKEQPS-ELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQP
        +DKKASMAT  TDWRTEIT+ETRQ+   SI+M+L+ Q S + N+ +ISD ARKHEM LFS A S +EYL+ G GK+ KRENH+GSSS +A V+YPQYHQP
Subjt:  LDKKASMAT-PTDWRTEITQETRQQIVRSIYMMLKEQPS-ELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQP

Query:  AEAKSLLLQHIQRTPQLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHN
                     TPQL RQHP VRQ HQQF MQNQ+ AS QNTSNSQ RPQGF RQD GIHLSSEMFTQHPNFVNLTTQV+KE +SEGF ASKS  QH 
Subjt:  AEAKSLLLQHIQRTPQLHRQHPNVRQ-HQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHN

Query:  QHSMSGAF-ADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGN
        QH    +  A  ERIP+SEV HDAAFAEMEQLKKTFLP  IKAYEP+RK VH +   +  L+KT+E+IL FF S KEKIIASYTKE+F RCL+YIEQ GN
Subjt:  QHSMSGAF-ADPERIPNSEVWHDAAFAEMEQLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGN

Query:  TIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSS--VAPQEKGSIR---------------------SKLHPQWIHGS
        TIK N NV NK SSLH GQPGLSGSRINHP+QQ  DNVKL CQSVIR TTGS  SS  +APQEKGS+R                     SK+HPQWIH S
Subjt:  TIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSS--VAPQEKGSIR---------------------SKLHPQWIHGS

Query:  GNTPSIYRSGMSLNPHLNSNFS----------HVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEM----
        GNTP+ YRSGMSLN HLNSNFS          HVAERPRPT PCT PL+G ASP PSS IVGLEK SPNVTYHS  NF    HCNPYQLLHSK E     
Subjt:  GNTPSIYRSGMSLNPHLNSNFS----------HVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEM----

Query:  ------------IAEPTSLGINGQLSTYQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIP
                    +AEPTS GINGQ STYQA++RLLKAVGSSS+ ALRAAVSGITSVGYMEDA+IDPRC A VTNLRL++G GSSNNMKRKINAMALNNIP
Subjt:  ------------IAEPTSLGINGQLSTYQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIP

Query:  SPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----VKLPVLTMKLLVPLDYPEDY
        SP S+IPGSEETVTSRTKKLKK +DSSLLEE+RNINKQFIETVLELD+DENLN+RLANAGTVLRCSYSAV DGTN+    VKLPVLTMKLLVPLDYPEDY
Subjt:  SPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNT----VKLPVLTMKLLVPLDYPEDY

Query:  PVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA
        PVFLSKFD  SSNVDEEC NLSN AMSMLRAFLR APECVSLEEYAR WDECARSV+S+YV+RAGGGSFSARYG+WEDSVA A
Subjt:  PVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA

SwissProt top hitse value%identityAlignment
F4I171 Mediator of RNA polymerase II transcription subunit 15a2.0e-2322.24Show/hide
Query:  DWRTEITQ------ETRQQIVRSIYMML--KEQPSELNVKLISDRARKHEMNLFSTAKSKE-EYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQPAEAK
        DW+ E+ Q      ET    +  IY  +  K Q   +  +  SD+  K  +  F T   +  ++LS     ++     + +   + ++ +   H+P   K
Subjt:  DWRTEITQ------ETRQQIVRSIYMML--KEQPSELNVKLISDRARKHEMNLFSTAKSKE-EYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQPAEAK

Query:  SLLLQHIQRTPQLHRQHPNVRQHQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHN-QHSM
         +    + ++     Q P  +  Q     NQT    Q+ S     P+   +Q +  ++ S + +  P        +   + +    + + +  +N Q   
Subjt:  SLLLQHIQRTPQLHRQHPNVRQHQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHN-QHSM

Query:  SGAFAD--PERIPNSEVWHDAAFAEME------QLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQ
         G+      + + NS     +  + ++      QL  + L +     +  +++  ++  Q+R + +  +++    Q  ++++ A   +++  +  Q  + 
Subjt:  SGAFAD--PERIPNSEVWHDAAFAEME------QLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQ

Query:  SGNTIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSSVAPQ--EKGSIRSKLHPQWIHGSGNTPSIYRSGMSLNPHLN
        +  T +  +NV       H     + G R N+P+QQ      +    +++  +   S  ++PQ  +K ++     P       N+P +  S  S  P   
Subjt:  SGNTIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSSVAPQ--EKGSIRSKLHPQWIHGSGNTPSIYRSGMSLNPHLN

Query:  SNFSHVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEMIAEPTSLGINGQLSTYQAHNRLLKAVGSSSKE
        S     +E+P  ++     +  + +      +  L   +P ++          P  N   +L+S      +P+        +T     RL++AV S S +
Subjt:  SNFSHVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEMIAEPTSLGINGQLSTYQAHNRLLKAVGSSSKE

Query:  ALRAAVSGITSVGYM------------------EDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMAL-----------NNIPSPSSEIPGSEETVTS
        AL +AVS I SV  M                  ED +   +CR +  N    +G  ++  MKR   AM L           N      SE    E T TS
Subjt:  ALRAAVSGITSVGYM------------------EDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMAL-----------NNIPSPSSEIPGSEETVTS

Query:  RTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLN-------KRLANAGTVLRCSYSAVSDG-------TNTVKLPVLTMKLLVPLDYPEDYPVFLS
          KK +  ++ +LLEE++ IN++ I+TV+E+  DE+           +   GT +R S+ AVS         ++T   P+  ++LLVP  YP   P  L 
Subjt:  RTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLN-------KRLANAGTVLRCSYSAVSDG-------TNTVKLPVLTMKLLVPLDYPEDYPVFLS

Query:  KFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA
        K  + +S  +E+   LS+ AM+     LR+  + +SL++ A+ WD CAR+V+ EY ++ GGG+FS++YG+WE  VAA+
Subjt:  KFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA

Q84LH3 Werner Syndrome-like exonuclease7.4e-1840.8Show/hide
Query:  VGLDVEWRPYF--GPKPNPVATLQLCVGHRCLIFQLLYCPAAPQALINFLYDSSCTFVGVGIHQDVQKLYHEYGLIVSNVVDLRDLAVNKLGRAYLRYAG
        VGLD+EWRP F  G  P  VAT+Q+CV         ++    PQ+L + + DS+   VG+GI  D  KL+H+YG+ + +V DL DLA  K+G    +  G
Subjt:  VGLDVEWRPYF--GPKPNPVATLQLCVGHRCLIFQLLYCPAAPQALINFLYDSSCTFVGVGIHQDVQKLYHEYGLIVSNVVDLRDLAVNKLGRAYLRYAG

Query:  LKSLWWEVLGREIEKPKYITLSNWD
        L SL   ++ +E+ KP  I L NW+
Subjt:  LKSLWWEVLGREIEKPKYITLSNWD

Q9NVH0 Exonuclease 3'-5' domain-containing protein 22.3e-1134.62Show/hide
Query:  IVGLDVEWRPYFGPKPNPVATLQLC-VGHRCLIFQL--LYC--PAAPQALINFLYDSSCTFVGVGIHQDVQKLYHEYGLIVSNVVDLRDLAVNKLGRAYL
        ++G+D EW    G K +P++ LQ+      C++ +L  L C     P+ L++ L D +   VGVG  +D  KL  +YGL+V   +DLR LA+ +      
Subjt:  IVGLDVEWRPYFGPKPNPVATLQLC-VGHRCLIFQL--LYC--PAAPQALINFLYDSSCTFVGVGIHQDVQKLYHEYGLIVSNVVDLRDLAVNKLGRAYL

Query:  RYAGLKSLWWEVLGREIEKPKYITLSNWDA
            LKSL   VL   ++K   +  SNWDA
Subjt:  RYAGLKSLWWEVLGREIEKPKYITLSNWDA

Q9SHV7 Probable mediator of RNA polymerase II transcription subunit 15c2.2e-1425.69Show/hide
Query:  NRLLKAVGSSSKEALRAAVSGITSVGYMED-------------AMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIPSPSSEIPGSEE-------
        +RL+KA  ++S ++L  +VS I+SV  M D             A +      +  N    +    S  MKR IN +     P  SS+I   E+       
Subjt:  NRLLKAVGSSSKEALRAAVSGITSVGYMED-------------AMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMALNNIPSPSSEIPGSEE-------

Query:  --TVTSRTKKLKKLSDS-SLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTN-----------------TVKLPVLTMKLLVPLD
          + TS   K+  ++   +LL+E++  N + +ETV+E+  +++L       GT++ C+Y+ V+                      ++  +  ++LL P+D
Subjt:  --TVTSRTKKLKKLSDS-SLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTN-----------------TVKLPVLTMKLLVPLD

Query:  YPEDYPVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA
        YP   P+ L +    +S    +  +LS    S     ++   E    +  A+ W++CAR+ + EY +R GGG+FS++YG+WE  + A+
Subjt:  YPEDYPVFLSKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA

Q9VGN7 Exonuclease 3'-5' domain-containing protein 21.1e-1035.14Show/hide
Query:  DMVNFWVSTIREINNRRIRPLIVGLDVEWRPYFGPKPNPVATLQLCVGHR--CLIFQLLYCPAAPQALINFLYDSSCTFVGVGIHQDVQKLYHEYGLIVS
        D    WV  + E+ N      ++G D EW    G +  PVA LQL   HR  C +F+L +    PQ L   L D S   VGV   +D  KL H+YG+ V+
Subjt:  DMVNFWVSTIREINNRRIRPLIVGLDVEWRPYFGPKPNPVATLQLCVGHR--CLIFQLLYCPAAPQALINFLYDSSCTFVGVGIHQDVQKLYHEYGLIVS

Query:  NVVDLRDLAVNKLGRAYLRYAGLKSLWWEVLGREIEKPKYITLSNWDA
        + +DLR L V     A  +  GL  L    L   ++K   +  SNW+A
Subjt:  NVVDLRDLAVNKLGRAYLRYAGLKSLWWEVLGREIEKPKYITLSNWDA

Arabidopsis top hitse value%identityAlignment
AT1G15780.1 unknown protein1.4e-2422.24Show/hide
Query:  DWRTEITQ------ETRQQIVRSIYMML--KEQPSELNVKLISDRARKHEMNLFSTAKSKE-EYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQPAEAK
        DW+ E+ Q      ET    +  IY  +  K Q   +  +  SD+  K  +  F T   +  ++LS     ++     + +   + ++ +   H+P   K
Subjt:  DWRTEITQ------ETRQQIVRSIYMML--KEQPSELNVKLISDRARKHEMNLFSTAKSKE-EYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQPAEAK

Query:  SLLLQHIQRTPQLHRQHPNVRQHQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHN-QHSM
         +    + ++     Q P  +  Q     NQT    Q+ S     P+   +Q +  ++ S + +  P        +   + +    + + +  +N Q   
Subjt:  SLLLQHIQRTPQLHRQHPNVRQHQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHN-QHSM

Query:  SGAFAD--PERIPNSEVWHDAAFAEME------QLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQ
         G+      + + NS     +  + ++      QL  + L +     +  +++  ++  Q+R + +  +++    Q  ++++ A   +++  +  Q  + 
Subjt:  SGAFAD--PERIPNSEVWHDAAFAEME------QLKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQ

Query:  SGNTIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSSVAPQ--EKGSIRSKLHPQWIHGSGNTPSIYRSGMSLNPHLN
        +  T +  +NV       H     + G R N+P+QQ      +    +++  +   S  ++PQ  +K ++     P       N+P +  S  S  P   
Subjt:  SGNTIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGDNVKLHCQSVIRTTTGSGSSSVAPQ--EKGSIRSKLHPQWIHGSGNTPSIYRSGMSLNPHLN

Query:  SNFSHVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEMIAEPTSLGINGQLSTYQAHNRLLKAVGSSSKE
        S     +E+P  ++     +  + +      +  L   +P ++          P  N   +L+S      +P+        +T     RL++AV S S +
Subjt:  SNFSHVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHSSSNFHFRPHCNPYQLLHSKAEMIAEPTSLGINGQLSTYQAHNRLLKAVGSSSKE

Query:  ALRAAVSGITSVGYM------------------EDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMAL-----------NNIPSPSSEIPGSEETVTS
        AL +AVS I SV  M                  ED +   +CR +  N    +G  ++  MKR   AM L           N      SE    E T TS
Subjt:  ALRAAVSGITSVGYM------------------EDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMAL-----------NNIPSPSSEIPGSEETVTS

Query:  RTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLN-------KRLANAGTVLRCSYSAVSDG-------TNTVKLPVLTMKLLVPLDYPEDYPVFLS
          KK +  ++ +LLEE++ IN++ I+TV+E+  DE+           +   GT +R S+ AVS         ++T   P+  ++LLVP  YP   P  L 
Subjt:  RTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLN-------KRLANAGTVLRCSYSAVSDG-------TNTVKLPVLTMKLLVPLDYPEDYPVFLS

Query:  KFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA
        K  + +S  +E+   LS+ AM+     LR+  + +SL++ A+ WD CAR+V+ EY ++ GGG+FS++YG+WE  VAA+
Subjt:  KFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA

AT3G12410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.0e-1932.11Show/hide
Query:  YFTHDYYDITIDDDEILTLRTASTDMVNFWVSTIREINNRRIRPLIVGLDVEWRP---YFGPKP---------------NPVATLQLCVGHRCLIFQLLY
        Y TH  Y +    DE +   T  + +++ W+  +   N     PL+VG+ V+W P   Y  P+P               NP   LQLCVG+RCLI QL Y
Subjt:  YFTHDYYDITIDDDEILTLRTASTDMVNFWVSTIREINNRRIRPLIVGLDVEWRP---YFGPKP---------------NPVATLQLCVGHRCLIFQLLY

Query:  CPAAPQALINFLYDSSCTFVGVGIHQDVQKLYH-EYGLIVSNVVDLRDLAVNKLGRAYLRYAGLKSLWWEVLGRE-IEKPKYITLSNWDA
        C   P  L +FL D   TFVGV   QD  KL    + L +  ++D+R    +  GR+ +R +  + +  E +G + +     I++S+W A
Subjt:  CPAAPQALINFLYDSSCTFVGVGIHQDVQKLYH-EYGLIVSNVVDLRDLAVNKLGRAYLRYAGLKSLWWEVLGRE-IEKPKYITLSNWDA

AT3G12470.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.8e-1933.14Show/hide
Query:  THDYYDITIDDDEILTLRTASTDMVNFWVSTIREIN-NRRIRPLIVGLDVEWRPYFGPKPNPVATLQLCVGHRCLIFQLLYCPAAPQALINFLYDSSCTF
        TH  + +    D+++   T +  ++  W+ ++R  N N  + PL+VG+ V+WRP       P  TLQLCVG RCLI QL Y    P+ L  FL D   TF
Subjt:  THDYYDITIDDDEILTLRTASTDMVNFWVSTIREIN-NRRIRPLIVGLDVEWRPYFGPKPNPVATLQLCVGHRCLIFQLLYCPAAPQALINFLYDSSCTF

Query:  VGVGIHQDVQKLYH-EYGLIVSNVVDLRDLAVNKLGRAYLRYAGLKSLWWEVLGR-EIEKPKYITLSNW
        VGV   QD +KL    + + +  ++D+R    +  G A + +   + +  E LGR  +     I +S+W
Subjt:  VGVGIHQDVQKLYH-EYGLIVSNVVDLRDLAVNKLGRAYLRYAGLKSLWWEVLGR-EIEKPKYITLSNW

AT4G13870.1 Werner syndrome-like exonuclease5.2e-1940.8Show/hide
Query:  VGLDVEWRPYF--GPKPNPVATLQLCVGHRCLIFQLLYCPAAPQALINFLYDSSCTFVGVGIHQDVQKLYHEYGLIVSNVVDLRDLAVNKLGRAYLRYAG
        VGLD+EWRP F  G  P  VAT+Q+CV         ++    PQ+L + + DS+   VG+GI  D  KL+H+YG+ + +V DL DLA  K+G    +  G
Subjt:  VGLDVEWRPYF--GPKPNPVATLQLCVGHRCLIFQLLYCPAAPQALINFLYDSSCTFVGVGIHQDVQKLYHEYGLIVSNVVDLRDLAVNKLGRAYLRYAG

Query:  LKSLWWEVLGREIEKPKYITLSNWD
        L SL   ++ +E+ KP  I L NW+
Subjt:  LKSLWWEVLGREIEKPKYITLSNWD

AT4G13870.2 Werner syndrome-like exonuclease5.2e-1940.8Show/hide
Query:  VGLDVEWRPYF--GPKPNPVATLQLCVGHRCLIFQLLYCPAAPQALINFLYDSSCTFVGVGIHQDVQKLYHEYGLIVSNVVDLRDLAVNKLGRAYLRYAG
        VGLD+EWRP F  G  P  VAT+Q+CV         ++    PQ+L + + DS+   VG+GI  D  KL+H+YG+ + +V DL DLA  K+G    +  G
Subjt:  VGLDVEWRPYF--GPKPNPVATLQLCVGHRCLIFQLLYCPAAPQALINFLYDSSCTFVGVGIHQDVQKLYHEYGLIVSNVVDLRDLAVNKLGRAYLRYAG

Query:  LKSLWWEVLGREIEKPKYITLSNWD
        L SL   ++ +E+ KP  I L NW+
Subjt:  LKSLWWEVLGREIEKPKYITLSNWD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCTTCTTCTTAACATCACAGACCTCCACATCCCTTACTTCACCCATGACTACTACGACATCACTATCGACGACGATGAAATCCTCACTCTCCGCACTGCC
TCCACCGACATGGTCAACTTTTGGGTCTCCACCATCCGTGAAATCAATAACCGCCGCATCCGCCCTCTCATCGTCGGTCTCGACGTCGAGTGGCGTCCTTACTTT
GGCCCTAAACCTAACCCCGTCGCCACTTTACAGCTCTGTGTTGGCCACCGCTGCCTCATCTTCCAACTCCTCTACTGCCCGGCTGCCCCTCAGGCTTTGATCAAT
TTCTTGTACGACTCATCCTGCACTTTCGTCGGCGTCGGAATCCACCAGGACGTTCAAAAGCTTTACCATGAGTACGGTCTGATTGTCTCCAACGTTGTGGATCTT
AGGGATTTGGCTGTTAATAAGTTGGGGAGAGCTTACCTGAGGTATGCCGGACTGAAGAGTCTGTGGTGGGAGGTTCTTGGAAGGGAGATTGAGAAGCCGAAATAT
ATAACTCTGAGTAACTGGGATGCTGGTTTTTTTTTTTTTTTTTCTGTTTGCGTACTGGACAAGAAAGCTAGTATGGCTACTCCAACTGATTGGAGGACGGAAATA
ACTCAAGAAACTAGACAGCAAATTGTTCGTTCAATATATATGATGTTGAAGGAACAGCCTTCTGAGTTGAATGTGAAACTAATTAGTGACCGTGCGAGGAAACAT
GAGATGAACTTGTTTAGTACTGCCAAGTCTAAGGAAGAATATTTGTCTACTGGAACCGGAAAGATGATTAAGAGAGAAAATCACCAGGGGAGTTCAAGCGGTCAA
GCAGTAGTGATTTATCCGCAATATCATCAGCCAGCAGAGGCTAAGTCACTTTTGCTGCAACATATTCAGCGAACGCCACAGCTACATAGACAACATCCAAATGTG
AGACAGCATCAACAATTTGGGATGCAGAATCAAACCGGTGCTAGTCCACAAAATACATCAAATTCACAGTGCAGACCCCAGGGTTTCCCGAGACAAGATACTGGA
ATTCATCTGTCCTCAGAAATGTTCACGCAACATCCTAACTTTGTGAACTTGACCACTCAAGTTAAAAAGGAAGTGGATAGCGAAGGTTTTATGGCCTCCAAATCC
TCCGAGCAGCATAATCAACACTCCATGAGTGGAGCTTTTGCAGATCCAGAAAGAATTCCAAATTCAGAAGTTTGGCATGATGCTGCTTTTGCTGAGATGGAACAG
CTCAAGAAGACGTTCTTGCCTTATTTTATTAAGGCATATGAACCCTTTCGAAAGGTTGTACACCAAGAGGGCCTACAAAGGAGGGGGCTTGTGAAGACAATCGAA
AAAATTCTGAGGTTTTTCCAATCGTCTAAGGAGAAAATCATTGCTTCTTACACCAAGGAGAAGTTTATTCGATGTTTGCAATATATTGAGCAAAGTGGGAACACA
ATTAAGAGCAATATCAATGTTGTTAATAAGCCGTCGTCCCTACATGATGGCCAGCCTGGCCTTAGTGGATCTCGCATAAACCATCCAGTTCAGCAATGTGGTGAT
AATGTGAAACTCCATTGTCAATCAGTAATCCGAACTACAACTGGCTCTGGTAGTTCTTCCGTAGCACCACAAGAAAAAGGTTCCATAAGGTCAAAATTACATCCC
CAATGGATTCATGGTTCAGGAAACACTCCATCTATTTATAGATCAGGAATGTCTTTGAATCCTCATTTGAACTCAAACTTTTCCCATGTTGCTGAGCGACCTCGT
CCAACAAATCCTTGTACTTATCCATTGCATGGAAGAGCTTCTCCACCTCCATCTTCCTCTATAGTTGGACTTGAAAAAATCTCTCCAAATGTTACCTACCATTCA
AGTTCGAACTTTCATTTCCGTCCACATTGTAACCCATATCAGTTACTTCATTCCAAAGCAGAAATGATCGCTGAACCTACTAGTCTTGGAATCAATGGGCAACTG
TCAACGTATCAGGCTCATAATCGCTTACTTAAAGCGGTTGGGTCATCATCAAAGGAAGCACTCAGAGCAGCTGTGTCGGGAATAACTTCAGTTGGGTATATGGAA
GACGCGATGATAGATCCCCGGTGTCGTGCAAAGGTAACAAATTTAAGGTTGATAGACGGATTTGGTTCCTCAAATAATATGAAGCGCAAAATAAACGCAATGGCC
TTAAACAACATACCATCGCCTAGCAGTGAGATTCCTGGATCTGAAGAAACAGTTACATCAAGAACAAAGAAACTCAAGAAACTGTCTGATTCTTCCCTCTTAGAA
GAAATGAGAAACATAAACAAGCAGTTCATCGAAACAGTCTTGGAACTAGATTTGGACGAGAACCTCAATAAACGATTAGCCAATGCGGGTACGGTTCTTCGATGC
TCCTATAGTGCTGTAAGTGATGGTACAAATACTGTGAAACTGCCTGTGCTTACTATGAAGCTGCTTGTACCTCTTGATTATCCTGAAGACTATCCCGTATTTCTA
AGCAAATTCGACTTGTCCTCCAGCAATGTGGACGAAGAATGTAGAAACCTGTCGAACGGAGCAATGTCGATGTTACGTGCATTCCTTCGCGCTGCTCCGGAATGT
GTGTCTCTTGAAGAATATGCAAGGGCATGGGATGAATGTGCTCGCTCTGTGCTATCTGAGTATGTTAAACGTGCGGGTGGGGGAAGCTTCAGTGCCCGATATGGG
TCTTGGGAGGACTCTGTTGCCGCCGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCTTCTTCTTAACATCACAGACCTCCACATCCCTTACTTCACCCATGACTACTACGACATCACTATCGACGACGATGAAATCCTCACTCTCCGCACTGCC
TCCACCGACATGGTCAACTTTTGGGTCTCCACCATCCGTGAAATCAATAACCGCCGCATCCGCCCTCTCATCGTCGGTCTCGACGTCGAGTGGCGTCCTTACTTT
GGCCCTAAACCTAACCCCGTCGCCACTTTACAGCTCTGTGTTGGCCACCGCTGCCTCATCTTCCAACTCCTCTACTGCCCGGCTGCCCCTCAGGCTTTGATCAAT
TTCTTGTACGACTCATCCTGCACTTTCGTCGGCGTCGGAATCCACCAGGACGTTCAAAAGCTTTACCATGAGTACGGTCTGATTGTCTCCAACGTTGTGGATCTT
AGGGATTTGGCTGTTAATAAGTTGGGGAGAGCTTACCTGAGGTATGCCGGACTGAAGAGTCTGTGGTGGGAGGTTCTTGGAAGGGAGATTGAGAAGCCGAAATAT
ATAACTCTGAGTAACTGGGATGCTGGTTTTTTTTTTTTTTTTTCTGTTTGCGTACTGGACAAGAAAGCTAGTATGGCTACTCCAACTGATTGGAGGACGGAAATA
ACTCAAGAAACTAGACAGCAAATTGTTCGTTCAATATATATGATGTTGAAGGAACAGCCTTCTGAGTTGAATGTGAAACTAATTAGTGACCGTGCGAGGAAACAT
GAGATGAACTTGTTTAGTACTGCCAAGTCTAAGGAAGAATATTTGTCTACTGGAACCGGAAAGATGATTAAGAGAGAAAATCACCAGGGGAGTTCAAGCGGTCAA
GCAGTAGTGATTTATCCGCAATATCATCAGCCAGCAGAGGCTAAGTCACTTTTGCTGCAACATATTCAGCGAACGCCACAGCTACATAGACAACATCCAAATGTG
AGACAGCATCAACAATTTGGGATGCAGAATCAAACCGGTGCTAGTCCACAAAATACATCAAATTCACAGTGCAGACCCCAGGGTTTCCCGAGACAAGATACTGGA
ATTCATCTGTCCTCAGAAATGTTCACGCAACATCCTAACTTTGTGAACTTGACCACTCAAGTTAAAAAGGAAGTGGATAGCGAAGGTTTTATGGCCTCCAAATCC
TCCGAGCAGCATAATCAACACTCCATGAGTGGAGCTTTTGCAGATCCAGAAAGAATTCCAAATTCAGAAGTTTGGCATGATGCTGCTTTTGCTGAGATGGAACAG
CTCAAGAAGACGTTCTTGCCTTATTTTATTAAGGCATATGAACCCTTTCGAAAGGTTGTACACCAAGAGGGCCTACAAAGGAGGGGGCTTGTGAAGACAATCGAA
AAAATTCTGAGGTTTTTCCAATCGTCTAAGGAGAAAATCATTGCTTCTTACACCAAGGAGAAGTTTATTCGATGTTTGCAATATATTGAGCAAAGTGGGAACACA
ATTAAGAGCAATATCAATGTTGTTAATAAGCCGTCGTCCCTACATGATGGCCAGCCTGGCCTTAGTGGATCTCGCATAAACCATCCAGTTCAGCAATGTGGTGAT
AATGTGAAACTCCATTGTCAATCAGTAATCCGAACTACAACTGGCTCTGGTAGTTCTTCCGTAGCACCACAAGAAAAAGGTTCCATAAGGTCAAAATTACATCCC
CAATGGATTCATGGTTCAGGAAACACTCCATCTATTTATAGATCAGGAATGTCTTTGAATCCTCATTTGAACTCAAACTTTTCCCATGTTGCTGAGCGACCTCGT
CCAACAAATCCTTGTACTTATCCATTGCATGGAAGAGCTTCTCCACCTCCATCTTCCTCTATAGTTGGACTTGAAAAAATCTCTCCAAATGTTACCTACCATTCA
AGTTCGAACTTTCATTTCCGTCCACATTGTAACCCATATCAGTTACTTCATTCCAAAGCAGAAATGATCGCTGAACCTACTAGTCTTGGAATCAATGGGCAACTG
TCAACGTATCAGGCTCATAATCGCTTACTTAAAGCGGTTGGGTCATCATCAAAGGAAGCACTCAGAGCAGCTGTGTCGGGAATAACTTCAGTTGGGTATATGGAA
GACGCGATGATAGATCCCCGGTGTCGTGCAAAGGTAACAAATTTAAGGTTGATAGACGGATTTGGTTCCTCAAATAATATGAAGCGCAAAATAAACGCAATGGCC
TTAAACAACATACCATCGCCTAGCAGTGAGATTCCTGGATCTGAAGAAACAGTTACATCAAGAACAAAGAAACTCAAGAAACTGTCTGATTCTTCCCTCTTAGAA
GAAATGAGAAACATAAACAAGCAGTTCATCGAAACAGTCTTGGAACTAGATTTGGACGAGAACCTCAATAAACGATTAGCCAATGCGGGTACGGTTCTTCGATGC
TCCTATAGTGCTGTAAGTGATGGTACAAATACTGTGAAACTGCCTGTGCTTACTATGAAGCTGCTTGTACCTCTTGATTATCCTGAAGACTATCCCGTATTTCTA
AGCAAATTCGACTTGTCCTCCAGCAATGTGGACGAAGAATGTAGAAACCTGTCGAACGGAGCAATGTCGATGTTACGTGCATTCCTTCGCGCTGCTCCGGAATGT
GTGTCTCTTGAAGAATATGCAAGGGCATGGGATGAATGTGCTCGCTCTGTGCTATCTGAGTATGTTAAACGTGCGGGTGGGGGAAGCTTCAGTGCCCGATATGGG
TCTTGGGAGGACTCTGTTGCCGCCGCCTAA
Protein sequenceShow/hide protein sequence
MPLLLNITDLHIPYFTHDYYDITIDDDEILTLRTASTDMVNFWVSTIREINNRRIRPLIVGLDVEWRPYFGPKPNPVATLQLCVGHRCLIFQLLYCPAAPQALIN
FLYDSSCTFVGVGIHQDVQKLYHEYGLIVSNVVDLRDLAVNKLGRAYLRYAGLKSLWWEVLGREIEKPKYITLSNWDAGFFFFFSVCVLDKKASMATPTDWRTEI
TQETRQQIVRSIYMMLKEQPSELNVKLISDRARKHEMNLFSTAKSKEEYLSTGTGKMIKRENHQGSSSGQAVVIYPQYHQPAEAKSLLLQHIQRTPQLHRQHPNV
RQHQQFGMQNQTGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFVNLTTQVKKEVDSEGFMASKSSEQHNQHSMSGAFADPERIPNSEVWHDAAFAEMEQ
LKKTFLPYFIKAYEPFRKVVHQEGLQRRGLVKTIEKILRFFQSSKEKIIASYTKEKFIRCLQYIEQSGNTIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQCGD
NVKLHCQSVIRTTTGSGSSSVAPQEKGSIRSKLHPQWIHGSGNTPSIYRSGMSLNPHLNSNFSHVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKISPNVTYHS
SSNFHFRPHCNPYQLLHSKAEMIAEPTSLGINGQLSTYQAHNRLLKAVGSSSKEALRAAVSGITSVGYMEDAMIDPRCRAKVTNLRLIDGFGSSNNMKRKINAMA
LNNIPSPSSEIPGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLELDLDENLNKRLANAGTVLRCSYSAVSDGTNTVKLPVLTMKLLVPLDYPEDYPVFL
SKFDLSSSNVDEECRNLSNGAMSMLRAFLRAAPECVSLEEYARAWDECARSVLSEYVKRAGGGSFSARYGSWEDSVAAA