; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G010920 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G010920
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionCCHC-type domain-containing protein
Genome locationCmo_Chr10:6088771..6097843
RNA-Seq ExpressionCmoCh10G010920
SyntenyCmoCh10G010920
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001995 - Peptidase A2A, retrovirus, catalytic
IPR018061 - Retropepsins
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022930200.1 uncharacterized protein LOC111436716 [Cucurbita moschata]6.3e-17093.63Show/hide
Query:  MNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIEDAVNTLIYTLIEFFVGD
        MNVVNEMMM AGAYKLQ HKSDHQIAQVLVT FTGQLKDWWDKYLDETTRQQILNHYV+RPTTQIIKEEGPSTRTEVQHER+EDAVNTL YTLIEFFVGD
Subjt:  MNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIEDAVNTLIYTLIEFFVGD

Query:  PLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKI
        PLKYQERSAEILMNLKCPTLGDFRWYKD+YFSKVLIRTDSSLEFWKENFVNG+PKHFSRRIKDGLKTKYNGTI WQTLSY SIASFIIEEGLRLCNESKI
Subjt:  PLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKI

Query:  QNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIR
        QNKLNSSISNRKELGRFCDQYGCK IEAPSTSRRKKVKTHPKPYHSYRPRE YRNKPVQSQKPTYSRR+Y PTK H+GKK++TCFKCREEGHYANKCPIR
Subjt:  QNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIR

Query:  GKINELDIDQELKN
        GKINELDI  ELKN
Subjt:  GKINELDIDQELKN

XP_023520850.1 uncharacterized protein LOC111784362 [Cucurbita pepo subsp. pepo]0.0e+0096.72Show/hide
Query:  MDTQSVYESQLNVITADFQIDKEFLKADFMSTTNSSKRNAFFKTYKEAERNELRAQWYSHMEAIKENIPFFEWFEENILQICTLTQRSWKTTKRGEVYSK
        MDTQSVYESQLNVITADFQIDKEFLKADFMSTTNSSKRNAFF+TYKEAERNELRAQWYSHME IKENIPFFEWFEENILQICTLTQRSWKTTKRGEVYSK
Subjt:  MDTQSVYESQLNVITADFQIDKEFLKADFMSTTNSSKRNAFFKTYKEAERNELRAQWYSHMEAIKENIPFFEWFEENILQICTLTQRSWKTTKRGEVYSK

Query:  HPPLEEVEFDNYYGEKVKASPFKHIPEGLEKGNPTLKDIKNIQHQNNYSNKILSTIATQLESIEGKISKKSGSLQEQASSSVPKVDESIPILRPANLEIF
        HPPLEEVEFDNYYGEKVKASPFKHIPEGLEKGNPTLKDIKNIQHQNNYSNKILSTIATQLESIEGKISKKSG+ QEQASSSVPKVDESIPILRPANLEIF
Subjt:  HPPLEEVEFDNYYGEKVKASPFKHIPEGLEKGNPTLKDIKNIQHQNNYSNKILSTIATQLESIEGKISKKSGSLQEQASSSVPKVDESIPILRPANLEIF

Query:  TKRISKEEAAIAKIEEKLDRILNPAMTPHDSSVNVVNNDDEDQDFEDEIANEREEPFYNRIERISRRSQNNASNQKNWYPQPSFPDIQFEEKTLQTQAHY
        TKRISKEEAA+AKIEEKLDRILNPAMTP DSSVNVVNNDDE+Q+FEDEIANEREEPFYNRIERISRRSQNNASNQKNWYPQPSFPDIQFEEKTLQTQAHY
Subjt:  TKRISKEEAAIAKIEEKLDRILNPAMTPHDSSVNVVNNDDEDQDFEDEIANEREEPFYNRIERISRRSQNNASNQKNWYPQPSFPDIQFEEKTLQTQAHY

Query:  DGLAIYEWNIDGLSDYLIMNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERI
        DGLAIYEWNIDGLSDYLIMNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHER+
Subjt:  DGLAIYEWNIDGLSDYLIMNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERI

Query:  EDAVNTLIYTLIEFFVGDPLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGS
        EDAVNTLIYTLIEFFVGDPLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNG+PKHFSRRIKDGLKTKYNGTIPWQTLSYGS
Subjt:  EDAVNTLIYTLIEFFVGDPLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGS

Query:  IASFIIEEGLRLCNESKIQNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQ
        IASFIIEEGLRLCNESKIQNKLNSSISNRKELGRFCDQYGCKGIEAP TSR+KKVKTHPKPYHSYRPRE YR+KPVQSQKPTYSRRKYIPTKTHRGKKKQ
Subjt:  IASFIIEEGLRLCNESKIQNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQ

Query:  TCFKCREEGHYANKCPIRGKINELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCEGCINVLTKDQEILLEVVEKVQDPE
        TCFKCR EGHYA KCPI+GKINELDIDQELKNQLLRLTLT+SEQS EGEIL+LQEESDS SST+YESEQEGKRTCEGCINVLTKDQEILLEVVEKVQD E
Subjt:  TCFKCREEGHYANKCPIRGKINELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCEGCINVLTKDQEILLEVVEKVQDPE

Query:  IQ
        IQ
Subjt:  IQ

XP_023521035.1 uncharacterized protein LOC111784623 [Cucurbita pepo subsp. pepo]3.6e-18987.31Show/hide
Query:  MNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIEDAVNTLIYTLIEFFVGD
        MN+VNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDE TRQQIL+HYVIRPTTQIIKEEGPSTRTEVQHER+EDAVNTLIYTLIEFFVGD
Subjt:  MNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIEDAVNTLIYTLIEFFVGD

Query:  PLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKI
        PLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNG+PKHFSRRIKDGLKTKYNGTIPWQTLSYG                   
Subjt:  PLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKI

Query:  QNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIR
                   KELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPY+SYRPRE YRNKPVQSQKPTYSRRKY PTKTHRGKKKQTCFKCREEGHYAN+CPIR
Subjt:  QNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIR

Query:  GKINELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCEGCINVLTKDQEILLEVVEKVQDPEIQQK
        GKINELDIDQELKNQLLRL LTDSEQSSEGEILELQEESDSYSSTEYES QEGKRTCEGCINVLTKDQEILLEVVEK +  +  +K
Subjt:  GKINELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCEGCINVLTKDQEILLEVVEKVQDPEIQQK

XP_023522280.1 uncharacterized protein LOC111786173, partial [Cucurbita pepo subsp. pepo]9.7e-19596.07Show/hide
Query:  MNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIEDAVNTLIYTLIEFFVGD
        MNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDE TRQQIL+HYVIRPTTQIIKEEGPSTRTEVQHER+EDAVNTLIYTLIEFFVG+
Subjt:  MNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIEDAVNTLIYTLIEFFVGD

Query:  PLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKI
        PLKYQERSAEILMNLKCPTLGDFRWYK MY SKVLIRTDSSLEFWKENFVNG+PKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKI
Subjt:  PLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKI

Query:  QNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIR
        QNKL SSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPY+SYRPRE YRNKPVQSQKPTYSRRKY PTKTHRGKKKQTCFKCREEGHYAN+CPIR
Subjt:  QNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIR

Query:  GKINELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRT
        GKINELDIDQELKNQLLRL LTDSEQSSEGEILELQEESDSYSSTEYES QEGKRT
Subjt:  GKINELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRT

XP_023552915.1 uncharacterized protein LOC111810441 [Cucurbita pepo subsp. pepo]1.6e-18173.19Show/hide
Query:  IAKIEEKLDRILNPAMTPHDSSVNVVNNDDEDQDFEDEIANEREEPFYNRIERISRRSQNNASNQKNWYPQPSFPDIQFEEKTLQTQAHYDGLAIYEWNI
        +AKIEEKLDRILNPAMTP DSSVNVVNND ++Q+ ED+IANE EEPFYNRIERISRRSQNN SNQKNW                                
Subjt:  IAKIEEKLDRILNPAMTPHDSSVNVVNNDDEDQDFEDEIANEREEPFYNRIERISRRSQNNASNQKNWYPQPSFPDIQFEEKTLQTQAHYDGLAIYEWNI

Query:  DGLSDYLIMNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIEDAVNTLIYT
                                                   QLKDWWDKYLDET                        TRTEVQHER+EDAVNTLIYT
Subjt:  DGLSDYLIMNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIEDAVNTLIYT

Query:  LIEFFVGDPLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGL
        LIEFFVGDP KYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVN +PKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGL
Subjt:  LIEFFVGDPLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGL

Query:  RLCNESKIQNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGH
        RLCNESKIQNKLNSS+SNRKELGRFCDQYGCKGIEAPSTSRRKK KTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKY PTK H+GKK+QTCFKCREEGH
Subjt:  RLCNESKIQNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGH

Query:  YANKCPIRGKINELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCEGCIN
        YANKCPIRGKINEL+IDQELKNQLLRL LTDSEQSS+GEILELQEESDSYS+TEYESEQEGKRT +GC N
Subjt:  YANKCPIRGKINELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCEGCIN

TrEMBL top hitse value%identityAlignment
A0A251VET5 Putative reverse transcriptase domain, Zinc finger, CCHC-type, Aspartic peptidase domain protein8.7e-14136.4Show/hide
Query:  IDKE---FLKADFMSTTNSSKRNAFFKTYKEAERNELRAQWYSHMEAIKENIPFFEWFE-------------ENILQICTLTQRSWKTTKRGEVYSKHPP
        +DKE   +LK +  +  N+     +F TY E ER      W+ +M    ENIPFF WF              ENI    T+T+      +  ++ S HPP
Subjt:  IDKE---FLKADFMSTTNSSKRNAFFKTYKEAERNELRAQWYSHMEAIKENIPFFEWFE-------------ENILQICTLTQRSWKTTKRGEVYSKHPP

Query:  LEEVEFDNYYGEKVKASPFKHIPEGLEKGNPTLKDIKNIQHQNNYSNKILSTIATQLESIEGKISKKSGSLQEQASSSVPKVDESI--PILRPANLEIFT
          +++  N   + + A P  +I E ++      K++K I  QNNY+N+ L+TI  QLE IE   +K   +L + ++S+   + E+I  PI +P N++I  
Subjt:  LEEVEFDNYYGEKVKASPFKHIPEGLEKGNPTLKDIKNIQHQNNYSNKILSTIATQLESIEGKISKKSGSLQEQASSSVPKVDESI--PILRPANLEIFT

Query:  KRISKEEAAIAKIEEKLDRILNPAMTPHDSSVNVVNNDDEDQDFEDEIANEREEPFYNRIERISRR-SQNNA----SNQKNWYPQPSFPDIQFEEKTLQT
         R++K++  +  I  K+ ++          ++NV+ +D E Q  E++  + + E   N+I+R + + S+NN        +N+YP+P+ PD+QFEE+    
Subjt:  KRISKEEAAIAKIEEKLDRILNPAMTPHDSSVNVVNNDDEDQDFEDEIANEREEPFYNRIERISRR-SQNNA----SNQKNWYPQPSFPDIQFEEKTLQT

Query:  QAHYDGLAIYEWNIDGLSDYLIMNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQ
        QA YDG ++YEWNI+G +++ I+N++ EM+MAA AYK  G+ ++ QI  ++V+GFTG LK WWDKYL E  +     H +      IIK E      ++ 
Subjt:  QAHYDGLAIYEWNIDGLSDYLIMNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQ

Query:  HERIEDAVNTLIYTLIEFFVGDPLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTL
         E   D +NTLI+ +I+ F+G+P  YQER++ IL+NL C  L DFRWYKD++ SKV+ R D    +WKE F+ G+PK F+ RI++ +K  +N  IP++ L
Subjt:  HERIEDAVNTLIYTLIEFFVGDPLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTL

Query:  SYGSIASFIIEEGLRLCNESKIQNKL-NSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHR
        SYG I ++I +EGL++CN+ K++ K+    I   KELG FC QYG +  + PS S++K     PK +     R+ Y+ KP    K  Y + K    K  +
Subjt:  SYGSIASFIIEEGLRLCNESKIQNKL-NSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHR

Query:  GKKK-QTCFKCREEGHYANKCPIRGKINELDIDQELKNQLLRLTLTD--SEQS----SEGEILELQEESDSYSSTEYESEQEGKRTCE--GCINVLTKDQ
        GK+    C+KC  +GHY+  C  + KINEL+I  ELK Q+ ++ + D  SE S    S+ ++  +++ +   SS +   E  G   CE  G IN LT++ 
Subjt:  GKKK-QTCFKCREEGHYANKCPIRGKINELDIDQELKNQLLRLTLTD--SEQS----SEGEILELQEESDSYSSTEYESEQEGKRTCE--GCINVLTKDQ

Query:  EILLEVVEKVQDPEIQQKIAQRLRDAMTIPKPIEKEERNPYRLQSVLQRFE----KPRELTTQDLQREINNLKQEIQALRSETRSESYTLRKEILNLQER
        + L E++EK+ D E++Q   ++L++   +  PIEKEE+  Y    + QRF+    K   +T  DLQ E+ N+K+EI+ ++S+         KE+    E+
Subjt:  EILLEVVEKVQDPEIQQKIAQRLRDAMTIPKPIEKEERNPYRLQSVLQRFE----KPRELTTQDLQREINNLKQEIQALRSETRSESYTLRKEILNLQER

Query:  LPAQKETVQPNLDEEDFQSSFVGAITTSQFQKWYALVTLKI-YDFKITLKALIDTGADQNCIQEGLIPTKYFEKTTEGLRGANNNKLKINYKLSKVHVCN
           +K+    N  E   + + +  +T+   QKWY  + + +  +F++TL A+ID+GAD NCIQEGLIPTKY+EKTT  L GAN N L I YKL+ VH+C 
Subjt:  LPAQKETVQPNLDEEDFQSSFVGAITTSQFQKWYALVTLKI-YDFKITLKALIDTGADQNCIQEGLIPTKYFEKTTEGLRGANNNKLKINYKLSKVHVCN

Query:  DGICFVNSFLLVKDLGQELILAM
        +  C+    +LVKDL  ++IL +
Subjt:  DGICFVNSFLLVKDLGQELILAM

A0A6J1EW44 uncharacterized protein LOC1114366181.1e-12796.64Show/hide
Query:  MNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKIQNKLNSSISNRK
        MNLKCPTLGDF+WYKDMYFSKVLIRTDSSLEFWKENFVNG+PKHFSRRIKDGLKTKYNGTIPWQTLSYGSIAS IIEEGLRLCNESKIQNKLNSSISNRK
Subjt:  MNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKIQNKLNSSISNRK

Query:  ELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRGKINELDIDQEL
        ELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETY NKPVQSQKP YSRRKYIPTKTHRGKKKQT FKCREEGHY NKCPIRGKINELDIDQEL
Subjt:  ELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRGKINELDIDQEL

Query:  KNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESE
        KNQLLRLTLTDSEQSSEGEILELQ+ESDSYSSTEYESE
Subjt:  KNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESE

A0A6J1EWB4 uncharacterized protein LOC1114367163.1e-17093.63Show/hide
Query:  MNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIEDAVNTLIYTLIEFFVGD
        MNVVNEMMM AGAYKLQ HKSDHQIAQVLVT FTGQLKDWWDKYLDETTRQQILNHYV+RPTTQIIKEEGPSTRTEVQHER+EDAVNTL YTLIEFFVGD
Subjt:  MNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIEDAVNTLIYTLIEFFVGD

Query:  PLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKI
        PLKYQERSAEILMNLKCPTLGDFRWYKD+YFSKVLIRTDSSLEFWKENFVNG+PKHFSRRIKDGLKTKYNGTI WQTLSY SIASFIIEEGLRLCNESKI
Subjt:  PLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKI

Query:  QNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIR
        QNKLNSSISNRKELGRFCDQYGCK IEAPSTSRRKKVKTHPKPYHSYRPRE YRNKPVQSQKPTYSRR+Y PTK H+GKK++TCFKCREEGHYANKCPIR
Subjt:  QNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIR

Query:  GKINELDIDQELKN
        GKINELDI  ELKN
Subjt:  GKINELDIDQELKN

A0A6J1EYM2 uncharacterized protein LOC1114397301.1e-15996.32Show/hide
Query:  MNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKIQNKLNSSISNRK
        MNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNG+PKHFSRRIKDGLKTK+NGTIPWQTLSYGSIASFIIEEGLRL NESKIQNKLNSSISNRK
Subjt:  MNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKIQNKLNSSISNRK

Query:  ELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRGKINELDIDQEL
        ELGRFCDQYGCKGIEAPSTS R KVKTHPKPYHSYRPRETYRNKPVQSQKP YSRRKYIPTKTH GKKKQTCFKCREEGHYANKCPIRGKINELDIDQEL
Subjt:  ELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRGKINELDIDQEL

Query:  KNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCEGCINVLTKDQEILLEVVEKVQDPEIQQKIAQRLRDAMTIPKPIEKEERNPYRL
        KNQLL LTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCEGCINVLTKDQEILLEVVEKVQDPEIQQKIAQRLRDAMTI KP E+EERNPYRL
Subjt:  KNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCEGCINVLTKDQEILLEVVEKVQDPEIQQKIAQRLRDAMTIPKPIEKEERNPYRL

A0A6J1EZI3 uncharacterized protein LOC1114406584.4e-16982.26Show/hide
Query:  LAIYEWNIDGLSDYLIMNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIED
        +  YEWNIDGLSDYLIMNVVNEMMMAAGAYKLQ HKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHER+ED
Subjt:  LAIYEWNIDGLSDYLIMNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIED

Query:  AVNTLIYTLIEFFVGDPLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIA
        AVNTLIYTLIEFFVGDPLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNG+PKHFSRRIKDGLKTK                
Subjt:  AVNTLIYTLIEFFVGDPLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIA

Query:  SFIIEEGLRLCNESKIQNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTC
                                   KELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYR+KPVQSQKPTYSRRKYIPTKTHRGKKKQTC
Subjt:  SFIIEEGLRLCNESKIQNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKKKQTC

Query:  FKCREEGHYANKCPIRGKINELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSY--SSTEYESEQEGK
        FKCREEGHYANKCPIRGKINELDIDQELKNQLLRLTLTDSEQSS+GEIL+LQEESD     +T    +++G+
Subjt:  FKCREEGHYANKCPIRGKINELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSY--SSTEYESEQEGK

SwissProt top hitse value%identityAlignment
P03542 Capsid protein4.0e-1028.24Show/hide
Query:  QERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKIQNKL
        +E + +   N+    L DF  Y + Y SK+ I  + +L  ++                     + NGT      S G  A  + EE  ++C+ SK Q KL
Subjt:  QERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKIQNKL

Query:  NSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRN-KPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRGKI
                 +G    +YGCK   +     +K+ K   K Y  Y+ ++ +R+ K  + ++   S++KY P    +GKK   C+ C  EGHYAN+CP R   
Subjt:  NSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRN-KPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRGKI

Query:  NELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCE
         +  I Q+ +   L+      E   E  ILE +EE +  +STE   E +G  T E
Subjt:  NELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCE

P03543 Capsid protein9.0e-1026.63Show/hide
Query:  DKYLDETTRQQILNHYVIRPTTQIIKEEGPSTR-TEVQHERIEDAVNTLIYTLI------EFFVGDPLKYQERSAEILMNLKCPTLGDFRWYKDMY--FS
        + YLD  T   ++ H     T+ I KE   +TR      + IE  +N + YT+       +  V + +  QE++   +  L+   L D  + ++    + 
Subjt:  DKYLDETTRQQILNHYVIRPTTQIIKEEGPSTR-TEVQHERIEDAVNTLIYTLI------EFFVGDPLKYQERSAEILMNLKCPTLGDFRWYKDMY--FS

Query:  KVLIRTD-SSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKIQNKLNSSISNRKELGRFCDQYGCKGIEAPST
        K + +T+ +    +   +++ +P     +     + + NGT      S G  A  + EE  ++C+ SK Q KL         +G    +YG K      T
Subjt:  KVLIRTD-SSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKIQNKLNSSISNRKELGRFCDQYGCKGIEAPST

Query:  SRRKKVKTHPKPYHSYRP---RETYRN-KPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRGKINELDIDQELKNQLLRLTLTDSEQS
        S++K  K + K Y  Y+P   ++ +R+ K  + ++   S+RKY P    +GKK   C+ C  EGHYAN+CP R    +  I Q+ +N  L+      E  
Subjt:  SRRKKVKTHPKPYHSYRP---RETYRN-KPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRGKINELDIDQELKNQLLRLTLTDSEQS

Query:  SEGEILELQEESDSYSSTEYESE
         E  ILE +EE +  S+ E + E
Subjt:  SEGEILELQEESDSYSSTEYESE

P03544 Capsid protein1.4e-1026.53Show/hide
Query:  DKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIEDAVNTLIYTL--IEFFVGDPLKYQERSAEILM--------------------NLKCPT
        + YLD  T   ++ H       ++I+    +  T    E++ DA+ T+   L   +  V + ++ QE+ A+I M                    N+    
Subjt:  DKYLDETTRQQILNHYVIRPTTQIIKEEGPSTRTEVQHERIEDAVNTLIYTL--IEFFVGDPLKYQERSAEILM--------------------NLKCPT

Query:  LGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKIQNKLNSSISNRKELGRFCD
        L DF  Y + Y SK+ I  + +L  ++                     + NGT      S G  A  + EE  ++C+ +K Q KL         +G    
Subjt:  LGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKIQNKLNSSISNRKELGRFCD

Query:  QYGCKGIEAPSTSRRKKVKTHPKPYHSYRP---RETYRN-KPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRGKINELDIDQELKNQ
        +YGCK      TS++K  K + K Y +Y+P   ++ +R+ K  + ++   S++KY P    +GKK   C+ C  EGHYAN+CP R    +  I Q+ +  
Subjt:  QYGCKGIEAPSTSRRKKVKTHPKPYHSYRP---RETYRN-KPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRGKINELDIDQELKNQ

Query:  LLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCE
         L+      E   E  ILE +EE +  +STE   E +G  T E
Subjt:  LLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCE

Q02951 Capsid protein4.9e-0827.84Show/hide
Query:  QERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKIQNKL
        +E + +   N+    L DF  Y + Y SK+ I  + +L  ++                     + NGT      S G  A  + EE  ++C  SK Q KL
Subjt:  QERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTLSYGSIASFIIEEGLRLCNESKIQNKL

Query:  NSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRN-KPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRGKI
                 +G    +YGCK   +      K+ K   K Y  Y+ ++ +R+ K  + ++   S++KY P    +GKK   C+    EGHYAN+CP R   
Subjt:  NSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRN-KPVQSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRGKI

Query:  NELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCE
         +  I Q+ +   L+      E   E  ILE +EE +  +STE   E +G  T E
Subjt:  NELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCE

Q7TD09 Capsid protein1.8e-0526.29Show/hide
Query:  ERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTL--SYGSIASFIIEEGLRLCNESKIQNK
        E++   L N+    L  F  Y   Y     + T      + E F+  +P  F R++ +G + +        TL  +   +   I EE ++      I   
Subjt:  ERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPWQTL--SYGSIASFIIEEGLRLCNESKIQNK

Query:  LNSSISNRKELGRFCDQYGCKGI-EAPSTSRRKKVKTHPKPYHSYRPRETYRNKPV-QSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRG
        +        E+ +   +YGC  I +   TS++K  K   KP    + +  ++ +   + +K    ++K+ PT    GKK   C+ C EEGHYAN+CP + 
Subjt:  LNSSISNRKELGRFCDQYGCKGI-EAPSTSRRKKVKTHPKPYHSYRPRETYRNKPV-QSQKPTYSRRKYIPTKTHRGKKKQTCFKCREEGHYANKCPIRG

Query:  KINELD-IDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQ
        K    D +   ++ +       +SE S   EI E+ EE DS SS+E + EQ
Subjt:  KINELD-IDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQ

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGACCCCTTGAAGGTAAAAGAGGTCAGTTATGATCAAAAAAGGGCGTCAATTCACTATGAAGATGGCTCAAGGTCTCCAACCCATACAGATATGGATACT
CAATCCGTCTACGAAAGCCAGCTAAATGTCATTACAGCTGATTTCCAGATAGACAAAGAATTTTTGAAAGCAGATTTTATGTCAACTACCAACTCTTCAAAAAGA
AATGCCTTCTTCAAAACCTACAAAGAAGCTGAGCGAAATGAATTAAGAGCTCAGTGGTATTCTCATATGGAAGCCATCAAAGAAAATATACCATTCTTCGAATGG
TTTGAAGAAAATATTCTGCAGATATGCACCTTGACCCAGAGAAGTTGGAAGACTACCAAAAGAGGCGAGGTATATTCAAAACATCCTCCGTTAGAAGAAGTCGAA
TTCGACAACTACTACGGAGAAAAGGTCAAAGCTAGCCCTTTCAAACATATTCCAGAAGGACTAGAAAAGGGTAATCCGACCCTAAAAGATATCAAGAATATCCAG
CATCAAAACAATTATTCAAATAAGATTTTATCCACGATCGCTACTCAACTTGAAAGTATCGAGGGAAAAATCTCAAAGAAATCAGGAAGTCTGCAGGAACAGGCA
AGCAGCTCTGTACCAAAGGTTGATGAGTCAATACCAATACTCAGACCGGCAAATCTCGAAATCTTTACAAAAAGGATTTCAAAAGAAGAGGCTGCGATAGCCAAG
ATAGAAGAAAAGTTAGACAGGATATTAAATCCGGCCATGACGCCTCATGATTCATCTGTCAATGTCGTTAACAATGACGACGAGGATCAAGATTTCGAAGACGAG
ATTGCAAATGAGAGAGAAGAACCTTTCTATAATCGAATTGAAAGGATCTCTAGAAGAAGTCAAAATAATGCCAGTAATCAGAAAAACTGGTATCCTCAACCGTCT
TTTCCAGATATCCAGTTCGAAGAAAAGACACTACAAACTCAAGCTCATTACGATGGTTTAGCAATCTATGAATGGAATATCGATGGATTGTCTGACTATCTGATA
ATGAATGTTGTCAATGAGATGATGATGGCCGCAGGCGCGTATAAATTACAAGGCCATAAATCCGACCATCAGATAGCTCAAGTTTTGGTAACCGGATTTACCGGG
CAACTCAAGGATTGGTGGGACAAATATCTCGACGAAACAACCCGTCAACAGATATTAAACCACTATGTCATCAGACCAACTACTCAAATTATCAAAGAAGAAGGT
CCATCAACTAGGACCGAGGTACAACATGAAAGAATAGAGGATGCCGTCAACACCCTCATATATACCCTCATAGAATTCTTCGTCGGCGACCCTCTAAAATACCAG
GAGAGATCTGCCGAAATACTCATGAATCTGAAATGCCCTACCTTAGGTGACTTCAGGTGGTACAAAGACATGTACTTCAGCAAAGTTCTTATTAGAACAGATAGT
TCGTTGGAATTCTGGAAAGAGAATTTCGTCAATGGAGTACCAAAACACTTCTCAAGAAGGATCAAAGATGGTCTTAAGACAAAGTACAATGGAACGATTCCATGG
CAAACTTTGTCATACGGATCCATAGCATCCTTCATCATAGAAGAAGGGCTCAGACTTTGTAATGAGTCAAAGATTCAAAACAAGCTCAATTCTTCGATATCAAAC
AGAAAAGAGCTTGGTAGATTTTGCGATCAATATGGATGCAAGGGAATAGAAGCTCCCTCAACCTCCAGGCGAAAAAAGGTTAAGACGCATCCAAAACCTTATCAT
TCATACCGGCCTAGAGAAACATATCGGAATAAACCCGTGCAGTCTCAGAAGCCAACATATTCAAGAAGGAAGTATATTCCTACAAAGACCCATAGAGGAAAGAAA
AAGCAAACTTGCTTCAAATGTCGTGAAGAAGGACACTATGCTAATAAGTGTCCAATCAGGGGAAAGATCAATGAGTTGGATATAGATCAAGAGCTGAAAAATCAG
CTATTGCGACTGACCCTAACCGACTCAGAGCAATCAAGCGAGGGAGAAATCCTTGAACTCCAAGAAGAATCTGATTCATACTCTAGCACTGAATACGAATCAGAA
CAAGAAGGAAAAAGGACGTGCGAAGGATGCATAAATGTCCTTACCAAAGATCAAGAAATTCTGCTCGAAGTAGTAGAAAAGGTTCAAGACCCAGAAATTCAACAG
AAGATTGCTCAACGACTTAGGGATGCCATGACAATTCCTAAGCCAATTGAAAAAGAAGAAAGAAATCCCTACAGACTCCAATCTGTTCTCCAAAGATTTGAAAAA
CCGAGGGAACTCACTACTCAAGATTTACAAAGAGAAATTAATAATCTCAAACAAGAAATACAGGCGCTAAGATCTGAAACCAGGTCAGAATCCTATACGCTTCGT
AAAGAAATTCTCAATCTTCAGGAAAGGCTGCCAGCACAAAAGGAAACGGTGCAACCAAACCTGGACGAAGAAGATTTTCAAAGTTCCTTCGTTGGAGCCATCACC
ACCTCTCAATTTCAAAAATGGTACGCTCTCGTTACCTTAAAAATTTATGATTTCAAAATCACACTCAAGGCTCTCATAGATACTGGAGCCGATCAAAATTGCATT
CAAGAAGGCCTAATACCTACAAAGTATTTTGAAAAAACTACTGAGGGCCTTAGAGGTGCTAACAATAACAAACTCAAAATTAATTACAAACTATCAAAAGTTCAT
GTTTGCAATGATGGAATTTGCTTCGTAAATTCCTTCCTTCTAGTAAAAGACTTAGGACAAGAACTAATCTTAGCAATGACCGGAAAGAAAGGGATAGAAAAGGGG
AAGAAGCCCAATGCTTCTTCAAGTACACCTGCTCCAATGACATCCGATAGCTATGCAATGGATTCTGGTTTCACATTGGTGACCAGATCTAAAGCAAAGCTTTCT
CAAAAACAGGCGGAGGTTATCACTCCGACAAGGCCTTCAGCCTCTTCAACACGGCCATCTGCCGTTACTCCATCGAGACCCACTGTCTCTTCCACTTCAAAGGGA
CCTTCTGTCCCCTCGACTTATTCGGACGCGGTTGTTCCCGTTCGTTTTTCTCCAGTTCCGGAAATCAAAACCTATTTTCAAAAATCTGTGTCTATACCAGAAGCT
CTGGTAGAACCAGAATACGACGACCCGAAAATCGGCGAAGTCGTTAAAAAAGCCTATCCATCGGGATTCTTTTATATCCCAGAAGATCTTCATAAAACCAGAAGG
TTGGCTTTAGGATGGTTGATGGTTGGTATGATGAAGAAACATGCAAGGCGAGGTGGAGCCGGGTGCCTTCAGGCATGCGACATGAACCTTGTTGAGTTCGGGACG
TCGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATGACCCCTTGAAGGTAAAAGAGGTCAGTTATGATCAAAAAAGGGCGTCAATTCACTATGAAGATGGCTCAAGGTCTCCAACCCATACAGATATGGATACT
CAATCCGTCTACGAAAGCCAGCTAAATGTCATTACAGCTGATTTCCAGATAGACAAAGAATTTTTGAAAGCAGATTTTATGTCAACTACCAACTCTTCAAAAAGA
AATGCCTTCTTCAAAACCTACAAAGAAGCTGAGCGAAATGAATTAAGAGCTCAGTGGTATTCTCATATGGAAGCCATCAAAGAAAATATACCATTCTTCGAATGG
TTTGAAGAAAATATTCTGCAGATATGCACCTTGACCCAGAGAAGTTGGAAGACTACCAAAAGAGGCGAGGTATATTCAAAACATCCTCCGTTAGAAGAAGTCGAA
TTCGACAACTACTACGGAGAAAAGGTCAAAGCTAGCCCTTTCAAACATATTCCAGAAGGACTAGAAAAGGGTAATCCGACCCTAAAAGATATCAAGAATATCCAG
CATCAAAACAATTATTCAAATAAGATTTTATCCACGATCGCTACTCAACTTGAAAGTATCGAGGGAAAAATCTCAAAGAAATCAGGAAGTCTGCAGGAACAGGCA
AGCAGCTCTGTACCAAAGGTTGATGAGTCAATACCAATACTCAGACCGGCAAATCTCGAAATCTTTACAAAAAGGATTTCAAAAGAAGAGGCTGCGATAGCCAAG
ATAGAAGAAAAGTTAGACAGGATATTAAATCCGGCCATGACGCCTCATGATTCATCTGTCAATGTCGTTAACAATGACGACGAGGATCAAGATTTCGAAGACGAG
ATTGCAAATGAGAGAGAAGAACCTTTCTATAATCGAATTGAAAGGATCTCTAGAAGAAGTCAAAATAATGCCAGTAATCAGAAAAACTGGTATCCTCAACCGTCT
TTTCCAGATATCCAGTTCGAAGAAAAGACACTACAAACTCAAGCTCATTACGATGGTTTAGCAATCTATGAATGGAATATCGATGGATTGTCTGACTATCTGATA
ATGAATGTTGTCAATGAGATGATGATGGCCGCAGGCGCGTATAAATTACAAGGCCATAAATCCGACCATCAGATAGCTCAAGTTTTGGTAACCGGATTTACCGGG
CAACTCAAGGATTGGTGGGACAAATATCTCGACGAAACAACCCGTCAACAGATATTAAACCACTATGTCATCAGACCAACTACTCAAATTATCAAAGAAGAAGGT
CCATCAACTAGGACCGAGGTACAACATGAAAGAATAGAGGATGCCGTCAACACCCTCATATATACCCTCATAGAATTCTTCGTCGGCGACCCTCTAAAATACCAG
GAGAGATCTGCCGAAATACTCATGAATCTGAAATGCCCTACCTTAGGTGACTTCAGGTGGTACAAAGACATGTACTTCAGCAAAGTTCTTATTAGAACAGATAGT
TCGTTGGAATTCTGGAAAGAGAATTTCGTCAATGGAGTACCAAAACACTTCTCAAGAAGGATCAAAGATGGTCTTAAGACAAAGTACAATGGAACGATTCCATGG
CAAACTTTGTCATACGGATCCATAGCATCCTTCATCATAGAAGAAGGGCTCAGACTTTGTAATGAGTCAAAGATTCAAAACAAGCTCAATTCTTCGATATCAAAC
AGAAAAGAGCTTGGTAGATTTTGCGATCAATATGGATGCAAGGGAATAGAAGCTCCCTCAACCTCCAGGCGAAAAAAGGTTAAGACGCATCCAAAACCTTATCAT
TCATACCGGCCTAGAGAAACATATCGGAATAAACCCGTGCAGTCTCAGAAGCCAACATATTCAAGAAGGAAGTATATTCCTACAAAGACCCATAGAGGAAAGAAA
AAGCAAACTTGCTTCAAATGTCGTGAAGAAGGACACTATGCTAATAAGTGTCCAATCAGGGGAAAGATCAATGAGTTGGATATAGATCAAGAGCTGAAAAATCAG
CTATTGCGACTGACCCTAACCGACTCAGAGCAATCAAGCGAGGGAGAAATCCTTGAACTCCAAGAAGAATCTGATTCATACTCTAGCACTGAATACGAATCAGAA
CAAGAAGGAAAAAGGACGTGCGAAGGATGCATAAATGTCCTTACCAAAGATCAAGAAATTCTGCTCGAAGTAGTAGAAAAGGTTCAAGACCCAGAAATTCAACAG
AAGATTGCTCAACGACTTAGGGATGCCATGACAATTCCTAAGCCAATTGAAAAAGAAGAAAGAAATCCCTACAGACTCCAATCTGTTCTCCAAAGATTTGAAAAA
CCGAGGGAACTCACTACTCAAGATTTACAAAGAGAAATTAATAATCTCAAACAAGAAATACAGGCGCTAAGATCTGAAACCAGGTCAGAATCCTATACGCTTCGT
AAAGAAATTCTCAATCTTCAGGAAAGGCTGCCAGCACAAAAGGAAACGGTGCAACCAAACCTGGACGAAGAAGATTTTCAAAGTTCCTTCGTTGGAGCCATCACC
ACCTCTCAATTTCAAAAATGGTACGCTCTCGTTACCTTAAAAATTTATGATTTCAAAATCACACTCAAGGCTCTCATAGATACTGGAGCCGATCAAAATTGCATT
CAAGAAGGCCTAATACCTACAAAGTATTTTGAAAAAACTACTGAGGGCCTTAGAGGTGCTAACAATAACAAACTCAAAATTAATTACAAACTATCAAAAGTTCAT
GTTTGCAATGATGGAATTTGCTTCGTAAATTCCTTCCTTCTAGTAAAAGACTTAGGACAAGAACTAATCTTAGCAATGACCGGAAAGAAAGGGATAGAAAAGGGG
AAGAAGCCCAATGCTTCTTCAAGTACACCTGCTCCAATGACATCCGATAGCTATGCAATGGATTCTGGTTTCACATTGGTGACCAGATCTAAAGCAAAGCTTTCT
CAAAAACAGGCGGAGGTTATCACTCCGACAAGGCCTTCAGCCTCTTCAACACGGCCATCTGCCGTTACTCCATCGAGACCCACTGTCTCTTCCACTTCAAAGGGA
CCTTCTGTCCCCTCGACTTATTCGGACGCGGTTGTTCCCGTTCGTTTTTCTCCAGTTCCGGAAATCAAAACCTATTTTCAAAAATCTGTGTCTATACCAGAAGCT
CTGGTAGAACCAGAATACGACGACCCGAAAATCGGCGAAGTCGTTAAAAAAGCCTATCCATCGGGATTCTTTTATATCCCAGAAGATCTTCATAAAACCAGAAGG
TTGGCTTTAGGATGGTTGATGGTTGGTATGATGAAGAAACATGCAAGGCGAGGTGGAGCCGGGTGCCTTCAGGCATGCGACATGAACCTTGTTGAGTTCGGGACG
TCGTGGTGATGCCTCTCAGCCTAAGCCCTATGGTGACAGCGGTGTGGTGGCAGAACATGCACGGTGAGGTAGAGTCGGGTGCCTTTAGGCGTGATATGAGCCTTG
ATGGGCATCAGATGTTGTTGCGTTGCCTCCCAACCCCCCCACCACACACTTTCCTTCACCCTTTGTACACAGCTTTTTGTATGTTGCGAAATTAAGGCCCTAGGA
AGTGTTAGTAGGGATGTGGGAGTTGCTCGGAGCGAAGAATTTTGAGTGAAGTCAAGCTTATGGGGATATCGTGAGTCAATACACTTGACTTATATTTGATTTTGT
TTTTAGAAGATTGTTTTTGTGGTG
Protein sequenceShow/hide protein sequence
MYDPLKVKEVSYDQKRASIHYEDGSRSPTHTDMDTQSVYESQLNVITADFQIDKEFLKADFMSTTNSSKRNAFFKTYKEAERNELRAQWYSHMEAIKENIPFFEW
FEENILQICTLTQRSWKTTKRGEVYSKHPPLEEVEFDNYYGEKVKASPFKHIPEGLEKGNPTLKDIKNIQHQNNYSNKILSTIATQLESIEGKISKKSGSLQEQA
SSSVPKVDESIPILRPANLEIFTKRISKEEAAIAKIEEKLDRILNPAMTPHDSSVNVVNNDDEDQDFEDEIANEREEPFYNRIERISRRSQNNASNQKNWYPQPS
FPDIQFEEKTLQTQAHYDGLAIYEWNIDGLSDYLIMNVVNEMMMAAGAYKLQGHKSDHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVIRPTTQIIKEEG
PSTRTEVQHERIEDAVNTLIYTLIEFFVGDPLKYQERSAEILMNLKCPTLGDFRWYKDMYFSKVLIRTDSSLEFWKENFVNGVPKHFSRRIKDGLKTKYNGTIPW
QTLSYGSIASFIIEEGLRLCNESKIQNKLNSSISNRKELGRFCDQYGCKGIEAPSTSRRKKVKTHPKPYHSYRPRETYRNKPVQSQKPTYSRRKYIPTKTHRGKK
KQTCFKCREEGHYANKCPIRGKINELDIDQELKNQLLRLTLTDSEQSSEGEILELQEESDSYSSTEYESEQEGKRTCEGCINVLTKDQEILLEVVEKVQDPEIQQ
KIAQRLRDAMTIPKPIEKEERNPYRLQSVLQRFEKPRELTTQDLQREINNLKQEIQALRSETRSESYTLRKEILNLQERLPAQKETVQPNLDEEDFQSSFVGAIT
TSQFQKWYALVTLKIYDFKITLKALIDTGADQNCIQEGLIPTKYFEKTTEGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILAMTGKKGIEKG
KKPNASSSTPAPMTSDSYAMDSGFTLVTRSKAKLSQKQAEVITPTRPSASSTRPSAVTPSRPTVSSTSKGPSVPSTYSDAVVPVRFSPVPEIKTYFQKSVSIPEA
LVEPEYDDPKIGEVVKKAYPSGFFYIPEDLHKTRRLALGWLMVGMMKKHARRGGAGCLQACDMNLVEFGTSW