; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G21380 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G21380
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionCCHC-type domain-containing protein
Genome locationChr3:17619808..17628813
RNA-Seq ExpressionCSPI03G21380
SyntenyCSPI03G21380
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN64335.1 hypothetical protein VITISV_001808 [Vitis vinifera]6.1e-19134.99Show/hide
Query:  EGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNRISMCSSAQEI
        E  S  R P+F G++Y YWKARM  +LQS D ++W ++  GP  P K VD V  PK ++E++E + +    NAKA+  L CA+ ++E+NRI  C SA+EI
Subjt:  EGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNRISMCSSAQEI

Query:  WNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSL
        W  LEITHEGTNQVKESKI++ VHNYELF M   ETI +M TRFT I+N L+ LG+V   SE V KILRSLP  W  KVTAIQEAKDLTKLP+EEL+GSL
Subjt:  WNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSL

Query:  MTHEI-IMKEHLEDESKKKKSIALK-TISLEVDPEDEDDFDK-DDIAYFSRKYKNFIKRKKYFKKHLSTQK-------ESKGEKSK---KDEVICYECKR
        MT+EI + K+  E E KKKK+IALK T   E D E+E   ++ DD+A  +RK   +++ +++  K  ++++        S G+K K   K ++IC++CK 
Subjt:  MTHEI-IMKEHLEDESKKKKSIALK-TISLEVDPEDEDDFDK-DDIAYFSRKYKNFIKRKKYFKKHLSTQK-------ESKGEKSK---KDEVICYECKR

Query:  SGHIRTDCPLLK-SSKKSKKKAMKATWDDSSES--ESEVEEMANLGLMAHSDKDDEHDDKVTLE-PLSIDELFENFE--------------SMQNDLEKL
         GHI+ DCPL K  +K+  KKAM ATW +S ES  E + +E+AN+  MA  D D+ + +    +  +  +EL+E+FE               ++ +LE++
Subjt:  SGHIRTDCPLLK-SSKKSKKKAMKATWDDSSES--ESEVEEMANLGLMAHSDKDDEHDDKVTLE-PLSIDELFENFE--------------SMQNDLEKL

Query:  SSKYVV-------LKKKYNVLISENKSL------------------------LDTIACFKENE-------------------------------------
          K+ +       L+K++ +L +EN+ L                        L +  C KE++                                     
Subjt:  SSKYVV-------LKKKYNVLISENKSL------------------------LDTIACFKENE-------------------------------------

Query:  ---------------------------NFEQIEEL-----NVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKEN---------------------
                                   N   I +L      V  +   CI KD   DK  F+ H  CE    I + K +                     
Subjt:  ---------------------------NFEQIEEL-----NVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKEN---------------------

Query:  --ELSVLQELDKAK--ETIKKLTIGAQRLDKIIEVGK----SYGDKR-----------GLGYIDESSTPSSSKTTFVKASPIVPKFNMSNHV-----SNH
           + ++ +L+K +    + K+     ++ +  ++GK    S+ +K             +     S TPS    ++  A  IV  F+    V      + 
Subjt:  --ELSVLQELDKAK--ETIKKLTIGAQRLDKIIEVGK----SYGDKR-----------GLGYIDESSTPSSSKTTFVKASPIVPKFNMSNHV-----SNH

Query:  VKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFY------IAPRKNFSN-----KSRVCLKASK--------------------------
            F   C+    E      C +  + +  +   +F +    Y      +APR +  N     K+R   + ++                          
Subjt:  VKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFY------IAPRKNFSN-----KSRVCLKASK--------------------------

Query:  ---------------KNK-----WYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGYSSTSKAYRVFNKKTLVIEESIYVVFDESWNNVSNESICS
                       KNK     ++   GC   +   +  L  F  K+   +       GYS++SKA+RVFNK+T+V+EESI+  +   W N    +   
Subjt:  ---------------KNK-----WYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGYSSTSKAYRVFNKKTLVIEESIYVVFDESWNNVSNESICS

Query:  DDLEK-----------------DFGDLLVNDKGKEIVP------------RSIFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRP
        D+ +K                 +    ++N    +I+             R+I +NLAF+SQIEP++ KDA  DE W++AMQ+ELNQFER++VW+LVPRP
Subjt:  DDLEK-----------------DFGDLLVNDKGKEIVP------------RSIFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRP

Query:  SNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP----------------------------------------VEQPPGFESFDL
        SN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFAP                                        VEQPPGF+SF+ 
Subjt:  SNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP----------------------------------------VEQPPGFESFDL

Query:  PNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIK
        PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKMGKID TLFIK K NDML+VQIYVDDI FG+TN SLCE+FSKCMH                    
Subjt:  PNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIK

Query:  QLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIWKLVCFLVF--------------------
                   KY +DLLK+F + E KV KTPMS++ KLD DEKGK +D    RGMIGSLLYLTASRPD ++ +     F                    
Subjt:  QLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIWKLVCFLVF--------------------

Query:  --------------------------------------LHFMHLSLC---SKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDN
                                               H +  SL    SKKQ  ++ S+ E EY A +   +++LW++  L D       H  + CDN
Subjt:  --------------------------------------LHFMHLSLC---SKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDN

Query:  YNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLLLHKL
         + I I+ N V H RTKHIE   HF++ H +   + L+ +ST DQ  DIFTK LS  +F+ +  +L
Subjt:  YNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLLLHKL

RVW80634.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.2e-20441.34Show/hide
Query:  EGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNRISMCSSAQEI
        E  S  R P+F G++Y YWKARM  +LQS D ++W ++  GP  P K VD V  PK ++E++E + +    NAKA+  L CA+ ++E+NRI  C SA+EI
Subjt:  EGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNRISMCSSAQEI

Query:  WNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSL
        W  LEITHEGTNQVKESKI++ VHNYELF M   ETI +M TRFT I+N L+ LG+V   SE V KILRSLP  W  KVTAIQEAKDLTKLP+EEL+GSL
Subjt:  WNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSL

Query:  MTHEIIMKEHL-EDESKKKKSIALK-TISLEVDPEDEDDFDK-DDIAYFSRKYKNFIKRKKYFKKHLSTQK-------ESKGEKSK---KDEVICYECKR
        MT+EI + + L E E KKKKSIALK T   E D E+E   ++ DD+A  +RK   +++ +++  K  ++++        S G+K K   K ++IC++CK+
Subjt:  MTHEIIMKEHL-EDESKKKKSIALK-TISLEVDPEDEDDFDK-DDIAYFSRKYKNFIKRKKYFKKHLSTQK-------ESKGEKSK---KDEVICYECKR

Query:  SGHIRTDCPLLK-SSKKSKKKAMKATWDDSSES--ESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLI
         GHI+ DCPL K  +K+  KKAM ATW +S ES  E + +E+AN+  MA  D D+                                             
Subjt:  SGHIRTDCPLLK-SSKKSKKKAMKATWDDSSES--ESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLI

Query:  SENKSLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYG
                                                                                                            
Subjt:  SENKSLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYG

Query:  DKRGLGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCLK
                                                                                                            
Subjt:  DKRGLGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCLK

Query:  ASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDESWNNVSN
         SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN K                      GYS++SKA+RVFNK+T+V+EESI+V+FDES N++  
Subjt:  ASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDESWNNVSN

Query:  ESICSDD--LEKDFGDLLVNDK----------GKEIVP----------------------------------------------RSIFSNLAFVSQIEPR
             DD  LE   G L + DK           KE  P                                              R+I +NLAF+SQIEP+
Subjt:  ESICSDD--LEKDFGDLLVNDK----------GKEIVP----------------------------------------------RSIFSNLAFVSQIEPR

Query:  SFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP------------------
        + KDA  DE W++AMQEELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFAP                  
Subjt:  SFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP------------------

Query:  ----------------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGS
                              VEQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKMGKID TLFIK K  DML+VQIYVDDIIFG+
Subjt:  ----------------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGS

Query:  TNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTAS
        TN SLCE+FSKCMH+EFEMSMMGEL++FLGLQIKQLK+  FI+Q KY +DLLK+F + E KV KTPMS++ KLD DEKGK +D    RGMIGSLLYLTAS
Subjt:  TNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTAS

Query:  RPDTIWKL
        RPD ++ +
Subjt:  RPDTIWKL

RVW98982.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.8e-20439.02Show/hide
Query:  EGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNRISMCSSAQEI
        E  S  R P+F G++Y YWKARM  +LQS D ++W ++  GP  P K VD V  PK ++E++E + +    NAKA+  L CA+ ++E+NRI  C SA+EI
Subjt:  EGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNRISMCSSAQEI

Query:  WNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSL
        W  LEITHEGTNQVKESKI++ VHNYELF M   ETI +M TRFT I+N L+ LG+V   SE V KILRSLP  W  KVTAIQEAKDLTKLP+EEL+GSL
Subjt:  WNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSL

Query:  MTHEIIMKEHL-EDESKKKKSIALK-TISLEVDPEDEDDFDK-DDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKS---------KKDEVICYECKRS
        MT+EI + + L E E KKKKSIALK T   E D E+E   ++ DD+A  +RK   +++ +++  K  S+++ S+ E S         +K ++IC++CK+ 
Subjt:  MTHEIIMKEHL-EDESKKKKSIALK-TISLEVDPEDEDDFDK-DDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKS---------KKDEVICYECKRS

Query:  GHIRTDCPLLK-SSKKSKKKAMKATWDDSSES--ESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLIS
        GHI+ DCPL K  +K+  KKAM ATW +S ES  E + +E+AN+  MA  D D+                                              
Subjt:  GHIRTDCPLLK-SSKKSKKKAMKATWDDSSES--ESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLIS

Query:  ENKSLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGD
                                                                                                            
Subjt:  ENKSLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGD

Query:  KRGLGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCLKA
                                                                                                            
Subjt:  KRGLGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCLKA

Query:  SKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDESWNNVSNE
        SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN K                      GYS++SKA+RVFNK+T+V+EESI+V+FDES N++   
Subjt:  SKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDESWNNVSNE

Query:  SICSDD--LEKDFGDLLVNDK----------GKEIVP----------------------------------------------RSIFSNLAFVSQIEPRS
            DD  LE   G L + DK           KE  P                                              R+I +NLAF+SQIEP++
Subjt:  SICSDD--LEKDFGDLLVNDK----------GKEIVP----------------------------------------------RSIFSNLAFVSQIEPRS

Query:  FKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP-------------------
         KDA  DE W++AMQEELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFAP                   
Subjt:  FKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP-------------------

Query:  ---------------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGST
                             VEQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKMGKID TLFIK K  DML+VQIYVDDIIFG+T
Subjt:  ---------------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGST

Query:  NSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASR
        N SLCE+FSKCMH+EFEMSMMGEL++FLGLQIKQLK+  FI+Q KY +DLLK+F + E KV KTPMS++ KLD DEKGK +D    RGMIGSLLYLTASR
Subjt:  NSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASR

Query:  PDTIWKLVCFLVFLHFMHLSLCSKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERT--KHIENNCHFV
        PD I   VC            C K+  +   S+++  +  L    +  LW          P+        DN+ +I  +  D    R   K     CHF+
Subjt:  PDTIWKLVCFLVFLHFMHLSLCSKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERT--KHIENNCHFV

Query:  QHHLES------NTLHLQSIST---------IDQPVDIFTKALS---PHR---FTLLLHKLKKIEV
         H L S      N++ L +            +++PVD F  A S   P     FT+L H  K + +
Subjt:  QHHLES------NTLHLQSIST---------IDQPVDIFTKALS---PHR---FTLLLHKLKKIEV

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]0.0e+0098.67Show/hide
Query:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNR
        MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNR
Subjt:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNR

Query:  ISMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTK
        ISMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTK
Subjt:  ISMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTK

Query:  LPLEELIGSLMTHEIIMKEHLEDESKKKKSIALKTISLEVDPEDEDDFDKDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGH
        LPLEELIGSLMTHEIIMKEHLEDESKKKKSIALKTISLEVDPEDED  D+DDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGH
Subjt:  LPLEELIGSLMTHEIIMKEHLEDESKKKKSIALKTISLEVDPEDEDDFDKDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGH

Query:  IRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENKSL
        IRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENKSL
Subjt:  IRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENKSL

Query:  LDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLG
        LDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLG
Subjt:  LDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLG

Query:  YIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCLKASKKNK
        YIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRV  K   KNK
Subjt:  YIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCLKASKKNK

XP_042972981.1 uncharacterized protein LOC122304782, partial [Carya illinoinensis]3.4e-16536.48Show/hide
Query:  QSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNRISMCSSAQEIWN
        QS+SRPP F G NY++WK RM+I+LQ+    +W  +  GPY+P K V  V   K EEE+D  + +  + N  A+N LY AL+ +EFNRI  C++A+EIW+
Subjt:  QSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNRISMCSSAQEIWN

Query:  TLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSLMT
         LE+T+EGT+QVKESKI +  H YE+FKM+ +E+I+ M TRFTNIIN+L  LGKVY+  E VRKIL SLPK WE+KVTAI EA+DL KL + ELIGSL+T
Subjt:  TLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSLMT

Query:  HEIIMKEHLE-------------------DESKKKKSIALKTISLEVDPEDEDDFDKDDIAYFSRKYKNFIKRKKYFKKHLS-TQKESKG-EKSKKDEVI
         +     H+                     ++ K K + +  I       +E     +D+        N +   +   K  + T K  K    +  D  I
Subjt:  HEIIMKEHLE-------------------DESKKKKSIALKTISLEVDPEDEDDFDKDDIAYFSRKYKNFIKRKKYFKKHLS-TQKESKG-EKSKKDEVI

Query:  CYECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLS-----SKYVVL
        C+   R+ ++ T    +   + + + A+      S+++E+       LG          H +   +  LS ++L           +K+       K    
Subjt:  CYECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLS-----SKYVVL

Query:  KKKYNVLISENKSL-LDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDK
          K    IS  + L L  +  F  N    ++  L       V ++  +    V FL H            K+   +   +L K  +  K  TI + R D+
Subjt:  KKKYNVLISENKSL-LDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDK

Query:  IIEVGKSYGDKRGLGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNF
            GK + +K    + DE+    +            P+    N V      S   +      E ++    +    +   Y       R+K    P + +
Subjt:  IIEVGKSYGDKRGLGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNF

Query:  SNKSRVCLKASKKNKWYLD-SGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGYSSTSKAYRVFNKKTLVIEESIYVVFDESWNNVSNESICSDDLEKDF
        + K        K N  Y    GC   +  DR  L  F  K+   +       GYS+ SKAYRVFNKKTL ++ES++VVFDE                   
Subjt:  SNKSRVCLKASKKNKWYLD-SGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGYSSTSKAYRVFNKKTLVIEESIYVVFDESWNNVSNESICSDDLEKDF

Query:  GDLLVNDKGKEIVPRSIFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQ
                                 QIEP++  DA  DE WILAMQEELNQFERN VW LVPRP N +IIGTKWVFRNK DE+G I RNKARLVAQG+ Q
Subjt:  GDLLVNDKGKEIVPRSIFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQ

Query:  EEGIDYEETFAP----------------------------------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKM
        EEGIDY+ET+AP                                        VEQPPGFE+   PNHV+KL KALYGLKQAPRAWY+RLS FL+E  F  
Subjt:  EEGIDYEETFAP----------------------------------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKM

Query:  GKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTT
        GKID TLFIK +N+D+L++QIYVDDIIFG+TN ++C+ F+K M  EFEMSMMGEL+FFLGLQIKQ K + FI+Q KY ++LLKKF +   K   TPMS +
Subjt:  GKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTT

Query:  TKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIWKLVCFLV---------------------------------------------------------
        TKLDKDE GK VD K  RGMIGSLLYLTASRPD ++  VC                                                            
Subjt:  TKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIWKLVCFLV---------------------------------------------------------

Query:  -----FLHFMHLSLCSKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTL
             FL    +S  SKKQ  V+ S+ E EY A     +++L+++  L D      +H  + CDN + I ++ N + H RTKHIE   HF++ H++   +
Subjt:  -----FLHFMHLSLCSKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTL

Query:  HLQSISTIDQPVDIFTK
         L+  +T DQ  DIFTK
Subjt:  HLQSISTIDQPVDIFTK

TrEMBL top hitse value%identityAlignment
A0A2N9EG92 CCHC-type domain-containing protein5.7e-21139Show/hide
Query:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNR
        MAN     ++EGQST RPP F GS+Y YWK RM +Y++  DY++W I+A GP++P K V+     KLE E++E +++    N KA++ LYCAL   E+NR
Subjt:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNR

Query:  ISMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTK
        +S C SA+EIW+ LE+T+EGTNQVKESK++M VH YELF M  +E I++M TRFTNI+N+LK LGK+YT  ENVRKILRSLPK WEAK+TAI EA+DL  
Subjt:  ISMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTK

Query:  LPLEELIGSLMTHEIIMKEHLEDES-KKKKSIALKTISLEVDPEDEDDFDKDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG
        L LEEL GSLMT+E+ M   +E+E  K KK+ ALK+   + D  +E+  ++++IA  +R +K F+K+KK F +    + E+KGE SK +   CY+CK+ G
Subjt:  LPLEELIGSLMTHEIIMKEHLEDES-KKKKSIALKTISLEVDPEDEDDFDKDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG

Query:  HIRTDCPLLKSSK-KSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENK
        H + +CP +   K K KKKA+K TWDDS ES+S+                                                                  
Subjt:  HIRTDCPLLKSSK-KSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENK

Query:  SLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRG
                             N SSD  V                      NL                                               
Subjt:  SLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRG

Query:  LGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCL-KASK
        LGYI+ES                        ++S    +SF P+  N   E      C                             ++   VCL   S 
Subjt:  LGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCL-KASK

Query:  KNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDES-----WNNV
        K+KW+LDSGCSRHMTGD++K  S + K+GG V FGDN K                      GYS+ SKAYRVFNK+T+V++ES++VVFDE+      NN 
Subjt:  KNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDES-----WNNV

Query:  SNESIC-----------------------------------SDDLEKDFG-------DLLVNDKGKEIVPRS----IFSNLAFVSQIEPRSFKDAECDEF
         +E I                                    +++L K +        +L++ +    +  RS    I +N+AF+SQIEP++  +A  DE 
Subjt:  SNESIC-----------------------------------SDDLEKDFG-------DLLVNDKGKEIVPRS----IFSNLAFVSQIEPRSFKDAECDEF

Query:  WILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP----------------------------
        WILAMQEELNQFERNKVW L PRP + S+IGTKWVFRNK DE G I+RNKARLVAQGY QEEGIDY ET+AP                            
Subjt:  WILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP----------------------------

Query:  ------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFS
                    VEQPPGFE+ + PNHV+KL KALYGLKQAPRAWY+RLS FL+E  F  GK+D TLF+     DMLIVQIYVDDIIFGSTN +LC+EFS
Subjt:  ------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFS

Query:  KCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIW----
        K M +EFEMSMMGEL FFLGLQIKQ +D IF++Q KY  DLLK+F L   K   TPMS +TKLDKDEKGK VD+K  R +     YL  +    +W    
Subjt:  KCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIW----

Query:  ---KLVCF------------------LVFLHFMHLSLCSKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVF
            L+ +                    FL    +S  SKKQ  V+ S+ E EY A     ++ L+++  L D       H  + CDN + I ++ N + 
Subjt:  ---KLVCF------------------LVFLHFMHLSLCSKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVF

Query:  HERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLLLHKL
        H RTKHIE   HF++ H++   + L+ +ST  Q  DI TK L   RF  +  ++
Subjt:  HERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLLLHKL

A0A2N9FRL0 CCHC-type domain-containing protein1.8e-21738.84Show/hide
Query:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNR
        MAN     ++EGQST RPP F GS+Y YWK RM +Y++  DY++W I+A GP++P K V+     KLE E++E +++    N KA++ LYCAL   E+NR
Subjt:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNR

Query:  ISMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTK
        +S C SA+EIW+ LE+T+EGTNQVKESK+++ VH YELF M  +E I++M TRFTNI+N+LK LGK+YT  ENVRKILRSLPK WEAK+TAI EA+DL  
Subjt:  ISMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTK

Query:  LPLEELIGSLMTHEIIMKEHLEDES-KKKKSIALKTISLEVDPEDEDDFDKDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG
        L LEEL GSLMT+E+ M   +E+E  K KK+ ALK+   + D  +E+  ++++IA  +R +K F+K+KK F +    + E+KGE SK +   CY+CK+ G
Subjt:  LPLEELIGSLMTHEIIMKEHLEDES-KKKKSIALKTISLEVDPEDEDDFDKDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG

Query:  HIRTDCPLLKSSK-KSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENK
        H + +CP +   K K KKKA+K TWDDS ES+S+                                                                  
Subjt:  HIRTDCPLLKSSK-KSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENK

Query:  SLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRG
                             N SSD  V                      NL                                               
Subjt:  SLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRG

Query:  LGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCL-KASK
        LGYI+ES                        ++S    +SF P+  N   E      C                             ++   VCL   S 
Subjt:  LGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCL-KASK

Query:  KNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDES-----WNNV
        K KW+LDSGCSRHMTGD++K  S + K+GG V FGDN K                      GYS+ SKAYRVFNK+T+V++ES++VVFDE+      NN 
Subjt:  KNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDES-----WNNV

Query:  SNESIC-----------------------------------SDDLEKDFG-------DLLVNDKGKEIVPRS----IFSNLAFVSQIEPRSFKDAECDEF
         +E I                                    +++L K +        +L++ +  + +  RS    I +N+AF+SQIEP++  +A  DE 
Subjt:  SNESIC-----------------------------------SDDLEKDFG-------DLLVNDKGKEIVPRS----IFSNLAFVSQIEPRSFKDAECDEF

Query:  WILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP----------------------------
        WILAMQEELNQFERNKVW L PRP + S+IGTKWVFRNK DE G I+RNKARLVAQGY QEEGIDY ET+AP                            
Subjt:  WILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP----------------------------

Query:  ------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFS
                    VEQPPGFE+ + PNHV+KL KALYGLKQAPRAWY+RLS FL+E  F  GK+D TLF+     DMLIVQIYVDDIIFGSTN +LC+EFS
Subjt:  ------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFS

Query:  KCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIWKLVC
        K M +EFEMSMMGEL FFLGLQIKQ +D IF++Q KY  DLLK+F L   K   TPMS +TKLDKDEKGK VD+K  RGMIGSLLYLTASRPD ++ +  
Subjt:  KCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIWKLVC

Query:  FLVF----------------------------------------------------------LHFMHLSLC---SKKQFVVSRSSIEFEYCALADATSEL
           F                                                           HF+  SL    SKKQ  V+ S+ E EY A     ++ 
Subjt:  FLVF----------------------------------------------------------LHFMHLSLC---SKKQFVVSRSSIEFEYCALADATSEL

Query:  LWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLLLHKL
        L+++  L D       H  + CDN + I ++ N + H RTKHIE   HF++ H++   + L+ +ST  Q  DI TK L   RF  +  ++
Subjt:  LWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLLLHKL

A0A2N9G589 CCHC-type domain-containing protein2.0e-21139Show/hide
Query:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNR
        MAN     ++EGQST RPP F GS+Y YWK RM +Y++  DY++W I+A GP++P K V+     KLE E++E +++    N KA++ LYCAL   E+NR
Subjt:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNR

Query:  ISMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTK
        +S C SA+EIW+ LE+T+EGTNQVKESK++M VH YELF M  +E I++M TRFTNI+N+LK LGK+YT  ENVRKILRSLPK WEAK+TAI EA+DL  
Subjt:  ISMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTK

Query:  LPLEELIGSLMTHEIIMKEHLEDES-KKKKSIALKTISLEVDPEDEDDFDKDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG
        L LEEL GSLMT+E+ M   +E+E  K KK+ ALK+   + D  +E+  ++++IA  +R +K F+K+KK F +    + E+KGE SK +   CY+CK+ G
Subjt:  LPLEELIGSLMTHEIIMKEHLEDES-KKKKSIALKTISLEVDPEDEDDFDKDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG

Query:  HIRTDCPLLKSSK-KSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENK
        H + +CP +   K K KKKA+K TWDDS ES+S+                                                                  
Subjt:  HIRTDCPLLKSSK-KSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENK

Query:  SLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRG
                             N SSD  V                      NL                                               
Subjt:  SLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRG

Query:  LGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCL-KASK
        LGYI+ES                        ++S    +SF P+  N   E      C                             ++   VCL   S 
Subjt:  LGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCL-KASK

Query:  KNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDES-----WNNV
        K KW+LDSGCSRHMTGD++K  S + K+GG V FGDN K                      GYS+ SKAYRVFNK+T+V++ES++VVFDE+      NN 
Subjt:  KNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDES-----WNNV

Query:  SNESICSDDLEKDFGDLLVNDKGK------------------EIVPRS----------------------------IFSNLAFVSQIEPRSFKDAECDEF
         +E I  ++       + +++K K                  E +P+S                            I +N+AF+SQIEP++  +A  DE 
Subjt:  SNESICSDDLEKDFGDLLVNDKGK------------------EIVPRS----------------------------IFSNLAFVSQIEPRSFKDAECDEF

Query:  WILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP----------------------------
        WILAMQEELNQFERNKVW L PRP + S+IGTKWVFRNK DE G I+RNKARLVAQGY QEEGIDY ET+AP                            
Subjt:  WILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP----------------------------

Query:  ------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFS
                    VEQPPGFE+ + PNHV+KL KALYGLKQAPRAWY+RLS FL+E  F  GK+D TLF+     DMLIVQIYVDDIIFGSTN +LC+EFS
Subjt:  ------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFS

Query:  KCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIW----
        K M +EFEMSMMGEL FFLGLQIKQ +D IF++Q KY  DLLK+F L   K   TPMS +TKLDKDEKGK VD+K  R +     YL  +    +W    
Subjt:  KCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIW----

Query:  ---KLVCF------------------LVFLHFMHLSLCSKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVF
            L+ +                    FL    +S  SKKQ  V+ S+ E EY A     ++ L+++  L D       H  + CDN + I ++ N + 
Subjt:  ---KLVCF------------------LVFLHFMHLSLCSKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVF

Query:  HERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLLLHKL
        H RTKHIE   HF++ H++   + L+ +ST  Q  DI TK L   RF  +  ++
Subjt:  HERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLLLHKL

A0A2N9HLC7 CCHC-type domain-containing protein3.8e-21538.82Show/hide
Query:  IVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNRISMCSSAQ
        ++EGQST RPP F GS+Y YWK RM +Y++  DY++W I+A GP++P K V+     KLE E++E +++    N KA++ LYCAL   E+NR+S C SA+
Subjt:  IVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNRISMCSSAQ

Query:  EIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIG
        EIW+ LE+T+EGTNQVKESK+++ VH YELF M  +E I++M TRFTNI+N+LK LGK+YT  ENVRKILRSLPK WEAK+TAI EA+DL  L LEEL G
Subjt:  EIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIG

Query:  SLMTHEIIMKEHLEDES-KKKKSIALKTISLEVDPEDEDDFDKDDIA-YFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGHIRTDCP
        SLMT+E+ M   +E+E  K KK+ ALK+   + D  +E+  ++++IA   +R +K F+K+KK F +    + E+KGE SK +   CY+CK+ GH + +CP
Subjt:  SLMTHEIIMKEHLEDES-KKKKSIALKTISLEVDPEDEDDFDKDDIA-YFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGHIRTDCP

Query:  LLKSSK-KSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENKSLLDTIA
         +   K K KKKA+K TWDDS ES+S+                                                                         
Subjt:  LLKSSK-KSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENKSLLDTIA

Query:  CFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLGYIDES
                      N SSD  V                      NL                                               LGYI+ES
Subjt:  CFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLGYIDES

Query:  STPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCL-KASKKNKWYLD
                                ++S    +SF P+  N   E      C                             ++   VCL   S K KW+LD
Subjt:  STPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCL-KASKKNKWYLD

Query:  SGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDES-----WNNVSNESICS
        SGCSRHMTGD++K  S + K+GG V FGDN K                      GYS+ SKAYRVFNK+T+V++ES++VVFDE+      NN  +E I  
Subjt:  SGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDES-----WNNVSNESICS

Query:  DDLEKDFGDLLVNDKGK------------------EIVPRS----------------------------IFSNLAFVSQIEPRSFKDAECDEFWILAMQE
        ++       + +++K K                  E +P+S                            I +N+AF+SQIEP++  +A  DE WILAMQE
Subjt:  DDLEKDFGDLLVNDKGK------------------EIVPRS----------------------------IFSNLAFVSQIEPRSFKDAECDEFWILAMQE

Query:  ELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP-----------------------------------
        ELNQFERNKVW L PRP + S+IGTKWVFRNK DE G I+RNKARLVAQGY QEEGIDY ET+AP                                   
Subjt:  ELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP-----------------------------------

Query:  -----VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEF
             VEQPPGFE+ + PNHV+KL KALYGLKQAPRAWY+RLS FL+E  F  GK+D TLF+     DMLIVQIYVDDIIFGSTN +LC+EFSK M +EF
Subjt:  -----VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEF

Query:  EMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIWKLVCFLVF---
        EMSMMGEL FFLGLQIKQ +D IF++Q KY  DLLK+F L   K   TPMS +TKLDKDEKGK VD+K  RGMIGSLLYLTASRPD ++ +     F   
Subjt:  EMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIWKLVCFLVF---

Query:  -------------------------------------------------------LHFMHLSLC---SKKQFVVSRSSIEFEYCALADATSELLWLRCLL
                                                                HF+  SL    SKKQ  V+ S+ E EY A     ++ L+++  L
Subjt:  -------------------------------------------------------LHFMHLSLC---SKKQFVVSRSSIEFEYCALADATSELLWLRCLL

Query:  IDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLLLHKL
         D       H  + CDN + I ++ N + H RTKHIE   HF++ H++   + L+ +ST  Q  DI TK L   RF  +  ++
Subjt:  IDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLLLHKL

A0A2N9I2B8 CCHC-type domain-containing protein3.4e-21139Show/hide
Query:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNR
        MAN     ++EGQST RPP F GS+Y YWK RM +Y++  DY++W I+A GP++P K V+     KLE E++E +++    N KA++ LYCAL   E+NR
Subjt:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNR

Query:  ISMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTK
        +S C SA+EIW+ LE+T+EGTNQVKESK++M VH YELF M  +E I++M TRFTNI+N+LK LGK+YT  ENVRKILRSLPK WEAK+TAI EA+DL  
Subjt:  ISMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTK

Query:  LPLEELIGSLMTHEIIMKEHLEDES-KKKKSIALKTISLEVDPEDEDDFDKDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG
        L LEEL GSLMT+E+ M   +E+E  K KK+ ALK+   + D  +E+  ++++IA  +R +K F+K+KK F +    + E+KGE SK +   CY+CK+ G
Subjt:  LPLEELIGSLMTHEIIMKEHLEDES-KKKKSIALKTISLEVDPEDEDDFDKDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG

Query:  HIRTDCPLLKSSK-KSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENK
        H + +CP +   K K KKKA+K TWDDS ES+S+                                                                  
Subjt:  HIRTDCPLLKSSK-KSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENK

Query:  SLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRG
                             N SSD  V                      NL                                               
Subjt:  SLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRG

Query:  LGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCL-KASK
        LGYI+ES                        ++S    +SF P+  N   E      C                             ++   VCL   S 
Subjt:  LGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCL-KASK

Query:  KNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDES-----WNNV
        K+KW+LDSGCSRHMTGD++K  S + K+GG V FGDN K                      GYS+ SKAYRVFNK+T+V++ES++VVFDE+      NN 
Subjt:  KNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKK----------------------GYSSTSKAYRVFNKKTLVIEESIYVVFDES-----WNNV

Query:  SNESIC-----------------------------------SDDLEKDFG-------DLLVNDKGKEIVPRS----IFSNLAFVSQIEPRSFKDAECDEF
         +E I                                    +++L K +        +L++ +    +  RS    I +N+AF+SQIEP++  +A  DE 
Subjt:  SNESIC-----------------------------------SDDLEKDFG-------DLLVNDKGKEIVPRS----IFSNLAFVSQIEPRSFKDAECDEF

Query:  WILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP----------------------------
        WILAMQEELNQFERNKVW L PRP + S+IGTKWVFRNK DE G I+RNKARLVAQGY QEEGIDY ET+AP                            
Subjt:  WILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAP----------------------------

Query:  ------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFS
                    VEQPPGFE+ + PNHV+KL KALYGLKQAPRAWY+RLS FL+E  F  GK+D TLF+     DMLIVQIYVDDIIFGSTN +LC+EFS
Subjt:  ------------VEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFS

Query:  KCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIW----
        K M +EFEMSMMGEL FFLGLQIKQ +D IF++Q KY  DLLK+F L   K   TPMS +TKLDKDEKGK VD+K  R +     YL  +    +W    
Subjt:  KCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIW----

Query:  ---KLVCF------------------LVFLHFMHLSLCSKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVF
            L+ +                    FL    +S  SKKQ  V+ S+ E EY A     ++ L+++  L D       H  + CDN + I ++ N + 
Subjt:  ---KLVCF------------------LVFLHFMHLSLCSKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVF

Query:  HERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLLLHKL
        H RTKHIE   HF++ H++   + L+ +ST  Q  DI TK L   RF  +  ++
Subjt:  HERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLLLHKL

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.0e-4825.99Show/hide
Query:  IDESSTPSSS----KTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVC-----
        +DE++  +S     +T F+K S      N  N     +++ F      C     ++      K ++N    +NF   ++  I  +  F N+S+ C     
Subjt:  IDESSTPSSS----KTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEGHIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVC-----

Query:  LKASKK-NKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGYSSTSKAYRVFNKKTLVIEESIYVVFDESWNNVSNESICSDDLEKDFGDLLVND
        LK SK+ NK++L+    R     R   ++ SK +G       N+   S T++  +          + I ++   S    +   I  ++ +     +++N 
Subjt:  LKASKK-NKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGYSSTSKAYRVFNKKTLVIEESIYVVFDESWNNVSNESICSDDLEKDFGDLLVND

Query:  KGKEIVPRSIFSNLAFVSQIEPRSFKDAECDE---FWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGI
                +IF+++       P SF + +  +    W  A+  ELN  + N  W +  RP N +I+ ++WVF  K +E GN IR KARLVA+G+ Q+  I
Subjt:  KGKEIVPRSIFSNLAFVSQIEPRSFKDAECDE---FWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGI

Query:  DYEETFAPVEQPPGFE-------SFDLPNH-------------------------------VYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNT
        DYEETFAPV +   F         ++L  H                               V KL KA+YGLKQA R W++   + L E +F    +D  
Subjt:  DYEETFAPVEQPPGFE-------SFDLPNH-------------------------------VYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNT

Query:  LFIKVKN--NDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLD
        ++I  K   N+ + V +YVDD++  + + +    F + +  +F M+ + E+  F+G++I+  +D I++SQ  Y + +L KF +       TP+   +K++
Subjt:  LFIKVKN--NDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLD

Query:  KDEKGKCVDIKT-CRGMIGSLLY-LTASRPD-----------------TIW----------------------------KLVCFL---------------
         +      D  T CR +IG L+Y +  +RPD                  +W                            K++ ++               
Subjt:  KDEKGKCVDIKT-CRGMIGSLLY-LTASRPD-----------------TIW----------------------------KLVCFL---------------

Query:  --VFLHFMHLSLC--SKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTL
          +F  F    +C  +K+Q  V+ SS E EY AL +A  E LWL+ LL  +     +   ++ DN   I IA+N   H+R KHI+   HF +  +++N +
Subjt:  --VFLHFMHLSLC--SKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTL

Query:  HLQSISTIDQPVDIFTKALSPHRFTLLLHKL
         L+ I T +Q  DIFTK L   RF  L  KL
Subjt:  HLQSISTIDQPVDIFTKALSPHRFTLLLHKL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-4327.03Show/hide
Query:  AMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPV------------------------------
        AMQEE+   ++N  +KLV  P     +  KWVF+ K D +  ++R KARLV +G+ Q++GID++E F+PV                              
Subjt:  AMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPV------------------------------

Query:  ----------EQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIK-VKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKC
                  EQP GFE     + V KL K+LYGLKQAPR WY +   F+    +     D  ++ K    N+ +I+ +YVDD++    +  L  +    
Subjt:  ----------EQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIK-VKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKC

Query:  MHNEFEMSMMGELSFFLGLQI--KQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDK-------DEKGKCVDIKTCRGMIGSLLY-LTASRP
        +   F+M  +G     LG++I  ++    +++SQEKY   +L++F +   K   TP++   KL K       +EKG    +      +GSL+Y +  +RP
Subjt:  MHNEFEMSMMGELSFFLGLQI--KQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDK-------DEKGKCVDIKTCRGMIGSLLY-LTASRP

Query:  DTI-----------------WKLV---------------CF----------------------------LVFLHFMHLSLCSKKQFVVSRSSIEFEYCAL
        D                   W+ V               CF                            L       +S  SK Q  V+ S+ E EY A 
Subjt:  DTI-----------------WKLV---------------CF----------------------------LVFLHFMHLSLCSKKQFVVSRSSIEFEYCAL

Query:  ADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTL
         +   E++WL+  L ++G  Q  + +++CD+ + I ++ N ++H RTKHI+   H+++  ++  +L +  IST + P D+ TK +  ++F L
Subjt:  ADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTL

P92520 Uncharacterized mitochondrial protein AtMg008201.9e-1753.49Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPV
        EP+S   A  D  W  AMQEEL+   RNK W LVP P N +I+G KWVF+ K+  +G + R KARLVA+G+ QEEGI + ET++PV
Subjt:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-6031.31Show/hide
Query:  LAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPV------
        ++  ++ EPR+   A  DE W  AM  E+N    N  W LV P PS+ +I+G +W+F  K + +G++ R KARLVA+GY Q  G+DY ETF+PV      
Subjt:  LAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPV------

Query:  ----------------------------------EQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIV
                                           QPPGF   D PN+V KL+KALYGLKQAPRAWY  L  +LL   F     D +LF+  +   ++ +
Subjt:  ----------------------------------EQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIV

Query:  QIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRG
         +YVDDI+    + +L       +   F +    EL +FLG++ K++   + +SQ +Y  DLL +  +   K   TPM+ + KL      K  D    RG
Subjt:  QIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRG

Query:  MIGSLLYLTASRPDTIWKLVCFLVFLHF---------------------------------MH----------------------------LSLCSKKQF
        ++GSL YL  +RPD  + +     F+H                                  +H                            +S  SKKQ 
Subjt:  MIGSLLYLTASRPDTIWKLVCFLVFLHF---------------------------------MH----------------------------LSLCSKKQF

Query:  VVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALS
         V RSS E EY ++A+ +SE+ W+  LL ++G   T   +++CDN     +  N VFH R KHI  + HF+++ ++S  L +  +ST DQ  D  TK LS
Subjt:  VVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALS

Query:  PHRFTLLLHKL
           F     K+
Subjt:  PHRFTLLLHKL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.0e-6031.04Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPV-------------
        EPR+   A  D+ W  AM  E+N    N  W LV P P + +I+G +W+F  K + +G++ R KARLVA+GY Q  G+DY ETF+PV             
Subjt:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPV-------------

Query:  ---------------------------EQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDI
                                    QPPGF   D P++V +L+KA+YGLKQAPRAWY  L  +LL   F     D +LF+  +   ++ + +YVDDI
Subjt:  ---------------------------EQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDI

Query:  IFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLY
        +    ++ L +     +   F +    +L +FLG++ K++   + +SQ +YT DLL +  +   K   TPM+T+ KL      K  D    RG++GSL Y
Subjt:  IFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLY

Query:  LTASRPDTI-----------------WKLV--------------------------------------------CFLVFLHFMHLSLCSKKQFVVSRSSI
        L  +RPD                   W  +                                             ++V+L    +S  SKKQ  V RSS 
Subjt:  LTASRPDTI-----------------WKLV--------------------------------------------CFLVFLHFMHLSLCSKKQFVVSRSSI

Query:  EFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLL
        E EY ++A+ +SEL W+  LL ++G   +   +++CDN     +  N VFH R KHI  + HF+++ ++S  L +  +ST DQ  D  TK LS   F   
Subjt:  EFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSPHRFTLL

Query:  LHKLKKIEV
          K+  I+V
Subjt:  LHKLKKIEV

Arabidopsis top hitse value%identityAlignment
AT4G05360.1 Zinc knuckle (CCHC-type) family protein1.1e-0424.29Show/hide
Query:  KESKGEKSKKDEVICYECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLE
        K SKG++       C+ECK   H+ ++C  L      K+K  K    DS     + EE+ N  L+A +  +          P S         S      
Subjt:  KESKGEKSKKDEVICYECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLE

Query:  KLSSKYVVLKKKYNVLISENKSLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLT
                         S+N    D      + E  E  + L     K   +E++++L K +        K       KE E S L+E  K    ++ L 
Subjt:  KLSSKYVVLKKKYNVLISENKSLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLT

Query:  IGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTFV------KASPIVPKFNMSNHVSNHVKSS------------------------FVPICHNCG
         G ++L  I+ +GK+  DK GLG+      PS S   FV       AS  V +      +++  ++                         F P+CH+CG
Subjt:  IGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTFV------KASPIVPKFNMSNHVSNHVKSS------------------------FVPICHNCG

Query:  VEGHIRPKCFKLKYAQN
        V GHIRP+CF+L   +N
Subjt:  VEGHIRPKCFKLKYAQN

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.0e-5531.99Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPV--------------
        EP ++ +A+    W  AM +E+   E    W++   P N   IG KWV++ K + +G I R KARLVA+GY Q+EGID+ ETF+PV              
Subjt:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPV--------------

Query:  --------------------------EQPPGFESFD----LPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYV
                                  + PPG+ +       PN V  LKK++YGLKQA R W+ + S  L+   F     D+T F+K+     L V +YV
Subjt:  --------------------------EQPPGFESFD----LPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYV

Query:  DDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGS
        DDII  S N +  +E    + + F++  +G L +FLGL+I +    I I Q KY  DLL +  L   K +  PM  +        G  VD K  R +IG 
Subjt:  DDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGS

Query:  LLYLTASRPD---------------------TIWKLVCFL----------------------------------------VFLHFMHLSLCSKKQFVVSR
        L+YL  +R D                      + K++ ++                                        +FL    +S  SKKQ VVS+
Subjt:  LLYLTASRPD---------------------TIWKLVCFL----------------------------------------VFLHFMHLSLCSKKQFVVSR

Query:  SSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSP
        SS E EY AL+ AT E++WL     ++  P +  T+L CDN   I IA N VFHERTKHIE++CH V+           S    D+  D FT+ LSP
Subjt:  SSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKALSP

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-0927.11Show/hide
Query:  IYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTC
        +YVDDI+   ++++L       + + F M  +G + +FLG+QIK     +F+SQ KY   +L     N G +   PMST   L  +      K  D    
Subjt:  IYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTC

Query:  RGMIGSLLYLTASRPDTIW--KLVC-----------------------------------------------------------FLVFLHFMHLSLCSKK
        R ++G+L YLT +RPD  +   +VC                                                           F  FL    +S  +K+
Subjt:  RGMIGSLLYLTASRPDTIW--KLVC-----------------------------------------------------------FLVFLHFMHLSLCSKK

Query:  QFVVSRSSIEFEYCALADATSELLW
        Q  VSRSS E EY ALA   +EL W
Subjt:  QFVVSRSSIEFEYCALADATSELLW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.3e-1853.49Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPV
        EP+S   A  D  W  AMQEEL+   RNK W LVP P N +I+G KWVF+ K+  +G + R KARLVA+G+ QEEGI + ET++PV
Subjt:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAACCTATTGGCAAATGGTATTGTTGAAGGTCAATCTACTTCTAGACCTCCTTATTTTGATGGTTCAAATTATGCATATTGGAAAGCTAGAATGAAAATTTATTT
GCAATCTATTGACTATAATTTGTGGTTAATTGTTGCTAAAGGTCCTTATGTACCCATGAAAAATGTTGATAATGTTGATACGCCTAAATTAGAAGAAGAGTATGATGAAA
ATGAAATGAAAAAGTGTTCATTTAATGCTAAGGCTATTAATTGTTTGTATTGTGCCTTGAGTAAAGATGAATTTAATAGAATATCCATGTGTTCTTCCGCTCAAGAAATT
TGGAATACTCTTGAAATTACTCATGAAGGAACAAATCAAGTTAAAGAGTCTAAAATTAGCATGTTTGTTCATAATTATGAATTGTTTAAGATGGATGCTAATGAGACTAT
CACCGATATGTTTACTAGATTTACTAACATCATAAATGCTTTGAAGGGTCTTGGTAAAGTTTATACAACTTCCGAAAATGTTAGAAAAATTCTAAGGTCTCTACCTAAGA
CTTGGGAAGCTAAGGTAACGGCGATTCAAGAAGCAAAGGATCTCACCAAACTTCCACTAGAGGAGCTTATTGGCTCACTCATGACTCATGAGATCATCATGAAGGAGCAC
TTGGAGGATGAGTCCAAAAAGAAGAAGAGCATTGCATTAAAGACTATCTCCTTGGAAGTTGATCCCGAAGATGAGGATGACTTTGATAAAGATGACATTGCTTATTTCTC
ACGTAAGTACAAAAATTTCATAAAAAGAAAGAAATATTTCAAGAAACACCTATCAACCCAAAAAGAGTCAAAAGGTGAGAAAAGCAAAAAGGATGAGGTGATTTGTTATG
AATGCAAAAGATCGGGTCATATAAGAACGGATTGTCCTCTCCTCAAATCATCTAAGAAATCCAAGAAGAAGGCAATGAAGGCTACATGGGATGATAGTAGTGAAAGTGAA
AGTGAAGTTGAAGAAATGGCAAATCTTGGTCTCATGGCTCATAGTGACAAAGATGATGAACATGATGATAAGGTAACTCTAGAACCTCTTTCTATTGATGAATTATTTGA
AAATTTTGAAAGCATGCAAAATGACCTAGAAAAACTTAGTTCTAAGTATGTTGTGCTTAAGAAGAAATACAATGTTTTAATTAGTGAAAATAAGTCTTTACTTGATACAA
TTGCTTGCTTTAAAGAGAATGAAAATTTTGAGCAAATTGAAGAATTGAATGTCTCTAGTGATAAGCATGTTTGTATTGAGAAAGATGCTTTGCTTGATAAAGTTAGATTT
CTTGAGCATGATAGTTGTGAAAAAGATAACTTGATTAAAGTGCTTAAAGAAAATGAACTAAGTGTTTTACAAGAACTTGATAAAGCTAAAGAAACTATTAAAAAGTTGAC
AATAGGTGCTCAAAGATTGGACAAAATTATTGAAGTAGGAAAATCTTATGGTGATAAGAGAGGTTTAGGCTATATTGATGAATCATCTACTCCATCAAGTTCTAAAACTA
CATTCGTTAAAGCATCACCTATTGTGCCTAAGTTTAACATGTCTAATCATGTGTCTAATCATGTTAAATCTAGTTTTGTACCCATATGTCATAATTGTGGTGTTGAAGGT
CACATTAGACCTAAATGCTTTAAATTGAAGTATGCTCAAAATACTTATTCAAGAAGAAATTTTTCGCAAAGAGCAAAGTTTTACATTGCTCCAAGGAAGAATTTCTCCAA
TAAAAGTAGAGTTTGTTTGAAAGCCTCCAAGAAAAACAAATGGTACTTGGATAGTGGTTGCTCAAGACACATGACGGGAGACCGATCCAAGCTTATCTCTTTCTCCAAAA
AGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAGTTATTGAGGAATCTATTTAT
GTTGTATTTGATGAATCTTGGAATAATGTTTCTAATGAGTCTATTTGTAGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGT
TCCAAGATCTATATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAA
ATCAATTTGAAAGAAACAAAGTTTGGAAATTAGTCCCTAGGCCTTCTAATGCATCTATAATCGGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATC
ATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCGGTAGAACAACCTCCGGGCTTTGAAAGTTTTGA
TTTACCTAATCATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGCAAGTTTTTACTTGAGAATGACTTTAAGA
TGGGAAAAATTGATAATACTCTATTTATTAAAGTTAAAAATAATGACATGCTTATAGTACAAATTTATGTGGATGATATTATATTTGGTTCTACTAATTCATCTTTGTGT
GAAGAATTTTCCAAGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTCTTCAAATCAAACAACTCAAGGATGACATCTTCATAAG
TCAAGAAAAATACACAAGGGATTTGCTCAAGAAATTCAAATTAAATGAAGGTAAAGTTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTA
AGTGTGTGGATATAAAGACTTGTCGAGGTATGATCGGATCTTTACTTTATTTGACCGCTAGTAGACCCGATACCATTTGGAAACTCGTATGTTTTCTAGTTTTTCTTCAC
TTCATGCATCTCTCTCTATGTAGCAAGAAACAATTTGTTGTTTCCCGTTCTAGCATTGAGTTCGAATATTGTGCTCTGGCTGATGCCACTTCAGAATTACTATGGCTTCG
TTGTCTACTTATTGATATGGGAGCCCCACAAACATCACACACTATTCTTCATTGTGATAACTACAATGTCATTCAGATTGCACACAATGATGTATTTCATGAACGAACAA
AGCACATTGAGAACAATTGTCACTTTGTTCAACATCATCTTGAGAGCAACACTCTCCATCTCCAATCCATCTCCACCATTGACCAACCTGTCGATATCTTCACCAAAGCT
CTCTCTCCTCATCGTTTCACTCTGTTACTTCACAAACTCAAGAAGATCGAAGTTCAGATCAACTTGCAACTACAACATCCTAAAGACCGAAGACTAAGAAGTTCTGAAGT
TTATTTTATCTTGTATTCAAGAGAAAGCATCAAAGGATTTGGCACGCCTAGTGGGACCATCTCTACTTTTCATCTCTTTCTCCTTTTGAAGAACATCTCTAAGGAAATCA
TGACATCTCAAGGCAACACTTCCAAAGCTCTAGGTGACATCAGCAAGCGACCAAATACTCGCAGTCGCTTAAGGGAATTTCATTCATCTAAAGATGTGCTGCCGTTTGAG
GTCGCAAAGAATATTTGGGAACAAATCTCTAGTCCACCAAAACATGGAATTAGTCGTGACGCTGCTGAATCAAGTCACACACATATTGTCAAGAATGTTGACAAACGAAA
GGCAATTATTCAAGAAAGTCAACCACAAAATTCGACTTCAATTGCATCTGTGTATGTCCAACAATTGAAGGAGATGATTGCAAACTCCATCAAAATTCAATATGGTGGAC
CCGCTCAAACTTTCTCTTTGTATTCCAAGTTGTATACGAAGATGATCGACAACCTAAGAATGTCGAATGGGTACCAACCACCCAAGTTCCAACAGTTTGATGGAAGCGAC
AACCCAAAACAACATGTTGCTCATTTTATTGAAACATGTGAAACTGCTGGTACACGTGGAGATTTATTGGTCAAACAGTTCGTCCGAACTCTCAAAGGAAATGCCTTTGA
CTGGTACACATACTTGGAGCCTGAATCCATCGACAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAACCTATTGGCAAATGGTATTGTTGAAGGTCAATCTACTTCTAGACCTCCTTATTTTGATGGTTCAAATTATGCATATTGGAAAGCTAGAATGAAAATTTATTT
GCAATCTATTGACTATAATTTGTGGTTAATTGTTGCTAAAGGTCCTTATGTACCCATGAAAAATGTTGATAATGTTGATACGCCTAAATTAGAAGAAGAGTATGATGAAA
ATGAAATGAAAAAGTGTTCATTTAATGCTAAGGCTATTAATTGTTTGTATTGTGCCTTGAGTAAAGATGAATTTAATAGAATATCCATGTGTTCTTCCGCTCAAGAAATT
TGGAATACTCTTGAAATTACTCATGAAGGAACAAATCAAGTTAAAGAGTCTAAAATTAGCATGTTTGTTCATAATTATGAATTGTTTAAGATGGATGCTAATGAGACTAT
CACCGATATGTTTACTAGATTTACTAACATCATAAATGCTTTGAAGGGTCTTGGTAAAGTTTATACAACTTCCGAAAATGTTAGAAAAATTCTAAGGTCTCTACCTAAGA
CTTGGGAAGCTAAGGTAACGGCGATTCAAGAAGCAAAGGATCTCACCAAACTTCCACTAGAGGAGCTTATTGGCTCACTCATGACTCATGAGATCATCATGAAGGAGCAC
TTGGAGGATGAGTCCAAAAAGAAGAAGAGCATTGCATTAAAGACTATCTCCTTGGAAGTTGATCCCGAAGATGAGGATGACTTTGATAAAGATGACATTGCTTATTTCTC
ACGTAAGTACAAAAATTTCATAAAAAGAAAGAAATATTTCAAGAAACACCTATCAACCCAAAAAGAGTCAAAAGGTGAGAAAAGCAAAAAGGATGAGGTGATTTGTTATG
AATGCAAAAGATCGGGTCATATAAGAACGGATTGTCCTCTCCTCAAATCATCTAAGAAATCCAAGAAGAAGGCAATGAAGGCTACATGGGATGATAGTAGTGAAAGTGAA
AGTGAAGTTGAAGAAATGGCAAATCTTGGTCTCATGGCTCATAGTGACAAAGATGATGAACATGATGATAAGGTAACTCTAGAACCTCTTTCTATTGATGAATTATTTGA
AAATTTTGAAAGCATGCAAAATGACCTAGAAAAACTTAGTTCTAAGTATGTTGTGCTTAAGAAGAAATACAATGTTTTAATTAGTGAAAATAAGTCTTTACTTGATACAA
TTGCTTGCTTTAAAGAGAATGAAAATTTTGAGCAAATTGAAGAATTGAATGTCTCTAGTGATAAGCATGTTTGTATTGAGAAAGATGCTTTGCTTGATAAAGTTAGATTT
CTTGAGCATGATAGTTGTGAAAAAGATAACTTGATTAAAGTGCTTAAAGAAAATGAACTAAGTGTTTTACAAGAACTTGATAAAGCTAAAGAAACTATTAAAAAGTTGAC
AATAGGTGCTCAAAGATTGGACAAAATTATTGAAGTAGGAAAATCTTATGGTGATAAGAGAGGTTTAGGCTATATTGATGAATCATCTACTCCATCAAGTTCTAAAACTA
CATTCGTTAAAGCATCACCTATTGTGCCTAAGTTTAACATGTCTAATCATGTGTCTAATCATGTTAAATCTAGTTTTGTACCCATATGTCATAATTGTGGTGTTGAAGGT
CACATTAGACCTAAATGCTTTAAATTGAAGTATGCTCAAAATACTTATTCAAGAAGAAATTTTTCGCAAAGAGCAAAGTTTTACATTGCTCCAAGGAAGAATTTCTCCAA
TAAAAGTAGAGTTTGTTTGAAAGCCTCCAAGAAAAACAAATGGTACTTGGATAGTGGTTGCTCAAGACACATGACGGGAGACCGATCCAAGCTTATCTCTTTCTCCAAAA
AGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAGTTATTGAGGAATCTATTTAT
GTTGTATTTGATGAATCTTGGAATAATGTTTCTAATGAGTCTATTTGTAGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGT
TCCAAGATCTATATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAA
ATCAATTTGAAAGAAACAAAGTTTGGAAATTAGTCCCTAGGCCTTCTAATGCATCTATAATCGGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATC
ATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCGGTAGAACAACCTCCGGGCTTTGAAAGTTTTGA
TTTACCTAATCATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGCAAGTTTTTACTTGAGAATGACTTTAAGA
TGGGAAAAATTGATAATACTCTATTTATTAAAGTTAAAAATAATGACATGCTTATAGTACAAATTTATGTGGATGATATTATATTTGGTTCTACTAATTCATCTTTGTGT
GAAGAATTTTCCAAGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTCTTCAAATCAAACAACTCAAGGATGACATCTTCATAAG
TCAAGAAAAATACACAAGGGATTTGCTCAAGAAATTCAAATTAAATGAAGGTAAAGTTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTA
AGTGTGTGGATATAAAGACTTGTCGAGGTATGATCGGATCTTTACTTTATTTGACCGCTAGTAGACCCGATACCATTTGGAAACTCGTATGTTTTCTAGTTTTTCTTCAC
TTCATGCATCTCTCTCTATGTAGCAAGAAACAATTTGTTGTTTCCCGTTCTAGCATTGAGTTCGAATATTGTGCTCTGGCTGATGCCACTTCAGAATTACTATGGCTTCG
TTGTCTACTTATTGATATGGGAGCCCCACAAACATCACACACTATTCTTCATTGTGATAACTACAATGTCATTCAGATTGCACACAATGATGTATTTCATGAACGAACAA
AGCACATTGAGAACAATTGTCACTTTGTTCAACATCATCTTGAGAGCAACACTCTCCATCTCCAATCCATCTCCACCATTGACCAACCTGTCGATATCTTCACCAAAGCT
CTCTCTCCTCATCGTTTCACTCTGTTACTTCACAAACTCAAGAAGATCGAAGTTCAGATCAACTTGCAACTACAACATCCTAAAGACCGAAGACTAAGAAGTTCTGAAGT
TTATTTTATCTTGTATTCAAGAGAAAGCATCAAAGGATTTGGCACGCCTAGTGGGACCATCTCTACTTTTCATCTCTTTCTCCTTTTGAAGAACATCTCTAAGGAAATCA
TGACATCTCAAGGCAACACTTCCAAAGCTCTAGGTGACATCAGCAAGCGACCAAATACTCGCAGTCGCTTAAGGGAATTTCATTCATCTAAAGATGTGCTGCCGTTTGAG
GTCGCAAAGAATATTTGGGAACAAATCTCTAGTCCACCAAAACATGGAATTAGTCGTGACGCTGCTGAATCAAGTCACACACATATTGTCAAGAATGTTGACAAACGAAA
GGCAATTATTCAAGAAAGTCAACCACAAAATTCGACTTCAATTGCATCTGTGTATGTCCAACAATTGAAGGAGATGATTGCAAACTCCATCAAAATTCAATATGGTGGAC
CCGCTCAAACTTTCTCTTTGTATTCCAAGTTGTATACGAAGATGATCGACAACCTAAGAATGTCGAATGGGTACCAACCACCCAAGTTCCAACAGTTTGATGGAAGCGAC
AACCCAAAACAACATGTTGCTCATTTTATTGAAACATGTGAAACTGCTGGTACACGTGGAGATTTATTGGTCAAACAGTTCGTCCGAACTCTCAAAGGAAATGCCTTTGA
CTGGTACACATACTTGGAGCCTGAATCCATCGACAGTTGA
Protein sequenceShow/hide protein sequence
MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNAKAINCLYCALSKDEFNRISMCSSAQEI
WNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSLMTHEIIMKEH
LEDESKKKKSIALKTISLEVDPEDEDDFDKDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESE
SEVEEMANLGLMAHSDKDDEHDDKVTLEPLSIDELFENFESMQNDLEKLSSKYVVLKKKYNVLISENKSLLDTIACFKENENFEQIEELNVSSDKHVCIEKDALLDKVRF
LEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTFVKASPIVPKFNMSNHVSNHVKSSFVPICHNCGVEG
HIRPKCFKLKYAQNTYSRRNFSQRAKFYIAPRKNFSNKSRVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGYSSTSKAYRVFNKKTLVIEESIY
VVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPRSIFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNI
IRNKARLVAQGYCQEEGIDYEETFAPVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLC
EEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTCRGMIGSLLYLTASRPDTIWKLVCFLVFLH
FMHLSLCSKKQFVVSRSSIEFEYCALADATSELLWLRCLLIDMGAPQTSHTILHCDNYNVIQIAHNDVFHERTKHIENNCHFVQHHLESNTLHLQSISTIDQPVDIFTKA
LSPHRFTLLLHKLKKIEVQINLQLQHPKDRRLRSSEVYFILYSRESIKGFGTPSGTISTFHLFLLLKNISKEIMTSQGNTSKALGDISKRPNTRSRLREFHSSKDVLPFE
VAKNIWEQISSPPKHGISRDAAESSHTHIVKNVDKRKAIIQESQPQNSTSIASVYVQQLKEMIANSIKIQYGGPAQTFSLYSKLYTKMIDNLRMSNGYQPPKFQQFDGSD
NPKQHVAHFIETCETAGTRGDLLVKQFVRTLKGNAFDWYTYLEPESIDS