; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012472 (gene) of Snake gourd v1 genome

Gene IDTan0012472
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG03:27995555..27999771
RNA-Seq ExpressionTan0012472
SyntenyTan0012472
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN63777.1 hypothetical protein VITISV_043745 [Vitis vinifera]4.0e-16744.26Show/hide
Query:  KILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITM-KEHMEDEPKKKKSIALKTLSLEVESEDEGVLDEE--DIAYFSRKYKKFIK----RKK
        KILR LP  W  KVTAI+EAKDLTKLP+EE +GSLMT+EI++ K+  E E KKKKSIALK  + + E  +E    EE  D+A  +RK  K+++    R+K
Subjt:  KILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITM-KEHMEDEPKKKKSIALKTLSLEVESEDEGVLDEE--DIAYFSRKYKKFIK----RKK

Query:  QFNKHISNQK---ESKGEKSK---KDEVICYECKKSGHIRTDCPLLK-SSKKSKKKAMKATWDDSDESESGSDKQTFASWLMVT------RVMNKMMRFL
        +F    +  K    S G+K K   K ++IC++CK  GHI+ +CPL    +K+ KKKAM ATW +S+ES     ++  A+   +        + N    F+
Subjt:  QFNKHISNQK---ESKGEKSK---KDEVICYECKKSGHIRTDCPLLK-SSKKSKKKAMKATWDDSDESESGSDKQTFASWLMVT------RVMNKMMRFL

Query:  EH--DSYEKDNLIKL--------------------LKENELNALQELGKAK--ESIKKLTIGAQRLDKIIEVGKPYGDK-RGLGYIDECSTPSSSKIIFV
         H  ++    N+ K                     L+   ++ + +L K +    + K+     ++ +  ++GK   +  +   +I      S+S+ + +
Subjt:  EH--DSYEKDNLIKL--------------------LKENELNALQELGKAK--ESIKKLTIGAQRLDKIIEVGKPYGDK-RGLGYIDECSTPSSSKIIFV

Query:  KASPIVPNHNMPKIGSK-------------------HDKS----SFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHK
            +      P +G K                     KS     F   C+    E      C +  +     +  +F +    H    N  + ++    
Subjt:  KASPIVPNHNMPKIGSK-------------------HDKS----SFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHK

Query:  FVV--RDKSLHDVVCFSCGEYGHKAYFCYLSKSKALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGD
         VV  ++++L ++      E     YF       A  VN   +V   ++  LL P    LK +    W        +      K   L+ KD  L  F  
Subjt:  FVV--RDKSLHDVVCFSCGEYGHKAYFCYLSKSKALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGD

Query:  NKK-----GYSSTSKAYRVFNKRTLIIEESMHVVFDESCNNVSNESICSDD--LERNFGDLLVSDKDKE----IDSSKQEVSL-----NEKKENSSSSMS
                GYS++SK +RVFNKRT+++EES+HV+F ES N++       DD  LE + G L + DK ++     D  K+E  L        +  SS  + 
Subjt:  NKK-----GYSSTSKAYRVFNKRTLIIEESMHVVFDESCNNVSNESICSDD--LERNFGDLLVSDKDKE----IDSSKQEVSL-----NEKKENSSSSMS

Query:  KEWRYAPSHPKDLILGDPEQGVKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNI
        K+W++  +HP+D I+G+P  GV+TRSSL N+ NNLAF+ QIEPK+ KDA  DE W++AMQEELNQFER++VWELVPRPSN S+IGTK VFRNKMDENG I
Subjt:  KEWRYAPSHPKDLILGDPEQGVKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNI

Query:  IRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWY
        +RNKARLVAQGY QE  IDYEETFAPVARLEAIRMLLAFA +K+FILYQMDVKS FLNG+I EEVY EQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY
Subjt:  IRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWY

Query:  DRLSNFLLENDFKMGKLDTTLFIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLK
        +RLS FL +  FKMGK+DTTLFIKTKE DMLLVQIYVDDIIFG+TN  LCE+FSKCMHSEFEMSMMGEL+FFLGLQIKQLK+  FI+Q KY KDLL+
Subjt:  DRLSNFLLENDFKMGKLDTTLFIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLK

RVW80634.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.7e-19446.35Show/hide
Query:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITM-KEHMEDEPKKKKSIALKTLSLE
        M   E+I EM TRFT I N L+ LG+V   SE V KILRSLP  W  KVTAI+EAKDLTKLP+EE +GSLMT+EI++ K+  E E KKKKSIALK  + E
Subjt:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITM-KEHMEDEPKKKKSIALKTLSLE

Query:  VESEDEGVLDEE--DIAYFSRKYKKFIK----RKKQFNKHIS---NQKESKGEKSK---KDEVICYECKKSGHIRTDCPLLK-SSKKSKKKAMKATWDDS
         E  +E    EE  D+A  +RK  K+++    R K+F    +    +  S G+K K   K ++IC++CKK GHI+ DCPL K  +K+  KKAM ATW +S
Subjt:  VESEDEGVLDEE--DIAYFSRKYKKFIK----RKKQFNKHIS---NQKESKGEKSK---KDEVICYECKKSGHIRTDCPLLK-SSKKSKKKAMKATWDDS

Query:  DESESGSDKQTFASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSK
        +ES                                                 E  K KE      +    LD                            
Subjt:  DESESGSDKQTFASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSK

Query:  IIFVKASPIVPNHNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEY
                                            EG                                                              
Subjt:  IIFVKASPIVPNHNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEY

Query:  GHKAYFCYLSKSKALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK------------------
                                                 SK++KW+LDSG SRHMTGD+SKF  L+K+ GG VTFGDN K                  
Subjt:  GHKAYFCYLSKSKALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK------------------

Query:  ----GYSSTSKAYRVFNKRTLIIEESMHVVFDESCNNVSNESICSDD--LERNFGDLLVSDKDKEIDS----SKQEVSL-----NEKKENSSSSMSKEWR
            GYS++SKA+RVFNKRT+++EES+HV+FDES N++       DD  LE + G L + DK ++ +S     K++  L      + +  SS  + K+W+
Subjt:  ----GYSSTSKAYRVFNKRTLIIEESMHVVFDESCNNVSNESICSDD--LERNFGDLLVSDKDKEIDS----SKQEVSL-----NEKKENSSSSMSKEWR

Query:  YAPSHPKDLILGDPEQGVKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNK
        +  +HP+D I+G+P  GV+TRSSL N+ NNLAF+SQIEPK+ KDA  DE W++AMQEELNQFER++VWELVPRPSN S+IGTK VFRNKMDENG I+RNK
Subjt:  YAPSHPKDLILGDPEQGVKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNK

Query:  ARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLS
        ARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EEVYVEQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY+RLS
Subjt:  ARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLS

Query:  NFLLENDFKMGKLDTTLFIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEG
         FLL+  FKMGK+DTTLFIKTKE DMLLVQIYVDDIIFG+TN  LCE+FSKCMHSEFEMSMMGEL++FLGLQIKQLK+  FI+Q KY KDLLKRF   E 
Subjt:  NFLLENDFKMGKLDTTLFIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEG

Query:  KIAKTPMSTSTKLDKDEKGLWYPRNVELNLIG
        K+ KTPMS+S KLD DEKG      +   +IG
Subjt:  KIAKTPMSTSTKLDKDEKGLWYPRNVELNLIG

RVW93906.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.3e-19145.77Show/hide
Query:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITM-KEHMEDEPKKKKSIALKTLSLE
        M   E+I EM TRFT I N L+ LG+V   SE V KILRSLP  W  KVTAI+EAKDLTKLP+EE +GSLMT+EI++ K+  E E KKKKSIALK  + E
Subjt:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITM-KEHMEDEPKKKKSIALKTLSLE

Query:  VESEDEGVLDEE--DIAYFSRKYKKFIK----RKKQFNKHIS---NQKESKGEKSK---KDEVICYECKKSGHIRTDCPLLK-SSKKSKKKAMKATWDDS
         E  +E    EE  D+A  +RK  K+++    R K+F    +    +  S G+K K   K ++IC++CKK GHI+ DCPL K  +K+  KKAM ATW +S
Subjt:  VESEDEGVLDEE--DIAYFSRKYKKFIK----RKKQFNKHIS---NQKESKGEKSK---KDEVICYECKKSGHIRTDCPLLK-SSKKSKKKAMKATWDDS

Query:  DESESGSDKQTFASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSK
        +E                              S+E++                  K KE      +    LD                            
Subjt:  DESESGSDKQTFASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSK

Query:  IIFVKASPIVPNHNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEY
                                            EG                                                              
Subjt:  IIFVKASPIVPNHNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEY

Query:  GHKAYFCYLSKSKALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK------------------
                                                 SK++KW+LDSG SRHMTGD+SKF  L+K+ GG VTFGDN K                  
Subjt:  GHKAYFCYLSKSKALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK------------------

Query:  ----GYSSTSKAYRVFNKRTLIIEESMHVVFDESCNNVSNESICSDD--LERNFGDLLVSDKDKEIDS----SKQEVSL-----NEKKENSSSSMSKEWR
            GYS++SKA+RVFNKRT+++EES+HV+FDES N +       DD  LE + G L + DK ++ +S     K+E  L      + +  SS  + K+W+
Subjt:  ----GYSSTSKAYRVFNKRTLIIEESMHVVFDESCNNVSNESICSDD--LERNFGDLLVSDKDKEIDS----SKQEVSL-----NEKKENSSSSMSKEWR

Query:  YAPSHPKDLILGDPEQGVKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNK
        +  +HP+D I+G+P  GV+TRSSL N+ NNLAF+SQIEPK+ KDA  DE W++AMQEELNQFER++VWELVPRPSN S+IGTK VFRNKMDENG I+RNK
Subjt:  YAPSHPKDLILGDPEQGVKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNK

Query:  ARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLS
        ARLVAQGY QEEGIDYEETF  VARLEAIRMLLAFA +K+FILYQMDVKS FLNG+I EEVYVEQPP F+SF+ PNHV+KLKKALYGLKQAPRAWY+RLS
Subjt:  ARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLS

Query:  NFLLENDFKMGKLDTTLFIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEG
         FLL+  FKMGK+DTTLFIKTKE DMLLVQIYVDDIIFG+TN  LCE+FSKCMHSEFEMSMMGEL++FLGLQIKQLK+  F++Q KY KDLLKRF   E 
Subjt:  NFLLENDFKMGKLDTTLFIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEG

Query:  KIAKTPMSTSTKLDKDEKGLWYPRNVELNLIGY
        K+ KTPMS+  KLD DEKG      +   +IG+
Subjt:  KIAKTPMSTSTKLDKDEKGLWYPRNVELNLIGY

RVW98982.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.8e-19644.42Show/hide
Query:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITM-KEHMEDEPKKKKSIALKTLSLE
        M   E+I EM TRFT I N L+ LG+V   SE V KILRSLP  W  KVTAI+EAKDLTKLP+EE +GSLMT+EI++ K+  E E KKKKSIALK  + E
Subjt:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITM-KEHMEDEPKKKKSIALKTLSLE

Query:  VESEDEGVLDEE--DIAYFSRKYKKFIK----RKKQFNKHISNQKESK--GEKSK---KDEVICYECKKSGHIRTDCPLLK-SSKKSKKKAMKATWDDSD
         E  +E    EE  D+A  +RK  K+++    R K+F+   ++++ES   G+K K   K ++IC++CKK GHI+ DCPL K  +K+  KKAM ATW +S+
Subjt:  VESEDEGVLDEE--DIAYFSRKYKKFIK----RKKQFNKHISNQKESK--GEKSK---KDEVICYECKKSGHIRTDCPLLK-SSKKSKKKAMKATWDDSD

Query:  ESESGSDKQTFASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKI
        ES                                                 E  K KE      +    LD                             
Subjt:  ESESGSDKQTFASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKI

Query:  IFVKASPIVPNHNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYG
                                           EG                                                               
Subjt:  IFVKASPIVPNHNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYG

Query:  HKAYFCYLSKSKALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK-------------------
                                                SK++KW+LDSG SRHMTGD+SKF  L+K+ GG VTFGDN K                   
Subjt:  HKAYFCYLSKSKALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK-------------------

Query:  ---GYSSTSKAYRVFNKRTLIIEESMHVVFDESCNNVSNESICSDD--LERNFGDLLVSDKDKEIDSSKQEVSLNEKKEN--------------SSSSMS
           GYS++SKA+RVFNKRT+++EES+HV+FDES N++       DD  LE + G L + DK ++ +S +     N KKE+              SS  + 
Subjt:  ---GYSSTSKAYRVFNKRTLIIEESMHVVFDESCNNVSNESICSDD--LERNFGDLLVSDKDKEIDSSKQEVSLNEKKEN--------------SSSSMS

Query:  KEWRYAPSHPKDLILGDPEQGVKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNI
        K+W++  +HP+D I+G+P  GV+TRSSL N+ NNLAF+SQIEPK+ KDA  DE W++AMQEELNQFER++VWELVPRPSN S+IGTK VFRNKMDENG I
Subjt:  KEWRYAPSHPKDLILGDPEQGVKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNI

Query:  IRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWY
        +RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EEVYVEQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY
Subjt:  IRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWY

Query:  DRLSNFLLENDFKMGKLDTTLFIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFK
        +RLS FLL+  FKMGK+DTTLFIKTKE DMLLVQIYVDDIIFG+TN  LCE+FSKCMHSEFEMSMMGEL++FLGLQIKQLK+  FI+Q KY KDLLKRF 
Subjt:  DRLSNFLLENDFKMGKLDTTLFIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFK

Query:  FNEGKIAKTPMSTSTKLDKDEK-------------------------------------------------------------GLWYPRNVELNLIGYSD
          E K+ KTPMS+S KLD DEK                                                             GLWYP+     LIG+SD
Subjt:  FNEGKIAKTPMSTSTKLDKDEK-------------------------------------------------------------GLWYPRNVELNLIGYSD

Query:  ADFA
        ADFA
Subjt:  ADFA

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]1.6e-17166.47Show/hide
Query:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITMKEHMEDEPKKKKSIALKTLSLEV
        MDANE+IT+MFTRFTNI N LKGLGKVYT SENVRKILRSLPK WEAKVTAI+EAKDLTKLPLEE IGSLMTHEI MKEH+EDE KKKKSIALKT+SLEV
Subjt:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITMKEHMEDEPKKKKSIALKTLSLEV

Query:  ESEDEGVLDEEDIAYFSRKYKKFIKRKKQFNKHISNQKESKGEKSKKDEVICYECKKSGHIRTDCPLLKSSKKSKKKAMKATWDDSDESES---------
        + EDE  LDE+DIAYFSRKYK FIKRKK F KH+S QKESKGEKSKKDEVICYECK+SGHIRTDCPLLKSSKKSKKKAMKATWDDS ESES         
Subjt:  ESEDEGVLDEEDIAYFSRKYKKFIKRKKQFNKHISNQKESKGEKSKKDEVICYECKKSGHIRTDCPLLKSSKKSKKKAMKATWDDSDESES---------

Query:  ---GSDK----------------QTFASWLMVTRVMNKM----------------------------------------------------------MRF
            SDK                + F ++  +   + K+                                                          +RF
Subjt:  ---GSDK----------------QTFASWLMVTRVMNKM----------------------------------------------------------MRF

Query:  LEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPNHNMPKIGSKHDKSSFV
        LEHDS EKDNLIK+LKENEL+ LQEL KAKE+IKKLTIGAQRLDKIIEVGK YGDKRGLGYIDE STPSSSK  FVKASPIVP  NM    S H KSSFV
Subjt:  LEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPNHNMPKIGSKHDKSSFV

Query:  PICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYGHKAYFCYLSKSKALNVNAKI-WVPKFV
        PICH+CGVEGHIRP CFKLKYA  T SRRNFSQ  KF+ APR NFS KSRVHKFV+++KSLH+VVCFSCG+YGHKAY CYLS+S A NV  K+ W+PK+V
Subjt:  PICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYGHKAYFCYLSKSKALNVNAKI-WVPKFV

Query:  NANLLGPKQV
        NAN+LGPKQV
Subjt:  NANLLGPKQV

TrEMBL top hitse value%identityAlignment
A0A2N9FWR3 CCHC-type domain-containing protein1.2e-19845.88Show/hide
Query:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITMKEHMEDEP-KKKKSIALKTLSLE
        M  +E+I+EM TRFTNI N+LK LGK+YTN ENVRKILRSLPK WEAK+TAI EA+DL  L LEE  GSLMT+E+ M   +E+E  K KK+ ALK+   +
Subjt:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITMKEHMEDEP-KKKKSIALKTLSLE

Query:  VESEDEGVLDEEDIAYFSRKYKKFIKRKKQFNKHISNQKESKGEKSKKDEVICYECKKSGHIRTDCPLLKSSK-KSKKKAMKATWDDSDESESGSDKQTF
         ++ +E   +EE+IA  +R +KKF+K+KK F +    + E+KGE SK +   CY+CKK GH + +CP +   K K KKKA+K TWDDSDES+S       
Subjt:  VESEDEGVLDEEDIAYFSRKYKKFIKRKKQFNKHISNQKESKGEKSKKDEVICYECKKSGHIRTDCPLLKSSK-KSKKKAMKATWDDSDESESGSDKQTF

Query:  ASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPN
                           D+   DN                      +  L +                    LGYI+E +                  
Subjt:  ASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPN

Query:  HNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYGHKAYFCYLSKS
               S+ + +SF P+  +                                                    D+S  + +C       H    C +SK 
Subjt:  HNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYGHKAYFCYLSKS

Query:  KALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK----------------------GYSSTSKA
                                     S K+KW+LDSG SRHMTGD++KF +L+ KDGG V FGDN K                      GYS+ SKA
Subjt:  KALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK----------------------GYSSTSKA

Query:  YRVFNKRTLIIEESMHVVFDES-----CNNVSNESICSDDLERNFGDLLVSDKDK-EIDSSKQEVSLNEKKENSSSSMSKEWRYAPSHPKDLILGDPEQG
        YRVFNKRT++++ESMHVVFDE+      NN  +E I  ++   +   + +S+K K ++D  K E        N    + K W    SHPK+LI+G+ E+G
Subjt:  YRVFNKRTLIIEESMHVVFDES-----CNNVSNESICSDDLERNFGDLLVSDKDK-EIDSSKQEVSLNEKKENSSSSMSKEWRYAPSHPKDLILGDPEQG

Query:  VKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYE
        V TRS L ++ NN+AF+SQIEPK+  +A  DE WILAMQEELNQFERNKVW L PRP + S+IGTK VFRNK DE G I+RNKARLVAQGY QEEGIDY 
Subjt:  VKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYE

Query:  ETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTL
        ET+APVARLEAIRMLLAFA +KNF L+QMDVKSAFLNG+I EEVYVEQPPGFE+ + PNHV+KL KALYGLKQAPRAWY+RLS FL+E  F  GKLDTTL
Subjt:  ETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTL

Query:  FIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDKDE
        F+     DML+VQIYVDDIIFGSTN  LC+EFSK M  EFEMSMMGEL FFLGLQIKQ +D IF++Q KY  DLLKRF     K   TPMS STKLDKDE
Subjt:  FIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDKDE

Query:  K------------------------GLWYPRNVELNLIGYSDADFA
        K                        GLWYP++   +LI Y+DADFA
Subjt:  K------------------------GLWYPRNVELNLIGYSDADFA

A0A2N9G589 CCHC-type domain-containing protein2.8e-19845.88Show/hide
Query:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITMKEHMEDEP-KKKKSIALKTLSLE
        M  +E+I+EM TRFTNI N+LK LGK+YTN ENVRKILRSLPK WEAK+TAI EA+DL  L LEE  GSLMT+E+ M   +E+E  K KK+ ALK+   +
Subjt:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITMKEHMEDEP-KKKKSIALKTLSLE

Query:  VESEDEGVLDEEDIAYFSRKYKKFIKRKKQFNKHISNQKESKGEKSKKDEVICYECKKSGHIRTDCPLLKSSK-KSKKKAMKATWDDSDESESGSDKQTF
         ++ +E   +EE+IA  +R +KKF+K+KK F +    + E+KGE SK +   CY+CKK GH + +CP +   K K KKKA+K TWDDSDES+S       
Subjt:  VESEDEGVLDEEDIAYFSRKYKKFIKRKKQFNKHISNQKESKGEKSKKDEVICYECKKSGHIRTDCPLLKSSK-KSKKKAMKATWDDSDESESGSDKQTF

Query:  ASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPN
                           D+   DN                      +  L +                    LGYI+E +                  
Subjt:  ASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPN

Query:  HNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYGHKAYFCYLSKS
               S+ + +SF P+  +                                                    D+S  + +C       H    C +SK 
Subjt:  HNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYGHKAYFCYLSKS

Query:  KALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK----------------------GYSSTSKA
                                     S K KW+LDSG SRHMTGD++KF +L+ KDGG V FGDN K                      GYS+ SKA
Subjt:  KALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK----------------------GYSSTSKA

Query:  YRVFNKRTLIIEESMHVVFDES-----CNNVSNESICSDDLERNFGDLLVSDKDK-EIDSSKQEVSLNEKKENSSSSMSKEWRYAPSHPKDLILGDPEQG
        YRVFNKRT++++ESMHVVFDE+      NN  +E I  ++   +   + +S+K K ++D  K E        N    + K W    SHPK+LI+G+ E G
Subjt:  YRVFNKRTLIIEESMHVVFDES-----CNNVSNESICSDDLERNFGDLLVSDKDK-EIDSSKQEVSLNEKKENSSSSMSKEWRYAPSHPKDLILGDPEQG

Query:  VKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYE
        V TRS L ++ NN+AF+SQIEPK+  +A  DE WILAMQEELNQFERNKVW L PRP + S+IGTK VFRNK DE G I+RNKARLVAQGY QEEGIDY 
Subjt:  VKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYE

Query:  ETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTL
        ET+APVARLEAIRMLLAFA +KNF L+QMDVKSAFLNG+I EEVYVEQPPGFE+ + PNHV+KL KALYGLKQAPRAWY+RLS FL+E  F  GKLDTTL
Subjt:  ETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTL

Query:  FIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDKDE
        F+     DML+VQIYVDDIIFGSTN  LC+EFSK M  EFEMSMMGEL FFLGLQIKQ +D IF++Q KY  DLLKRF     K   TPMS STKLDKDE
Subjt:  FIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDKDE

Query:  K------------------------GLWYPRNVELNLIGYSDADFA
        K                        GLWYP++   +LI Y+DADFA
Subjt:  K------------------------GLWYPRNVELNLIGYSDADFA

A0A2N9HSW7 CCHC-type domain-containing protein2.5e-19946.09Show/hide
Query:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITMKEHMEDEP-KKKKSIALKTLSLE
        M  +E+I+EM TRFTNI N+LK LGK+YTN ENVRKILRSLPK WEAK+TAI EA+DL  L LEE  GSLMT+E+ M   +E+E  K KK+ ALK+   +
Subjt:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITMKEHMEDEP-KKKKSIALKTLSLE

Query:  VESEDEGVLDEEDIAYFSRKYKKFIKRKKQFNKHISNQKESKGEKSKKDEVICYECKKSGHIRTDCPLLKSSK-KSKKKAMKATWDDSDESESGSDKQTF
         ++ +E   +EE+IA  +R +KKF+K+KK F +    + E+KGE SK +   CY+CKK GH + +CP +   K K KKKA+K TWDDSDES+S       
Subjt:  VESEDEGVLDEEDIAYFSRKYKKFIKRKKQFNKHISNQKESKGEKSKKDEVICYECKKSGHIRTDCPLLKSSK-KSKKKAMKATWDDSDESESGSDKQTF

Query:  ASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPN
                           D+   DN                      +  L +                    LGYI+E +                  
Subjt:  ASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPN

Query:  HNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYGHKAYFCYLSKS
               S+ + +SF P+  +                                                    D+S  + +C       H    C +SK 
Subjt:  HNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYGHKAYFCYLSKS

Query:  KALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK----------------------GYSSTSKA
                                     S KNKW+LDSG SRHMTGD++KF +L+ KDGG V FGDN K                      GYS+ SKA
Subjt:  KALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK----------------------GYSSTSKA

Query:  YRVFNKRTLIIEESMHVVFDES-----CNNVSNESICSDDLERNFGDLLVSDKDK-EIDSSKQEVSLNEKKENSSSSMSKEWRYAPSHPKDLILGDPEQG
        YRVFNKRT++++ESMHVVFDE+      NN  +E I  ++   +   + +S+K K ++D  K E        N    + K W    SHPK+LI+G+ E G
Subjt:  YRVFNKRTLIIEESMHVVFDES-----CNNVSNESICSDDLERNFGDLLVSDKDK-EIDSSKQEVSLNEKKENSSSSMSKEWRYAPSHPKDLILGDPEQG

Query:  VKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYE
        V TRS L N+ NN+AF+SQIEPK+  +A  DE WILAMQEELNQFERNKVW L PRP + S+IGTK VFRNK DE G I+RNKARLVAQGY QEEGIDY 
Subjt:  VKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYE

Query:  ETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTL
        ET+APVARLEAIRMLLAFA +KNF L+QMDVKSAFLNG+I EEVYVEQPPGFE+ + PNHV+KL KALYGLKQAPRAWY+RLS FL+E  F  GKLDTTL
Subjt:  ETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTL

Query:  FIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDKDE
        F+     DML+VQIYVDDIIFGSTN  LC+EFSK M  EFEMSMMGEL FFLGLQIKQ +D IF++Q KY  DLLKRF     K   TPMS STKLDKDE
Subjt:  FIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDKDE

Query:  K------------------------GLWYPRNVELNLIGYSDADFA
        K                        GLWYP++   +LI Y+DADFA
Subjt:  K------------------------GLWYPRNVELNLIGYSDADFA

A0A2N9I2B8 CCHC-type domain-containing protein4.7e-19845.88Show/hide
Query:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITMKEHMEDEP-KKKKSIALKTLSLE
        M  +E+I+EM TRFTNI N+LK LGK+YTN ENVRKILRSLPK WEAK+TAI EA+DL  L LEE  GSLMT+E+ M   +E+E  K KK+ ALK+   +
Subjt:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITMKEHMEDEP-KKKKSIALKTLSLE

Query:  VESEDEGVLDEEDIAYFSRKYKKFIKRKKQFNKHISNQKESKGEKSKKDEVICYECKKSGHIRTDCPLLKSSK-KSKKKAMKATWDDSDESESGSDKQTF
         ++ +E   +EE+IA  +R +KKF+K+KK F +    + E+KGE SK +   CY+CKK GH + +CP +   K K KKKA+K TWDDSDES+S       
Subjt:  VESEDEGVLDEEDIAYFSRKYKKFIKRKKQFNKHISNQKESKGEKSKKDEVICYECKKSGHIRTDCPLLKSSK-KSKKKAMKATWDDSDESESGSDKQTF

Query:  ASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPN
                           D+   DN                      +  L +                    LGYI+E +                  
Subjt:  ASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPN

Query:  HNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYGHKAYFCYLSKS
               S+ + +SF P+  +                                                    D+S  + +C       H    C +SK 
Subjt:  HNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYGHKAYFCYLSKS

Query:  KALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK----------------------GYSSTSKA
                                     S K+KW+LDSG SRHMTGD++KF +L+ KDGG V FGDN K                      GYS+ SKA
Subjt:  KALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK----------------------GYSSTSKA

Query:  YRVFNKRTLIIEESMHVVFDES-----CNNVSNESICSDDLERNFGDLLVSDKDK-EIDSSKQEVSLNEKKENSSSSMSKEWRYAPSHPKDLILGDPEQG
        YRVFNKRT++++ESMHVVFDE+      NN  +E I  ++   +   +  S+K K ++D  K E        N    + K W    SHPK+LI+G+ E G
Subjt:  YRVFNKRTLIIEESMHVVFDES-----CNNVSNESICSDDLERNFGDLLVSDKDK-EIDSSKQEVSLNEKKENSSSSMSKEWRYAPSHPKDLILGDPEQG

Query:  VKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYE
        V TRS L ++ NN+AF+SQIEPK+  +A  DE WILAMQEELNQFERNKVW L PRP + S+IGTK VFRNK DE G I+RNKARLVAQGY QEEGIDY 
Subjt:  VKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYE

Query:  ETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTL
        ET+APVARLEAIRMLLAFA +KNF L+QMDVKSAFLNG+I EEVYVEQPPGFE+ + PNHV+KL KALYGLKQAPRAWY+RLS FL+E  F  GKLDTTL
Subjt:  ETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTL

Query:  FIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDKDE
        F+     DML+VQIYVDDIIFGSTN  LC+EFSK M  EFEMSMMGEL FFLGLQIKQ +D IF++Q KY  DLLKRF     K   TPMS STKLDKDE
Subjt:  FIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDKDE

Query:  K------------------------GLWYPRNVELNLIGYSDADFA
        K                        GLWYP++   +LI Y+DADFA
Subjt:  K------------------------GLWYPRNVELNLIGYSDADFA

A0A2N9IJR3 CCHC-type domain-containing protein2.8e-19845.88Show/hide
Query:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITMKEHMEDEP-KKKKSIALKTLSLE
        M  +E+I+EM TRFTNI N+LK LGK+YTN ENVRKILRSLPK WEAK+TAI EA+DL  L LEE  GSLMT+E+ M   +E+E  K KK+ ALK+   +
Subjt:  MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITMKEHMEDEP-KKKKSIALKTLSLE

Query:  VESEDEGVLDEEDIAYFSRKYKKFIKRKKQFNKHISNQKESKGEKSKKDEVICYECKKSGHIRTDCPLLKSSK-KSKKKAMKATWDDSDESESGSDKQTF
         ++ +E   +EE+IA  +R +KKF+K+KK F +    + E+KGE SK +   CY+CKK GH + +CP +   K K KKKA+K TWDDSDES+S       
Subjt:  VESEDEGVLDEEDIAYFSRKYKKFIKRKKQFNKHISNQKESKGEKSKKDEVICYECKKSGHIRTDCPLLKSSK-KSKKKAMKATWDDSDESESGSDKQTF

Query:  ASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPN
                           D+   DN                      +  L +                    LGYI+E +                  
Subjt:  ASWLMVTRVMNKMMRFLEHDSYEKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPN

Query:  HNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYGHKAYFCYLSKS
               S+ + +SF P+  +                                                    D+S  + +C       H    C +SK 
Subjt:  HNMPKIGSKHDKSSFVPICHHCGVEGHIRPNCFKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYGHKAYFCYLSKS

Query:  KALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK----------------------GYSSTSKA
                                     S K KW+LDSG SRHMTGD++KF +L+ KDGG V FGDN K                      GYS+ SKA
Subjt:  KALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKK----------------------GYSSTSKA

Query:  YRVFNKRTLIIEESMHVVFDES-----CNNVSNESICSDDLERNFGDLLVSDKDK-EIDSSKQEVSLNEKKENSSSSMSKEWRYAPSHPKDLILGDPEQG
        YRVFNKRT++++ESMHVVFDE+      NN  +E I  ++   +   + +S+K K ++D  K E        N    + K W    SHPK+LI+G+ E G
Subjt:  YRVFNKRTLIIEESMHVVFDES-----CNNVSNESICSDDLERNFGDLLVSDKDK-EIDSSKQEVSLNEKKENSSSSMSKEWRYAPSHPKDLILGDPEQG

Query:  VKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYE
        V TRS L ++ NN+AF+SQIEPK+  +A  DE WILAMQEELNQFERNKVW L PRP + S+IGTK VFRNK DE G I+RNKARLVAQGY QEEGIDY 
Subjt:  VKTRSSL-NLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYE

Query:  ETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTL
        ET+APVARLEAIRMLLAFA +KNF L+QMDVKSAFLNG+I EEVYVEQPPGFE+ + PNHV+KL KALYGLKQAPRAWY+RLS FL+E  F  GKLDTTL
Subjt:  ETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTL

Query:  FIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDKDE
        F+     DML+VQIYVDDIIFGSTN  LC+EFSK M  EFEMSMMGEL FFLGLQIKQ +D IF++Q KY  DLLKRF     K   TPMS STKLDKDE
Subjt:  FIKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDKDE

Query:  K------------------------GLWYPRNVELNLIGYSDADFA
        K                        GLWYP++   +LI Y+DADFA
Subjt:  K------------------------GLWYPRNVELNLIGYSDADFA

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.6e-4730.83Show/hide
Query:  VNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKKGYSSTSKAYRVFNKRTLIIEESMHVVFDESCNNVSNESICSDDL
        VN+  +  + V LK SK+++       SR +   Q++F N SK+   +    D+K+   S +K +   +++ +  E      F        N     D  
Subjt:  VNANLLGPKQVCLKVSKKNKWYLDSGYSRHMTGDQSKFVNLSKKDGGLVTFGDNKKGYSSTSKAYRVFNKRTLIIEESMHVVFDESCNNVSNESICSDDL

Query:  ERNFGDLLVSDK---DKEIDSSKQEVSLNEKKENSSSSMSKEWRYAPSHPKD--LILGDPEQGVKTR---------SSLN-LFNNLAFVSQIEPKSFKD-
        E N   L  S K   D  ++ SK   + NE +E+ ++   KE         D   I+    + +KT+         +SLN +  N   +    P SF + 
Subjt:  ERNFGDLLVSDK---DKEIDSSKQEVSLNEKKENSSSSMSKEWRYAPSHPKD--LILGDPEQGVKTR---------SSLN-LFNNLAFVSQIEPKSFKD-

Query:  --AENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFI
           ++   W  A+  ELN  + N  W +  RP N +I+ ++ VF  K +E GN IR KARLVA+G+ Q+  IDYEETFAPVAR+ + R +L+     N  
Subjt:  --AENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFI

Query:  LYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTLFI--KTKENDMLLVQIYVDDIIFGS
        ++QMDVK+AFLNG + EE+Y+  P G       ++V KL KA+YGLKQA R W++     L E +F    +D  ++I  K   N+ + V +YVDD++  +
Subjt:  LYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTLFI--KTKENDMLLVQIYVDDIIFGS

Query:  TNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTK---LDKDE
         +      F + +  +F M+ + E+  F+G++I+  +D I++SQ  Y K +L +F         TP+ +      L+ DE
Subjt:  TNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTK---LDKDE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-4335.09Show/hide
Query:  EPKSFKDA----ENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLL
        EP+S K+     E ++  + AMQEE+   ++N  ++LV  P     +  K VF+ K D +  ++R KARLV +G+ Q++GID++E F+PV ++ +IR +L
Subjt:  EPKSFKDA----ENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLL

Query:  AFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTLFIKT-KENDMLLVQIY
        + A+  +  + Q+DVK+AFL+G + EE+Y+EQP GFE     + V KL K+LYGLKQAPR WY +  +F+    +     D  ++ K   EN+ +++ +Y
Subjt:  AFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTLFIKT-KENDMLLVQIY

Query:  VDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQI--KQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDK
        VDD++    +  L  +    +   F+M  +G     LG++I  ++    +++SQEKY + +L+RF     K   TP++   KL K
Subjt:  VDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQI--KQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDK

P25600 Putative transposon Ty5-1 protein YCL074W2.9e-1931.36Show/hide
Query:  MDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTLFIKTKENDMLLVQIYVDDIIFGSTNPCL
        MDV +AFLN  + E +YV+QPPGF +   P++V++L   +YGLKQAP  W + ++N L +  F   + +  L+ ++  +  + + +YVDD++  + +P +
Subjt:  MDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTLFIKTKENDMLLVQIYVDDIIFGSTNPCL

Query:  CEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKD-DIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKL
         +   + +   + M  +G++  FLGL I Q  + DI +S + Y        + N  K+ +TP+  S  L
Subjt:  CEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKD-DIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-5540.85Show/hide
Query:  LAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELV-PRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAI
        ++  ++ EP++   A  DE W  AM  E+N    N  W+LV P PS+V+I+G + +F  K + +G++ R KARLVA+GY Q  G+DY ETF+PV +  +I
Subjt:  LAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELV-PRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAI

Query:  RMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTLFIKTKENDMLLV
        R++L  A  +++ + Q+DV +AFL G + ++VY+ QPPGF   D PN+V KL+KALYGLKQAPRAWY  L N+LL   F     DT+LF+  +   ++ +
Subjt:  RMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTLFIKTKENDMLLV

Query:  QIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKL
         +YVDDI+    +P L       +   F +    EL +FLG++ K++   + +SQ +Y  DLL R      K   TPM+ S KL
Subjt:  QIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-5340.07Show/hide
Query:  EPKSFKDAENDEFWILAMQEELNQFERNKVWELV-PRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        EP++   A  D+ W  AM  E+N    N  W+LV P P +V+I+G + +F  K + +G++ R KARLVA+GY Q  G+DY ETF+PV +  +IR++L  A
Subjt:  EPKSFKDAENDEFWILAMQEELNQFERNKVWELV-PRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA

Query:  SYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTLFIKTKENDMLLVQIYVDDI
          +++ + Q+DV +AFL G + +EVY+ QPPGF   D P++V +L+KA+YGLKQAPRAWY  L  +LL   F     DT+LF+  +   ++ + +YVDDI
Subjt:  SYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTLFIKTKENDMLLVQIYVDDI

Query:  IFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKL
        +    +  L +     +   F +    +L +FLG++ K++   + +SQ +YT DLL R      K   TPM+TS KL
Subjt:  IFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKL

Arabidopsis top hitse value%identityAlignment
AT4G05360.1 Zinc knuckle (CCHC-type) family protein4.5e-0730.6Show/hide
Query:  KENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPNHNMPK-------------IGSKHDKSS----
        KE E + L+E    +++++ L  G ++L  I+ +GK   DK GLG+      PS S  +FV    I       K               S+ D  +    
Subjt:  KENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPNHNMPK-------------IGSKHDKSS----

Query:  -------------FVPICHHCGVEGHIRPNCFKL
                     F P+CHHCGV GHIRP CF+L
Subjt:  -------------FVPICHHCGVEGHIRPNCFKL

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.6e-5138.13Show/hide
Query:  EPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS
        EP ++ +A+    W  AM +E+   E    WE+   P N   IG K V++ K + +G I R KARLVA+GY Q+EGID+ ETF+PV +L +++++LA ++
Subjt:  EPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS

Query:  YKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFD----LPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTLFIKTKENDMLLVQIYV
          NF L+Q+D+ +AFLNG + EE+Y++ PPG+ +       PN V  LKK++YGLKQA R W+ + S  L+   F     D T F+K      L V +YV
Subjt:  YKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFD----LPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTLFIKTKENDMLLVQIYV

Query:  DDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDKDEKGLWYPRNVELNLIG
        DDII  S N    +E    + S F++  +G L +FLGL+I +    I I Q KY  DLL        K +  PM  S        G +        LIG
Subjt:  DDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDKDEKGLWYPRNVELNLIG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.9e-0534.94Show/hide
Query:  IYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKL
        +YVDDI+   ++  L       + S F M  +G + +FLG+QIK     +F+SQ KY + +L     N G +   PMST   L
Subjt:  IYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.0e-1951.52Show/hide
Query:  EPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        EPKS   A  D  W  AMQEEL+   RNK W LVP P N +I+G K VF+ K+  +G + R KARLVA+G+ QEEGI + ET++PV R   IR +L  A
Subjt:  EPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCTAATGAATCTATTACCGAGATGTTTACTAGATTTACTAACATTACAAATACTTTGAAAGGTCTTGGTAAAGTTTATACTAACTCGGAAAATGTTAGAAAAAT
TCTTAGATCTCTACCTAAGATTTGGGAAGCTAAGGTGACGGCAATTGAAGAAGCAAAGGATCTCACCAAACTTCCTTTGGAGGAATTTATTGGCTCTCTCATGACACATG
AGATCACTATGAAGGAACACATGGAGGATGAGCCCAAAAAGAAGAAAAGTATTGCTTTAAAGACTTTATCCTTGGAAGTTGAGTCCGAAGATGAGGGCGTTCTTGATGAA
GAAGATATTGCATATTTTTCACGTAAGTATAAAAAGTTCATTAAGAGGAAGAAACAATTCAATAAACATATTTCAAACCAAAAAGAGTCAAAGGGTGAAAAGAGTAAAAA
GGATGAGGTCATATGCTATGAATGCAAGAAATCGGGTCACATAAGAACGGATTGCCCTCTCCTCAAATCATCCAAGAAATCCAAGAAGAAAGCAATGAAAGCCACTTGGG
ATGATAGTGATGAAAGTGAAAGTGGTAGTGACAAGCAAACTTTTGCTTCATGGCTCATGGTGACAAGAGTGATGAACAAGATGATGAGATTTCTTGAGCATGATAGTTAT
GAAAAGGATAATTTGATTAAATTGCTTAAGGAAAATGAATTAAATGCTTTGCAAGAACTTGGTAAGGCAAAAGAGTCCATTAAAAAGTTGACAATAGGTGCTCAAAGATT
GGACAAAATAATTGAAGTTGGCAAACCTTATGGTGATAAAAGAGGTTTAGGCTACATTGATGAATGTTCTACTCCTTCAAGTTCTAAAATCATCTTTGTTAAAGCATCTC
CCATTGTGCCTAATCATAATATGCCTAAGATTGGGTCTAAGCATGATAAATCTAGTTTTGTGCCTATATGTCATCATTGTGGTGTTGAAGGTCATATTAGGCCAAATTGC
TTTAAATTAAAATATGCTCATACTACTTCTTCAAGAAGAAATTTTTCTCAAATAACAAAGTTTCACAATGCTCCAAGAAATAATTTCTCTAAGAAAAGTAGAGTGCATAA
ATTTGTCGTAAGAGATAAATCCTTGCATGATGTTGTTTGTTTCTCATGTGGCGAGTATGGACATAAAGCTTATTTTTGTTACTTGTCTAAATCCAAAGCTTTGAATGTGA
ATGCAAAAATATGGGTTCCCAAGTTTGTAAATGCTAACCTTCTAGGACCCAAACAAGTATGTTTGAAAGTCTCCAAGAAAAACAAATGGTACTTGGATAGTGGTTACTCG
AGGCATATGACGGGAGACCAATCCAAGTTTGTCAATCTTTCCAAAAAGGATGGAGGTTTAGTAACTTTTGGTGACAACAAGAAAGGTTATTCATCTACTAGTAAAGCCTA
TAGGGTTTTCAATAAGAGAACTTTAATTATTGAGGAATCTATGCATGTTGTATTTGATGAATCTTGCAATAATGTTTCTAATGAGTCTATTTGCAGTGATGATTTAGAAA
GGAATTTTGGAGATTTACTTGTTAGTGACAAAGACAAGGAAATTGACTCAAGTAAGCAAGAAGTGAGCTTGAATGAAAAGAAGGAAAATAGTTCTTCATCTATGTCTAAA
GAATGGAGGTATGCTCCATCCCATCCTAAGGACTTGATTCTTGGTGATCCCGAACAAGGTGTGAAAACTCGTTCCTCTCTTAATTTGTTTAATAATCTTGCTTTTGTTTC
TCAAATTGAACCTAAAAGTTTTAAAGATGCGGAAAATGATGAATTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAAAGGAACAAAGTTTGGGAATTAGTCC
CTAGGCCTTCTAATGTTTCTATTATTGGGACTAAATGTGTATTTAGAAACAAAATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTAT
TGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCCGTTGCTAGATTAGAAGCTATTAGAATGTTACTTGCTTTTGCTTCCTATAAAAATTTTATCTTGTATCA
AATGGATGTTAAAAGCGCCTTCTTAAATGGTTATATTATGGAGGAAGTTTATGTGGAACAACCTCCGGGCTTTGAAAGTTTTGATTTGCCTAATCATGTCTATAAGTTGA
AAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGGCTTAGTAATTTTCTTCTTGAGAATGATTTTAAAATGGGTAAACTTGACACTACTCTTTTT
ATTAAGACTAAGGAAAATGATATGCTTTTAGTACAAATCTATGTAGATGATATTATCTTTGGTTCTACTAATCCTTGTTTGTGTGAAGAATTTTCTAAATGTATGCATAG
TGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGACTTCAAATCAAACAACTCAAGGATGATATCTTCATAAGTCAAGAGAAATACACAAAAGATTTGC
TCAAAAGGTTCAAGTTCAATGAAGGTAAAATTGCAAAAACTCCTATGAGCACATCCACTAAGCTTGACAAGGATGAAAAAGGTTTATGGTATCCTAGAAATGTTGAGCTT
AATTTGATAGGATATTCCGATGCGGATTTTGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCTAATGAATCTATTACCGAGATGTTTACTAGATTTACTAACATTACAAATACTTTGAAAGGTCTTGGTAAAGTTTATACTAACTCGGAAAATGTTAGAAAAAT
TCTTAGATCTCTACCTAAGATTTGGGAAGCTAAGGTGACGGCAATTGAAGAAGCAAAGGATCTCACCAAACTTCCTTTGGAGGAATTTATTGGCTCTCTCATGACACATG
AGATCACTATGAAGGAACACATGGAGGATGAGCCCAAAAAGAAGAAAAGTATTGCTTTAAAGACTTTATCCTTGGAAGTTGAGTCCGAAGATGAGGGCGTTCTTGATGAA
GAAGATATTGCATATTTTTCACGTAAGTATAAAAAGTTCATTAAGAGGAAGAAACAATTCAATAAACATATTTCAAACCAAAAAGAGTCAAAGGGTGAAAAGAGTAAAAA
GGATGAGGTCATATGCTATGAATGCAAGAAATCGGGTCACATAAGAACGGATTGCCCTCTCCTCAAATCATCCAAGAAATCCAAGAAGAAAGCAATGAAAGCCACTTGGG
ATGATAGTGATGAAAGTGAAAGTGGTAGTGACAAGCAAACTTTTGCTTCATGGCTCATGGTGACAAGAGTGATGAACAAGATGATGAGATTTCTTGAGCATGATAGTTAT
GAAAAGGATAATTTGATTAAATTGCTTAAGGAAAATGAATTAAATGCTTTGCAAGAACTTGGTAAGGCAAAAGAGTCCATTAAAAAGTTGACAATAGGTGCTCAAAGATT
GGACAAAATAATTGAAGTTGGCAAACCTTATGGTGATAAAAGAGGTTTAGGCTACATTGATGAATGTTCTACTCCTTCAAGTTCTAAAATCATCTTTGTTAAAGCATCTC
CCATTGTGCCTAATCATAATATGCCTAAGATTGGGTCTAAGCATGATAAATCTAGTTTTGTGCCTATATGTCATCATTGTGGTGTTGAAGGTCATATTAGGCCAAATTGC
TTTAAATTAAAATATGCTCATACTACTTCTTCAAGAAGAAATTTTTCTCAAATAACAAAGTTTCACAATGCTCCAAGAAATAATTTCTCTAAGAAAAGTAGAGTGCATAA
ATTTGTCGTAAGAGATAAATCCTTGCATGATGTTGTTTGTTTCTCATGTGGCGAGTATGGACATAAAGCTTATTTTTGTTACTTGTCTAAATCCAAAGCTTTGAATGTGA
ATGCAAAAATATGGGTTCCCAAGTTTGTAAATGCTAACCTTCTAGGACCCAAACAAGTATGTTTGAAAGTCTCCAAGAAAAACAAATGGTACTTGGATAGTGGTTACTCG
AGGCATATGACGGGAGACCAATCCAAGTTTGTCAATCTTTCCAAAAAGGATGGAGGTTTAGTAACTTTTGGTGACAACAAGAAAGGTTATTCATCTACTAGTAAAGCCTA
TAGGGTTTTCAATAAGAGAACTTTAATTATTGAGGAATCTATGCATGTTGTATTTGATGAATCTTGCAATAATGTTTCTAATGAGTCTATTTGCAGTGATGATTTAGAAA
GGAATTTTGGAGATTTACTTGTTAGTGACAAAGACAAGGAAATTGACTCAAGTAAGCAAGAAGTGAGCTTGAATGAAAAGAAGGAAAATAGTTCTTCATCTATGTCTAAA
GAATGGAGGTATGCTCCATCCCATCCTAAGGACTTGATTCTTGGTGATCCCGAACAAGGTGTGAAAACTCGTTCCTCTCTTAATTTGTTTAATAATCTTGCTTTTGTTTC
TCAAATTGAACCTAAAAGTTTTAAAGATGCGGAAAATGATGAATTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAAAGGAACAAAGTTTGGGAATTAGTCC
CTAGGCCTTCTAATGTTTCTATTATTGGGACTAAATGTGTATTTAGAAACAAAATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTAT
TGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCCGTTGCTAGATTAGAAGCTATTAGAATGTTACTTGCTTTTGCTTCCTATAAAAATTTTATCTTGTATCA
AATGGATGTTAAAAGCGCCTTCTTAAATGGTTATATTATGGAGGAAGTTTATGTGGAACAACCTCCGGGCTTTGAAAGTTTTGATTTGCCTAATCATGTCTATAAGTTGA
AAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGGCTTAGTAATTTTCTTCTTGAGAATGATTTTAAAATGGGTAAACTTGACACTACTCTTTTT
ATTAAGACTAAGGAAAATGATATGCTTTTAGTACAAATCTATGTAGATGATATTATCTTTGGTTCTACTAATCCTTGTTTGTGTGAAGAATTTTCTAAATGTATGCATAG
TGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGACTTCAAATCAAACAACTCAAGGATGATATCTTCATAAGTCAAGAGAAATACACAAAAGATTTGC
TCAAAAGGTTCAAGTTCAATGAAGGTAAAATTGCAAAAACTCCTATGAGCACATCCACTAAGCTTGACAAGGATGAAAAAGGTTTATGGTATCCTAGAAATGTTGAGCTT
AATTTGATAGGATATTCCGATGCGGATTTTGCTTAA
Protein sequenceShow/hide protein sequence
MDANESITEMFTRFTNITNTLKGLGKVYTNSENVRKILRSLPKIWEAKVTAIEEAKDLTKLPLEEFIGSLMTHEITMKEHMEDEPKKKKSIALKTLSLEVESEDEGVLDE
EDIAYFSRKYKKFIKRKKQFNKHISNQKESKGEKSKKDEVICYECKKSGHIRTDCPLLKSSKKSKKKAMKATWDDSDESESGSDKQTFASWLMVTRVMNKMMRFLEHDSY
EKDNLIKLLKENELNALQELGKAKESIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSSKIIFVKASPIVPNHNMPKIGSKHDKSSFVPICHHCGVEGHIRPNC
FKLKYAHTTSSRRNFSQITKFHNAPRNNFSKKSRVHKFVVRDKSLHDVVCFSCGEYGHKAYFCYLSKSKALNVNAKIWVPKFVNANLLGPKQVCLKVSKKNKWYLDSGYS
RHMTGDQSKFVNLSKKDGGLVTFGDNKKGYSSTSKAYRVFNKRTLIIEESMHVVFDESCNNVSNESICSDDLERNFGDLLVSDKDKEIDSSKQEVSLNEKKENSSSSMSK
EWRYAPSHPKDLILGDPEQGVKTRSSLNLFNNLAFVSQIEPKSFKDAENDEFWILAMQEELNQFERNKVWELVPRPSNVSIIGTKCVFRNKMDENGNIIRNKARLVAQGY
CQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSNFLLENDFKMGKLDTTLF
IKTKENDMLLVQIYVDDIIFGSTNPCLCEEFSKCMHSEFEMSMMGELSFFLGLQIKQLKDDIFISQEKYTKDLLKRFKFNEGKIAKTPMSTSTKLDKDEKGLWYPRNVEL
NLIGYSDADFA