; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005864 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005864
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold11:178967..190391
RNA-Seq ExpressionSpg005864
SyntenySpg005864
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]1.6e-4226.71Show/hide
Query:  EEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEHT-TIEYWGFNLFLCKFKNTRIKGFIMNSGPWFYDKSMLLLQDPKGDCS
        E++     ++ E I   E+     ++ K++T    N   FK  + + W  +++  ++    NLFL +F   +    ++ +GPW +D+++L+L    G+  
Subjt:  EEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEHT-TIEYWGFNLFLCKFKNTRIKGFIMNSGPWFYDKSMLLLQDPKGDCS

Query:  GEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSAKTGIEKWIAVTYEKLPDFCYGCGR
          +++   V+FW+  + LP    S   A ++G+++GN E+VD  DA        LR+K  +D+ KPLKRG  IK     +  W+   YE+LP+FC+ CG+
Subjt:  GEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSAKTGIEKWIAVTYEKLPDFCYGCGR

Query:  LGHIIKECE-----EDSGTSDV---DLPYGPMLREPPK---FKGLDSSATMNPESKHW--GWGRGRGRMGSGGRGSWRSADSGAEEILERRKDEEESQLG
        +GH +KECE     +++  SD+      YGP LR  P    F+     A+    SK+      + +G    G +   +  D      +E++    ++   
Subjt:  LGHIIKECE-----EDSGTSDV---DLPYGPMLREPPK---FKGLDSSATMNPESKHW--GWGRGRGRMGSGGRGSWRSADSGAEEILERRKDEEESQLG

Query:  KEGPKVGENGAPAAEKGKAVV--TPRGQGSSEG------GGGEPAMATTASGSTVEKTNLDRENVIINS--SINEGVNL----------------GITKL
         +   V E+    A     VV  T + QG S+G         +P  A TA     E    +  +V I+   +++  + L                 I +L
Subjt:  KEGPKVGENGAPAAEKGKAVV--TPRGQGSSEG------GGGEPAMATTASGSTVEKTNLDRENVIINS--SINEGVNL----------------GITKL

Query:  KFLNGDS---------------------------VLMKEQSLN---GEVKDIAYMELDTNMEGLREYSDGLKVQEVRPDSE---------EKILSNEG-K
        KF  G S                           + +K  SLN   G+  D+   E   ++ G+  Y++  K   VR   +         E  L++ G +
Subjt:  KFLNGDS---------------------------VLMKEQSLN---GEVKDIAYMELDTNMEGLREYSDGLKVQEVRPDSE---------EKILSNEG-K

Query:  GKLKTWKRAQRELKNKEASELTKLS--DIKDKCCFFKVYHLNLLASDH--RPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSG
        G   TW   +    NK+  +   +S  +  ++     V HL    SDH    I  E  T ++ R+  R   + RFEE WT    C  +++  W     S 
Subjt:  GKLKTWKRAQRELKNKEASELTKLS--DIKDKCCFFKVYHLNLLASDH--RPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSG

Query:  HKDILDKTKECLIRLKGWSHQKYGGSIRGAIKKKER--ELHDLLNQSVEQSLMEVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCR
          D L + ++    L   S     GSI   I + E+  + HD+ ++S E S+      E  LE LL+++E  W QR+R  WLK GD+NTK+FH KAS+ R
Subjt:  HKDILDKTKECLIRLKGWSHQKYGGSIRGAIKKKER--ELHDLLNQSVEQSLMEVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCR

Query:  KINTIRGLLDEEGSW
        K+N I+ L DE G W
Subjt:  KINTIRGLLDEEGSW

KAF4363317.1 hypothetical protein G4B88_011714 [Cannabis sativa]1.6e-3926.81Show/hide
Query:  DEEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEH--TTIEYWGFNLFLCKFKNTRIKGFIMNSGPWF
        D E++ RLA++ V  E    V  L +  ++  +K++   ++ K+   R  N +  +  M  +W   H     E    N+F   F     +  +   GPW 
Subjt:  DEEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEH--TTIEYWGFNLFLCKFKNTRIKGFIMNSGPWF

Query:  YDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSAKTGIEKWI
         DK ++    P G     +M F F SFWI  + +PL C +   A E G  +G +E + +V+ +       ++++V+I++T+PLKRG+ +   + G E  +
Subjt:  YDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSAKTGIEKWI

Query:  AVTYEKLPDFCYGCGRLGHIIKECEEDSGTSDVDL----PYGPMLREP-----PKFKG---LDSSATMNP-----ESKHWGWGRGRGRMGSGGRGSWRSA
           YE LPDFCY CG +GH   +C       D        +G  L  P      +F+G    +SS +  P     E+        R R+   G  S   A
Subjt:  AVTYEKLPDFCYGCGRLGHIIKECEEDSGTSDVDL----PYGPMLREP-----PKFKG---LDSSATMNP-----ESKHWGWGRGRGRMGSGGRGSWRSA

Query:  DSGAEEILERRKDEEESQLGKEGPKVGENGAPAAEKGKAVVTPRGQGSSEGGGGEPAM---------ATTASGSTVEKTNLDRENVIINSSINEGVNLGI
         +   E  ER + +++   G +GP V   G   + K K  + P         G  P +         ++T  GS V + ++      ++ S  EG   G 
Subjt:  DSGAEEILERRKDEEESQLGKEGPKVGENGAPAAEKGKAVVTPRGQGSSEGGGGEPAM---------ATTASGSTVEKTNLDRENVIINSSINEGVNLGI

Query:  TKLKFLNGDSVLMKEQSLNGEVKDIAYMELDTNMEGLREYS--DGLKVQEVRPDSEEKILSNEGKGKLKTWKRAQRELKNKEASELT------KLSDIKD
              N +  ++ E   + + K    +  +   +G+ ++S  D +  +     + + I+  +   K KTWKRA      K  S+L+      KLS +  
Subjt:  TKLKFLNGDSVLMKEQSLNGEVKDIAYMELDTNMEGLREYS--DGLKVQEVRPDSEEKILSNEGKGKLKTWKRAQRELKNKEASELT------KLSDIKD

Query:  KCCFFK---------VYHLNLLASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVW--QNRMRSGHKDILDKTKECLIRLKGWSHQK
            +K         V   + + SDHRP++A        +K  +     RFE  W K E+C  IV Q W   +        I+D    C  RL  W+  K
Subjt:  KCCFFK---------VYHLNLLASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVW--QNRMRSGHKDILDKTKECLIRLKGWSHQK

Query:  YGGSIRGAIKKKERELHDLLNQSVEQSLM-EVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEG
        + GSI   +++ +++L DLL+ +     M EV   E  L +LL  +E YW  R+R DWL  GDRNTK+FH KA+  +K N I  ++ ++G
Subjt:  YGGSIRGAIKKKERELHDLLNQSVEQSLM-EVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEG

KAF4368982.1 hypothetical protein G4B88_011810 [Cannabis sativa]2.3e-4127.41Show/hide
Query:  LLAAELMASTSSADEEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEH--TTIEYWGFNLFLCKFKNT
        +L + ++    S D E++ RL+++ V  E    V  L +  I+  +KK+   ++ K+   R  N E  +  M  +W   H     E    N+F   F   
Subjt:  LLAAELMASTSSADEEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEH--TTIEYWGFNLFLCKFKNT

Query:  RIKGFIMNSGPWFYDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGV
          +  +   GPW  DK ++    P G     +M F F SFWI  + +PLAC +   A E G  +G +E + +V       + +++++V+I++T+PLKRG+
Subjt:  RIKGFIMNSGPWFYDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGV

Query:  FIKSAKTGIEKWIAVTYEKLPDFCYGCGRLGHIIKEC----------EEDSGT--SDVDLPYGPMLREPPKFKGLDSSATMNP-----ESKHWGWGRGRG
         +     G E  +   YE LPDFC+ CG +GH   +C            DSG   S +  P  P        K  ++S +  P     E+        R 
Subjt:  FIKSAKTGIEKWIAVTYEKLPDFCYGCGRLGHIIKEC----------EEDSGT--SDVDLPYGPMLREPPKFKGLDSSATMNP-----ESKHWGWGRGRG

Query:  RMGSGGRGSWRSADSGAEEILERRKDEEESQLGKEGPKVGENGAPAAEKGKAVVTPRGQGSSEGGGGEPAMA-TTASGSTVEKTNLDRENVIINSSINEG
        R+   G      A +   E  ER  DEE +       K     AP    G + VT    G      G+P ++      S+     +    V++  + +  
Subjt:  RMGSGGRGSWRSADSGAEEILERRKDEEESQLGKEGPKVGENGAPAAEKGKAVVTPRGQGSSEGGGGEPAMA-TTASGSTVEKTNLDRENVIINSSINEG

Query:  VNLGITKLKFLNGDSVLMKEQSLNGEVKDIA------YMELDTNMEGLREYSDGLKVQEVRPDSEEKILSNEGKGKLKTWKRAQRELKNKEASELTKLSD
          LG      ++G S   K   + GEV           ME+D   E +++    L+ Q      +     N+ +G+    +R  R   N+E   L     
Subjt:  VNLGITKLKFLNGDSVLMKEQSLNGEVKDIA------YMELDTNMEGLREYSDGLKVQEVRPDSEEKILSNEGKGKLKTWKRAQRELKNKEASELTKLSD

Query:  IKDKCCFFKVYHLNLLASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSGHKD----ILDKTKECLIRLKGWSHQKYGGS
                +V + +LL SDHR ++A        ++  R     RFE  W K +DC  IV++ W     S   D    ILD    C  +L  W+  K+ GS
Subjt:  IKDKCCFFKVYHLNLLASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSGHKD----ILDKTKECLIRLKGWSHQKYGGS

Query:  IRGAIKKKERELHDLLNQSVEQSLM-EVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEG
        I   +++ +++L DLL+ S     M EV   E  L +LL  +E YW  R+R DWL  GDRNTK+FH KA+  +K N I  ++ E+G
Subjt:  IRGAIKKKERELHDLLNQSVEQSLM-EVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEG

KAF4375818.1 hypothetical protein F8388_014540 [Cannabis sativa]2.5e-4026.09Show/hide
Query:  LLAAELMASTSSADEEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEH--TTIEYWGFNLFLCKFKNT
        +L +  +    S D E++ RL+++ V  E    V  L +  I+  +KK+   ++ K+   R  N E  +  M  +W   H     E    N+F   F   
Subjt:  LLAAELMASTSSADEEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEH--TTIEYWGFNLFLCKFKNT

Query:  RIKGFIMNSGPWFYDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGV
          +  +   GPW  DK ++    P G     +M F F SFWI  + +PLAC +   A E G  +  +E + +V       + +++++V+I++T+PLKRG+
Subjt:  RIKGFIMNSGPWFYDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGV

Query:  FIKSAKTGIEKWIAVTYEKLPDFCYGCGRLGHIIKECEEDSGTSDVDLPYGPMLREPPKFKGLDSSATMNPESKHWGWGRGRGRMGSGGRGSWRSADSGA
         +     G E  +   YE LPDFC+ CG +GH   +C                         L     +NP               SG  GSW  A S  
Subjt:  FIKSAKTGIEKWIAVTYEKLPDFCYGCGRLGHIIKECEEDSGTSDVDLPYGPMLREPPKFKGLDSSATMNPESKHWGWGRGRGRMGSGGRGSWRSADSGA

Query:  EEILERRKDEEESQLGKEGPKVGE----NGAPAAEKGKAVVTPRGQGSSEGGGGEPAMATTASGSTVEKTNLD---RENVIINSSINEGVNLGITKLKFL
               +D   +++   GP + E    +   A E+ + + +       E       + +  +GS + + N D    E V++ SS     ++ +  +  +
Subjt:  EEILERRKDEEESQLGKEGPKVGE----NGAPAAEKGKAVVTPRGQGSSEGGGGEPAMATTASGSTVEKTNLD---RENVIINSSINEGVNLGITKLKFL

Query:  NGDSVLMKEQ------------SLNGEVKDIAYMELD----TNME---GLREYSDGLKVQEVRPDSEEKILS--NEGKGKLKTWKRAQRELKNKEASELT
          D+   KEQ            S +G+V  +   +L      N +    L+    G +  E+  +S  +  +  N+ +G     +R  R   N+E   L 
Subjt:  NGDSVLMKEQ------------SLNGEVKDIAYMELD----TNME---GLREYSDGLKVQEVRPDSEEKILS--NEGKGKLKTWKRAQRELKNKEASELT

Query:  KLSDIKDKCCFFKVYHLNLLASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSGHKD----ILDKTKECLIRLKGWSHQK
                    +V + +L+ SDHRP++A    D   ++  R     RFE  W K +DC  IV++ W     S   D    ILD    C  +L  W+  K
Subjt:  KLSDIKDKCCFFKVYHLNLLASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSGHKD----ILDKTKECLIRLKGWSHQK

Query:  YGGSIRGAIKKKERELHDLLNQSVEQSLM-EVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEG
        + GSI   +++ +++L DLL+ S     M EV   E  L +LL  +E YW  R+R DWL  GDRNTK+FH KA+  +K   I  ++ E+G
Subjt:  YGGSIRGAIKKKERELHDLLNQSVEQSLM-EVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEG

TXG61366.1 hypothetical protein EZV62_012729 [Acer yangbiense]5.6e-4026.25Show/hide
Query:  EEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQ-EHTTIEYWGFNLFLCKFKNTRIKGFIMNSGPWFYD
        E  +++L +     +E  +V ++ EE     E ++   ++ K+ + +K+N + FK+ + ++W+      ++  G N+F+  F+       I   GPW++D
Subjt:  EEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQ-EHTTIEYWGFNLFLCKFKNTRIKGFIMNSGPWFYD

Query:  KSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSAKTGIEKWIAV
        KS+L+L+ P+G  +  ++ F  V  WI  H +P+ C +R +A  +   LG V     +D L E    ++ +                          + +
Subjt:  KSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSAKTGIEKWIAV

Query:  TYEKLPDFCYGCGRLGHIIKECEEDSGTSDVDL-----PYGPMLREP-PKFKGLDSSATMNPESK-HWGWGRGRGRMGSGGRGSWRSADSGAEEILERRK
         YE+LP+FCY CGR+GH  K+C +    S+ DL      YG  +R   P+ + L     +   SK   G    R      G G   + D   + +     
Subjt:  TYEKLPDFCYGCGRLGHIIKECEEDSGTSDVDL-----PYGPMLREP-PKFKGLDSSATMNPESK-HWGWGRGRGRMGSGGRGSWRSADSGAEEILERRK

Query:  DEEESQLGKEGPKVGENGAPAAEKGKAVVTPRGQGSSEGGGGEPAMATTASGSTVEKTNLDRENVII-NSSINEGVNLGITKLKFLNGDSVLMKEQSLNG
          E++ + KE  K  ++G   ++K K + T  G GSS+G G       +  G + E    +++NV+  N S   G+      +  L     L      N 
Subjt:  DEEESQLGKEGPKVGENGAPAAEKGKAVVTPRGQGSSEGGGGEPAMATTASGSTVEKTNLDRENVII-NSSINEGVNLGITKLKFLNGDSVLMKEQSLNG

Query:  EVKDIAYM-------------ELDTNMEGL-REYSDGLKVQEVRPDSEEKILSNEGKGKLKTWKRA-QRELKNKEASELTKLSDIKDKCCFFKVYHLNL-
         +KD+A +              +  N + + RE  + L   + +P    K+ +   KGK  +   A    + N      T  S +K     F    L + 
Subjt:  EVKDIAYM-------------ELDTNMEGL-REYSDGLKVQEVRPDSEEKILSNEGKGKLKTWKRA-QRELKNKEASELTKLSDIKDKCCFFKVYHLNL-

Query:  -------LASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNR-MRSGHKDILDKTKECLIRLKGWSHQKYGGSIRGAIKKKERE
                 SDHRPIL ++  D+  R+     S  RFE  W K ED  ++V   W+ + + +  ++ L K  +C   L GWS  ++  ++   I+ K RE
Subjt:  -------LASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNR-MRSGHKDILDKTKECLIRLKGWSHQKYGGSIRGAIKKKERE

Query:  LHDLLNQSVEQSLMEV-TEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEGSWK
        +  L   S +  +M V  E EK +E LL+ +EI+W QR+R +WL+ GDRN+K+ H +A+  +K NTI+ LL+ EG ++
Subjt:  LHDLLNQSVEQSLMEV-TEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEGSWK

TrEMBL top hitse value%identityAlignment
A0A2N9F7A6 Uncharacterized protein1.6e-4024Show/hide
Query:  DEEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWN-QEHTTIEYWGFNLFLCKFKNTRIKGFIMNSGPWFY
        +E +L+      +T +E A  F + ++ +  ++   +N +L ++ T++  N    KS M ++W      TI+  G NLF+ +F N   +  +MN  PW +
Subjt:  DEEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWN-QEHTTIEYWGFNLFLCKFKNTRIKGFIMNSGPWFY

Query:  DKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSAKTGIEKWIA
        +  ML L +  G C   +++F    FW+  H +PL   ++T+   +G  LG V +VD+++     W   LR ++ +D++KP+ RG  I     G + W++
Subjt:  DKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSAKTGIEKWIA

Query:  VTYEKLPDFCYGCGRLGHIIKE----CEEDSGTSDVDLPYGPMLREPPKFKGLDSSATMNPESKHWGWGRGR----GRMGSGGRGSW-----------RS
          YE+LP  C+ CG +GH  ++    C       +V   YGP LR          +A +N   +  G+GR R    GR  +  +G+             S
Subjt:  VTYEKLPDFCYGCGRLGHIIKE----CEEDSGTSDVDLPYGPMLREPPKFKGLDSSATMNPESKHWGWGRGR----GRMGSGGRGSW-----------RS

Query:  ADSGAEEILERRKD-----------------EEESQLGKEGPKV---GENGAPAAEKGKAVVTPRGQGSSEGGGGEPAMATTAS----------------
        +  GA  +  + ++                 E  +   KE  KV    E G   +      VT +      G    P++ T  S                
Subjt:  ADSGAEEILERRKD-----------------EEESQLGKEGPKV---GENGAPAAEKGKAVVTPRGQGSSEGGGGEPAMATTAS----------------

Query:  ----------------------GSTVEKT--------NLDRENVIIN-SSINEGVN---LGITKLKFLNGDSVLMKEQ---SLNGEVK---DIAYMELDT
                              G +V K         NL R  +  + +  NE V    LG  +     G ++L   +    LN   K   D A ++ + 
Subjt:  ----------------------GSTVEKT--------NLDRENVIIN-SSINEGVN---LGITKLKFLNGDSVLMKEQ---SLNGEVK---DIAYMELDT

Query:  -----------NMEGLREYSDGLKVQEVRPDSE---------EKILSNE---GKGKLKTWKRAQRELKNKEASELTKLSDI------------KDKCCFF
                   N E  R       ++ +   +           +IL N    G G  + W+        +EA   ++L D+            +D   F 
Subjt:  -----------NMEGLREYSDGLKVQEVRPDSE---------EKILSNE---GKGKLKTWKRAQRELKNKEASELTKLSDI------------KDKCCFF

Query:  K--------------------VYHLNLLASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSGH--KDILDKTKECLIRLK
                             VYHL +  SDH P+L +  +     K  +   + RFE +WTK E CR ++ + W   ++ G     + +K K+C + L 
Subjt:  K--------------------VYHLNLLASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSGH--KDILDKTKECLIRLK

Query:  GWSHQKYGGSIRGAIKKKERELHDLLNQSVEQSLMEVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEGSWKTK
         WS +++ GSI  +IK K  +L    N S       + E + +L  LLE +EI+W QR+R  W+  GD+NTK+FH   ++ R+ N I+GL D++  W+T+
Subjt:  GWSHQKYGGSIRGAIKKKERELHDLLNQSVEQSLMEVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEGSWKTK

A0A2Z6NZV1 Uncharacterized protein7.6e-4326.71Show/hide
Query:  EEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEHT-TIEYWGFNLFLCKFKNTRIKGFIMNSGPWFYDKSMLLLQDPKGDCS
        E++     ++ E I   E+     ++ K++T    N   FK  + + W  +++  ++    NLFL +F   +    ++ +GPW +D+++L+L    G+  
Subjt:  EEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEHT-TIEYWGFNLFLCKFKNTRIKGFIMNSGPWFYDKSMLLLQDPKGDCS

Query:  GEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSAKTGIEKWIAVTYEKLPDFCYGCGR
          +++   V+FW+  + LP    S   A ++G+++GN E+VD  DA        LR+K  +D+ KPLKRG  IK     +  W+   YE+LP+FC+ CG+
Subjt:  GEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSAKTGIEKWIAVTYEKLPDFCYGCGR

Query:  LGHIIKECE-----EDSGTSDV---DLPYGPMLREPPK---FKGLDSSATMNPESKHW--GWGRGRGRMGSGGRGSWRSADSGAEEILERRKDEEESQLG
        +GH +KECE     +++  SD+      YGP LR  P    F+     A+    SK+      + +G    G +   +  D      +E++    ++   
Subjt:  LGHIIKECE-----EDSGTSDV---DLPYGPMLREPPK---FKGLDSSATMNPESKHW--GWGRGRGRMGSGGRGSWRSADSGAEEILERRKDEEESQLG

Query:  KEGPKVGENGAPAAEKGKAVV--TPRGQGSSEG------GGGEPAMATTASGSTVEKTNLDRENVIINS--SINEGVNL----------------GITKL
         +   V E+    A     VV  T + QG S+G         +P  A TA     E    +  +V I+   +++  + L                 I +L
Subjt:  KEGPKVGENGAPAAEKGKAVV--TPRGQGSSEG------GGGEPAMATTASGSTVEKTNLDRENVIINS--SINEGVNL----------------GITKL

Query:  KFLNGDS---------------------------VLMKEQSLN---GEVKDIAYMELDTNMEGLREYSDGLKVQEVRPDSE---------EKILSNEG-K
        KF  G S                           + +K  SLN   G+  D+   E   ++ G+  Y++  K   VR   +         E  L++ G +
Subjt:  KFLNGDS---------------------------VLMKEQSLN---GEVKDIAYMELDTNMEGLREYSDGLKVQEVRPDSE---------EKILSNEG-K

Query:  GKLKTWKRAQRELKNKEASELTKLS--DIKDKCCFFKVYHLNLLASDH--RPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSG
        G   TW   +    NK+  +   +S  +  ++     V HL    SDH    I  E  T ++ R+  R   + RFEE WT    C  +++  W     S 
Subjt:  GKLKTWKRAQRELKNKEASELTKLS--DIKDKCCFFKVYHLNLLASDH--RPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSG

Query:  HKDILDKTKECLIRLKGWSHQKYGGSIRGAIKKKER--ELHDLLNQSVEQSLMEVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCR
          D L + ++    L   S     GSI   I + E+  + HD+ ++S E S+      E  LE LL+++E  W QR+R  WLK GD+NTK+FH KAS+ R
Subjt:  HKDILDKTKECLIRLKGWSHQKYGGSIRGAIKKKER--ELHDLLNQSVEQSLMEVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCR

Query:  KINTIRGLLDEEGSW
        K+N I+ L DE G W
Subjt:  KINTIRGLLDEEGSW

A0A5C7HYG1 CCHC-type domain-containing protein2.7e-4026.25Show/hide
Query:  EEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQ-EHTTIEYWGFNLFLCKFKNTRIKGFIMNSGPWFYD
        E  +++L +     +E  +V ++ EE     E ++   ++ K+ + +K+N + FK+ + ++W+      ++  G N+F+  F+       I   GPW++D
Subjt:  EEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQ-EHTTIEYWGFNLFLCKFKNTRIKGFIMNSGPWFYD

Query:  KSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSAKTGIEKWIAV
        KS+L+L+ P+G  +  ++ F  V  WI  H +P+ C +R +A  +   LG V     +D L E    ++ +                          + +
Subjt:  KSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSAKTGIEKWIAV

Query:  TYEKLPDFCYGCGRLGHIIKECEEDSGTSDVDL-----PYGPMLREP-PKFKGLDSSATMNPESK-HWGWGRGRGRMGSGGRGSWRSADSGAEEILERRK
         YE+LP+FCY CGR+GH  K+C +    S+ DL      YG  +R   P+ + L     +   SK   G    R      G G   + D   + +     
Subjt:  TYEKLPDFCYGCGRLGHIIKECEEDSGTSDVDL-----PYGPMLREP-PKFKGLDSSATMNPESK-HWGWGRGRGRMGSGGRGSWRSADSGAEEILERRK

Query:  DEEESQLGKEGPKVGENGAPAAEKGKAVVTPRGQGSSEGGGGEPAMATTASGSTVEKTNLDRENVII-NSSINEGVNLGITKLKFLNGDSVLMKEQSLNG
          E++ + KE  K  ++G   ++K K + T  G GSS+G G       +  G + E    +++NV+  N S   G+      +  L     L      N 
Subjt:  DEEESQLGKEGPKVGENGAPAAEKGKAVVTPRGQGSSEGGGGEPAMATTASGSTVEKTNLDRENVII-NSSINEGVNLGITKLKFLNGDSVLMKEQSLNG

Query:  EVKDIAYM-------------ELDTNMEGL-REYSDGLKVQEVRPDSEEKILSNEGKGKLKTWKRA-QRELKNKEASELTKLSDIKDKCCFFKVYHLNL-
         +KD+A +              +  N + + RE  + L   + +P    K+ +   KGK  +   A    + N      T  S +K     F    L + 
Subjt:  EVKDIAYM-------------ELDTNMEGL-REYSDGLKVQEVRPDSEEKILSNEGKGKLKTWKRA-QRELKNKEASELTKLSDIKDKCCFFKVYHLNL-

Query:  -------LASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNR-MRSGHKDILDKTKECLIRLKGWSHQKYGGSIRGAIKKKERE
                 SDHRPIL ++  D+  R+     S  RFE  W K ED  ++V   W+ + + +  ++ L K  +C   L GWS  ++  ++   I+ K RE
Subjt:  -------LASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNR-MRSGHKDILDKTKECLIRLKGWSHQKYGGSIRGAIKKKERE

Query:  LHDLLNQSVEQSLMEV-TEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEGSWK
        +  L   S +  +M V  E EK +E LL+ +EI+W QR+R +WL+ GDRN+K+ H +A+  +K NTI+ LL+ EG ++
Subjt:  LHDLLNQSVEQSLMEV-TEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEGSWK

A0A7J6FE65 CCHC-type domain-containing protein1.1e-4127.41Show/hide
Query:  LLAAELMASTSSADEEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEH--TTIEYWGFNLFLCKFKNT
        +L + ++    S D E++ RL+++ V  E    V  L +  I+  +KK+   ++ K+   R  N E  +  M  +W   H     E    N+F   F   
Subjt:  LLAAELMASTSSADEEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEH--TTIEYWGFNLFLCKFKNT

Query:  RIKGFIMNSGPWFYDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGV
          +  +   GPW  DK ++    P G     +M F F SFWI  + +PLAC +   A E G  +G +E + +V       + +++++V+I++T+PLKRG+
Subjt:  RIKGFIMNSGPWFYDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGV

Query:  FIKSAKTGIEKWIAVTYEKLPDFCYGCGRLGHIIKEC----------EEDSGT--SDVDLPYGPMLREPPKFKGLDSSATMNP-----ESKHWGWGRGRG
         +     G E  +   YE LPDFC+ CG +GH   +C            DSG   S +  P  P        K  ++S +  P     E+        R 
Subjt:  FIKSAKTGIEKWIAVTYEKLPDFCYGCGRLGHIIKEC----------EEDSGT--SDVDLPYGPMLREPPKFKGLDSSATMNP-----ESKHWGWGRGRG

Query:  RMGSGGRGSWRSADSGAEEILERRKDEEESQLGKEGPKVGENGAPAAEKGKAVVTPRGQGSSEGGGGEPAMA-TTASGSTVEKTNLDRENVIINSSINEG
        R+   G      A +   E  ER  DEE +       K     AP    G + VT    G      G+P ++      S+     +    V++  + +  
Subjt:  RMGSGGRGSWRSADSGAEEILERRKDEEESQLGKEGPKVGENGAPAAEKGKAVVTPRGQGSSEGGGGEPAMA-TTASGSTVEKTNLDRENVIINSSINEG

Query:  VNLGITKLKFLNGDSVLMKEQSLNGEVKDIA------YMELDTNMEGLREYSDGLKVQEVRPDSEEKILSNEGKGKLKTWKRAQRELKNKEASELTKLSD
          LG      ++G S   K   + GEV           ME+D   E +++    L+ Q      +     N+ +G+    +R  R   N+E   L     
Subjt:  VNLGITKLKFLNGDSVLMKEQSLNGEVKDIA------YMELDTNMEGLREYSDGLKVQEVRPDSEEKILSNEGKGKLKTWKRAQRELKNKEASELTKLSD

Query:  IKDKCCFFKVYHLNLLASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSGHKD----ILDKTKECLIRLKGWSHQKYGGS
                +V + +LL SDHR ++A        ++  R     RFE  W K +DC  IV++ W     S   D    ILD    C  +L  W+  K+ GS
Subjt:  IKDKCCFFKVYHLNLLASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSGHKD----ILDKTKECLIRLKGWSHQKYGGS

Query:  IRGAIKKKERELHDLLNQSVEQSLM-EVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEG
        I   +++ +++L DLL+ S     M EV   E  L +LL  +E YW  R+R DWL  GDRNTK+FH KA+  +K N I  ++ E+G
Subjt:  IRGAIKKKERELHDLLNQSVEQSLM-EVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEG

A0A7J6FYQ2 CCHC-type domain-containing protein1.2e-4026.09Show/hide
Query:  LLAAELMASTSSADEEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEH--TTIEYWGFNLFLCKFKNT
        +L +  +    S D E++ RL+++ V  E    V  L +  I+  +KK+   ++ K+   R  N E  +  M  +W   H     E    N+F   F   
Subjt:  LLAAELMASTSSADEEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIWNQEH--TTIEYWGFNLFLCKFKNT

Query:  RIKGFIMNSGPWFYDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGV
          +  +   GPW  DK ++    P G     +M F F SFWI  + +PLAC +   A E G  +  +E + +V       + +++++V+I++T+PLKRG+
Subjt:  RIKGFIMNSGPWFYDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGV

Query:  FIKSAKTGIEKWIAVTYEKLPDFCYGCGRLGHIIKECEEDSGTSDVDLPYGPMLREPPKFKGLDSSATMNPESKHWGWGRGRGRMGSGGRGSWRSADSGA
         +     G E  +   YE LPDFC+ CG +GH   +C                         L     +NP               SG  GSW  A S  
Subjt:  FIKSAKTGIEKWIAVTYEKLPDFCYGCGRLGHIIKECEEDSGTSDVDLPYGPMLREPPKFKGLDSSATMNPESKHWGWGRGRGRMGSGGRGSWRSADSGA

Query:  EEILERRKDEEESQLGKEGPKVGE----NGAPAAEKGKAVVTPRGQGSSEGGGGEPAMATTASGSTVEKTNLD---RENVIINSSINEGVNLGITKLKFL
               +D   +++   GP + E    +   A E+ + + +       E       + +  +GS + + N D    E V++ SS     ++ +  +  +
Subjt:  EEILERRKDEEESQLGKEGPKVGE----NGAPAAEKGKAVVTPRGQGSSEGGGGEPAMATTASGSTVEKTNLD---RENVIINSSINEGVNLGITKLKFL

Query:  NGDSVLMKEQ------------SLNGEVKDIAYMELD----TNME---GLREYSDGLKVQEVRPDSEEKILS--NEGKGKLKTWKRAQRELKNKEASELT
          D+   KEQ            S +G+V  +   +L      N +    L+    G +  E+  +S  +  +  N+ +G     +R  R   N+E   L 
Subjt:  NGDSVLMKEQ------------SLNGEVKDIAYMELD----TNME---GLREYSDGLKVQEVRPDSEEKILS--NEGKGKLKTWKRAQRELKNKEASELT

Query:  KLSDIKDKCCFFKVYHLNLLASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSGHKD----ILDKTKECLIRLKGWSHQK
                    +V + +L+ SDHRP++A    D   ++  R     RFE  W K +DC  IV++ W     S   D    ILD    C  +L  W+  K
Subjt:  KLSDIKDKCCFFKVYHLNLLASDHRPILAEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSGHKD----ILDKTKECLIRLKGWSHQK

Query:  YGGSIRGAIKKKERELHDLLNQSVEQSLM-EVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEG
        + GSI   +++ +++L DLL+ S     M EV   E  L +LL  +E YW  R+R DWL  GDRNTK+FH KA+  +K   I  ++ E+G
Subjt:  YGGSIRGAIKKKERELHDLLNQSVEQSLM-EVTEKEKDLENLLEDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41590.1 unknown protein1.4e-0420.97Show/hide
Query:  EEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIW---NQEHTTIEYWGFNLFLCKFKNTRIKGFIMNSGPWF
        +E+   + +L++  E+ A    +  E   + E     +++ +    R  N       +P+ W   NQ H  I    +  FL  F+N      +    PW 
Subjt:  EEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIFTNRKINPEIFKSKMPKIW---NQEHTTIEYWGFNLFLCKFKNTRIKGFIMNSGPWF

Query:  YDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLK--RGVFIKSAKTGIEK
        ++     +   + + +        +  W+    +PL   S  + MEI   LG V  +D  D       + +R++V+  +T  L+  + +   S +T    
Subjt:  YDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLK--RGVFIKSAKTGIEK

Query:  WIAVTYEKLPDFCYGCGRLGHIIKECEEDSGTSDVDLPYGP----MLREPPKFKGLDSSATMNPESK
         I   YE+L   C  C R  H    C           PY P    + RE   F+     ++MN +S+
Subjt:  WIAVTYEKLPDFCYGCGRLGHIIKECEEDSGTSDVDLPYGP----MLREPPKFKGLDSSATMNPESK

AT3G42140.1 zinc ion binding;nucleic acid binding5.2e-0421.97Show/hide
Query:  IMNSGPWFYDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSA
        I+  GPW ++  M ++Q  +      + EF+ + FWI    +PL   +      IG                                   + G+F+++ 
Subjt:  IMNSGPWFYDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQVDLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSA

Query:  KTGIEKWIAVTYEKLPDFCYGCGRLGHIIKEC
               +   YEKL +FC  CG L H   EC
Subjt:  KTGIEKWIAVTYEKLPDFCYGCGRLGHIIKEC

AT4G29090.1 Ribonuclease H-like superfamily protein8.6e-0721.83Show/hide
Query:  WTTADYLLWIWKDKKGENLDERRMAMSLV--ICWLIWEHRNNFIHSRQQLDMEKLKFQIQKYSVELFNVEDSHLIHSSAVSLGAARPETSTATEDPRRSV
        W  + Y+   W    G    +   A  LV  + W +W++RN  +   ++ + +++   +++   +L    +   I + A S G  +P+ + ++    R  
Subjt:  WTTADYLLWIWKDKKGENLDERRMAMSLV--ICWLIWEHRNNFIHSRQQLDMEKLKFQIQKYSVELFNVEDSHLIHSSAVSLGAARPETSTATEDPRRSV

Query:  PCGVWRLSCDATWNAGKSRGGIGWIVRDWSGRLIRAGHRSVSQAWKISWLEAYAVCEGLKELPIHSPQTRIETDALQISKLLANEDEDDTELGNFIMEAH
        P    + + DATWN    R GIGW++R+  G +   G R++ +   +   E  A+   +  L        I     Q+   + N DE    L   I +  
Subjt:  PCGVWRLSCDATWNAGKSRGGIGWIVRDWSGRLIRAGHRSVSQAWKISWLEAYAVCEGLKELPIHSPQTRIETDALQISKLLANEDEDDTELGNFIMEAH

Query:  ALMAAHQIESVVHVARSNNEAAHFLARRA
         L++       V + R  N  A  +AR +
Subjt:  ALMAAHQIESVVHVARSNNEAAHFLARRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAGAAAGAGCGGTTGAGGAAAGGGGAGGAGGAATTGATGCAAAGGTTAGCTCCGAATCTAGAAAGAGGGAAGAGAGAGGAAAGAAAGAGAACATGTCTGAAGAG
AGTAGAGTTGAGAGGAGGGTGTACTCCAGCGTTGGGTTGTCCGGTTCTTCTTGCGGCTGAGTTAATGGCGTCAACGAGCAGTGCAGACGAAGAGGTGTTGAAACGATTGG
CTGATCTGAAGGTTACGCCTGAGGAGAAAGCAAGTGTGTTTCAGTTACAGGAAGAAACAATTGATCTTACAGAGAAGAAGCTTGCTAATGCTGTCTTGTGCAAGATATTT
ACGAACAGGAAAATAAATCCAGAAATCTTCAAGTCCAAGATGCCAAAAATCTGGAACCAAGAGCATACAACTATCGAATACTGGGGATTCAATCTTTTCTTATGCAAGTT
TAAAAATACTCGAATCAAGGGATTTATAATGAATTCTGGGCCATGGTTTTATGATAAGTCGATGTTATTACTACAAGACCCGAAAGGAGATTGCAGTGGAGAAGAAATGG
AGTTCAGGTTTGTCTCTTTTTGGATTCATTTCCATAAATTACCTTTGGCTTGTTTTTCCAGGACTTCAGCAATGGAGATTGGGAGCTTACTCGGGAACGTTGAGCAAGTA
GATCTTGTTGACGCATTAGATGAAAATTGGAGCAGTTCGTTGAGGATCAAAGTCCAGATCGATGTTACAAAACCTTTGAAACGTGGTGTTTTTATAAAATCAGCAAAAAC
AGGGATTGAGAAATGGATTGCAGTTACATACGAGAAGCTGCCTGACTTTTGTTATGGGTGTGGGCGACTAGGTCATATTATTAAAGAGTGTGAGGAAGATAGTGGTACGA
GTGATGTTGATCTCCCATATGGTCCGATGCTTAGGGAACCACCTAAGTTCAAAGGATTAGACTCCTCAGCCACAATGAATCCAGAATCTAAGCATTGGGGATGGGGAAGA
GGGAGAGGTAGGATGGGTAGTGGTGGAAGGGGGAGTTGGAGAAGTGCCGATTCTGGTGCAGAAGAAATTCTAGAAAGGCGGAAGGATGAAGAAGAATCACAGTTGGGCAA
AGAAGGGCCAAAAGTGGGTGAAAATGGAGCTCCGGCGGCGGAGAAAGGAAAAGCGGTGGTAACTCCCAGAGGACAAGGTAGCTCCGAAGGCGGCGGCGGAGAACCGGCAA
TGGCAACCACGGCCTCAGGCTCAACGGTCGAAAAAACGAATTTGGACAGAGAGAATGTTATAATTAATTCCAGCATTAATGAGGGAGTAAATTTAGGAATAACGAAGCTG
AAGTTTTTAAACGGTGATTCAGTCTTAATGAAGGAGCAGTCTTTAAATGGGGAGGTCAAAGATATTGCTTATATGGAATTGGATACTAATATGGAAGGTCTGAGAGAATA
TTCTGACGGGCTGAAAGTCCAAGAAGTGCGGCCCGATTCTGAGGAGAAAATATTGTCGAATGAGGGGAAGGGTAAACTTAAAACTTGGAAACGGGCGCAGAGAGAGTTGA
AGAATAAGGAGGCTTCGGAGCTGACGAAATTGAGTGATATAAAAGACAAATGTTGTTTCTTCAAGGTTTATCATTTAAACCTTCTTGCTTCGGATCATAGACCAATTTTA
GCTGAATGGTCAACGGATCAAGAGTTTCGAAAGGGGTCAAGATCAAATAGTATGAGGCGGTTTGAAGAGGTGTGGACTAAATATGAGGATTGTAGACAAATTGTTCGACA
AGTATGGCAGAATAGGATGAGATCGGGACATAAGGACATTCTGGATAAGACTAAAGAATGTTTGATTCGGCTAAAAGGGTGGAGCCACCAGAAATATGGTGGGTCGATTC
GGGGAGCTATTAAGAAGAAAGAAAGAGAGCTTCACGATCTATTAAACCAGTCAGTTGAGCAGAGTTTGATGGAAGTGACTGAGAAAGAAAAGGATCTCGAAAATCTCCTG
GAAGATGATGAGATTTATTGGCATCAAAGAGCTCGTGAGGATTGGTTAAAATGGGGGGATAGGAATACCAAATGGTTCCATATGAAGGCCAGTCGATGTAGGAAAATCAA
TACGATCAGAGGTTTATTGGATGAAGAGGGTAGTTGGAAAACAAAAGATTTGGAGATGGAGTTTATAACTAGTCAGAGAACTTGGAATGAAGATCTCGTGAGAAAGTCCT
TTTTGGAAGCCGATGCTCTGGCCATCCTGAATATTCCGCTTAATCCTTTTCTAAAGGAGGACACAATTATTTGGGATCTCGACTCTAAGGGGAAGTTTTCTGTCAAAAGT
GGATGGACGACGGCAGATTATCTCTTATGGATTTGGAAGGACAAAAAGGGGGAGAACCTAGATGAGAGGCGGATGGCTATGAGCTTGGTGATTTGCTGGTTAATTTGGGA
GCATAGGAACAACTTCATTCACAGTAGACAGCAACTAGATATGGAGAAACTGAAATTTCAAATCCAAAAATACAGTGTAGAGCTTTTCAATGTAGAGGACTCTCACCTGA
TTCATTCCAGTGCTGTGAGTCTCGGAGCTGCCAGACCGGAGACGTCGACTGCAACAGAGGATCCACGGCGTTCTGTTCCTTGCGGCGTTTGGCGCCTCAGCTGCGATGCA
ACGTGGAATGCTGGAAAATCGCGGGGAGGGATTGGCTGGATTGTGAGAGATTGGAGCGGCAGATTGATTCGTGCGGGACACAGAAGCGTGTCTCAAGCTTGGAAAATCAG
TTGGCTGGAGGCTTATGCGGTTTGTGAAGGTCTGAAAGAGTTGCCGATTCACTCTCCTCAAACTCGGATTGAAACTGATGCTTTGCAAATCTCCAAACTGCTGGCCAACG
AGGATGAGGATGATACCGAACTGGGTAACTTCATAATGGAAGCCCATGCCCTAATGGCTGCTCACCAAATTGAATCTGTGGTCCATGTTGCTAGATCTAATAATGAGGCA
GCCCACTTTCTGGCCCGAAGAGCTTGTGATCTAAATGCCAATGAAAGTTGGGCCCAAGATTTTCCTAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAGAAAGAGCGGTTGAGGAAAGGGGAGGAGGAATTGATGCAAAGGTTAGCTCCGAATCTAGAAAGAGGGAAGAGAGAGGAAAGAAAGAGAACATGTCTGAAGAG
AGTAGAGTTGAGAGGAGGGTGTACTCCAGCGTTGGGTTGTCCGGTTCTTCTTGCGGCTGAGTTAATGGCGTCAACGAGCAGTGCAGACGAAGAGGTGTTGAAACGATTGG
CTGATCTGAAGGTTACGCCTGAGGAGAAAGCAAGTGTGTTTCAGTTACAGGAAGAAACAATTGATCTTACAGAGAAGAAGCTTGCTAATGCTGTCTTGTGCAAGATATTT
ACGAACAGGAAAATAAATCCAGAAATCTTCAAGTCCAAGATGCCAAAAATCTGGAACCAAGAGCATACAACTATCGAATACTGGGGATTCAATCTTTTCTTATGCAAGTT
TAAAAATACTCGAATCAAGGGATTTATAATGAATTCTGGGCCATGGTTTTATGATAAGTCGATGTTATTACTACAAGACCCGAAAGGAGATTGCAGTGGAGAAGAAATGG
AGTTCAGGTTTGTCTCTTTTTGGATTCATTTCCATAAATTACCTTTGGCTTGTTTTTCCAGGACTTCAGCAATGGAGATTGGGAGCTTACTCGGGAACGTTGAGCAAGTA
GATCTTGTTGACGCATTAGATGAAAATTGGAGCAGTTCGTTGAGGATCAAAGTCCAGATCGATGTTACAAAACCTTTGAAACGTGGTGTTTTTATAAAATCAGCAAAAAC
AGGGATTGAGAAATGGATTGCAGTTACATACGAGAAGCTGCCTGACTTTTGTTATGGGTGTGGGCGACTAGGTCATATTATTAAAGAGTGTGAGGAAGATAGTGGTACGA
GTGATGTTGATCTCCCATATGGTCCGATGCTTAGGGAACCACCTAAGTTCAAAGGATTAGACTCCTCAGCCACAATGAATCCAGAATCTAAGCATTGGGGATGGGGAAGA
GGGAGAGGTAGGATGGGTAGTGGTGGAAGGGGGAGTTGGAGAAGTGCCGATTCTGGTGCAGAAGAAATTCTAGAAAGGCGGAAGGATGAAGAAGAATCACAGTTGGGCAA
AGAAGGGCCAAAAGTGGGTGAAAATGGAGCTCCGGCGGCGGAGAAAGGAAAAGCGGTGGTAACTCCCAGAGGACAAGGTAGCTCCGAAGGCGGCGGCGGAGAACCGGCAA
TGGCAACCACGGCCTCAGGCTCAACGGTCGAAAAAACGAATTTGGACAGAGAGAATGTTATAATTAATTCCAGCATTAATGAGGGAGTAAATTTAGGAATAACGAAGCTG
AAGTTTTTAAACGGTGATTCAGTCTTAATGAAGGAGCAGTCTTTAAATGGGGAGGTCAAAGATATTGCTTATATGGAATTGGATACTAATATGGAAGGTCTGAGAGAATA
TTCTGACGGGCTGAAAGTCCAAGAAGTGCGGCCCGATTCTGAGGAGAAAATATTGTCGAATGAGGGGAAGGGTAAACTTAAAACTTGGAAACGGGCGCAGAGAGAGTTGA
AGAATAAGGAGGCTTCGGAGCTGACGAAATTGAGTGATATAAAAGACAAATGTTGTTTCTTCAAGGTTTATCATTTAAACCTTCTTGCTTCGGATCATAGACCAATTTTA
GCTGAATGGTCAACGGATCAAGAGTTTCGAAAGGGGTCAAGATCAAATAGTATGAGGCGGTTTGAAGAGGTGTGGACTAAATATGAGGATTGTAGACAAATTGTTCGACA
AGTATGGCAGAATAGGATGAGATCGGGACATAAGGACATTCTGGATAAGACTAAAGAATGTTTGATTCGGCTAAAAGGGTGGAGCCACCAGAAATATGGTGGGTCGATTC
GGGGAGCTATTAAGAAGAAAGAAAGAGAGCTTCACGATCTATTAAACCAGTCAGTTGAGCAGAGTTTGATGGAAGTGACTGAGAAAGAAAAGGATCTCGAAAATCTCCTG
GAAGATGATGAGATTTATTGGCATCAAAGAGCTCGTGAGGATTGGTTAAAATGGGGGGATAGGAATACCAAATGGTTCCATATGAAGGCCAGTCGATGTAGGAAAATCAA
TACGATCAGAGGTTTATTGGATGAAGAGGGTAGTTGGAAAACAAAAGATTTGGAGATGGAGTTTATAACTAGTCAGAGAACTTGGAATGAAGATCTCGTGAGAAAGTCCT
TTTTGGAAGCCGATGCTCTGGCCATCCTGAATATTCCGCTTAATCCTTTTCTAAAGGAGGACACAATTATTTGGGATCTCGACTCTAAGGGGAAGTTTTCTGTCAAAAGT
GGATGGACGACGGCAGATTATCTCTTATGGATTTGGAAGGACAAAAAGGGGGAGAACCTAGATGAGAGGCGGATGGCTATGAGCTTGGTGATTTGCTGGTTAATTTGGGA
GCATAGGAACAACTTCATTCACAGTAGACAGCAACTAGATATGGAGAAACTGAAATTTCAAATCCAAAAATACAGTGTAGAGCTTTTCAATGTAGAGGACTCTCACCTGA
TTCATTCCAGTGCTGTGAGTCTCGGAGCTGCCAGACCGGAGACGTCGACTGCAACAGAGGATCCACGGCGTTCTGTTCCTTGCGGCGTTTGGCGCCTCAGCTGCGATGCA
ACGTGGAATGCTGGAAAATCGCGGGGAGGGATTGGCTGGATTGTGAGAGATTGGAGCGGCAGATTGATTCGTGCGGGACACAGAAGCGTGTCTCAAGCTTGGAAAATCAG
TTGGCTGGAGGCTTATGCGGTTTGTGAAGGTCTGAAAGAGTTGCCGATTCACTCTCCTCAAACTCGGATTGAAACTGATGCTTTGCAAATCTCCAAACTGCTGGCCAACG
AGGATGAGGATGATACCGAACTGGGTAACTTCATAATGGAAGCCCATGCCCTAATGGCTGCTCACCAAATTGAATCTGTGGTCCATGTTGCTAGATCTAATAATGAGGCA
GCCCACTTTCTGGCCCGAAGAGCTTGTGATCTAAATGCCAATGAAAGTTGGGCCCAAGATTTTCCTAATTGA
Protein sequenceShow/hide protein sequence
MKEKERLRKGEEELMQRLAPNLERGKREERKRTCLKRVELRGGCTPALGCPVLLAAELMASTSSADEEVLKRLADLKVTPEEKASVFQLQEETIDLTEKKLANAVLCKIF
TNRKINPEIFKSKMPKIWNQEHTTIEYWGFNLFLCKFKNTRIKGFIMNSGPWFYDKSMLLLQDPKGDCSGEEMEFRFVSFWIHFHKLPLACFSRTSAMEIGSLLGNVEQV
DLVDALDENWSSSLRIKVQIDVTKPLKRGVFIKSAKTGIEKWIAVTYEKLPDFCYGCGRLGHIIKECEEDSGTSDVDLPYGPMLREPPKFKGLDSSATMNPESKHWGWGR
GRGRMGSGGRGSWRSADSGAEEILERRKDEEESQLGKEGPKVGENGAPAAEKGKAVVTPRGQGSSEGGGGEPAMATTASGSTVEKTNLDRENVIINSSINEGVNLGITKL
KFLNGDSVLMKEQSLNGEVKDIAYMELDTNMEGLREYSDGLKVQEVRPDSEEKILSNEGKGKLKTWKRAQRELKNKEASELTKLSDIKDKCCFFKVYHLNLLASDHRPIL
AEWSTDQEFRKGSRSNSMRRFEEVWTKYEDCRQIVRQVWQNRMRSGHKDILDKTKECLIRLKGWSHQKYGGSIRGAIKKKERELHDLLNQSVEQSLMEVTEKEKDLENLL
EDDEIYWHQRAREDWLKWGDRNTKWFHMKASRCRKINTIRGLLDEEGSWKTKDLEMEFITSQRTWNEDLVRKSFLEADALAILNIPLNPFLKEDTIIWDLDSKGKFSVKS
GWTTADYLLWIWKDKKGENLDERRMAMSLVICWLIWEHRNNFIHSRQQLDMEKLKFQIQKYSVELFNVEDSHLIHSSAVSLGAARPETSTATEDPRRSVPCGVWRLSCDA
TWNAGKSRGGIGWIVRDWSGRLIRAGHRSVSQAWKISWLEAYAVCEGLKELPIHSPQTRIETDALQISKLLANEDEDDTELGNFIMEAHALMAAHQIESVVHVARSNNEA
AHFLARRACDLNANESWAQDFPN