; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022283 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022283
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationscaffold2:5288234..5293992
RNA-Seq ExpressionSpg022283
SyntenySpg022283
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]6.7e-4025.42Show/hide
Query:  TVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVCGW
        TVA LI     W E ++ + F  +DA  I+ IP+    + D++IW +D +G +SV+S Y++ ++++       SC    + LW+F WK  IP K+K+  W
Subjt:  TVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVCGW

Query:  KIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGW-LEESRI--AESLLICW
        +  +D+LPT  NL  + V   P+C  C    ET++H + EC   R IW +   L  +  G+ R        C+I+W    Q W  + +++  AE   + W
Subjt:  KIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGW-LEESRI--AESLLICW

Query:  QVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIG
         +WK RN   +  +  +   + ++ + ++ ++ K+ + +            V    G+ ER  +         W  PP G  K+N D + + E +  G+G
Subjt:  QVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIG

Query:  WILRRWDGTPISAGFRVIHKCWKISWLEAFAV-----------------------VVNLLLGREEDFTELSLFFDEARGLLSGFQIHSISHIPRRQNQMA
         ++R  DG   +A  + +     ++  EA A+                       V++L+  +    TE+     + +  L  FQ     H PR  N  A
Subjt:  WILRRWDGTPISAGFRVIHKCWKISWLEAFAV-----------------------VVNLLLGREEDFTELSLFFDEARGLLSGFQIHSISHIPRRQNQMA

Query:  HQLARRAWAYNSSEVWHGSFP
        H LA+ A     + +W    P
Subjt:  HQLARRAWAYNSSEVWHGSFP

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]4.3e-3926.22Show/hide
Query:  GRRKRIRRFEECWTKYEECREIVAQVWEAHNLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDS
        G  K+I  F + W    E    +  +    +  VA LI+ +  W+E  +++ F + D ++IL IP+      DE++W +D RG +SV+S Y+L   L+  
Subjt:  GRRKRIRRFEECWTKYEECREIVAQVWEAHNLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDS

Query:  LEASSSCLLQKERLWKFFWKTDIPPKIKVCGWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWAT
           S+SC     + W   W  ++P K+K+  W+  N++LP+  NL  R V   P C  C+   ETI+H + ECK  R IW    Q P     L+ ++   
Subjt:  LEASSSCLLQKERLWKFFWKTDIPPKIKVCGWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWAT

Query:  ADYCEIMWKGSNQGWLEESRIAESLLICWQVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVS
            + M K      L +S +   + +CW  W  RN   ++ +  +     +  + VL+ + +V    R+ +Q  +              S+  + ++  
Subjt:  ADYCEIMWKGSNQGWLEESRIAESLLICWQVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVS

Query:  SFWLRPPMGIMKLNCDGSWNEEKKTGGIGWILRRWDGTPISAGFRVIHKCWKISWLEAFAVVVNLLLGREEDFTELSLFFD-------------------
          WL PP  + K+N D ++N +  + G+G ++R  +G  ++AG          S  EA AV+  L L R  D + L +  D                   
Subjt:  SFWLRPPMGIMKLNCDGSWNEEKKTGGIGWILRRWDGTPISAGFRVIHKCWKISWLEAFAVVVNLLLGREEDFTELSLFFD-------------------

Query:  ----EARGLLSGFQIHSISHIPRRQNQMAHQLARRAWAYNSSEVWHGSFP
              +  +  FQ   ++HIPR  N  AH LA+ A    S  +W G+ P
Subjt:  ----EARGLLSGFQIHSISHIPRRQNQMAHQLARRAWAYNSSEVWHGSFP

XP_030502765.1 uncharacterized protein LOC115717936 [Cannabis sativa]3.3e-3927.46Show/hide
Query:  NLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVC
        NL VA LI     W+   ++  FS+ D   IL IP++     D +IW     GI++V+S Y+L + L +  + +SS  +  E  W  FWK  IPPK+++ 
Subjt:  NLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVC

Query:  GWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLLICWQ
         WK+++  LP    L  R +  +P C +C +  E+I H ++ C   + +W  L  L  DF  L ++  A+AD   I+   S    L  S   + L+ICW 
Subjt:  GWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLLICWQ

Query:  VWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIGW
         W  RN +++ N + S +++ ++      +YL   ++ R ++ Q  P T  +AA      S EF +   +  W  PP G +KLN + + +  +   GIG 
Subjt:  VWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIGW

Query:  ILRRWDGTPISAGFRVIHKCWKISWLEAFAVVVNL-----------------------LLGREEDFTELSLFFDEARGLLSGFQIHSISHIPRRQNQMAH
         LR  DG  ++A  + +   +K   +EA  + ++L                       LL  +   +      +    L+S F    I+H+ R  N  AH
Subjt:  ILRRWDGTPISAGFRVIHKCWKISWLEAFAVVVNL-----------------------LLGREEDFTELSLFFDEARGLLSGFQIHSISHIPRRQNQMAH

Query:  QLARRAWAYNSSEVWHGSFPDWFLNL
         L + A   ++  +W  +FP   + L
Subjt:  QLARRAWAYNSSEVWHGSFPDWFLNL

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]2.9e-4327.93Show/hide
Query:  NLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVC
        NL VA LI  +  W+   ++  F++ D + +L IP++P    D +IW     G+++V+S Y   + L +  +  S+C    E  W  FWK  +PPK+++ 
Subjt:  NLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVC

Query:  GWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLLICWQ
         WK+++  LP    L  R +  +P C +C +  ET++H ++ C   + +W +L     DF  ++RS  +TAD   ++        L  S +   L++CW 
Subjt:  GWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLLICWQ

Query:  VWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIGW
        +W  RN +++ N + +  ++ ++    L+ + +    + +      P T   AA  S   S EF +   +  W  PP G +KLN D + ++E+ T GIG 
Subjt:  VWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIGW

Query:  ILRRWDGTPISAGFRVIHKCWKISWLEA--FAVVVNLLLGR--EEDFTE----------------LSLF---FDEARGLLSGFQIHSISHIPRRQNQMAH
        +LR  DG  ++A  +     +K   +EA   A+ +N LL      DF E                LS F    +    L+S F    I H+ R  N  AH
Subjt:  ILRRWDGTPISAGFRVIHKCWKISWLEA--FAVVVNLLLGR--EEDFTE----------------LSLF---FDEARGLLSGFQIHSISHIPRRQNQMAH

Query:  QLARRAWAYNSSEVWHGSFPDWFLNL
         LA+ A   ++  +W G+FP   + L
Subjt:  QLARRAWAYNSSEVWHGSFPDWFLNL

XP_030509188.1 uncharacterized protein LOC115723863 [Cannabis sativa]1.0e-4027.7Show/hide
Query:  NLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVC
        NL VA LI     W+   ++  FS+ D   IL IP++     D +IW     GI++V+S Y+L +   +  + +SS  +  E  W  FWK  +PPK+++ 
Subjt:  NLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVC

Query:  GWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLLICWQ
         WK+++  LP    L  R +  +P C +C +  E+INH +++C   + +W +L  L  DF+ L +S  A+AD   I+   S    L  S     L++CW 
Subjt:  GWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLLICWQ

Query:  VWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIGW
         W  RN +++ N + S +++ S+      +YL   +  R ++ Q  P + + A D     S EF +   +  W  PP G +KLN D + ++ +   GIG 
Subjt:  VWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIGW

Query:  ILRRWDGTPISAGFRVIHKCWKISWLEAFAVVVNL-----------------------LLGREEDFTELSLFFDEARGLLSGFQIHSISHIPRRQNQMAH
         LR  DG  ++A  + +   +K   +EA  + ++L                       LL  +   +      +    L+S F    I+H+ R  N  AH
Subjt:  ILRRWDGTPISAGFRVIHKCWKISWLEAFAVVVNL-----------------------LLGREEDFTELSLFFDEARGLLSGFQIHSISHIPRRQNQMAH

Query:  QLARRAWAYNSSEVWHGSFPDWFLNL
         LA+ A   ++  +W  +FP   + L
Subjt:  QLARRAWAYNSSEVWHGSFPDWFLNL

TrEMBL top hitse value%identityAlignment
A0A803NML1 Uncharacterized protein2.3e-3828.47Show/hide
Query:  NLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVC
        +L V+H I  N  WN  ++   F + D   IL IP+      D ++W   P GI+SV++ + L   L+D  + SSS   ++   WKFFW   +PPKI++ 
Subjt:  NLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVC

Query:  GWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLL-ICW
         WK++ +ILPT   L  R V  +  C LC +  E+I H ++ CK  + IW K+ +   DF           DY        +   + +    E+LL + W
Subjt:  GWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLL-ICW

Query:  QVWKYRNDVFYNNQIPSQESIHSHIQKVLSTY----LKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKT
         +W  RN V +  Q     +I  +  K    +    L++       +    P+T   A D   +R            W  P     KLN D + N E+K 
Subjt:  QVWKYRNDVFYNNQIPSQESIHSHIQKVLSTY----LKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKT

Query:  GGIGWILRRWDGTPISAGFRVIHKCWKISWLEAFAV-----------------------VVNLLLGREEDFTELSLFFDEARGLLSGFQIHSISHIPRRQ
         GIG ILR   GT ++A  + +   +K   +EA A+                       V N L     D +  S    + R LLS F    ++H+ R  
Subjt:  GGIGWILRRWDGTPISAGFRVIHKCWKISWLEAFAV-----------------------VVNLLLGREEDFTELSLFFDEARGLLSGFQIHSISHIPRRQ

Query:  NQMAHQLARRAWAYNSSEVWHGSFP
        NQ AH LA+ A   +   VW G  P
Subjt:  NQMAHQLARRAWAYNSSEVWHGSFP

A0A803NML1 Uncharacterized protein2.3e-0927.57Show/hide
Query:  EEKASVFQLQDGEIDRFELHLAKSILCKIYTNKKINIEIFKSKMPKIWIQEQTVITNVGFKSFL----CKFKNVRI---------------------KGW
        E++ SVF+  D E     L +   +  KI T KK+ +   +++M + W     V  +     F+    C+   +R+                     K +
Subjt:  EEKASVFQLQDGEIDRFELHLAKSILCKIYTNKKINIEIFKSKMPKIWIQEQTVITNVGFKSFL----CKFKNVRI---------------------KGW

Query:  VLDTWDLGFMAKRYYYSRNPREASAVTKWISAEAIGSLLGKVEKVDITDELDSTWGRSLMIKIQIDVIKPLKRGIFLKSGTKGKDK-WIPVTYEKLPDFC
          D  DL F        R P  +   T    A A+G+++G+ + V   D L+  WG  L +++ +DV KPLKRG  + S +  KDK W+   YE+LP++C
Subjt:  VLDTWDLGFMAKRYYYSRNPREASAVTKWISAEAIGSLLGKVEKVDITDELDSTWGRSLMIKIQIDVIKPLKRGIFLKSGTKGKDK-WIPVTYEKLPDFC

Query:  YGCGWLGHTNKECN
          CG +GH   +C+
Subjt:  YGCGWLGHTNKECN

A0A803P5M6 Uncharacterized protein7.9e-3929.22Show/hide
Query:  NLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVC
        +L V+H I  N  WN  ++   F + D   IL IP+      D ++W   P GI+SV++ + L   L+D  + SSS   ++   WKFFW   +PPKI++ 
Subjt:  NLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVC

Query:  GWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLL-ICW
         WK++ +ILPT   L  R V  +  C LC +  E+I H ++ CK  + IW KL +   DF           DY        +   + +    E+LL + W
Subjt:  GWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLL-ICW

Query:  QVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIG
         +W  RN V +  Q     +I  +  K    + K +  +R      V AT    +  S   + +   R     W  P     KLN D + N E+K  GIG
Subjt:  QVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIG

Query:  WILRRWDGTPISAGFRVIHKCWKISWLEAFAV-----------------------VVNLLLGREEDFTELSLFFDEARGLLSGFQIHSISHIPRRQNQMA
         ILR   GT ++A  + +   +K   +EA A+                       V N L     D +  S    + R LLS F    ++H+ R  NQ A
Subjt:  WILRRWDGTPISAGFRVIHKCWKISWLEAFAV-----------------------VVNLLLGREEDFTELSLFFDEARGLLSGFQIHSISHIPRRQNQMA

Query:  HQLARRAWAYNSSEVWHGSFP
        H LA+ A   +   VW G  P
Subjt:  HQLARRAWAYNSSEVWHGSFP

A0A803P5M6 Uncharacterized protein2.0e-1028.1Show/hide
Query:  EEKASVFQLQDGEIDRFELHLAKSILCKIYTNKKINIEIFKSKMPKIWIQEQTVITNVGFKSFLCKFKNVRIKGWVLDTWDLGFMAKRYYYSRNPREASA
        E++ SVFQ  D E     L +   +  KI T KK+ +   +++M + W     V  +     F+  F     K  VLD     F    +     P     
Subjt:  EEKASVFQLQDGEIDRFELHLAKSILCKIYTNKKINIEIFKSKMPKIWIQEQTVITNVGFKSFLCKFKNVRIKGWVLDTWDLGFMAKRYYYSRNPREASA

Query:  VTK--------WIS-------------AEAIGSLLGKVEKVDITDELDSTWGRSLMIKIQIDVIKPLKRGIFLKSGTKGKDK-WIPVTYEKLPDFCYGCG
         T         W+              A A+G+++G+ + V   D L+  WG  L +++ +DV KPLKRG  + S +  KDK W+   YE+LP++C  CG
Subjt:  VTK--------WIS-------------AEAIGSLLGKVEKVDITDELDSTWGRSLMIKIQIDVIKPLKRGIFLKSGTKGKDK-WIPVTYEKLPDFCYGCG

Query:  WLGHTNKECN
         +GH   +C+
Subjt:  WLGHTNKECN

A0A803P5M6 Uncharacterized protein1.4e-3823.6Show/hide
Query:  EELAQQIADLRVTLEEKASVFQLQDGEIDRFELHLAKSILCKIYTNKKINIEIFKSKMPKIWIQEQTV-ITNVGFKSFLCKF------KNVRIKG-WVLD
        EELA++   +R++  E++++ +L+ G+I + +     S+L ++ T++  N E FK+ +  +W     + + ++    FL  F      + V I+  W  D
Subjt:  EELAQQIADLRVTLEEKASVFQLQDGEIDRFELHLAKSILCKIYTNKKINIEIFKSKMPKIWIQEQTV-ITNVGFKSFLCKF------KNVRIKG-WVLD

Query:  --------------TWDLGFMAKRYYYSRNPREASAVTKWISAEAIGSLLGKVEKVDITDELDSTWGRSLMIKIQIDVIKPLKRGIFLKSGTKGKDKWIP
                        D+ F    ++         ++ K +  E IG+ +G + +VD+  E    WGR L I+++IDV KPL RG  L+ G  GK  W+ 
Subjt:  --------------TWDLGFMAKRYYYSRNPREASAVTKWISAEAIGSLLGKVEKVDITDELDSTWGRSLMIKIQIDVIKPLKRGIFLKSGTKGKDKWIP

Query:  VTYEKLPDFCYGCGWLGHTNKECNDPSGSNEHDLL----YGAWLR-------EPIRLRIE----EDSPYGRRAQNRGRGRGWYGGRGNWSQFDGDGEEEE
          YE LP FCY CG +GH+  EC +   S    ++    +G+WLR       +P R  IE    +D   G  +Q                   GDG+ E 
Subjt:  VTYEKLPDFCYGCGWLGHTNKECNDPSGSNEHDLL----YGAWLR-------EPIRLRIE----EDSPYGRRAQNRGRGRGWYGGRGNWSQFDGDGEEEE

Query:  EHGGQQEVEAVQNRPAEPIPQVAPAPLTDTIQATAKYSKLESCKLVEKSINENNENHGESERNLKGKNKAGVNHGGEMVGILNGKLRDDAAEIRNNGNKL
          G  Q VEA+                       A    +E     E S+    +   + +                M G+ +GK      EI       
Subjt:  EHGGQQEVEAVQNRPAEPIPQVAPAPLTDTIQATAKYSKLESCKLVEKSINENNENHGESERNLKGKNKAGVNHGGEMVGILNGKLRDDAAEIRNNGNKL

Query:  DSSYIEVDATAVGGENKLGNPSGKAYTENEDLNNGSANKGLIKTKGWKRLPRELDR-FLINAKIQSRCNVFKVQHLARVASDHRPILADWKEEPPDQKRG
              + A  +   +    P G  ++E   +    A K     +G +R    L +  L +AK +    V     + RV                    G
Subjt:  DSSYIEVDATAVGGENKLGNPSGKAYTENEDLNNGSANKGLIKTKGWKRLPRELDR-FLINAKIQSRCNVFKVQHLARVASDHRPILADWKEEPPDQKRG

Query:  RRKRIRRFEECWTKYEECREIVA--QVWEAHNLTVAHLI-QPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQ
          + I+ +++ W     C +I++  +V +A N TVA LI    G WN  ++ ++F   DA  I  IP++     D++IW  + RG+FSV++AY   +Q  
Subjt:  RRKRIRRFEECWTKYEECREIVA--QVWEAHNLTVAHLI-QPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQ

Query:  DSLEASSSCLLQKERLWKFFWKTDIPPKIKVCGWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKL-FQLPNDFYGLDRST
        + +  SSS      + W   W   +  KI+   W+   +ILPT T L ++ +  +  C  C  + ET +H++W C   + +WS     +P    G+D + 
Subjt:  DSLEASSSCLLQKERLWKFFWKTDIPPKIKVCGWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKL-FQLPNDFYGLDRST

Query:  WATADYCEIMWKGSNQGWLEESRIAESLLICWQVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENR
         + +D+ +   +      L    I   + + W +W  RN++ +       E IHS++  + +             +  V A E L   G CE S+  E  
Subjt:  WATADYCEIMWKGSNQGWLEESRIAESLLICWQVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENR

Query:  RVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIGWILRRWDGTPISAGFRVIHKCWKISWLEAFAVVVNLLLGREEDFTEL-----------SLFFDEARGL
          S  W  P  G  KL+    +  +    G+G +LR  +G  + A   V+ K          AV   L L  E  FT+L            +  D+    
Subjt:  RVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIGWILRRWDGTPISAGFRVIHKCWKISWLEAFAVVVNLLLGREEDFTEL-----------SLFFDEARGL

Query:  LSGFQIHSISHIPRRQNQMAHQLARRAWAYNSSEVWHGSFPDWFL
           F       I    N+ +  LA  A +    +VW    P   L
Subjt:  LSGFQIHSISHIPRRQNQMAHQLARRAWAYNSSEVWHGSFPDWFL

A0A803Q2K8 Uncharacterized protein5.0e-4127.7Show/hide
Query:  NLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVC
        NL VA LI     W+   ++  FS+ D   IL IP++     D +IW     GI++V+S Y+L +   +  + +SS  +  E  W  FWK  +PPK+++ 
Subjt:  NLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVC

Query:  GWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLLICWQ
         WK+++  LP    L  R +  +P C +C +  E+INH +++C   + +W +L  L  DF+ L +S  A+AD   I+   S    L  S     L++CW 
Subjt:  GWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLLICWQ

Query:  VWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIGW
         W  RN +++ N + S +++ S+      +YL   +  R ++ Q  P + + A D     S EF +   +  W  PP G +KLN D + ++ +   GIG 
Subjt:  VWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKKTGGIGW

Query:  ILRRWDGTPISAGFRVIHKCWKISWLEAFAVVVNL-----------------------LLGREEDFTELSLFFDEARGLLSGFQIHSISHIPRRQNQMAH
         LR  DG  ++A  + +   +K   +EA  + ++L                       LL  +   +      +    L+S F    I+H+ R  N  AH
Subjt:  ILRRWDGTPISAGFRVIHKCWKISWLEAFAVVVNL-----------------------LLGREEDFTELSLFFDEARGLLSGFQIHSISHIPRRQNQMAH

Query:  QLARRAWAYNSSEVWHGSFPDWFLNL
         LA+ A   ++  +W  +FP   + L
Subjt:  QLARRAWAYNSSEVWHGSFPDWFLNL

A0A803Q2K8 Uncharacterized protein3.6e-0735.71Show/hide
Query:  ISAEAIGSLLGKVEKVDITDELDSTWGRSLMIKIQIDVIKPLKRGIFLKSGTKGKDKWIPVTYEKLPDFCYGCGWLGHTNKECN
        + AE  G+++G    V   D L+  WG  L  +  ID+ KPL RG  +K     ++ W+   YE+LP+FC+ CG +GH  + C+
Subjt:  ISAEAIGSLLGKVEKVDITDELDSTWGRSLMIKIQIDVIKPLKRGIFLKSGTKGKDKWIPVTYEKLPDFCYGCGWLGHTNKECN

A0A803Q2K8 Uncharacterized protein7.9e-3927.44Show/hide
Query:  NLTVAHLIQPNG-GWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQL---QDSLEASSSCLLQ-KERLWKFFWKTDIPP
        N+ V+ LI  +G  W + ++  L    D +DI  IP++     D ++W F   G+FSVRSAY + + +   ++S E S     +  ++LWK  W   IP 
Subjt:  NLTVAHLIQPNG-GWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQL---QDSLEASSSCLLQ-KERLWKFFWKTDIPP

Query:  KIKVCGWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSK-LFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAES
        KIK+  W+   DILPT T L  R + V+  C LC  ++ET  HL  +C   R +W   +F LP D   L        D+C++     +   L+ + +   
Subjt:  KIKVCGWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSK-LFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAES

Query:  LLICWQVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKK
         ++ W VW +RN + ++ +      + S   +    YLK E  D + K  +VP   V     S               W+ P  G+ KLN DGSW   + 
Subjt:  LLICWQVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGSWNEEKK

Query:  TGGIGWILRRWDGTPISAGFRVIHKCWKISWLEAFAVVVNLLLGRE-----------------------EDFTELSLFFDEARGLLSGFQIHSISHIPRR
        TGG+G ++R W G  I    + + +C      +A A++  +L  R+                       +D ++L    D+ +  L+ F    ++H+ R 
Subjt:  TGGIGWILRRWDGTPISAGFRVIHKCWKISWLEAFAVVVNLLLGRE-----------------------EDFTELSLFFDEARGLLSGFQIHSISHIPRR

Query:  QNQMAHQLARRAWAYNSSEVWHGSFPDWF---LNLNILDVG
         N +AH LA    + +S ++     P  F   LN +   VG
Subjt:  QNQMAHQLARRAWAYNSSEVWHGSFPDWF---LNLNILDVG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein2.6e-1822.32Show/hide
Query:  WKTDIPPKIKVCGWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEE
        WK  + PKIK   W+     L T T L +R ++ +P+C  C  + ETI+H+M+ C  T+ +W     +  + +G   S     +    + K      L+ 
Subjt:  WKTDIPPKIKVCGWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEE

Query:  SRIAESLLICWQVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGS
                I W++WK RN   +  +  S +       +  + +L   ET           T V  A    + S     RR SS W  PP G +K N D  
Subjt:  SRIAESLLICWQVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGS

Query:  WNEEKKTGGIGWILRRWDGTPISAG------------------FRVIHKCW----KISWLEAFAVVVNLLLGREEDFTELSLFFDEARGLLSGFQIHSIS
        + +       GW +R  +G  +  G                     +   W    +  W E+ +  +  L+   ED + L     + R  +      S+ 
Subjt:  WNEEKKTGGIGWILRRWDGTPISAG------------------FRVIHKCW----KISWLEAFAVVVNLLLGREEDFTELSLFFDEARGLLSGFQIHSIS

Query:  HIPRRQNQMAHQLARRAWAYNSSEVWHGSFPDWFLN
         + R +N  A  LA    A +     + + P W +N
Subjt:  HIPRRQNQMAHQLARRAWAYNSSEVWHGSFPDWFLN

AT2G46460.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.4e-0525.85Show/hide
Query:  WLRPPMGIMKLNCDGSWNEEKKTGGIGWILRRWDGTPISAG------------------FRVIHKCW----KISWLEAFAVVVNLLLGREEDFTELSLFF
        W RPP G  K N DGS+N E  +   GWI+R   G  +SAG                     +  CW    +  W E     V  +L R +   ++  + 
Subjt:  WLRPPMGIMKLNCDGSWNEEKKTGGIGWILRRWDGTPISAG------------------FRVIHKCW----KISWLEAFAVVVNLLLGREEDFTELSLFF

Query:  DEARGLLSGFQIHSISHIPRRQNQMAHQLARRAWAYNSSEVWHGSFP
         + +     FQ    + I R+QN+ A  LA+     N    ++   P
Subjt:  DEARGLLSGFQIHSISHIPRRQNQMAHQLARRAWAYNSSEVWHGSFP

AT3G09510.1 Ribonuclease H-like superfamily protein3.2e-2422.12Show/hide
Query:  LTVAHLIQPNGG---WNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQD----SLEASSSCLLQKERLWKFFWKTDIP
        +T+ +L +  G    W++S + +   + D   I  I +    + D+IIW ++  G ++VRS Y L          ++      +  K R+W       I 
Subjt:  LTVAHLIQPNGG---WNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRGIFSVRSAYRLRIQLQD----SLEASSSCLLQKERLWKFFWKTDIP

Query:  PKIKVCGWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAE-
        PK+K   W+  +  L T   L  RG+ ++P C  C  ++E+INH ++ C      W    +L +    L R+   + D+ E +    N  +++++ +++ 
Subjt:  PKIKVCGWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAE-

Query:  ----SLLICWQVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEE-TDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGS
             + + W++WK RN+V +N      +   S  + VLS   +  +  +  +  +  P+     A+   E             W  PP   +K N D  
Subjt:  ----SLLICWQVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEE-TDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFWLRPPMGIMKLNCDGS

Query:  WNEEKKTGGIGWILRRWDGTPISAG------------------FRVIHKCWKISWLEAF-----AVVVNLLLGREEDFTELSLFFDEARGLLSGFQIHSI
        ++ +K     GWI+R   GTPIS G                     + + W   + + F       ++NL+ G     + L+   ++     + F     
Subjt:  WNEEKKTGGIGWILRRWDGTPISAG------------------FRVIHKCWKISWLEAF-----AVVVNLLLGREEDFTELSLFFDEARGLLSGFQIHSI

Query:  SHIPRRQNQMAHQLARRAWAYNSSEVWHGSFPDW
          I R+ N++AH LA+    Y++     GS P W
Subjt:  SHIPRRQNQMAHQLARRAWAYNSSEVWHGSFPDW

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.4e-0640Show/hide
Query:  WKTDIPPKIKVCGWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWEC
        W   I PKIK+  WK  N+ LP    L++R + + P C  CR   ETI H+++ C
Subjt:  WKTDIPPKIKVCGWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWEC

AT4G10613.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.8e-0625.89Show/hide
Query:  WKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLLICWQV
        W    D LPT   L++ G++++PLC LC    ET +HL+  C  +  IW+ + Q       +    W +      +   S+   +   ++     +C  +
Subjt:  WKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGLDRSTWATADYCEIMWKGSNQGWLEESRIAESLLICWQV

Query:  WKYRNDVFYNNQ
        WK RN++ +N Q
Subjt:  WKYRNDVFYNNQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGCAGACGGAAGAGGAGTTAGCACAGCAGATTGCTGATCTAAGAGTCACACTTGAGGAAAAAGCAAGTGTCTTTCAGCTTCAGGATGGCGAAATTGATCGGTT
TGAGCTTCATCTGGCGAAATCGATCCTATGCAAAATTTACACAAATAAAAAGATCAACATTGAGATTTTCAAATCAAAGATGCCCAAGATCTGGATTCAAGAGCAAACAG
TCATTACAAATGTTGGGTTTAAATCATTTCTTTGTAAGTTCAAGAATGTCCGTATCAAAGGTTGGGTGCTGGACACATGGGACCTTGGTTTTATGGCAAAGCGCTACTAT
TATTCGAGGAACCCAAGGGAGGCATCTGCAGTAACGAAATGGATTTCAGCAGAAGCAATTGGTAGCTTACTTGGGAAGGTAGAAAAAGTAGACATCACGGATGAATTAGA
CTCGACATGGGGGAGGTCGCTTATGATTAAAATTCAAATTGATGTTATTAAGCCATTGAAGAGGGGAATTTTCTTGAAATCAGGTACCAAAGGGAAGGATAAGTGGATTC
CGGTAACATATGAAAAACTACCGGACTTTTGTTACGGGTGTGGTTGGTTAGGTCATACTAATAAAGAATGCAATGATCCGTCGGGGTCAAATGAACATGATTTACTATAT
GGGGCGTGGCTTCGTGAACCTATAAGATTAAGAATTGAAGAGGACAGTCCGTATGGGAGACGTGCTCAGAATAGAGGTAGAGGAAGGGGCTGGTATGGGGGTAGAGGGAA
TTGGAGTCAATTCGATGGTGATGGGGAGGAAGAGGAGGAACATGGAGGACAGCAAGAGGTGGAGGCGGTGCAGAATCGACCGGCCGAGCCAATTCCACAAGTCGCTCCGG
CGCCGTTGACGGATACAATCCAAGCAACGGCTAAATATTCAAAATTGGAAAGTTGTAAGTTAGTGGAAAAATCTATTAATGAGAATAACGAGAATCATGGGGAGTCTGAA
AGGAATCTAAAGGGAAAGAATAAGGCAGGCGTAAATCATGGAGGTGAGATGGTTGGAATATTAAATGGGAAATTAAGAGATGATGCTGCGGAAATCCGTAATAATGGCAA
CAAATTAGACAGCTCATACATAGAGGTGGATGCAACAGCCGTTGGGGGTGAAAATAAATTGGGTAATCCAAGTGGCAAGGCTTATACAGAAAATGAGGATTTAAATAATG
GGTCTGCTAATAAGGGGCTTATTAAGACTAAAGGATGGAAAAGGCTTCCTAGAGAACTAGATCGATTTCTCATTAATGCGAAGATACAGAGCCGTTGCAATGTGTTTAAG
GTTCAGCATTTGGCACGAGTAGCGTCAGATCATAGGCCTATTTTAGCAGATTGGAAGGAAGAACCCCCAGACCAGAAAAGAGGTAGACGTAAACGAATTAGGAGATTTGA
GGAGTGCTGGACAAAGTATGAGGAGTGCAGGGAGATTGTGGCCCAGGTTTGGGAAGCTCATAACCTTACTGTGGCTCATCTTATTCAACCGAATGGGGGATGGAATGAGT
CGATGGTTAAAGAGTTGTTTTCTGAAAAAGATGCAAGTGATATCCTTGACATCCCAATAAACCCAATGAACAGGGTAGATGAGATTATCTGGAAGTTTGACCCAAGAGGG
ATCTTCTCAGTTCGAAGTGCTTATCGATTGCGAATCCAGCTACAAGACTCCTTAGAGGCTTCGAGTTCGTGCTTACTGCAGAAGGAACGATTGTGGAAATTTTTTTGGAA
AACTGACATCCCCCCTAAAATCAAAGTTTGTGGATGGAAAATCTATAATGATATCCTTCCAACTTGCACTAATTTGATCAATAGGGGAGTGGAAGTTAATCCACTGTGTC
TGCTTTGCAGGGCCAAATCTGAGACAATAAATCACCTAATGTGGGAATGCAAGGTAACTAGGGGTATTTGGTCTAAACTTTTCCAACTTCCTAATGATTTCTATGGTCTT
GACAGGAGCACTTGGGCGACAGCGGACTACTGTGAGATCATGTGGAAAGGTAGCAACCAAGGATGGTTGGAGGAAAGCAGAATAGCAGAGAGTTTATTAATATGCTGGCA
AGTGTGGAAATACAGGAATGATGTGTTTTACAATAACCAAATCCCTAGCCAAGAGTCAATTCATTCTCATATTCAAAAAGTACTTTCAACATACCTTAAAGTAGAGGAGA
CAGATCGGCGGGAGAAGCAGCAGATGGTGCCGGCGACTGAAGTCCTTGCAGCGGATGGTTCGTGCGAGAGATCTCTGGAGTTTGAGAATCGGAGAGTGTCGAGTTTCTGG
TTGAGGCCGCCTATGGGAATCATGAAGCTTAACTGTGATGGATCGTGGAATGAAGAAAAGAAAACCGGCGGGATTGGCTGGATTTTGCGTCGCTGGGATGGAACTCCAAT
CTCGGCTGGGTTTCGAGTCATTCACAAATGTTGGAAGATCAGTTGGTTGGAAGCCTTTGCAGTGGTGGTAAATTTACTTTTGGGAAGAGAGGAGGATTTTACTGAACTCT
CGCTCTTCTTTGATGAAGCAAGAGGACTTCTATCAGGGTTCCAAATCCATTCGATCAGTCATATTCCCAGGCGCCAAAACCAAATGGCCCATCAATTGGCCCGACGGGCC
TGGGCTTACAATTCTTCTGAAGTTTGGCATGGATCTTTCCCCGATTGGTTTTTAAACCTAAATATTTTAGATGTTGGAATTGATTCGAGTAGTTTTGGGGGTGCCTGTCC
CAGGGCTGGAAGTGTTTTGGGAGTTGTTGCTCCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGCAGACGGAAGAGGAGTTAGCACAGCAGATTGCTGATCTAAGAGTCACACTTGAGGAAAAAGCAAGTGTCTTTCAGCTTCAGGATGGCGAAATTGATCGGTT
TGAGCTTCATCTGGCGAAATCGATCCTATGCAAAATTTACACAAATAAAAAGATCAACATTGAGATTTTCAAATCAAAGATGCCCAAGATCTGGATTCAAGAGCAAACAG
TCATTACAAATGTTGGGTTTAAATCATTTCTTTGTAAGTTCAAGAATGTCCGTATCAAAGGTTGGGTGCTGGACACATGGGACCTTGGTTTTATGGCAAAGCGCTACTAT
TATTCGAGGAACCCAAGGGAGGCATCTGCAGTAACGAAATGGATTTCAGCAGAAGCAATTGGTAGCTTACTTGGGAAGGTAGAAAAAGTAGACATCACGGATGAATTAGA
CTCGACATGGGGGAGGTCGCTTATGATTAAAATTCAAATTGATGTTATTAAGCCATTGAAGAGGGGAATTTTCTTGAAATCAGGTACCAAAGGGAAGGATAAGTGGATTC
CGGTAACATATGAAAAACTACCGGACTTTTGTTACGGGTGTGGTTGGTTAGGTCATACTAATAAAGAATGCAATGATCCGTCGGGGTCAAATGAACATGATTTACTATAT
GGGGCGTGGCTTCGTGAACCTATAAGATTAAGAATTGAAGAGGACAGTCCGTATGGGAGACGTGCTCAGAATAGAGGTAGAGGAAGGGGCTGGTATGGGGGTAGAGGGAA
TTGGAGTCAATTCGATGGTGATGGGGAGGAAGAGGAGGAACATGGAGGACAGCAAGAGGTGGAGGCGGTGCAGAATCGACCGGCCGAGCCAATTCCACAAGTCGCTCCGG
CGCCGTTGACGGATACAATCCAAGCAACGGCTAAATATTCAAAATTGGAAAGTTGTAAGTTAGTGGAAAAATCTATTAATGAGAATAACGAGAATCATGGGGAGTCTGAA
AGGAATCTAAAGGGAAAGAATAAGGCAGGCGTAAATCATGGAGGTGAGATGGTTGGAATATTAAATGGGAAATTAAGAGATGATGCTGCGGAAATCCGTAATAATGGCAA
CAAATTAGACAGCTCATACATAGAGGTGGATGCAACAGCCGTTGGGGGTGAAAATAAATTGGGTAATCCAAGTGGCAAGGCTTATACAGAAAATGAGGATTTAAATAATG
GGTCTGCTAATAAGGGGCTTATTAAGACTAAAGGATGGAAAAGGCTTCCTAGAGAACTAGATCGATTTCTCATTAATGCGAAGATACAGAGCCGTTGCAATGTGTTTAAG
GTTCAGCATTTGGCACGAGTAGCGTCAGATCATAGGCCTATTTTAGCAGATTGGAAGGAAGAACCCCCAGACCAGAAAAGAGGTAGACGTAAACGAATTAGGAGATTTGA
GGAGTGCTGGACAAAGTATGAGGAGTGCAGGGAGATTGTGGCCCAGGTTTGGGAAGCTCATAACCTTACTGTGGCTCATCTTATTCAACCGAATGGGGGATGGAATGAGT
CGATGGTTAAAGAGTTGTTTTCTGAAAAAGATGCAAGTGATATCCTTGACATCCCAATAAACCCAATGAACAGGGTAGATGAGATTATCTGGAAGTTTGACCCAAGAGGG
ATCTTCTCAGTTCGAAGTGCTTATCGATTGCGAATCCAGCTACAAGACTCCTTAGAGGCTTCGAGTTCGTGCTTACTGCAGAAGGAACGATTGTGGAAATTTTTTTGGAA
AACTGACATCCCCCCTAAAATCAAAGTTTGTGGATGGAAAATCTATAATGATATCCTTCCAACTTGCACTAATTTGATCAATAGGGGAGTGGAAGTTAATCCACTGTGTC
TGCTTTGCAGGGCCAAATCTGAGACAATAAATCACCTAATGTGGGAATGCAAGGTAACTAGGGGTATTTGGTCTAAACTTTTCCAACTTCCTAATGATTTCTATGGTCTT
GACAGGAGCACTTGGGCGACAGCGGACTACTGTGAGATCATGTGGAAAGGTAGCAACCAAGGATGGTTGGAGGAAAGCAGAATAGCAGAGAGTTTATTAATATGCTGGCA
AGTGTGGAAATACAGGAATGATGTGTTTTACAATAACCAAATCCCTAGCCAAGAGTCAATTCATTCTCATATTCAAAAAGTACTTTCAACATACCTTAAAGTAGAGGAGA
CAGATCGGCGGGAGAAGCAGCAGATGGTGCCGGCGACTGAAGTCCTTGCAGCGGATGGTTCGTGCGAGAGATCTCTGGAGTTTGAGAATCGGAGAGTGTCGAGTTTCTGG
TTGAGGCCGCCTATGGGAATCATGAAGCTTAACTGTGATGGATCGTGGAATGAAGAAAAGAAAACCGGCGGGATTGGCTGGATTTTGCGTCGCTGGGATGGAACTCCAAT
CTCGGCTGGGTTTCGAGTCATTCACAAATGTTGGAAGATCAGTTGGTTGGAAGCCTTTGCAGTGGTGGTAAATTTACTTTTGGGAAGAGAGGAGGATTTTACTGAACTCT
CGCTCTTCTTTGATGAAGCAAGAGGACTTCTATCAGGGTTCCAAATCCATTCGATCAGTCATATTCCCAGGCGCCAAAACCAAATGGCCCATCAATTGGCCCGACGGGCC
TGGGCTTACAATTCTTCTGAAGTTTGGCATGGATCTTTCCCCGATTGGTTTTTAAACCTAAATATTTTAGATGTTGGAATTGATTCGAGTAGTTTTGGGGGTGCCTGTCC
CAGGGCTGGAAGTGTTTTGGGAGTTGTTGCTCCGTGA
Protein sequenceShow/hide protein sequence
MEKQTEEELAQQIADLRVTLEEKASVFQLQDGEIDRFELHLAKSILCKIYTNKKINIEIFKSKMPKIWIQEQTVITNVGFKSFLCKFKNVRIKGWVLDTWDLGFMAKRYY
YSRNPREASAVTKWISAEAIGSLLGKVEKVDITDELDSTWGRSLMIKIQIDVIKPLKRGIFLKSGTKGKDKWIPVTYEKLPDFCYGCGWLGHTNKECNDPSGSNEHDLLY
GAWLREPIRLRIEEDSPYGRRAQNRGRGRGWYGGRGNWSQFDGDGEEEEEHGGQQEVEAVQNRPAEPIPQVAPAPLTDTIQATAKYSKLESCKLVEKSINENNENHGESE
RNLKGKNKAGVNHGGEMVGILNGKLRDDAAEIRNNGNKLDSSYIEVDATAVGGENKLGNPSGKAYTENEDLNNGSANKGLIKTKGWKRLPRELDRFLINAKIQSRCNVFK
VQHLARVASDHRPILADWKEEPPDQKRGRRKRIRRFEECWTKYEECREIVAQVWEAHNLTVAHLIQPNGGWNESMVKELFSEKDASDILDIPINPMNRVDEIIWKFDPRG
IFSVRSAYRLRIQLQDSLEASSSCLLQKERLWKFFWKTDIPPKIKVCGWKIYNDILPTCTNLINRGVEVNPLCLLCRAKSETINHLMWECKVTRGIWSKLFQLPNDFYGL
DRSTWATADYCEIMWKGSNQGWLEESRIAESLLICWQVWKYRNDVFYNNQIPSQESIHSHIQKVLSTYLKVEETDRREKQQMVPATEVLAADGSCERSLEFENRRVSSFW
LRPPMGIMKLNCDGSWNEEKKTGGIGWILRRWDGTPISAGFRVIHKCWKISWLEAFAVVVNLLLGREEDFTELSLFFDEARGLLSGFQIHSISHIPRRQNQMAHQLARRA
WAYNSSEVWHGSFPDWFLNLNILDVGIDSSSFGGACPRAGSVLGVVAP