; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019100 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019100
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold1:46327850..46335683
RNA-Seq ExpressionSpg019100
SyntenySpg019100
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KMS97072.1 hypothetical protein BVRB_7g179330 [Beta vulgaris subsp. vulgaris]1.0e-3522.92Show/hide
Query:  LQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWNS
        L+D+G+    FTW  +  D ++++ERLDR LA+    D++ +  + +L    SDH PIV++L  +  +       +  +FE  W++  ++  ++K  W  
Subjt:  LQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWNS

Query:  GVELGIHNFGSKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFHSRKW
           LG+  +   +K C  KL +W+                                                ++Y K   R  WLK            + 
Subjt:  GVELGIHNFGSKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFHSRKW

Query:  RWKRVADLLD-DMGDWIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTS-----VWKSIRRANN
           RVADL+D +   W ++ +   F   D   I  +    +   D I W  +  G+F VR AY L         A E++N+A++S     +WK I     
Subjt:  RWKRVADLLD-DMGDWIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTS-----VWKSIRRANN

Query:  IPKVKITAWKILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNKEEQDRAITIL
         PKV    W+   DI+P   N+ KK       C  C    E ++H F  C++ ++ W +              +W     W  +     KE+Q+  +  +
Subjt:  IPKVKITAWKILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNKEEQDRAITIL

Query:  WSLWNTRN---ISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLKSSQLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCK
        W LW  RN     N   S     +    I R    ++  K+  +K ++ +       W   +P   K+N D +   +  + G+G   RD NG ++    +
Subjt:  WSLWNTRN---ISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLKSSQLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCK

Query:  QFVRKWNIKCMEAKAILEGIEAYLKASNQAERRFKLVVESDSSDVIGALNRDIEDLSEMACFTEEI-HRISETEQISFAKCPRTVNSLAHELARAVTLNS
             W  +  EA A+L   E  +     A+    +V+ESD+  +I A+N  ++   ++    E++ + +   E I F+ C R  N +AH LA+  T N 
Subjt:  QFVRKWNIKCMEAKAILEGIEAYLKASNQAERRFKLVVESDSSDVIGALNRDIEDLSEMACFTEEI-HRISETEQISFAKCPRTVNSLAHELARAVTLNS

Query:  NWEVFFGNSLPLCEEDEAFWREFEFPFWFCDLLAKET
          EV+                    P W  DL+  ++
Subjt:  NWEVFFGNSLPLCEEDEAFWREFEFPFWFCDLLAKET

PWA36168.1 hypothetical protein CTI12_AA602590 [Artemisia annua]3.4e-4423.8Show/hide
Query:  REEQYWQRLQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYRE
        RE   +  L+D  +     TW    R    +++RLDRFL  +   D++ D   ++L +  SDH PI+  L+   K K     ++  +FE  W+  +    
Subjt:  REEQYWQRLQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYRE

Query:  IIKNHWNSGVELGI-HNFGSKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRN
        ++++ W  G+  G+ H+    +  C ++L+DWNK R  G +  +IK K+R +Q++ +  D     +      ++  LL ++EL WK RSR +WL+ GD+N
Subjt:  IIKNHWNSGVELGI-HNFGSKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRN

Query:  TKWFHSR----------------KWRW----------------------------------------------------KRVADLLDDMGDWIEDDVQRA
        T++FH+R                  RW                                                      V DLL+  GD    ++  +
Subjt:  TKWFHSR----------------KWRW----------------------------------------------------KRVADLLDDMGDWIEDDVQRA

Query:  FLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVKITAWKILKDIIPTKANISKKGINT
          P +  + +     SK  ND + W+ NP G F+ +SAY LA +       + + +++    W+ + +A    KVK+  W+   + +PT  N+  +G+N 
Subjt:  FLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVKITAWKILKDIIPTKANISKKGINT

Query:  NGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNK--EEQDRAITILWSLWNTRNISNQTQSPPKFGQTCRSIY
          +C  C    E+ VH+ +KC  +K +W     N  +F     +    I + D    IL K   E +  + ILW LW  RN     Q   + G       
Subjt:  NGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNK--EEQDRAITILWSLWNTRNISNQTQSPPKFGQTCRSIY

Query:  RFLEGYSEGKKTNLKSSQLKNHSSHQ-CWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQFVRKWNIKCMEAKAILEGIEAYLKASNQ
          L  Y +  +    S     H+ H   W     +  K+N D +W ++S K G+G+  R+  G ++  G +        +C  +      +EA  KA + 
Subjt:  RFLEGYSEGKKTNLKSSQLKNHSSHQ-CWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQFVRKWNIKCMEAKAILEGIEAYLKASNQ

Query:  AERRF------KLVVESDSSDVIGALNRDIEDLSEMACFTEEIHRISETEQISFAKCPRTVNSLAHELARAVTLNSNWEVFFGNSLPLC
        A  R        ++ E++S  ++ AL   +  L   + F+E + ++      +++   R  N LAH +A ++ L+ + +     S+P C
Subjt:  AERRF------KLVVESDSSDVIGALNRDIEDLSEMACFTEEIHRISETEQISFAKCPRTVNSLAHELARAVTLNSNWEVFFGNSLPLC

PWA65244.1 hypothetical protein CTI12_AA315820 [Artemisia annua]1.0e-3526.2Show/hide
Query:  LQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWNS
        L+D+ +    +TW    R  S +K+RLDRFLA  +  ++       +L +  SDH PI+  L    K K      K  +FE  W+  E   E+++  W S
Subjt:  LQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWNS

Query:  GVELGI-HNFGSKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFHSRK
         +  G+ H+  + +K C  KL  WN     G +  +IK K+  ++     S V+  +  D         +   E +W    R    K       +     
Subjt:  GVELGI-HNFGSKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFHSRK

Query:  WRWKRVADLLDDMGDWIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVK
             V DLL+D GD    ++  +  P D    +     S+ + D I W  +  G F+ ++AY LA D+    + +++E      +W +I +AN   K+K
Subjt:  WRWKRVADLLDDMGDWIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVK

Query:  ITAWKILKDIIPTKANISKKGINTNGACCLCMAHL-EDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRIL---NKEEQDRAITILW
        +  W  L++ +PT  N+  + +  + A C C   L ED +H+ ++C  +K++W+    + L        + N  D  + +C ++   N    +  I ILW
Subjt:  ITAWKILKDIIPTKANISKKGINTNGACCLCMAHL-EDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRIL---NKEEQDRAITILW

Query:  SLWNTRNISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLK--SSQLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQF
         LW  RN     Q   +  Q        L  Y    K NL   S+++K   S   W+   P+  K NSD SW +++ +  +G+  R+ NG ++  G +  
Subjt:  SLWNTRNISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLK--SSQLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQF

Query:  VRKWNIKCMEAKAILEGIEAYLKASNQAERR--FKLVVESDSSDVIGALNRDIEDLSEMAC
        V  + +  +EA+A     +A + A   A  R    +V ESDS  ++ AL R+   L ++AC
Subjt:  VRKWNIKCMEAKAILEGIEAYLKASNQAERR--FKLVVESDSSDVIGALNRDIEDLSEMAC

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.2e-3628.82Show/hide
Query:  VADLLDDMGDWIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVKITAWK
        VADL+D    W  D +++ F+ ED E IL++   S +  D ++W+ + KG ++V+S Y LA + +  ++   S +S+   +WK     +   KVKI  W+
Subjt:  VADLLDDMGDWIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVKITAWK

Query:  ILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNKE---EQDRAITILWSLWNTR
         LK+I+PT  N+ K+       C  C   +E   H+  +CK ++KIW     +L   +     D N  D++  +  + ++    E +  I   W +W+ R
Subjt:  ILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNKE---EQDRAITILWSLWNTR

Query:  N--ISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLKSSQLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQFVRKWNI
        N  I    +S  +F          L+ Y    K              Q WKP + +  KLN D +   K +K G+G  +RD+ G ++ VG KQ   +  +
Subjt:  N--ISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLKSSQLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQFVRKWNI

Query:  KCMEAKAILEGIEAYLKASNQAERRFKLVVESDSSDVIGALNRDIEDLSEMACFTEEIHRIS-ETEQISFAKCPRTVNSLAHELARAVTLNSNWEVFFG
           EA+AI  G++   + S+ +     L+VESD  +V+  LN      +E+     ++ R S E +Q+ F+  PRT N+ AH LA+    NS+ +V+ G
Subjt:  KCMEAKAILEGIEAYLKASNQAERRFKLVVESDSSDVIGALNRDIEDLSEMACFTEEIHRIS-ETEQISFAKCPRTVNSLAHELARAVTLNSNWEVFFG

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]2.1e-1729.87Show/hide
Query:  LQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSL-TCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWN
        L D+G     FTW  +    ++I+ERLDR L + +    F++L    L    SDH PI+  +  C  K   K+       +E  W  +E    I+++ W 
Subjt:  LQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSL-TCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWN

Query:  S----GVELGIHNFGSKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGS-----DVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDR
        S      E  +  F    K  +  L  W+K   +G      K+K+ E+   L  +       I   ++ + E ++  +L  +E+YWK RSR DWLK GD+
Subjt:  S----GVELGIHNFGSKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGS-----DVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDR

Query:  NTKWFHSR---KWRWKRVADLLDDMGDWIED
        NTK+FHS+   + R  ++  + DD G+W++D
Subjt:  NTKWFHSR---KWRWKRVADLLDDMGDWIED

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]7.6e-3624.01Show/hide
Query:  LQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWNS
        L D+G     FTW  K      + ERLD+  A  E R+ F    + H +   SDH PI++SL      +   K+H+  +FE  W   E+   IIK  W +
Subjt:  LQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWNS

Query:  GVELGIH-NFGSKLKSCIHKLNDWNKIRLQG--SITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFH-
                     L  C   L  W++ +        S +K + + IQ   A        + ++   +L+ L EQDE+YW  RSR ++LK+GD ++++FH 
Subjt:  GVELGIH-NFGSKLKSCIHKLNDWNKIRLQG--SITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFH-

Query:  --SRKWRWKRVADLLDDMGDWIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSA--NTSVWKSIRRA
          +++ +   +  L +D  +W+                       K  +DA++W  +  G +TV+  Y LA   +  ++  +  +S   +T++WK I R 
Subjt:  --SRKWRWKRVADLLDDMGDWIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSA--NTSVWKSIRRA

Query:  NNIPKVKITAWKILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNKEEQDRAIT
         + P+ K   W+ +++ I TK N+  +    +  C LC   +E   H+  +C++++ +W     + LSF +  +N  +   + +G+      +  D   T
Subjt:  NNIPKVKITAWKILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNKEEQDRAIT

Query:  ---ILWSLWNTRNISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLKSSQLKNH--SSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIE
           I W++W  RN    +Q  P   +T +  ++ + G       + ++   +NH  S+   W+    D WK N D ++    +       +RD+NGS++E
Subjt:  ---ILWSLWNTRNISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLKSSQLKNH--SSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIE

Query:  VGCKQFVRKWNIKCMEAKAILEGIEAYLKASNQAERRFKLVVESDSSDVIGALN
        V   + ++ ++    EA AI   +     +  +A  +   ++ESD   ++  L+
Subjt:  VGCKQFVRKWNIKCMEAKAILEGIEAYLKASNQAERRFKLVVESDSSDVIGALN

TrEMBL top hitse value%identityAlignment
A0A2U1KHJ0 CCHC-type domain-containing protein1.7e-4423.8Show/hide
Query:  REEQYWQRLQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYRE
        RE   +  L+D  +     TW    R    +++RLDRFL  +   D++ D   ++L +  SDH PI+  L+   K K     ++  +FE  W+  +    
Subjt:  REEQYWQRLQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYRE

Query:  IIKNHWNSGVELGI-HNFGSKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRN
        ++++ W  G+  G+ H+    +  C ++L+DWNK R  G +  +IK K+R +Q++ +  D     +      ++  LL ++EL WK RSR +WL+ GD+N
Subjt:  IIKNHWNSGVELGI-HNFGSKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRN

Query:  TKWFHSR----------------KWRW----------------------------------------------------KRVADLLDDMGDWIEDDVQRA
        T++FH+R                  RW                                                      V DLL+  GD    ++  +
Subjt:  TKWFHSR----------------KWRW----------------------------------------------------KRVADLLDDMGDWIEDDVQRA

Query:  FLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVKITAWKILKDIIPTKANISKKGINT
          P +  + +     SK  ND + W+ NP G F+ +SAY LA +       + + +++    W+ + +A    KVK+  W+   + +PT  N+  +G+N 
Subjt:  FLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVKITAWKILKDIIPTKANISKKGINT

Query:  NGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNK--EEQDRAITILWSLWNTRNISNQTQSPPKFGQTCRSIY
          +C  C    E+ VH+ +KC  +K +W     N  +F     +    I + D    IL K   E +  + ILW LW  RN     Q   + G       
Subjt:  NGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNK--EEQDRAITILWSLWNTRNISNQTQSPPKFGQTCRSIY

Query:  RFLEGYSEGKKTNLKSSQLKNHSSHQ-CWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQFVRKWNIKCMEAKAILEGIEAYLKASNQ
          L  Y +  +    S     H+ H   W     +  K+N D +W ++S K G+G+  R+  G ++  G +        +C  +      +EA  KA + 
Subjt:  RFLEGYSEGKKTNLKSSQLKNHSSHQ-CWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQFVRKWNIKCMEAKAILEGIEAYLKASNQ

Query:  AERRF------KLVVESDSSDVIGALNRDIEDLSEMACFTEEIHRISETEQISFAKCPRTVNSLAHELARAVTLNSNWEVFFGNSLPLC
        A  R        ++ E++S  ++ AL   +  L   + F+E + ++      +++   R  N LAH +A ++ L+ + +     S+P C
Subjt:  AERRF------KLVVESDSSDVIGALNRDIEDLSEMACFTEEIHRISETEQISFAKCPRTVNSLAHELARAVTLNSNWEVFFGNSLPLC

A0A2U1MVM0 Uncharacterized protein4.8e-3626.2Show/hide
Query:  LQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWNS
        L+D+ +    +TW    R  S +K+RLDRFLA  +  ++       +L +  SDH PI+  L    K K      K  +FE  W+  E   E+++  W S
Subjt:  LQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWNS

Query:  GVELGI-HNFGSKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFHSRK
         +  G+ H+  + +K C  KL  WN     G +  +IK K+  ++     S V+  +  D         +   E +W    R    K       +     
Subjt:  GVELGI-HNFGSKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFHSRK

Query:  WRWKRVADLLDDMGDWIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVK
             V DLL+D GD    ++  +  P D    +     S+ + D I W  +  G F+ ++AY LA D+    + +++E      +W +I +AN   K+K
Subjt:  WRWKRVADLLDDMGDWIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVK

Query:  ITAWKILKDIIPTKANISKKGINTNGACCLCMAHL-EDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRIL---NKEEQDRAITILW
        +  W  L++ +PT  N+  + +  + A C C   L ED +H+ ++C  +K++W+    + L        + N  D  + +C ++   N    +  I ILW
Subjt:  ITAWKILKDIIPTKANISKKGINTNGACCLCMAHL-EDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRIL---NKEEQDRAITILW

Query:  SLWNTRNISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLK--SSQLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQF
         LW  RN     Q   +  Q        L  Y    K NL   S+++K   S   W+   P+  K NSD SW +++ +  +G+  R+ NG ++  G +  
Subjt:  SLWNTRNISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLK--SSQLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQF

Query:  VRKWNIKCMEAKAILEGIEAYLKASNQAERR--FKLVVESDSSDVIGALNRDIEDLSEMAC
        V  + +  +EA+A     +A + A   A  R    +V ESDS  ++ AL R+   L ++AC
Subjt:  VRKWNIKCMEAKAILEGIEAYLKASNQAERR--FKLVVESDSSDVIGALNRDIEDLSEMAC

A0A803P5M6 Uncharacterized protein5.3e-3522.67Show/hide
Query:  FTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWNSGVELGIHNFG
        FTW +   + + +KERLD     +    IFK +   HL+ ++SDHR I V++   + ++ +       +FE  W++  D   II+ HW+     G+  F 
Subjt:  FTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWNSGVELGIHNFG

Query:  SKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSD--VIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFH----SRK-----
        S L+SC   L  W+ IR  G++   I   ++++  +   +D  V    +L  +E  LD LLEQ+E YW  RSR DWL+ GD+NT +FH    SRK     
Subjt:  SKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSD--VIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFH----SRK-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------WR----W--------------------------------KRVADLLDDMG--------DWIEDD--------VQ
                                  W+    W                                K+  D L+  G         WI           + 
Subjt:  --------------------------WR----W--------------------------------KRVADLLDDMG--------DWIEDD--------VQ

Query:  RAFLPE----------------------DAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVK
         A LP                       D + IL +P       D ++W+ +P GI++V++ +HLA  L    Q S S ++  +  WK        PK++
Subjt:  RAFLPE----------------------DAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVK

Query:  ITAWKILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSF---LNSGRNDWNPIDYWDGMCRILNKEEQDRAITILWS
        I AWK+ ++I+PT   + K+ +  +G C LC ++ E   H  + CK +K IW      L  F    +   N +N  DY   +  I  + + +  + +LW 
Subjt:  ITAWKILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSF---LNSGRNDWNPIDYWDGMCRILNKEEQDRAITILWS

Query:  LWNTRNISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLKSSQLKNHSSH----------QCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLI
        +W  RN       P       +   +F E +++ K  N   +   +H S           Q W+P   + +KLN D +   + +K G+G  +RD  G+++
Subjt:  LWNTRNISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLKSSQLKNHSSH----------QCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLI

Query:  EVGCKQFVRKWNIKCMEAKAILEGIEAYLKASNQAERRFKLV-VESDSSDVIGALNRDIEDLSEMACFTEEIHRI----SETEQISFAKCPRTVNSLAHE
            K     +    MEAKA+   +         ++ +F L  +E+D+S V  ALNR   DLS   CF++ I  I    S   Q+      RT N  AH 
Subjt:  EVGCKQFVRKWNIKCMEAKAILEGIEAYLKASNQAERRFKLV-VESDSSDVIGALNRDIEDLSEMACFTEEIHRI----SETEQISFAKCPRTVNSLAHE

Query:  LAR
        LA+
Subjt:  LAR

A0A803P5M6 Uncharacterized protein2.5e-0842.5Show/hide
Query:  AVALAKSIGEFVEAESDEKGKMEGETLRVRVKLNVSKPLRRGTNIKAGTMAEKKWIRVTYEKLPDFCYYCGRLGHVDQEC
        A+AL   IGE+ +   D   +  G  LRVRV L+VSKPL+RG  I    + +K W+   YE+LP++C  CG +GH   +C
Subjt:  AVALAKSIGEFVEAESDEKGKMEGETLRVRVKLNVSKPLRRGTNIKAGTMAEKKWIRVTYEKLPDFCYYCGRLGHVDQEC

A0A803P5M6 Uncharacterized protein2.2e-3324.35Show/hide
Query:  LQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWNS
        L+D+ +    FTW    R  + +++RLDRFL   +  D+F      +L +  SDH PI+  L    K           +FE  W+  E   E++++ W +
Subjt:  LQDVGSGAGIFTWERKTRDGSWIKERLDRFLATNELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWNS

Query:  GVELGIHNFGSKL-KSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFHSR-
         +  G+ N    + + C  +L++WNK    G +   +K K+R +Q +    D     +      E+  LL ++E  WK RSR  WL  GD+NT++FHSR 
Subjt:  GVELGIHNFGSKL-KSCIHKLNDWNKIRLQGSITSAIKRKEREIQSILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFHSR-

Query:  --KWRWKRVADLLDDMGDWIEDDVQRAFL-------------PEDAETILRMPRCSKRTNDAIIWNLNPKGIFT----------------VRSAYH----
          + +  R+  L D+ G W+E+D     L             P+D ++++     S   ND I   ++                      VR   H    
Subjt:  --KWRWKRVADLLDDMGDWIEDDVQRAFL-------------PEDAETILRMPRCSKRTNDAIIWNLNPKGIFT----------------VRSAYH----

Query:  ------LARDLSATSQASESENSANTSVWKSIRRANNI---PKVKITAWKILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEF
              + +DL          +  N +VW+    A+     PK   T    ++D++ ++                     ED VH+ +KC  +K +W   
Subjt:  ------LARDLSATSQASESENSANTSVWKSIRRANNI---PKVKITAWKILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEF

Query:  FPNLLSFLNSGRNDWNPIDYWDGMCRILNKEEQ---DRAITILWSLWNTRNISNQTQSPPKFGQTCRSIYRFLEGYSE-GKKTNLKSSQLKNHSSHQCWK
          +  SF      D          CR++        D  + ILW LW  RN     Q   +         + L  YS   KK   +   L   +S   WK
Subjt:  FPNLLSFLNSGRNDWNPIDYWDGMCRILNKEEQ---DRAITILWSLWNTRNISNQTQSPPKFGQTCRSIYRFLEGYSE-GKKTNLKSSQLKNHSSHQCWK

Query:  PSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQFVRKWNIKCMEAKAILEGIEAYLKASNQAERRFKLVVESDSSDVIGALNRDIEDLSEM
            D  K N D +W ++S K G+G+  R+ NG ++  G K      +    EAKAI     A + A N+      +V ESDS  +I AL      L  +
Subjt:  PSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQFVRKWNIKCMEAKAILEGIEAYLKASNQAERRFKLVVESDSSDVIGALNRDIEDLSEM

Query:  ACFTEEIHRISETEQISFAKCPRTVNSLAHELARAVTLNSNWEVFFGNSLPLC
        + F + + +  +     ++   R  N +AH +A +  L     V     +P C
Subjt:  ACFTEEIHRISETEQISFAKCPRTVNSLAHELARAVTLNSNWEVFFGNSLPLC

A0A803QEG9 Uncharacterized protein1.5e-3227.68Show/hide
Query:  VADLLDDMGDWIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSV---WKSIRRANNIPKVKIT
        VA+L+ D   W    +Q+ F P D E IL +P     T D +IW+ +  G FTV+SAYHL     ATS  +E  +S++TS    WK         KVKI 
Subjt:  VADLLDDMGDWIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSV---WKSIRRANNIPKVKIT

Query:  AWKILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNKEEQDRAITILWSLWNTR
        AW+++ D +P   ++ ++ I T+  C +C    E + H  + CK++K +W  F  N   F  S        DY   +  I NK E +    I+W++W  R
Subjt:  AWKILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNKEEQDRAITILWSLWNTR

Query:  NISNQTQSPPKFGQTCRSIYRFLEGYSEGK-----------------KTNLKSSQLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGS
        N     ++  +          FL+ +   +                  +   S      ++   W+P A DC+KLN+D +    S   GVG  +RD++GS
Subjt:  NISNQTQSPPKFGQTCRSIYRFLEGYSEGK-----------------KTNLKSSQLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGS

Query:  LIEVGCKQFVRKWNIKCMEAKAILEGIEAYLKASNQAERRFKLVVESDSSDVIGALNRDIEDLSEMACFTEEIHR-ISETEQISFAKCPRTVNSLAHELA
        +        +  +    MEAKA+   +   L+            +E+D+  V+ AL       SE +    ++H  +S    +S +   RT N+ AH LA
Subjt:  LIEVGCKQFVRKWNIKCMEAKAILEGIEAYLKASNQAERRFKLVVESDSSDVIGALNRDIEDLSEMACFTEEIHR-ISETEQISFAKCPRTVNSLAHELA

Query:  R
        +
Subjt:  R

A0A803QEG9 Uncharacterized protein1.3e-0931.48Show/hide
Query:  FKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHW-NSGVELGIHNFGSKLKSCIHKLNDWNKIRLQGSITSAIKR
        F   ++ HL+   SDHR +  S    S      K    ++FE  W+   + ++II   W N      I    +    C  KL  W+  +  G +   IK 
Subjt:  FKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHW-NSGVELGIHNFGSKLKSCIHKLNDWNKIRLQGSITSAIKR

Query:  KEREIQSI--LAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFHSR
         + +++ +            DL  AE  LD LLEQ+E+YW+ RSR DWL  GD+NTK+FH++
Subjt:  KEREIQSI--LAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFHSR

A0A803QEG9 Uncharacterized protein3.9e-0940.59Show/hide
Query:  KKYAVALAKSIGEFVEAESDEKGKMEGETLRVRVKLNVSKPLRRGTNIKAGTMAEKKWIRVTYEKLPDFCYYCGRLGHVDQEC----EEEGSDNNSKRDY
        K  A AL   IGEF+E   D   +  G  LRVRVKL  +KPL RG  I+   + ++ W+   YE+LP+FC+ CG LGH  + C    E   + N+   +Y
Subjt:  KKYAVALAKSIGEFVEAESDEKGKMEGETLRVRVKLNVSKPLRRGTNIKAGTMAEKKWIRVTYEKLPDFCYYCGRLGHVDQEC----EEEGSDNNSKRDY

Query:  G
        G
Subjt:  G

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.5e-1021.65Show/hide
Query:  ACCLCMAHLEDSVHLFWKCKFSKKIW----IEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNKEEQ--------DRAITILWSLWNTRNISNQTQSPPKF
        +C  C    E   HL +KC F++ +W    I  +P           +W     +  +  +LN E +        +    +LW LW +RN           
Subjt:  ACCLCMAHLEDSVHLFWKCKFSKKIW----IEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNKEEQ--------DRAITILWSLWNTRNISNQTQSPPKF

Query:  GQTCRSIYRFLEGYSEGKKTNLKSS--QLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQFVRKWNIKCMEAKAILEGI
         +  R      E +S  ++   K+S  Q++ + S Q WK       K N+D +W  ++ + G+GW +R+ +G ++ +G +   R  N+   E +A+   +
Subjt:  GQTCRSIYRFLEGYSEGKKTNLKSS--QLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQFVRKWNIKCMEAKAILEGI

Query:  EAYLKASNQAERRFKLVVESDSSDVIGALNRD---------IEDLSEMACFTEEIHRISETEQISFAKCPRTVNSLAHELARAVTLNSNWE
            + + +     +++ ESD+  ++  LN D         +ED+ ++         +   E++ F   PR  N +A  +AR     SN++
Subjt:  EAYLKASNQAERRFKLVVESDSSDVIGALNRD---------IEDLSEMACFTEEIHRISETEQISFAKCPRTVNSLAHELARAVTLNSNWE

AT3G09510.1 Ribonuclease H-like superfamily protein2.5e-1622.92Show/hide
Query:  WIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVKITAWKILKDIIPTKA
        W +  + +     D   I R+     +  D IIWN N  G +TVRS Y L     +T+  + +    +  +   I     +PK+K   W+ L   + T  
Subjt:  WIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKGIFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVKITAWKILKDIIPTKA

Query:  NISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNKEEQDR--------AITILWSLWNTRN---I
         ++ +G+  + +C  C    E   H  + C F+   W       LS  +  RN     D+ + +  ILN  +            + ++W +W  RN    
Subjt:  NISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRNDWNPIDYWDGMCRILNKEEQDR--------AITILWSLWNTRN---I

Query:  SNQTQSPPKFGQTCRS-IYRFLEGYSEGKKTNLKSSQLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQFVRKWNIKCM
        +   +SP K   + ++  + +L      KKT   + Q+  +     W+       K N D  +  +  +   GW IR+  G+ I  G  +     N    
Subjt:  SNQTQSPPKFGQTCRS-IYRFLEGYSEGKKTNLKSSQLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEVGCKQFVRKWNIKCM

Query:  EAKAILEGI-EAYLKASNQAERRFKLVVESDSSDVIGALNRDIEDLSEMACFTEEI-HRISETEQISFAKCPRTVNSLAHELAR
        E KA+L  + + +++   Q      + +E D   +I  +N  I   S +A   E+I    ++   I F    R  N LAH LA+
Subjt:  EAKAILEGI-EAYLKASNQAERRFKLVVESDSSDVIGALNRDIEDLSEMACFTEEI-HRISETEQISFAKCPRTVNSLAHELAR

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-0523.08Show/hide
Query:  SATSQASESENSANTSVW-KSIRRANNIPKVKITAWKILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRN
        SA    S     ++T  W K++   N++PK     W +  + + T+  +   G++    C LC AH +   HLF++C+FS  +W         F  +  N
Subjt:  SATSQASESENSANTSVW-KSIRRANNIPKVKITAWKILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNSGRN

Query:  DWNPIDYWDGMCRILNKEEQDRAITIL--------WSLWNTRN------ISNQTQS
           P    D +  +L+   +     I+        +++W  RN      +S  T+S
Subjt:  DWNPIDYWDGMCRILNKEEQDRAITIL--------WSLWNTRN------ISNQTQS

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.5e-0924.88Show/hide
Query:  ILWSLWNTRN--ISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLKSSQLKN--HSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEV
        ++W +W + N  + N T++  KF  T        + + +   TN + +  +N   S +  W P   D  K N D S  E++   G+GW +R+S G++IE 
Subjt:  ILWSLWNTRN--ISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLKSSQLKN--HSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGVGWAIRDSNGSLIEV

Query:  GCKQFVRKWNIKCMEAKAILEGIEAYLKASNQAERRFKLVVESDSSDVIGALNRDIEDLSEMACFTEEIHR-ISETEQISFAKCPRTVNSLAHELAR-AV
        G  +F  +   +  E   ++  I+A     ++     K++ E D+  +   +N    +   +  F + I   I   E I F+   R  N  A  LA+ A+
Subjt:  GCKQFVRKWNIKCMEAKAILEGIEAYLKASNQAERRFKLVVESDSSDVIGALNRDIEDLSEMACFTEEIHR-ISETEQISFAKCPRTVNSLAHELAR-AV

Query:  TLNSNWEVF
          N+ W +F
Subjt:  TLNSNWEVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGGAATAGTAGGTATTTGGGTTGGATTCGGAATGGGAATGACAATGGGAATGGTGATTGGGGTACTGAGGGTTACAGTTGGAGGACTTTTGCAGATTGGAGGGA
CAGCATCTTGAGTTTCACGATGGAGAGGACGGATACTGCAGAGGAAGTTCAACGGCAAATGGAAAAGCTGGGATTAGAAGAGGAAGAAAGCGGAAGGGTGGTCGAGATTA
AAGATGACGATATCGACGAAACAGACAAAGACCACCAAACTGCTACGGCCTGCAAAATCTTATCAACACAAACAATCAATGCAGATGGATTCTCGAATCTCATGCCCAAA
ATATGGGGATTGAAGGGGAACGCCAAGATCGTAAAGAAATATGCGGTGGCCTTGGCAAAATCGATAGGAGAATTTGTGGAGGCAGAATCAGATGAAAAAGGGAAAATGGA
GGGGGAAACTCTACGAGTTAGAGTTAAGCTCAACGTCTCCAAACCGTTAAGAAGAGGGACAAACATCAAAGCTGGAACAATGGCGGAAAAGAAATGGATAAGAGTCACCT
ACGAAAAATTGCCAGACTTTTGTTATTACTGTGGTAGACTCGGCCATGTCGATCAGGAATGCGAAGAGGAAGGATCTGACAACAATTCAAAAAGGGACTATGGGGTGGAT
CTAAGAGAAACACACAGCAACGAAAAAGATAAAAGATCTGAGGAAAGCGAGGAAAGACCTGAAAACACCGTCTGGGAAGAAGAGGCAACGCCGGAAATTCTTGATCAGAG
ACTGGGGGAGGAAGGTACGGAGGCGAGGAGAGAGAACAGAAACCAGACAAATCGAATAGGTCTGACAACCGGATCAGGGGATAAGACAGAGGAAAGAGAGGAACAGTACT
GGCAAAGATTACAAGATGTGGGTAGTGGTGCTGGGATATTTACTTGGGAAAGGAAAACAAGAGATGGGTCCTGGATAAAAGAGAGGCTGGACAGATTTTTAGCTACCAAT
GAATTAAGAGATATATTCAAAGATCTCAGAATTGACCATTTGAACAAGCACAATTCGGACCACAGACCCATTGTGGTGTCCTTAACCTGTAATTCTAAGGATAAGGGCAA
GAGGAAGCTGCATAAAAATATTAAGTTTGAAGGGGGCTGGGTGGAGTTTGAAGACTACAGAGAGATCATAAAGAATCATTGGAATAGCGGCGTTGAGTTGGGGATTCATA
ATTTCGGCAGCAAGCTCAAGTCTTGTATCCACAAGCTAAATGATTGGAATAAAATCAGATTGCAAGGGTCGATCACCTCAGCCATAAAAAGGAAAGAAAGGGAGATCCAG
AGCATTTTAGCAGGCAGCGATGTGATTAAAGACAGGGACCTTGACAGGGCAGAAAGAGAGTTGGATTATCTTCTTGAGCAAGATGAGCTTTATTGGAAGTTTAGATCTCG
CGAGGATTGGCTAAAATGGGGAGACCGTAACACGAAGTGGTTTCATTCTAGGAAATGGAGATGGAAAAGAGTTGCAGACCTCCTTGACGATATGGGCGATTGGATTGAAG
ATGATGTCCAAAGGGCGTTCCTCCCTGAAGACGCAGAAACAATCCTAAGAATGCCTAGATGTAGCAAGAGAACGAACGATGCGATCATTTGGAACCTCAATCCCAAAGGC
ATATTTACGGTCAGAAGTGCATATCATTTGGCTAGGGATCTTAGTGCGACCTCTCAAGCTTCAGAATCAGAAAATTCAGCCAACACATCAGTATGGAAATCTATACGGAG
AGCCAACAACATCCCTAAAGTGAAAATCACGGCGTGGAAGATCTTAAAAGACATAATCCCTACTAAAGCTAATATTAGTAAAAAGGGGATCAATACTAATGGTGCTTGTT
GTTTGTGCATGGCTCATTTGGAGGACTCAGTGCATCTCTTCTGGAAGTGTAAGTTTTCTAAGAAAATTTGGATTGAATTTTTTCCTAATCTTCTCTCTTTTCTGAATTCT
GGCAGAAACGATTGGAACCCTATCGACTACTGGGATGGGATGTGCAGAATTTTAAACAAAGAAGAGCAGGATCGAGCGATTACAATTCTTTGGTCTCTTTGGAACACCAG
AAACATCAGCAACCAGACTCAAAGCCCCCCAAAATTTGGGCAAACGTGCAGATCAATTTATAGATTTCTTGAAGGCTATTCAGAGGGAAAGAAGACTAACCTGAAATCGT
CTCAGTTGAAGAACCATTCGAGTCACCAATGTTGGAAGCCGTCGGCCCCCGACTGCTGGAAATTAAACTCTGACACGTCTTGGTGCGAGAAATCGAGGAAAGGTGGGGTG
GGGTGGGCCATTCGTGACTCTAATGGGTCTTTGATCGAAGTTGGATGCAAGCAATTCGTCAGAAAATGGAATATCAAGTGTATGGAGGCCAAAGCGATATTGGAAGGGAT
TGAAGCTTACCTGAAAGCTAGCAACCAAGCGGAAAGAAGATTCAAGCTCGTTGTTGAATCAGATTCTTCGGATGTCATCGGAGCCCTGAATCGCGACATCGAAGACCTCT
CGGAGATGGCTTGCTTCACTGAGGAAATTCACAGAATCTCGGAGACGGAGCAAATTTCGTTCGCCAAATGCCCTAGGACTGTTAACTCCCTTGCCCACGAACTCGCTCGT
GCGGTGACTCTGAACAGCAACTGGGAAGTTTTTTTTGGTAACTCTCTTCCATTGTGCGAGGAAGATGAAGCGTTTTGGAGGGAATTCGAGTTCCCCTTTTGGTTTTGTGA
TTTATTGGCTAAAGAAACCGGTGTACCTAACTTCCCGTTTATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGGAATAGTAGGTATTTGGGTTGGATTCGGAATGGGAATGACAATGGGAATGGTGATTGGGGTACTGAGGGTTACAGTTGGAGGACTTTTGCAGATTGGAGGGA
CAGCATCTTGAGTTTCACGATGGAGAGGACGGATACTGCAGAGGAAGTTCAACGGCAAATGGAAAAGCTGGGATTAGAAGAGGAAGAAAGCGGAAGGGTGGTCGAGATTA
AAGATGACGATATCGACGAAACAGACAAAGACCACCAAACTGCTACGGCCTGCAAAATCTTATCAACACAAACAATCAATGCAGATGGATTCTCGAATCTCATGCCCAAA
ATATGGGGATTGAAGGGGAACGCCAAGATCGTAAAGAAATATGCGGTGGCCTTGGCAAAATCGATAGGAGAATTTGTGGAGGCAGAATCAGATGAAAAAGGGAAAATGGA
GGGGGAAACTCTACGAGTTAGAGTTAAGCTCAACGTCTCCAAACCGTTAAGAAGAGGGACAAACATCAAAGCTGGAACAATGGCGGAAAAGAAATGGATAAGAGTCACCT
ACGAAAAATTGCCAGACTTTTGTTATTACTGTGGTAGACTCGGCCATGTCGATCAGGAATGCGAAGAGGAAGGATCTGACAACAATTCAAAAAGGGACTATGGGGTGGAT
CTAAGAGAAACACACAGCAACGAAAAAGATAAAAGATCTGAGGAAAGCGAGGAAAGACCTGAAAACACCGTCTGGGAAGAAGAGGCAACGCCGGAAATTCTTGATCAGAG
ACTGGGGGAGGAAGGTACGGAGGCGAGGAGAGAGAACAGAAACCAGACAAATCGAATAGGTCTGACAACCGGATCAGGGGATAAGACAGAGGAAAGAGAGGAACAGTACT
GGCAAAGATTACAAGATGTGGGTAGTGGTGCTGGGATATTTACTTGGGAAAGGAAAACAAGAGATGGGTCCTGGATAAAAGAGAGGCTGGACAGATTTTTAGCTACCAAT
GAATTAAGAGATATATTCAAAGATCTCAGAATTGACCATTTGAACAAGCACAATTCGGACCACAGACCCATTGTGGTGTCCTTAACCTGTAATTCTAAGGATAAGGGCAA
GAGGAAGCTGCATAAAAATATTAAGTTTGAAGGGGGCTGGGTGGAGTTTGAAGACTACAGAGAGATCATAAAGAATCATTGGAATAGCGGCGTTGAGTTGGGGATTCATA
ATTTCGGCAGCAAGCTCAAGTCTTGTATCCACAAGCTAAATGATTGGAATAAAATCAGATTGCAAGGGTCGATCACCTCAGCCATAAAAAGGAAAGAAAGGGAGATCCAG
AGCATTTTAGCAGGCAGCGATGTGATTAAAGACAGGGACCTTGACAGGGCAGAAAGAGAGTTGGATTATCTTCTTGAGCAAGATGAGCTTTATTGGAAGTTTAGATCTCG
CGAGGATTGGCTAAAATGGGGAGACCGTAACACGAAGTGGTTTCATTCTAGGAAATGGAGATGGAAAAGAGTTGCAGACCTCCTTGACGATATGGGCGATTGGATTGAAG
ATGATGTCCAAAGGGCGTTCCTCCCTGAAGACGCAGAAACAATCCTAAGAATGCCTAGATGTAGCAAGAGAACGAACGATGCGATCATTTGGAACCTCAATCCCAAAGGC
ATATTTACGGTCAGAAGTGCATATCATTTGGCTAGGGATCTTAGTGCGACCTCTCAAGCTTCAGAATCAGAAAATTCAGCCAACACATCAGTATGGAAATCTATACGGAG
AGCCAACAACATCCCTAAAGTGAAAATCACGGCGTGGAAGATCTTAAAAGACATAATCCCTACTAAAGCTAATATTAGTAAAAAGGGGATCAATACTAATGGTGCTTGTT
GTTTGTGCATGGCTCATTTGGAGGACTCAGTGCATCTCTTCTGGAAGTGTAAGTTTTCTAAGAAAATTTGGATTGAATTTTTTCCTAATCTTCTCTCTTTTCTGAATTCT
GGCAGAAACGATTGGAACCCTATCGACTACTGGGATGGGATGTGCAGAATTTTAAACAAAGAAGAGCAGGATCGAGCGATTACAATTCTTTGGTCTCTTTGGAACACCAG
AAACATCAGCAACCAGACTCAAAGCCCCCCAAAATTTGGGCAAACGTGCAGATCAATTTATAGATTTCTTGAAGGCTATTCAGAGGGAAAGAAGACTAACCTGAAATCGT
CTCAGTTGAAGAACCATTCGAGTCACCAATGTTGGAAGCCGTCGGCCCCCGACTGCTGGAAATTAAACTCTGACACGTCTTGGTGCGAGAAATCGAGGAAAGGTGGGGTG
GGGTGGGCCATTCGTGACTCTAATGGGTCTTTGATCGAAGTTGGATGCAAGCAATTCGTCAGAAAATGGAATATCAAGTGTATGGAGGCCAAAGCGATATTGGAAGGGAT
TGAAGCTTACCTGAAAGCTAGCAACCAAGCGGAAAGAAGATTCAAGCTCGTTGTTGAATCAGATTCTTCGGATGTCATCGGAGCCCTGAATCGCGACATCGAAGACCTCT
CGGAGATGGCTTGCTTCACTGAGGAAATTCACAGAATCTCGGAGACGGAGCAAATTTCGTTCGCCAAATGCCCTAGGACTGTTAACTCCCTTGCCCACGAACTCGCTCGT
GCGGTGACTCTGAACAGCAACTGGGAAGTTTTTTTTGGTAACTCTCTTCCATTGTGCGAGGAAGATGAAGCGTTTTGGAGGGAATTCGAGTTCCCCTTTTGGTTTTGTGA
TTTATTGGCTAAAGAAACCGGTGTACCTAACTTCCCGTTTATTTAA
Protein sequenceShow/hide protein sequence
MGGNSRYLGWIRNGNDNGNGDWGTEGYSWRTFADWRDSILSFTMERTDTAEEVQRQMEKLGLEEEESGRVVEIKDDDIDETDKDHQTATACKILSTQTINADGFSNLMPK
IWGLKGNAKIVKKYAVALAKSIGEFVEAESDEKGKMEGETLRVRVKLNVSKPLRRGTNIKAGTMAEKKWIRVTYEKLPDFCYYCGRLGHVDQECEEEGSDNNSKRDYGVD
LRETHSNEKDKRSEESEERPENTVWEEEATPEILDQRLGEEGTEARRENRNQTNRIGLTTGSGDKTEEREEQYWQRLQDVGSGAGIFTWERKTRDGSWIKERLDRFLATN
ELRDIFKDLRIDHLNKHNSDHRPIVVSLTCNSKDKGKRKLHKNIKFEGGWVEFEDYREIIKNHWNSGVELGIHNFGSKLKSCIHKLNDWNKIRLQGSITSAIKRKEREIQ
SILAGSDVIKDRDLDRAERELDYLLEQDELYWKFRSREDWLKWGDRNTKWFHSRKWRWKRVADLLDDMGDWIEDDVQRAFLPEDAETILRMPRCSKRTNDAIIWNLNPKG
IFTVRSAYHLARDLSATSQASESENSANTSVWKSIRRANNIPKVKITAWKILKDIIPTKANISKKGINTNGACCLCMAHLEDSVHLFWKCKFSKKIWIEFFPNLLSFLNS
GRNDWNPIDYWDGMCRILNKEEQDRAITILWSLWNTRNISNQTQSPPKFGQTCRSIYRFLEGYSEGKKTNLKSSQLKNHSSHQCWKPSAPDCWKLNSDTSWCEKSRKGGV
GWAIRDSNGSLIEVGCKQFVRKWNIKCMEAKAILEGIEAYLKASNQAERRFKLVVESDSSDVIGALNRDIEDLSEMACFTEEIHRISETEQISFAKCPRTVNSLAHELAR
AVTLNSNWEVFFGNSLPLCEEDEAFWREFEFPFWFCDLLAKETGVPNFPFI