; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008857 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008857
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold10:33646578..33653882
RNA-Seq ExpressionSpg008857
SyntenySpg008857
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU38731.1 hypothetical protein TSUD_208420 [Trifolium subterraneum]7.3e-4826.94Show/hide
Query:  RFTGIYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFL
        R TG YG P  S   ++W  +++L +   + W + GDFN+I+ + EK G   R +  +  FR+A+    + D  + G  +TW  +      + E+LDR +
Subjt:  RFTGIYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFL

Query:  INPRMNGWCSVFK---VFHLARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSERSRNTNTE---------TNRDAHQYSV-
         N   + WC++F+   V  L   ASDH P+L E   +P     H+ ++  +FE  W    E    VK  W      T T          T+   H++S+ 
Subjt:  INPRMNGWCSVFK---VFHLARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSERSRNTNTE---------TNRDAHQYSV-

Query:  -AHFI---------------KPTGV-------------------WDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQS
          H I               KP  +                   WD  L++ +  +  A+ IL+ PL    + D I W ++  G +TVKSAYR  I    
Subjt:  -AHFI---------------KPTGV-------------------WDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQS

Query:  SNAASGSCLCQKEDLWKEYWKTPILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWA
         +   G    + E  W   W+T + PKIK   WRI  + LPTR  L  +G+     C+LC    E   HL + C+ +   W +     ++    N     
Subjt:  SNAASGSCLCQKEDLWKEYWKTPILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWA

Query:  AADYCEVMWRGGSEDAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKALNLLT--RGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAG
          +   +M +        E + A    V W IW  RN ++  N+   +++     E+A +LLT  R   E  DR   Q       H P+    W  P AG
Subjt:  AADYCEVMWRGGSEDAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKALNLLT--RGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAG

Query:  YWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGIRSIP--HDSPRLFIELDSIQVVN
         WK N D S++  +   G+G  IR   G  V A          +   EA+ ++  ++ +   H +  +F ELDS +VV+
Subjt:  YWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGIRSIP--HDSPRLFIELDSIQVVN

KAF8408042.1 hypothetical protein HHK36_007182 [Tetracentron sinense]2.7e-6631.08Show/hide
Query:  RFTGIYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFL
        R TG+YG+P  +   ETW L++ L     M WV  GDFNEI    EK G +++    MR F++AI  C +   GF G  +TWCN       + ERLDR +
Subjt:  RFTGIYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFL

Query:  INPRMNGWCSVF---KVFHLARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSERSRNTNTETNRDAHQYSVAHFI-KPTGV
             + WC +F    V HL+   SDH P+L  + +E P     K  R  RFE +W    EC +I+   W+   + +     RDA    V+  I K    
Subjt:  INPRMNGWCSVF---KVFHLARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSERSRNTNTETNRDAHQYSVAHFI-KPTGV

Query:  WDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHL---QSSNAASGSCLCQKEDL----WKEYWKTPILPKIKVCGWRIYH
        W+  L+  +F+  +AE I +IPLS     D  +WH+ SKG F+V+SAY L   L   +S+ ++S S L     L    W + W+  I PK+K+  W++  
Subjt:  WDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHL---QSSNAASGSCLCQKEDL----WKEYWKTPILPKIKVCGWRIYH

Query:  DILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGNFAKSTIVCWQIWSYRN
        +ILP R NL K+ + V+++C +C ++ ETI H+L  C   + +W        +  L  R    +AD          +  G+EG  +   ++ W IW +RN
Subjt:  DILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGNFAKSTIVCWQIWSYRN

Query:  TILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTAGFRAIH
          + +       N  Q+  K    L        DR  P+  S +          W+ PP   +K+N DG+ + E +  GVG V+R   G ++ A  + I 
Subjt:  TILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTAGFRAIH

Query:  KSWHISWLEAIAVMEGIR-SIPHDSPRLFIELDSIQVVNLLEGKEED
         +   + +EAIA  EG++ ++      + +E DS+  +  L  +EE+
Subjt:  KSWHISWLEAIAVMEGIR-SIPHDSPRLFIELDSIQVVNLLEGKEED

KMS97072.1 hypothetical protein BVRB_7g179330 [Beta vulgaris subsp. vulgaris]2.0e-5327.47Show/hide
Query:  RFTGIYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFL
        RFTGIYG    S   +TW +M+ LG+  ++AW+L GDFNE++   EK      + A ++ FR  +D   + D G +G  +TW N   +   + ERLDR L
Subjt:  RFTGIYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFL

Query:  INPRMNGWCSVFKVFHLARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSE-------------------------------
         +       S   V +L    SDH PI+        +     G +  RFE  W + A   +++K VW E                               
Subjt:  INPRMNGWCSVFKVFHLARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSE-------------------------------

Query:  ----RSRNTNTETNRDAHQYSVAHFIKPTGV-WDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSSNAASGSCLCQK
            R      E   DA    VA  I    V W + ++ ++F + D   I NI LS    ED I W     G F V+ AYRL I  ++ N AS S     
Subjt:  ----RSRNTNTETNRDAHQYSVAHFIKPTGV-WDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSSNAASGSCLCQK

Query:  EDLWKEYWKTPILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWAAADYCEVMWRGG
        + +WK  W   I PK+    WR   DILP   NL KK    +  C+ C    ET  H   +C+  +  WAK   +     + N K W A  +        
Subjt:  EDLWKEYWKTPILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWAAADYCEVMWRGG

Query:  SEDAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEK
         E   KE       +  WQ+W  RN ++  N      +     E+A+++L         +RE            R   RW+LP  G+ K+N D + N E 
Subjt:  SEDAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEK

Query:  KIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGI-RSIPHDSPRLFIELDSIQVVNLLEGKEEDLTDLAKFIDEARSSLPVCQPHSFFYVPR
           G+G V R   G ++ A  R +   W     EA AV+     +I      + +E D+  ++N +  + +   D+   +++  + +P  +   F +  R
Subjt:  KIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGI-RSIPHDSPRLFIELDSIQVVNLLEGKEEDLTDLAKFIDEARSSLPVCQPHSFFYVPR

Query:  LQNQLAHHLARRAWATDSSESWSRNFPNWLLHLNSQD
          N++AH LA+ A +    E W    P+W+  L   D
Subjt:  LQNQLAHHLARRAWATDSSESWSRNFPNWLLHLNSQD

RYQ92149.1 hypothetical protein Ahy_B09g098304 [Arachis hypogaea]7.8e-5025.76Show/hide
Query:  IYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFLINPR
        +YGNPV     + W  +       +   V  GDFN+I++  EK G L + +  +  FR  +D   + D    G KYTW +N  N     ERLDR L+N  
Subjt:  IYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFLINPR

Query:  MNGWCSVFKVFHL---ARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSERSRNTN---------TETNRDAHQYSVAHFIK
           W  +++   L     I+SDH  ++ + +      R  K      FE  W ++ ECK+++K  W +   N N             R+  ++S   F +
Subjt:  MNGWCSVFKVFHL---ARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSERSRNTN---------TETNRDAHQYSVAHFIK

Query:  PT-----------GVWDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSS------NAASGSCLCQKEDLWKEYWKTP
                      + +  +++++F  + AE+I   P+S ++++D +IW Y ++G +TVKS YR     + +      N AS S      ++W   W+ P
Subjt:  PT-----------GVWDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSS------NAASGSCLCQKEDLWKEYWKTP

Query:  ILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMW--AKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGN
        +  KI++  W+    ILP  +NL K+   V   C +C+D+ ET++H L  C  T+ +W  +     P  + +++   W      ++    G +   +E  
Subjt:  ILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMW--AKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGN

Query:  FAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVI
          K   VCW IW  RN  +        IN +Q +  A  L        T     Q  S       +  + W  PP    K+N D ++ ++      G VI
Subjt:  FAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVI

Query:  RCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGIRSIPH-DSPRLFIELDSIQVVNLLEGK----EED--LTDLAKFIDEARSSLPVCQPHSFFYVPRLQN
        R W G  +T G      +   +  EA A  E +  I +       IE D +Q+V +++ +    E D  L D+ + ++EA +           + PR  N
Subjt:  RCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGIRSIPH-DSPRLFIELDSIQVVNLLEGK----EED--LTDLAKFIDEARSSLPVCQPHSFFYVPRLQN

Query:  QLAHHLARRAWATDSSESWSRNFPN
         +AH LA  A     +  W+ N PN
Subjt:  QLAHHLARRAWATDSSESWSRNFPN

RYR18269.1 hypothetical protein Ahy_B03g062876 [Arachis hypogaea]1.9e-4824.11Show/hide
Query:  IYGNPVRSLHSETW--TLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFLIN
        +YGNPV       W    +  +  ++  A++  GDFN+I++  EK G   +    +  FR  +D  ++ D    G KYTW +N  N     +RLDR L+N
Subjt:  IYGNPVRSLHSETW--TLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFLIN

Query:  PRMNGWCSVFKVFHL---ARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKH--VWSERSRN-------TNTETNRDAHQYSVAHF
         +   W  +++  +L     + SDH  ++ + ++     ++ K      FE  W ++ ECK++++    W + SRN             R+  ++S   F
Subjt:  PRMNGWCSVFKVFHL---ARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKH--VWSERSRN-------TNTETNRDAHQYSVAHF

Query:  IKPTGVWDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSSNAASGSCLCQK----EDLWKEYWKTPILPKIKVCGWR
         +     +   ++   ++  AE I   P+S ++++D  +W +   G +TV++ Y +    + S      C         ++W+  W+ P+  K+++  W+
Subjt:  IKPTGVWDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSSNAASGSCLCQK----EDLWKEYWKTPILPKIKVCGWR

Query:  IYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMW--AKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGNFAKSTIVCWQI
          H ILP  TNL ++ + +   C +C+ + ETI+H L  C  T+ +W  +    +P  + + + + W       +    GSE   +E    K   VCW I
Subjt:  IYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMW--AKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGNFAKSTIVCWQI

Query:  WSYRN--TILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVT
        W  RN      T   P K+  Q +   A     +   E +    P    G      R  + W  PP    K+N D ++ ++     +   +R W G ++T
Subjt:  WSYRN--TILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVT

Query:  AGFRAIHKSWHISWLEAIAVMEGIRSIPH-DSPRLFIELDSIQVVNLLEGK------EEDLTDLAKFIDEARSSLPVCQPHSFFYVPRLQNQLAHHLARR
         G  A  K+      EA A  E +  I +   P   IE DS+ +V  ++ +      +  + D+ + ++EA             + PR  N+LAH LA  
Subjt:  AGFRAIHKSWHISWLEAIAVMEGIRSIPH-DSPRLFIELDSIQVVNLLEGK------EEDLTDLAKFIDEARSSLPVCQPHSFFYVPRLQNQLAHHLARR

Query:  AWATDSSESWSRNFPNWL
        A   +    W+ N P  L
Subjt:  AWATDSSESWSRNFPNWL

TrEMBL top hitse value%identityAlignment
A0A0J8BAU9 Uncharacterized protein9.6e-5427.47Show/hide
Query:  RFTGIYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFL
        RFTGIYG    S   +TW +M+ LG+  ++AW+L GDFNE++   EK      + A ++ FR  +D   + D G +G  +TW N   +   + ERLDR L
Subjt:  RFTGIYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFL

Query:  INPRMNGWCSVFKVFHLARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSE-------------------------------
         +       S   V +L    SDH PI+        +     G +  RFE  W + A   +++K VW E                               
Subjt:  INPRMNGWCSVFKVFHLARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSE-------------------------------

Query:  ----RSRNTNTETNRDAHQYSVAHFIKPTGV-WDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSSNAASGSCLCQK
            R      E   DA    VA  I    V W + ++ ++F + D   I NI LS    ED I W     G F V+ AYRL I  ++ N AS S     
Subjt:  ----RSRNTNTETNRDAHQYSVAHFIKPTGV-WDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSSNAASGSCLCQK

Query:  EDLWKEYWKTPILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWAAADYCEVMWRGG
        + +WK  W   I PK+    WR   DILP   NL KK    +  C+ C    ET  H   +C+  +  WAK   +     + N K W A  +        
Subjt:  EDLWKEYWKTPILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWAAADYCEVMWRGG

Query:  SEDAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEK
         E   KE       +  WQ+W  RN ++  N      +     E+A+++L         +RE            R   RW+LP  G+ K+N D + N E 
Subjt:  SEDAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEK

Query:  KIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGI-RSIPHDSPRLFIELDSIQVVNLLEGKEEDLTDLAKFIDEARSSLPVCQPHSFFYVPR
           G+G V R   G ++ A  R +   W     EA AV+     +I      + +E D+  ++N +  + +   D+   +++  + +P  +   F +  R
Subjt:  KIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGI-RSIPHDSPRLFIELDSIQVVNLLEGKEEDLTDLAKFIDEARSSLPVCQPHSFFYVPR

Query:  LQNQLAHHLARRAWATDSSESWSRNFPNWLLHLNSQD
          N++AH LA+ A +    E W    P+W+  L   D
Subjt:  LQNQLAHHLARRAWATDSSESWSRNFPNWLLHLNSQD

A0A2Z6N4T0 Uncharacterized protein3.5e-4826.94Show/hide
Query:  RFTGIYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFL
        R TG YG P  S   ++W  +++L +   + W + GDFN+I+ + EK G   R +  +  FR+A+    + D  + G  +TW  +      + E+LDR +
Subjt:  RFTGIYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFL

Query:  INPRMNGWCSVFK---VFHLARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSERSRNTNTE---------TNRDAHQYSV-
         N   + WC++F+   V  L   ASDH P+L E   +P     H+ ++  +FE  W    E    VK  W      T T          T+   H++S+ 
Subjt:  INPRMNGWCSVFK---VFHLARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSERSRNTNTE---------TNRDAHQYSV-

Query:  -AHFI---------------KPTGV-------------------WDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQS
          H I               KP  +                   WD  L++ +  +  A+ IL+ PL    + D I W ++  G +TVKSAYR  I    
Subjt:  -AHFI---------------KPTGV-------------------WDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQS

Query:  SNAASGSCLCQKEDLWKEYWKTPILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWA
         +   G    + E  W   W+T + PKIK   WRI  + LPTR  L  +G+     C+LC    E   HL + C+ +   W +     ++    N     
Subjt:  SNAASGSCLCQKEDLWKEYWKTPILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWA

Query:  AADYCEVMWRGGSEDAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKALNLLT--RGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAG
          +   +M +        E + A    V W IW  RN ++  N+   +++     E+A +LLT  R   E  DR   Q       H P+    W  P AG
Subjt:  AADYCEVMWRGGSEDAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKALNLLT--RGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAG

Query:  YWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGIRSIP--HDSPRLFIELDSIQVVN
         WK N D S++  +   G+G  IR   G  V A          +   EA+ ++  ++ +   H +  +F ELDS +VV+
Subjt:  YWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGIRSIP--HDSPRLFIELDSIQVVN

A0A444XQY7 RNase H domain-containing protein3.8e-5025.76Show/hide
Query:  IYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFLINPR
        +YGNPV     + W  +       +   V  GDFN+I++  EK G L + +  +  FR  +D   + D    G KYTW +N  N     ERLDR L+N  
Subjt:  IYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFLINPR

Query:  MNGWCSVFKVFHL---ARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSERSRNTN---------TETNRDAHQYSVAHFIK
           W  +++   L     I+SDH  ++ + +      R  K      FE  W ++ ECK+++K  W +   N N             R+  ++S   F +
Subjt:  MNGWCSVFKVFHL---ARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSERSRNTN---------TETNRDAHQYSVAHFIK

Query:  PT-----------GVWDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSS------NAASGSCLCQKEDLWKEYWKTP
                      + +  +++++F  + AE+I   P+S ++++D +IW Y ++G +TVKS YR     + +      N AS S      ++W   W+ P
Subjt:  PT-----------GVWDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSS------NAASGSCLCQKEDLWKEYWKTP

Query:  ILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMW--AKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGN
        +  KI++  W+    ILP  +NL K+   V   C +C+D+ ET++H L  C  T+ +W  +     P  + +++   W      ++    G +   +E  
Subjt:  ILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMW--AKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGN

Query:  FAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVI
          K   VCW IW  RN  +        IN +Q +  A  L        T     Q  S       +  + W  PP    K+N D ++ ++      G VI
Subjt:  FAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVI

Query:  RCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGIRSIPH-DSPRLFIELDSIQVVNLLEGK----EED--LTDLAKFIDEARSSLPVCQPHSFFYVPRLQN
        R W G  +T G      +   +  EA A  E +  I +       IE D +Q+V +++ +    E D  L D+ + ++EA +           + PR  N
Subjt:  RCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGIRSIPH-DSPRLFIELDSIQVVNLLEGK----EED--LTDLAKFIDEARSSLPVCQPHSFFYVPRLQN

Query:  QLAHHLARRAWATDSSESWSRNFPN
         +AH LA  A     +  W+ N PN
Subjt:  QLAHHLARRAWATDSSESWSRNFPN

A0A444ZVS3 Uncharacterized protein9.3e-4924.11Show/hide
Query:  IYGNPVRSLHSETW--TLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFLIN
        +YGNPV       W    +  +  ++  A++  GDFN+I++  EK G   +    +  FR  +D  ++ D    G KYTW +N  N     +RLDR L+N
Subjt:  IYGNPVRSLHSETW--TLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFLIN

Query:  PRMNGWCSVFKVFHL---ARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKH--VWSERSRN-------TNTETNRDAHQYSVAHF
         +   W  +++  +L     + SDH  ++ + ++     ++ K      FE  W ++ ECK++++    W + SRN             R+  ++S   F
Subjt:  PRMNGWCSVFKVFHL---ARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKH--VWSERSRN-------TNTETNRDAHQYSVAHF

Query:  IKPTGVWDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSSNAASGSCLCQK----EDLWKEYWKTPILPKIKVCGWR
         +     +   ++   ++  AE I   P+S ++++D  +W +   G +TV++ Y +    + S      C         ++W+  W+ P+  K+++  W+
Subjt:  IKPTGVWDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSSNAASGSCLCQK----EDLWKEYWKTPILPKIKVCGWR

Query:  IYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMW--AKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGNFAKSTIVCWQI
          H ILP  TNL ++ + +   C +C+ + ETI+H L  C  T+ +W  +    +P  + + + + W       +    GSE   +E    K   VCW I
Subjt:  IYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMW--AKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGNFAKSTIVCWQI

Query:  WSYRN--TILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVT
        W  RN      T   P K+  Q +   A     +   E +    P    G      R  + W  PP    K+N D ++ ++     +   +R W G ++T
Subjt:  WSYRN--TILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVT

Query:  AGFRAIHKSWHISWLEAIAVMEGIRSIPH-DSPRLFIELDSIQVVNLLEGK------EEDLTDLAKFIDEARSSLPVCQPHSFFYVPRLQNQLAHHLARR
         G  A  K+      EA A  E +  I +   P   IE DS+ +V  ++ +      +  + D+ + ++EA             + PR  N+LAH LA  
Subjt:  AGFRAIHKSWHISWLEAIAVMEGIRSIPH-DSPRLFIELDSIQVVNLLEGK------EEDLTDLAKFIDEARSSLPVCQPHSFFYVPRLQNQLAHHLARR

Query:  AWATDSSESWSRNFPNWL
        A   +    W+ N P  L
Subjt:  AWATDSSESWSRNFPNWL

A0A445DNV3 Uncharacterized protein1.3e-4724.75Show/hide
Query:  IYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFLINPR
        IYGNP      E W  +    +      +  GDFN+I+   EK G   + +  +R+FR   D   + D    G ++TW  N  N     ER+DR L+N  
Subjt:  IYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDNGEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFLINPR

Query:  MNGWCSVFKVFHLARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSERSRNTNTETNRDAHQYSVAHFIKPTGVWDEALVKD
                 +  L  I+SDH P++ +  +     R  K     +FE  WT + +C++ VK  W +       E  +      +   +K    W++  ++ 
Subjt:  MNGWCSVFKVFHLARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWTKYAECKDIVKHVWSERSRNTNTETNRDAHQYSVAHFIKPTGVWDEALVKD

Query:  MFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHL-QSSNAASGSCLCQKEDLWKEYWKTPILPKIKVCGWRIYHDILPTRTNLIKKGM
         F +   + IL+ P+S +++ED + W +   G +++K+ Y     + Q+S   + S    K ++W+E W+  +  KI++  W+  HDILP  +NL K+ M
Subjt:  MFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHL-QSSNAASGSCLCQKEDLWKEYWKTPILPKIKVCGWRIYHDILPTRTNLIKKGM

Query:  EVDHMCLLCRDKPETIQHLLWECKITKGMW--AKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKI
          + +  +C   PET++H L  C   +  W  A+    P V  +S+   W   +  + +  GG ED   E   +K   + W+IW  RN  +   Q+   +
Subjt:  EVDHMCLLCRDKPETIQHLLWECKITKGMW--AKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKI

Query:  NLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIA
        N +  + +A   L +   +  +++  Q   G+        ++W  PP  + K N D ++ ++   G +  VIR   G I+  GF    ++   +  EA A
Subjt:  NLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIA

Query:  VMEGIRSIPH-DSPRLFIELDSIQVVNLLEGKEEDLTDLAKFIDEARSSLPVCQPHSFFYVPRLQNQLAHHLARRAWATDSSESWSRNFP
        V + +  + +    R  IE D++++   ++ K   L +      + +  +         + PR  N+LAH +A+ A A     +WS   P
Subjt:  VMEGIRSIPH-DSPRLFIELDSIQVVNLLEGKEEDLTDLAKFIDEARSSLPVCQPHSFFYVPRLQNQLAHHLARRAWATDSSESWSRNFP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.9e-1724.18Show/hide
Query:  CLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSE----DAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKINLQ
        C+ C D  ET+ HLL++C   + +WA  SP+P          W  + Y  + W    E      GK GN     +  W++W  RN ++   +       +
Subjt:  CLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSE----DAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKINLQ

Query:  QQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIAVME
            + L        E + RRE +  +     E    ++W  PP  + K N D +W  E    G+GW++R  +G ++  G RA+ ++ ++   E  A+  
Subjt:  QQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIAVME

Query:  GIRSIPH-DSPRLFIELDSIQVVNLLEGKEEDLTDLAKFIDEARSSLPVCQPHSFFYVPRLQNQLAHHLARRA
         + ++   +  R+  E D+  +VNLL   ++    L   +++ +  L   +   F + PR  N++A  +AR +
Subjt:  GIRSIPH-DSPRLFIELDSIQVVNLLEGKEEDLTDLAKFIDEARSSLPVCQPHSFFYVPRLQNQLAHHLARRA

AT2G46460.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.3e-0628.03Show/hide
Query:  WVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGIRSI-PHDSPRLFIELDSIQVVNLLEGKEEDLTDLAKF
        W  PP G+ K N DGS+N E      GW+IR  +G  V+AG    + + +    E  A++  ++S   H   +++ E D+ +V+ +L  + +   D+  +
Subjt:  WVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTAGFRAIHKSWHISWLEAIAVMEGIRSI-PHDSPRLFIELDSIQVVNLLEGKEEDLTDLAKF

Query:  IDEARSSLPVCQPHSFFYVPRLQNQLAHHLAR
        I + ++     Q   F ++ R QN+ A  LA+
Subjt:  IDEARSSLPVCQPHSFFYVPRLQNQLAHHLAR

AT3G09510.1 Ribonuclease H-like superfamily protein2.8e-2925.06Show/hide
Query:  WDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSSNAASGSCLCQKEDLWKEYWKTPILPKIKVCGWRIYHDILPTRT
        WD++ +     +SD   I  I L+   + D IIW+Y++ G +TV+S Y L  H  S+N  + +      DL    W  PI+PK+K   WR     L T  
Subjt:  WDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSSNAASGSCLCQKEDLWKEYWKTPILPKIKVCGWRIYHDILPTRT

Query:  NLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGNFAKSTI-----------VCWQIW
         L  +GM +D  C  C  + E+I H L+ C      W     L +   + N          ++M     E+     NF + T            + W+IW
Subjt:  NLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGNFAKSTI-----------VCWQIW

Query:  SYRNTILHT--NQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTA
          RN ++     ++P+K  L  + E    L        + ++ P P    A ++    + W  PPA Y K N D  ++ +K     GW+IR   G+ ++ 
Subjt:  SYRNTILHT--NQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTA

Query:  GFRAIHKSWHISWLEAIAVMEGIRSI-PHDSPRLFIELDSIQVVNLLEGKEEDLTDLAKFIDEARSSLPVCQPHSFFYVPRLQNQLAHHLARRAWATDSS
        G   +  + +    E  A++  ++        ++F+E D   ++NL+ G     + LA  +++            F ++ R  N+LAH LA+      + 
Subjt:  GFRAIHKSWHISWLEAIAVMEGIRSI-PHDSPRLFIELDSIQVVNLLEGKEEDLTDLAKFIDEARSSLPVCQPHSFFYVPRLQNQLAHHLARRAWATDSS

Query:  ESWSRNFPNWL
         S S + P WL
Subjt:  ESWSRNFPNWL

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.3e-0624.49Show/hide
Query:  KEYWKTPILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMW----AKFSPLPNVFFLSNRKGWAAADYCEVMWRGG
        K  W    +P+  +  W  + + LPTR  L   GM +    +LC +  ET  HL +EC  +  +W    +KF P P  F L     W      ++  R  
Subjt:  KEYWKTPILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMW----AKFSPLPNVFFLSNRKGWAAADYCEVMWRGG

Query:  SEDAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKAL
        S    K     +S +  + +W  RN  + T+ + +  +L+  +++ +
Subjt:  SEDAGKEGNFAKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKAL

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.1e-0824.41Show/hide
Query:  WQIWSYRNTILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIV
        W+IW   N ++    N  +   Q  VE ALN  T+   + T   E Q  +G+   +P    +W  P     K N D S +E   + G+GW++R   G+++
Subjt:  WQIWSYRNTILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIV

Query:  TAGFRAIHKSWHISWLEAIAVMEGIR-SIPHDSPRLFIELDSIQVVNLLEGKEEDLTDLAKFIDEARSSLPVCQPHSFFYVPRLQNQLAHHLARRAWATD
          G             E   ++  I+ S      ++  E D+  +  ++  K  +   L  F+D  +S +P  +   F +  R QN  A  LA++A   +
Subjt:  TAGFRAIHKSWHISWLEAIAVMEGIR-SIPHDSPRLFIELDSIQVVNLLEGKEEDLTDLAKFIDEARSSLPVCQPHSFFYVPRLQNQLAHHLARRAWATD

Query:  SSESWSRNFPNWL
        +  S   + P +L
Subjt:  SSESWSRNFPNWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAGACCTTGCCAAGCAACTTGCAGATCTACGTGTTACTCCTGAGGAGAAAGCTAGTGTTTTCAAACTTCAGGAACGGGAAATTGACCACTCTGAACAATTTTT
GGCGAACTCGATATTATTCAAAAATGCACGAATAAAAGGATGGGTCCTAGAGATGGGTCCATGGTTTTATGACAAGGCGATGCTATTATTCGAGGAACCAAAAGGAGACA
TCTGCAGCGAGGAAATGGAGTTCAGGGCATCCGCGGAGGCTATTGGAGGATTGCTCGGGAAGGTTGAAAAAGTGGACATAGGGGACGACTCAGATCCGGATTGGGGGAGA
TCATTGAGGATCAAAATTCAACTCAACGTCCGGAGACCTTTGAAGCGGGGAATTTTCTTGCAATCAGAGAGCTCAGGGAGAGACAAGTGGATTCCCATTACATATGAGAA
GCTCCCGGATTTTTGTTATGGCTGCGGTCGGTTGGGTCATACTATAAAAGAATGTGAGGATCAAATTAGCTCATCGGAAGAGGAGATGTTGTATGGCCCTACCCTTCGTG
AACCGGCCAGATTAAAATCTGAGGAGGCTGAATTTGCTACAAGAAATATCCAAGGCCGGGGGAGAGGAAGAGGAGGGTATGGAGGACGAGGTGGGTGGCGATACGTGGAC
GTTGTACAGGAGGAGGCTGAAAATCAAGAACCCGACAAACTGCATCCAGAGAAGGAAGAGAGGGTGCCGCAGGTCGAAGCTCATGGGCTGCCAGTGCAGGCGGAGGAGTC
TCCACCGCCGGTTGACAATCGGCACAGCACAACGGTTGAATTTTCCAAAAAGGAAAGAAGAGTGGGAATGGAAAATTCAATAAAGGCTAACGGCGAAATTTTAGGAATTT
CTGAGGTAGAGACCGTTGGGGACGAAATCAACGGAGAGTTAGGAATCTCAGCTTTAAATGGGGAGAGAACAGAGGTGTGTGCCAACGATTTCCTACAATACAGCTCACAA
ATGGAAATTGATTCCCACCAACAAGTGGCGGAAGATAATGAGAAGGATCCCAACTCTCAGGTGTTTAAAGATTCATTGAATGATACCAATGAACCCAAATGTGGGCCCAC
AAAGATTGGAACAATAAAGACCAAGGGCTGGAAAAGAATACAGAGGGATAATAAAACAAATGATGTTAGTGCTGAGTACTCCACGAAGAGATTCACTGGCATATATGGAA
ATCCGGTTAGAAGTTTGCATTCTGAAACCTGGACGTTAATGAAACGGTTGGGTGATCAGCTAGATATGGCATGGGTCCTTGGAGGCGACTTTAACGAGATCATTGACAAT
GGAGAAAAGTATGGCGGTTTAGACAGGAATGAAGCAGACATGAGGGATTTCCGGGATGCTATTGACTACTGTGAGGTTTTCGACCCTGGCTTCAATGGACCTAAATATAC
ATGGTGTAATAACCACGTTAATCGAGACAGGATATGGGAGAGACTCGATCGTTTTCTCATTAACCCGAGGATGAACGGATGGTGCAGTGTCTTTAAAGTATTCCATCTAG
CTAGGATTGCGTCAGATCACAGACCCATTCTGGCAGAATGGAAGGAAGAACCTCCAGATTGTAGAAACCACAAGGGCATTCGTCCAAGACGATTCGAGGAGGTCTGGACA
AAGTATGCAGAATGTAAAGATATTGTAAAGCACGTGTGGAGTGAGAGGTCGCGCAATACAAATACTGAAACTAACAGAGATGCACATCAATATTCAGTAGCCCATTTTAT
CAAACCTACGGGGGTGTGGGATGAGGCGTTAGTAAAGGATATGTTTCTGGAGAGTGATGCAGAAGCTATTCTTAACATCCCCCTGAGCTCAATGCATAGAGAAGATACAA
TTATATGGCACTATGACTCTAAAGGTTTCTTTACGGTGAAGAGTGCATACAGATTGGGGATTCATCTTCAAAGTTCCAATGCTGCATCAGGATCGTGCCTATGCCAGAAG
GAAGATCTGTGGAAAGAATACTGGAAAACTCCCATCCTCCCTAAAATTAAAGTTTGTGGGTGGAGAATTTATCATGATATTCTCCCAACTCGCACAAACCTAATTAAAAA
GGGGATGGAGGTGGATCATATGTGTTTGTTGTGCAGGGATAAACCAGAAACAATCCAACATCTTCTGTGGGAGTGTAAAATTACTAAAGGTATGTGGGCTAAATTTTCCC
CTCTTCCTAATGTTTTCTTTCTCTCTAACAGGAAGGGATGGGCGGCTGCTGACTACTGCGAAGTGATGTGGAGAGGGGGCAGCGAAGATGCAGGGAAGGAAGGAAATTTC
GCGAAAAGTACAATAGTATGTTGGCAAATATGGTCATACAGAAACACCATCCTTCATACCAATCAGAATCCAAACAAAATTAACCTCCAGCAGCAAGTTGAGAAGGCGTT
GAACTTACTTACCCGAGGGGAAGGAGAGCCGACGGACAGACGCGAACCCCAGCCGCCGAGTGGAAGTGCTCCGCACGAGCCTCGTCCCTGTCTCAGATGGGTTCTTCCGC
CGGCGGGCTATTGGAAGCTAAATTGCGATGGATCGTGGAATGAAGAGAAGAAAATCGGAGGAGTTGGCTGGGTCATCAGATGTTGGACAGGAAGCATCGTTACTGCAGGA
TTCCGAGCAATCCACAAATCGTGGCATATTAGTTGGCTTGAAGCCATTGCTGTGATGGAAGGAATACGATCGATCCCCCATGACTCTCCCAGGCTGTTTATCGAGCTTGA
TTCCATTCAGGTGGTAAACTTGCTTGAGGGAAAGGAGGAAGATCTCACTGACCTGGCAAAGTTCATTGATGAAGCCAGAAGTTCCCTTCCTGTGTGCCAGCCTCATTCGT
TTTTTTACGTGCCTCGGCTGCAAAACCAATTGGCCCACCATCTGGCCCGTCGGGCTTGGGCCACGGACTCATCTGAAAGTTGGAGTAGGAATTTTCCCAATTGGTTGTTA
CATTTAAACTCTCAGGATATTGGGCCTTCGCCCATCAGTTATGGGGGCTCCTGTCCCATGGCTGGTAGCCTTTCGGGAGCTATTGCTCTTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAGACCTTGCCAAGCAACTTGCAGATCTACGTGTTACTCCTGAGGAGAAAGCTAGTGTTTTCAAACTTCAGGAACGGGAAATTGACCACTCTGAACAATTTTT
GGCGAACTCGATATTATTCAAAAATGCACGAATAAAAGGATGGGTCCTAGAGATGGGTCCATGGTTTTATGACAAGGCGATGCTATTATTCGAGGAACCAAAAGGAGACA
TCTGCAGCGAGGAAATGGAGTTCAGGGCATCCGCGGAGGCTATTGGAGGATTGCTCGGGAAGGTTGAAAAAGTGGACATAGGGGACGACTCAGATCCGGATTGGGGGAGA
TCATTGAGGATCAAAATTCAACTCAACGTCCGGAGACCTTTGAAGCGGGGAATTTTCTTGCAATCAGAGAGCTCAGGGAGAGACAAGTGGATTCCCATTACATATGAGAA
GCTCCCGGATTTTTGTTATGGCTGCGGTCGGTTGGGTCATACTATAAAAGAATGTGAGGATCAAATTAGCTCATCGGAAGAGGAGATGTTGTATGGCCCTACCCTTCGTG
AACCGGCCAGATTAAAATCTGAGGAGGCTGAATTTGCTACAAGAAATATCCAAGGCCGGGGGAGAGGAAGAGGAGGGTATGGAGGACGAGGTGGGTGGCGATACGTGGAC
GTTGTACAGGAGGAGGCTGAAAATCAAGAACCCGACAAACTGCATCCAGAGAAGGAAGAGAGGGTGCCGCAGGTCGAAGCTCATGGGCTGCCAGTGCAGGCGGAGGAGTC
TCCACCGCCGGTTGACAATCGGCACAGCACAACGGTTGAATTTTCCAAAAAGGAAAGAAGAGTGGGAATGGAAAATTCAATAAAGGCTAACGGCGAAATTTTAGGAATTT
CTGAGGTAGAGACCGTTGGGGACGAAATCAACGGAGAGTTAGGAATCTCAGCTTTAAATGGGGAGAGAACAGAGGTGTGTGCCAACGATTTCCTACAATACAGCTCACAA
ATGGAAATTGATTCCCACCAACAAGTGGCGGAAGATAATGAGAAGGATCCCAACTCTCAGGTGTTTAAAGATTCATTGAATGATACCAATGAACCCAAATGTGGGCCCAC
AAAGATTGGAACAATAAAGACCAAGGGCTGGAAAAGAATACAGAGGGATAATAAAACAAATGATGTTAGTGCTGAGTACTCCACGAAGAGATTCACTGGCATATATGGAA
ATCCGGTTAGAAGTTTGCATTCTGAAACCTGGACGTTAATGAAACGGTTGGGTGATCAGCTAGATATGGCATGGGTCCTTGGAGGCGACTTTAACGAGATCATTGACAAT
GGAGAAAAGTATGGCGGTTTAGACAGGAATGAAGCAGACATGAGGGATTTCCGGGATGCTATTGACTACTGTGAGGTTTTCGACCCTGGCTTCAATGGACCTAAATATAC
ATGGTGTAATAACCACGTTAATCGAGACAGGATATGGGAGAGACTCGATCGTTTTCTCATTAACCCGAGGATGAACGGATGGTGCAGTGTCTTTAAAGTATTCCATCTAG
CTAGGATTGCGTCAGATCACAGACCCATTCTGGCAGAATGGAAGGAAGAACCTCCAGATTGTAGAAACCACAAGGGCATTCGTCCAAGACGATTCGAGGAGGTCTGGACA
AAGTATGCAGAATGTAAAGATATTGTAAAGCACGTGTGGAGTGAGAGGTCGCGCAATACAAATACTGAAACTAACAGAGATGCACATCAATATTCAGTAGCCCATTTTAT
CAAACCTACGGGGGTGTGGGATGAGGCGTTAGTAAAGGATATGTTTCTGGAGAGTGATGCAGAAGCTATTCTTAACATCCCCCTGAGCTCAATGCATAGAGAAGATACAA
TTATATGGCACTATGACTCTAAAGGTTTCTTTACGGTGAAGAGTGCATACAGATTGGGGATTCATCTTCAAAGTTCCAATGCTGCATCAGGATCGTGCCTATGCCAGAAG
GAAGATCTGTGGAAAGAATACTGGAAAACTCCCATCCTCCCTAAAATTAAAGTTTGTGGGTGGAGAATTTATCATGATATTCTCCCAACTCGCACAAACCTAATTAAAAA
GGGGATGGAGGTGGATCATATGTGTTTGTTGTGCAGGGATAAACCAGAAACAATCCAACATCTTCTGTGGGAGTGTAAAATTACTAAAGGTATGTGGGCTAAATTTTCCC
CTCTTCCTAATGTTTTCTTTCTCTCTAACAGGAAGGGATGGGCGGCTGCTGACTACTGCGAAGTGATGTGGAGAGGGGGCAGCGAAGATGCAGGGAAGGAAGGAAATTTC
GCGAAAAGTACAATAGTATGTTGGCAAATATGGTCATACAGAAACACCATCCTTCATACCAATCAGAATCCAAACAAAATTAACCTCCAGCAGCAAGTTGAGAAGGCGTT
GAACTTACTTACCCGAGGGGAAGGAGAGCCGACGGACAGACGCGAACCCCAGCCGCCGAGTGGAAGTGCTCCGCACGAGCCTCGTCCCTGTCTCAGATGGGTTCTTCCGC
CGGCGGGCTATTGGAAGCTAAATTGCGATGGATCGTGGAATGAAGAGAAGAAAATCGGAGGAGTTGGCTGGGTCATCAGATGTTGGACAGGAAGCATCGTTACTGCAGGA
TTCCGAGCAATCCACAAATCGTGGCATATTAGTTGGCTTGAAGCCATTGCTGTGATGGAAGGAATACGATCGATCCCCCATGACTCTCCCAGGCTGTTTATCGAGCTTGA
TTCCATTCAGGTGGTAAACTTGCTTGAGGGAAAGGAGGAAGATCTCACTGACCTGGCAAAGTTCATTGATGAAGCCAGAAGTTCCCTTCCTGTGTGCCAGCCTCATTCGT
TTTTTTACGTGCCTCGGCTGCAAAACCAATTGGCCCACCATCTGGCCCGTCGGGCTTGGGCCACGGACTCATCTGAAAGTTGGAGTAGGAATTTTCCCAATTGGTTGTTA
CATTTAAACTCTCAGGATATTGGGCCTTCGCCCATCAGTTATGGGGGCTCCTGTCCCATGGCTGGTAGCCTTTCGGGAGCTATTGCTCTTCCTTAA
Protein sequenceShow/hide protein sequence
MEEDLAKQLADLRVTPEEKASVFKLQEREIDHSEQFLANSILFKNARIKGWVLEMGPWFYDKAMLLFEEPKGDICSEEMEFRASAEAIGGLLGKVEKVDIGDDSDPDWGR
SLRIKIQLNVRRPLKRGIFLQSESSGRDKWIPITYEKLPDFCYGCGRLGHTIKECEDQISSSEEEMLYGPTLREPARLKSEEAEFATRNIQGRGRGRGGYGGRGGWRYVD
VVQEEAENQEPDKLHPEKEERVPQVEAHGLPVQAEESPPPVDNRHSTTVEFSKKERRVGMENSIKANGEILGISEVETVGDEINGELGISALNGERTEVCANDFLQYSSQ
MEIDSHQQVAEDNEKDPNSQVFKDSLNDTNEPKCGPTKIGTIKTKGWKRIQRDNKTNDVSAEYSTKRFTGIYGNPVRSLHSETWTLMKRLGDQLDMAWVLGGDFNEIIDN
GEKYGGLDRNEADMRDFRDAIDYCEVFDPGFNGPKYTWCNNHVNRDRIWERLDRFLINPRMNGWCSVFKVFHLARIASDHRPILAEWKEEPPDCRNHKGIRPRRFEEVWT
KYAECKDIVKHVWSERSRNTNTETNRDAHQYSVAHFIKPTGVWDEALVKDMFLESDAEAILNIPLSSMHREDTIIWHYDSKGFFTVKSAYRLGIHLQSSNAASGSCLCQK
EDLWKEYWKTPILPKIKVCGWRIYHDILPTRTNLIKKGMEVDHMCLLCRDKPETIQHLLWECKITKGMWAKFSPLPNVFFLSNRKGWAAADYCEVMWRGGSEDAGKEGNF
AKSTIVCWQIWSYRNTILHTNQNPNKINLQQQVEKALNLLTRGEGEPTDRREPQPPSGSAPHEPRPCLRWVLPPAGYWKLNCDGSWNEEKKIGGVGWVIRCWTGSIVTAG
FRAIHKSWHISWLEAIAVMEGIRSIPHDSPRLFIELDSIQVVNLLEGKEEDLTDLAKFIDEARSSLPVCQPHSFFYVPRLQNQLAHHLARRAWATDSSESWSRNFPNWLL
HLNSQDIGPSPISYGGSCPMAGSLSGAIALP