; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G15890 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G15890
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRibonuclease H
Genome locationChr6:14297016..14298647
RNA-Seq ExpressionCSPI06G15890
SyntenyCSPI06G15890
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]8.6e-22484.99Show/hide
Query:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE-----
        YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALT+PDD                     EVNMATSYLIDEEDWRQPIIE     
Subjt:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE-----

Query:  ALPSMSR------------------------KGRF---------SKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI
         LP  SR                        +G F          K LKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI
Subjt:  ALPSMSR------------------------KGRF---------SKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI

Query:  HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF
        HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKK NVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF
Subjt:  HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF

Query:  KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR
        KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR
Subjt:  KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR

Query:  LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK
        LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLA+RR IITTRHTGNKFTPKWDGPYIVK+
Subjt:  LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK

XP_031737372.1 uncharacterized protein LOC116402244 [Cucumis sativus]8.6e-22484.99Show/hide
Query:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE-----
        YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALT+PDD                     EVNMATSYLIDEEDWRQPIIE     
Subjt:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE-----

Query:  ALPSMSR------------------------KGRF---------SKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI
         LP  SR                        +G F          K LKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI
Subjt:  ALPSMSR------------------------KGRF---------SKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI

Query:  HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF
        HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKK NVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF
Subjt:  HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF

Query:  KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR
        KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR
Subjt:  KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR

Query:  LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK
        LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLA+RR IITTRHTGNKFTPKWDGPYIVK+
Subjt:  LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]8.6e-22484.99Show/hide
Query:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE-----
        YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALT+PDD                     EVNMATSYLIDEEDWRQPIIE     
Subjt:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE-----

Query:  ALPSMSR------------------------KGRF---------SKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI
         LP  SR                        +G F          K LKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI
Subjt:  ALPSMSR------------------------KGRF---------SKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI

Query:  HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF
        HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKK NVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF
Subjt:  HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF

Query:  KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR
        KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR
Subjt:  KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR

Query:  LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK
        LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLA+RR IITTRHTGNKFTPKWDGPYIVK+
Subjt:  LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]8.6e-22484.99Show/hide
Query:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE-----
        YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALT+PDD                     EVNMATSYLIDEEDWRQPIIE     
Subjt:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE-----

Query:  ALPSMSR------------------------KGRF---------SKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI
         LP  SR                        +G F          K LKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI
Subjt:  ALPSMSR------------------------KGRF---------SKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI

Query:  HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF
        HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKK NVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF
Subjt:  HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF

Query:  KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR
        KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR
Subjt:  KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR

Query:  LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK
        LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLA+RR IITTRHTGNKFTPKWDGPYIVK+
Subjt:  LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK

XP_031742888.1 uncharacterized protein LOC116404510 [Cucumis sativus]8.6e-22484.99Show/hide
Query:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE-----
        YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALT+PDD                     EVNMATSYLIDEEDWRQPIIE     
Subjt:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE-----

Query:  ALPSMSR------------------------KGRF---------SKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI
         LP  SR                        +G F          K LKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI
Subjt:  ALPSMSR------------------------KGRF---------SKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI

Query:  HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF
        HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKK NVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF
Subjt:  HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF

Query:  KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR
        KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR
Subjt:  KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR

Query:  LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK
        LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLA+RR IITTRHTGNKFTPKWDGPYIVK+
Subjt:  LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK

TrEMBL top hitse value%identityAlignment
A0A5A7SPV8 Ribonuclease H3.2e-20076.53Show/hide
Query:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE-----
        YDVKHEDLKPYF YARQLME+FD+VML+HVPR ENKRADALANLATAL +PD+                     E N+  S+LI+EEDW QPIIE     
Subjt:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE-----

Query:  ALPSMSR---------------KG----RF--------------SKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI
         LP  SR               KG    RF               K L+E HAGVCGAHQSG KLQFQLRRMGYYWPKM+QDS+DY KKCE CQYHANFI
Subjt:  ALPSMSR---------------KG----RF--------------SKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFI

Query:  HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF
        HQP EPLHP+VASW FEAWGLDLVGPITPKSSAGHSYILAAT+YFS+WAEAI LREAKK NV +FIRTHIIYRYGIPHRIVTDNG+QFSNSM+DKLCEKF
Subjt:  HQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKF

Query:  KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR
        KF+QYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQE+I EALWAYRTTHRT T VTPYSLVYGVE  LPLEREIPSLRM VQEGLTTEDNVKLR
Subjt:  KFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLR

Query:  LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK
        LQELE LDEKRLEAQQ LECYQARMSKAFDKHVKPRSFQVGDLVLA+RR IITTRH GNKFTPKWDGPYIVK+
Subjt:  LQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK

A0A5A7T485 Reverse transcriptase2.9e-20985.78Show/hide
Query:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDDEVNMATSYLIDEEDWRQPIIE-----ALPSMSR---KGRFSKTLKEV
        YDVKHE+LKPYF YARQLME FD+VMLEHVPR+ENKRADALANLATAL +PD+EVN+ TS+LIDEED RQ IIE      LP  SR   +    K L+E 
Subjt:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDDEVNMATSYLIDEEDWRQPIIE-----ALPSMSR---KGRFSKTLKEV

Query:  HAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIHQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEA
        HAGVCGAHQSGPKLQFQLRRM YYWPKM+QDS+DY KKCE CQYHANFIHQP EPLHPT+ASWPFEAWGLDLVGPITPKSSAGH YILAATDYFS+W EA
Subjt:  HAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIHQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEA

Query:  ISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAY
        I LREAKK NVA+FIRTHIIYRYGIPHRIVTDNG+QFSNSM+DKLCEKFKFKQYKSSMYNAAANGLAE F+KTLCNLLKKIV KSKRDWQE+IGEALWAY
Subjt:  ISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAY

Query:  RTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLI
        RTTHRTPT VTPYSLVYG++AVLPLEREIPSLRMAVQE LT EDNVKLRLQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLA+RR I
Subjt:  RTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLI

Query:  ITTRHTGNKFTPKWDGPYIVKK
        +TTRHTGNKFTPKWDGPYIVK+
Subjt:  ITTRHTGNKFTPKWDGPYIVKK

A0A5D3BV77 Reverse transcriptase2.9e-20985.78Show/hide
Query:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDDEVNMATSYLIDEEDWRQPIIE-----ALPSMSR---KGRFSKTLKEV
        YDVKHE+LKPYF YARQLME FD+VMLEHVPR+ENKRADALANLATAL +PD+EVN+ TS+LIDEED RQ IIE      LP  SR   +    K L+E 
Subjt:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDDEVNMATSYLIDEEDWRQPIIE-----ALPSMSR---KGRFSKTLKEV

Query:  HAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIHQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEA
        HAGVCGAHQSGPKLQFQLRRM YYWPKM+QDS+DY KKCE CQYHANFIHQP EPLHPT+ASWPFEAWGLDLVGPITPKSSAGH YILAATDYFS+W EA
Subjt:  HAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIHQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEA

Query:  ISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAY
        I LREAKK NVA+FIRTHIIYRYGIPHRIVTDNG+QFSNSM+DKLCEKFKFKQYKSSMYNAAANGLAE F+KTLCNLLKKIV KSKRDWQE+IGEALWAY
Subjt:  ISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAY

Query:  RTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLI
        RTTHRTPT VTPYSLVYG++AVLPLEREIPSLRMAVQE LT EDNVKLRLQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLA+RR I
Subjt:  RTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLI

Query:  ITTRHTGNKFTPKWDGPYIVKK
        +TTRHTGNKFTPKWDGPYIVK+
Subjt:  ITTRHTGNKFTPKWDGPYIVKK

A0A5D3C8N8 Ribonuclease H3.4e-20280.54Show/hide
Query:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIEAL--P
        YDVKHEDLKPYF YARQLME+FD+VMLEHVPR+ENKRAD L NLATAL + D+                     EVN+ TS+LID+EDWRQPIIE L   
Subjt:  YDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIEAL--P

Query:  SMSRKGRFS------KTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIHQPPEPLHPTVASWPFEAWGLDLVGPITPK
         +S+  R        K L+E HAGVCGAHQS  KLQFQLRRM YYWPKM+QDS+DY KK +  QYHANFIHQPPEP HPTVASWPFEAWGLDL GPITPK
Subjt:  SMSRKGRFS------KTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIHQPPEPLHPTVASWPFEAWGLDLVGPITPK

Query:  SSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYNAAANGLAEAFNKTLCNLLK
        SS GHSYILAAT YFS+WAE I LREAKK NVA+FIRTHIIYRYGIPHRI+TDNG+QFSNSM+DKLCEKFKFKQYKSSMYNAAANGLAEAFNKTLCNLLK
Subjt:  SSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYNAAANGLAEAFNKTLCNLLK

Query:  KIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLEAQQALECYQARMSKAFD
        KIVSK  RDWQE+IGEALWAYRTTHRTP GVTPYSLVYGVE VLPLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLEAQQAL+CYQARMSKAFD
Subjt:  KIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLEAQQALECYQARMSKAFD

Query:  KHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVK
        KHVKPRSFQV DLVLA+RR IITTRHTGNKFTPKWDGPYIVK
Subjt:  KHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVK

A0A5D3D1E5 Ribonuclease H2.6e-19771.16Show/hide
Query:  HTYDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE---
        + Y+VKH+DLKPYF+YAR+LM++FD+++LEH+PR ENK+ADALANLATALT+ +D                     E ++ + Y IDEEDWRQPII+   
Subjt:  HTYDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDD---------------------EVNMATSYLIDEEDWRQPIIE---

Query:  --ALPSMSR---------------------------------KGRFSKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHAN
           LP+  R                                 K   +K L+E H+G+CGAHQSGPKLQ+QL+RMGYYWP MI DS+ + K CE CQ+HAN
Subjt:  --ALPSMSR---------------------------------KGRFSKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHAN

Query:  FIHQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCE
        FIHQPPEPLHPT+ASWPFEAWGLDLVGPITPKS+AGHSYILA TDYFS+WAEA+ LREAKK N+ +F++THIIYRYGIPHRIVTDNG+QF+N++MDKLCE
Subjt:  FIHQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCE

Query:  KFKFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVK
        KF FKQ+KSSMYNAAANGLAEAFNKTLC+LLKK+VSK+KRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMA+QEGLTTEDN +
Subjt:  KFKFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVK

Query:  LRLQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK
        LRLQELEALDEKRLEAQQALECYQARMSKAFDK V+PRSFQVGDLVLA+RR IITTRHTGNKFTPKWDGPYIVK+
Subjt:  LRLQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKK

SwissProt top hitse value%identityAlignment
A4FUB7 Gypsy retrotransposon integrase-like protein 12.6e-2125.67Show/hide
Query:  KTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIHQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYF
        K L+E H    GAH  G      L    YYW  +  D   +V  C+ CQ   N +   P+  H      P+    +DL+GP    S+  H Y +  TD F
Subjt:  KTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIHQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYF

Query:  SRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIG
        ++W   + L +   + ++  I  +I + YG P +I+ D   +F + +  +LCE F  KQ   S  +   N  AE+   T+   L K       DW + + 
Subjt:  SRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIG

Query:  EALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKL--RLQELEALDEKRLEAQQALE---CYQARMSKAF----DKHVKPR
           +A+  TH  PT  TPY  ++     +P   +I  +     +G  T    K+   ++E + + E +  +   +E   C++   SK       K   P 
Subjt:  EALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKL--RLQELEALDEKRLEAQQALE---CYQARMSKAF----DKHVKPR

Query:  SFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIV
          +VG  VL  R+          +F  +W GP ++
Subjt:  SFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIV

P03360 Gag-Pol polyprotein (Fragment)1.7e-1727.24Show/hide
Query:  PFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYNAAA
        P E W +D    IT K   G+ Y+L   D FS W EA   +      V   +   II R+G+P +I +DNG  F   +  +LCE           Y   +
Subjt:  PFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYNAAA

Query:  NGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLEA
        +G  E  N+TL   + K+  ++  DW   + +AL   R T     G++P+ ++YG++  +     +P +       +T +      L+ L+AL   R  A
Subjt:  NGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLEA

Query:  QQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIV
        +  L   + ++ +   +  +   FQ GDLV          +H   +  P+WDGPY V
Subjt:  QQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIV

Q5DTZ0 Protein NYNRIN3.7e-1224.86Show/hide
Query:  RQPIIEALPSMSRKGRFSKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIH------QPPEPLHPTVASWPFEAWGL
        R P +  +P   R+      +  VH    G HQ        +R +G +WP M     DY + C  C    N I       + P PL  T    P+ +  +
Subjt:  RQPIIEALPSMSRKGRFSKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIH------QPPEPLHPTVASWPFEAWGL

Query:  DLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMM--------DKLCEKFKFKQYKSSMYNAA
        ++VGP+T  S  GH ++L   D  +RW EA  L+      VA  +  H+  R+G+P R+    G QF+  ++         ++    +  Q+   M + A
Subjt:  DLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMM--------DKLCEKFKFKQYKSSMYNAA

Query:  ANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLE
              A        LK+ +    + W   +     A+R    + T  TP+ ++ G E  L +E     +  A  EGL  +  +   ++EL  LD     
Subjt:  ANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLE

Query:  AQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIV
        A++A E  +   ++ F +  +   + VGD VL    L+   R   N  + KW GP+ +
Subjt:  AQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIV

Q9P2P1 Protein NYNRIN5.8e-1325.42Show/hide
Query:  RQPIIEALPSMSRKGRFSKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIH------QPPEPLHPTVASWPFEAWGL
        ++P +  +P+  R+      +  VH    GAHQ   +   +LR +G +WP M +   DY + C  C    N I       + P PL  T    P+    +
Subjt:  RQPIIEALPSMSRKGRFSKTLKEVHAGVCGAHQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIH------QPPEPLHPTVASWPFEAWGL

Query:  DLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYN-----AAANG
        ++VGP+T  S  GH ++L   D  +RW EA  L+      VA  +  H+  R+G+P R+    G QF+  ++   C      Q  S   +       ++G
Subjt:  DLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYN-----AAANG

Query:  LAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVL--PLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDE-KRLE
            F +     LK+ +    + W   +     A+R    + T  TP+ ++ G E+ L  PL  E+ S  +   EGL      K+ +  L+ + E   L 
Subjt:  LAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVL--PLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDE-KRLE

Query:  AQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIV
         + A +  +   ++ F +  + + + VGD VL    L+   R   N  + KW GP+ +
Subjt:  AQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIV

Q9TTC1 Gag-Pol polyprotein8.9e-1425.08Show/hide
Query:  HQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIHQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAK
        H    KL   + R  ++ P +     +   KC+ C    N +    EP        P   W +D    + P    G+ Y+L   D FS W EA   +   
Subjt:  HQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIHQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAK

Query:  KANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYNAAANGLAEAFNKTLCNLLKKI-VSKSKRDWQEKIGEALWAYRTTHRT
           V   I   I+ R+GIP  + +DNG  F   +   L  +          Y   ++G  E  N+T+   L K+ +    +DW   +  AL   R T   
Subjt:  KANVADFIRTHIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYNAAANGLAEAFNKTLCNLLKKI-VSKSKRDWQEKIGEALWAYRTTHRT

Query:  PTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLEA-QQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRH
          G+TPY +++G    +    E+           +  D   +    L+AL+  R +   Q  E Y+            P  FQVGD VL         RH
Subjt:  PTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLEA-QQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRH

Query:  TGNKFTPKWDGPYIV
              P+W GPY+V
Subjt:  TGNKFTPKWDGPYIV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAAGTCTTCATGTTTCACGTCATACTTATGACGTGAAACATGAAGACTTGAAGCCATATTTTGCTTATGCTCGACAACTGATGGAAAAGTTTGATAATGTGAT
GTTAGAACATGTCCCTAGAGTAGAAAATAAGAGAGCGGATGCATTGGCAAATTTAGCCACGGCCTTGACCATTCCAGATGATGAAGTGAACATGGCAACATCCTATTTGA
TTGATGAAGAAGATTGGCGTCAACCCATCATAGAGGCTCTTCCTTCGATGTCTCGAAAAGGAAGATTCAGTAAAACTCTAAAGGAAGTACATGCAGGTGTTTGTGGAGCA
CATCAATCGGGACCAAAGCTTCAATTCCAGCTAAGAAGAATGGGCTACTACTGGCCTAAGATGATCCAAGATTCAATAGACTATGTGAAGAAGTGTGAGCCTTGTCAATA
CCATGCAAACTTCATACACCAACCTCCAGAACCTCTTCATCCAACTGTGGCTTCTTGGCCTTTTGAGGCTTGGGGACTCGATCTGGTTGGCCCCATTACACCAAAATCAT
CAGCAGGACATTCTTATATCCTAGCAGCAACAGACTATTTTTCAAGGTGGGCTGAGGCCATTTCCTTGAGAGAAGCCAAGAAGGCGAACGTGGCAGACTTTATTCGAACA
CACATCATCTATCGATACGGTATTCCACATCGAATCGTGACGGATAATGGAAAGCAATTCTCCAATAGTATGATGGACAAGTTATGTGAAAAATTCAAATTCAAGCAATA
TAAGTCATCCATGTACAACGCAGCTGCGAATGGACTAGCAGAAGCATTCAACAAAACGTTGTGTAATCTTTTAAAGAAAATTGTCTCCAAGTCAAAGAGGGATTGGCAAG
AAAAGATCGGCGAGGCATTATGGGCTTATCGGACGACTCATCGCACCCCTACAGGGGTTACACCATATTCGCTTGTTTACGGTGTGGAGGCTGTCCTTCCTCTCGAAAGG
GAAATTCCGTCACTAAGAATGGCAGTACAAGAGGGATTGACTACCGAAGATAATGTGAAGTTACGTCTTCAAGAATTAGAAGCACTTGACGAAAAGCGATTAGAGGCTCA
GCAAGCATTGGAATGTTATCAAGCGAGAATGTCCAAAGCTTTTGATAAACACGTTAAACCTCGCTCCTTTCAAGTTGGTGATCTAGTACTTGCCATAAGAAGACTGATCA
TCACAACAAGGCATACAGGAAATAAGTTCACACCTAAATGGGATGGACCCTACATTGTTAAAAAAAGTTTATATAAATGGCGCATGCAAGATCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAAGTCTTCATGTTTCACGTCATACTTATGACGTGAAACATGAAGACTTGAAGCCATATTTTGCTTATGCTCGACAACTGATGGAAAAGTTTGATAATGTGAT
GTTAGAACATGTCCCTAGAGTAGAAAATAAGAGAGCGGATGCATTGGCAAATTTAGCCACGGCCTTGACCATTCCAGATGATGAAGTGAACATGGCAACATCCTATTTGA
TTGATGAAGAAGATTGGCGTCAACCCATCATAGAGGCTCTTCCTTCGATGTCTCGAAAAGGAAGATTCAGTAAAACTCTAAAGGAAGTACATGCAGGTGTTTGTGGAGCA
CATCAATCGGGACCAAAGCTTCAATTCCAGCTAAGAAGAATGGGCTACTACTGGCCTAAGATGATCCAAGATTCAATAGACTATGTGAAGAAGTGTGAGCCTTGTCAATA
CCATGCAAACTTCATACACCAACCTCCAGAACCTCTTCATCCAACTGTGGCTTCTTGGCCTTTTGAGGCTTGGGGACTCGATCTGGTTGGCCCCATTACACCAAAATCAT
CAGCAGGACATTCTTATATCCTAGCAGCAACAGACTATTTTTCAAGGTGGGCTGAGGCCATTTCCTTGAGAGAAGCCAAGAAGGCGAACGTGGCAGACTTTATTCGAACA
CACATCATCTATCGATACGGTATTCCACATCGAATCGTGACGGATAATGGAAAGCAATTCTCCAATAGTATGATGGACAAGTTATGTGAAAAATTCAAATTCAAGCAATA
TAAGTCATCCATGTACAACGCAGCTGCGAATGGACTAGCAGAAGCATTCAACAAAACGTTGTGTAATCTTTTAAAGAAAATTGTCTCCAAGTCAAAGAGGGATTGGCAAG
AAAAGATCGGCGAGGCATTATGGGCTTATCGGACGACTCATCGCACCCCTACAGGGGTTACACCATATTCGCTTGTTTACGGTGTGGAGGCTGTCCTTCCTCTCGAAAGG
GAAATTCCGTCACTAAGAATGGCAGTACAAGAGGGATTGACTACCGAAGATAATGTGAAGTTACGTCTTCAAGAATTAGAAGCACTTGACGAAAAGCGATTAGAGGCTCA
GCAAGCATTGGAATGTTATCAAGCGAGAATGTCCAAAGCTTTTGATAAACACGTTAAACCTCGCTCCTTTCAAGTTGGTGATCTAGTACTTGCCATAAGAAGACTGATCA
TCACAACAAGGCATACAGGAAATAAGTTCACACCTAAATGGGATGGACCCTACATTGTTAAAAAAAGTTTATATAAATGGCGCATGCAAGATCATTGA
Protein sequenceShow/hide protein sequence
MASSLHVSRHTYDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTIPDDEVNMATSYLIDEEDWRQPIIEALPSMSRKGRFSKTLKEVHAGVCGA
HQSGPKLQFQLRRMGYYWPKMIQDSIDYVKKCEPCQYHANFIHQPPEPLHPTVASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSRWAEAISLREAKKANVADFIRT
HIIYRYGIPHRIVTDNGKQFSNSMMDKLCEKFKFKQYKSSMYNAAANGLAEAFNKTLCNLLKKIVSKSKRDWQEKIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
EIPSLRMAVQEGLTTEDNVKLRLQELEALDEKRLEAQQALECYQARMSKAFDKHVKPRSFQVGDLVLAIRRLIITTRHTGNKFTPKWDGPYIVKKSLYKWRMQDH