; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032853 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032853
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIRK-interacting protein-like
Genome locationchr11:38219693..38222745
RNA-Seq ExpressionLag0032853
SyntenyLag0032853
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR001363 - Proteinase inhibitor I25C, fetuin, conserved site
IPR021109 - Aspartic peptidase domain superfamily
IPR042316 - IRK-interacting protein-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579443.1 IRK-interacting protein, partial [Cucurbita argyrosperma subsp. sororia]3.1e-23286.97Show/hide
Query:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAA---ADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSS
        MA PSPTS +SS  S S APT HHSRSHFTPIQECEREE+E++ AA NRT+N AA    DRGTSP  H TPLNR          +TVK+RSESESDSVSS
Subjt:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAA---ADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSS

Query:  SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRL
        SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTR+SPKSINESSALTAREEQWR AVTELSQKLVQ TRKRDEAIMEASRL
Subjt:  SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRL

Query:  KYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY
        KYAMAELEKKL+KLETYCHSLKSG+EECS   GNSPCQIG Y++IQ   QMNQKQV+E+FLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY
Subjt:  KYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY

Query:  DVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAW
        D+KISFSKNPRSLLFYLEALLN+AFFEDFESVGFQKNASTQ+LNPIERC+ANFGCFNFLHELTWEEVL+KGTKHFSE+FSRFCDRKMSEIVAMLGWNRAW
Subjt:  DVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAW

Query:  PEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA
        PEPLLQAFFGASKSV+LLHLLANSVHPNLPIFRVEKEA+FDSVYMEDM G+KARKLIPSVVRIM+APGFYVYGSVVKCKVLCRYNAA + A
Subjt:  PEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA

XP_004147608.3 IRK-interacting protein [Cucumis sativus]2.0e-23186.21Show/hide
Query:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAAADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSSSDG
        MAAPSPTS  SS              SHFTPIQEC+REE+E+ SA    T      DRGTSPKH+PTPLN    R+KNGKSQT+K+RSESESDSVSSSDG
Subjt:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAAADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSSSDG

Query:  PVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRLKYA
        PVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQ TRKRDEA+MEASRLKYA
Subjt:  PVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRLKYA

Query:  MAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYDVK
        MAELEKKLDKLETYCHSLKSG+EECS   GNSPCQIGKYN+IQSF Q NQKQV+E+FLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYD+K
Subjt:  MAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYDVK

Query:  ISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAWPEP
         SFSKNPRS+LFYLEALLN+AFFEDFES+GFQKNASTQVLNPIERCEANF CFNFLHELTWEEVL+KGTKHFSE+FSRFCDRKMSEIVAMLGWNRAWPEP
Subjt:  ISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAWPEP

Query:  LLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAATLSAS
        LLQAFF ASKSV+LLHLLANSVHPNLPIFRVEKEA+FDSVYMEDMGGDKARKLIPS+VRIM+APGFYVYGSVVKCKVLCRYNAA + AT +A+
Subjt:  LLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAATLSAS

XP_022922117.1 IRK-interacting protein-like [Cucurbita moschata]2.4e-23286.97Show/hide
Query:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAA---ADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSS
        MA PSPTS +SS  S S APT HHSRSHFTPIQECEREE+E++ AA NRT+N AA    DRGTSP  H TPLNR          +TVK+RSESESDSVSS
Subjt:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAA---ADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSS

Query:  SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRL
        SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTR+SPKSINESSALTAREEQWR AVTELSQKLVQ TRKRDEAIMEASRL
Subjt:  SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRL

Query:  KYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY
        KYAMAELEKKL+KLETYCHSLKSG+EECS   GNSPCQIG Y++IQ   QMNQKQV+E+FLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY
Subjt:  KYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY

Query:  DVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAW
        D+KISFSKNPRSLLFYLEALLN+AFFEDFESVGFQKNASTQ+LNPIERC+ANFGCFNFLHELTWEEVL+KGTKHFSE+FSRFCDRKMSEIVAMLGWNRAW
Subjt:  DVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAW

Query:  PEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA
        PEPLLQAFFGASKSV+LLHLLANSVHPNLPIFRVEKEA+FDSVYMEDM G+KARKLIPSVVRIM+APGFYVYGSVVKCKVLCRYNAA + A
Subjt:  PEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA

XP_023549684.1 IRK-interacting protein-like [Cucurbita pepo subsp. pepo]2.2e-23086.56Show/hide
Query:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAA---ADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSS
        MA PSPT SF SSPS S A   HHSR HFTPIQECEREE+E++ AA NRT+N AA    DRGTSP  H TPLNR          +TVK+RSESESDSVSS
Subjt:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAA---ADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSS

Query:  SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRL
        SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTR+SPKSINESSALTAREEQWR AVTELSQKLVQ TRKRDEAIMEASRL
Subjt:  SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRL

Query:  KYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY
        KYAMAELEKKL+KLETYCHSLKSG+EECS    NSPCQIG Y++IQ   QMNQKQV+E+FLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY
Subjt:  KYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY

Query:  DVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAW
        D+KISFSKNPRSLLFYLEALLN+AFFEDFESVGFQKNASTQ+LNPIERC+ANFGCFNFLHELTWEEVL+KGTKHFSE+FSRFCDRKMSEIVAMLGWNRAW
Subjt:  DVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAW

Query:  PEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA
        PEPLLQAFFGASKSV+LLHLLANSVHPNLPIFRVEKEA+FDSVYMEDM G+KARKLIPSVVRIM+APGFYVYGSVVKCKVLCRYNAA + A
Subjt:  PEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA

XP_038875926.1 IRK-interacting protein [Benincasa hispida]3.1e-23286.92Show/hide
Query:  AAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAA-----ADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVS
        AAPSPT SF+S    SPAPT HH  S FTPIQE ERE +E+     NRT+N AA      DRGTSPKHH TPLN    R+KNGKS TVK+RSESESDSVS
Subjt:  AAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAA-----ADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVS

Query:  SSDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASR
        SSDGPVSCNRCRPH REKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQ TRKRDEAIMEASR
Subjt:  SSDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASR

Query:  LKYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQP
        LKYAMAELEKKLDKLETYCH LKSG+EECS   GNSPCQIG YN+IQSF Q NQKQV+E+FLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQP
Subjt:  LKYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQP

Query:  YDVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRA
        YDVKISFSKNPRS+LFYLEALLN+AFFEDFES+GFQKNASTQVLNPIERCE NF CFNFLHELTWEEVL+KGTKHFSE+FSRFCDRKMSEIVAMLGWNRA
Subjt:  YDVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRA

Query:  WPEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAATLSAS
        WPEPLLQAFF ASKSV+LLHLLANSVHPNLPIFRV+KEA+FDSVYMEDMGGDKARKLIPS+VRIM+APGFYVYGSVVKCKVLCRYNAA +AAT +AS
Subjt:  WPEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAATLSAS

TrEMBL top hitse value%identityAlignment
A0A0A0KMS3 Uncharacterized protein9.9e-23286.21Show/hide
Query:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAAADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSSSDG
        MAAPSPTS  SS              SHFTPIQEC+REE+E+ SA    T      DRGTSPKH+PTPLN    R+KNGKSQT+K+RSESESDSVSSSDG
Subjt:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAAADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSSSDG

Query:  PVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRLKYA
        PVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQ TRKRDEA+MEASRLKYA
Subjt:  PVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRLKYA

Query:  MAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYDVK
        MAELEKKLDKLETYCHSLKSG+EECS   GNSPCQIGKYN+IQSF Q NQKQV+E+FLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYD+K
Subjt:  MAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYDVK

Query:  ISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAWPEP
         SFSKNPRS+LFYLEALLN+AFFEDFES+GFQKNASTQVLNPIERCEANF CFNFLHELTWEEVL+KGTKHFSE+FSRFCDRKMSEIVAMLGWNRAWPEP
Subjt:  ISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAWPEP

Query:  LLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAATLSAS
        LLQAFF ASKSV+LLHLLANSVHPNLPIFRVEKEA+FDSVYMEDMGGDKARKLIPS+VRIM+APGFYVYGSVVKCKVLCRYNAA + AT +A+
Subjt:  LLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAATLSAS

A0A1S3ATV6 IRK-interacting protein1.7e-22886.07Show/hide
Query:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAAADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSSSDG
        MAAPSPTS  SS              SHFTPIQEC+REE+E+ SAAA         D GTSPK +PTPLN    R+ NGKSQTVK+RSESESDSVSSSDG
Subjt:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAAADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSSSDG

Query:  PVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRLKYA
        PVSCNRCRPHAREKISVVPLDN+NGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQ TRKRDEA+MEASRLKYA
Subjt:  PVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRLKYA

Query:  MAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYDVK
        MAELEKKLDKLETYCHSLKSG+EECS   GNSPCQIGKYN+IQSF Q NQKQV+E+FLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERIS+LLQPYD+K
Subjt:  MAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYDVK

Query:  ISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAWPEP
        ISFSKNPRS+LFYLEALLN+AFFEDFES+GFQKNASTQVLNPIERCEANF CFNFLHELTWEEVL+KGTKHFSE+FSRFCDRKMSEIVAMLGWNRAWPEP
Subjt:  ISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAWPEP

Query:  LLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA
        LLQAFF ASKSV+LLHLLANSVHPNLPIFRVEKEA+FDSVYMEDMGGDKARKLIPS+VRIM++PGFYVYGSVVKCKVLCRYNAA ++A
Subjt:  LLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA

A0A5A7TLI4 IRK-interacting protein1.7e-22886.07Show/hide
Query:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAAADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSSSDG
        MAAPSPTS  SS              SHFTPIQEC+REE+E+ SAAA         D GTSPK +PTPLN    R+ NGKSQTVK+RSESESDSVSSSDG
Subjt:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAAADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSSSDG

Query:  PVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRLKYA
        PVSCNRCRPHAREKISVVPLDN+NGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQ TRKRDEA+MEASRLKYA
Subjt:  PVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRLKYA

Query:  MAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYDVK
        MAELEKKLDKLETYCHSLKSG+EECS   GNSPCQIGKYN+IQSF Q NQKQV+E+FLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERIS+LLQPYD+K
Subjt:  MAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYDVK

Query:  ISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAWPEP
        ISFSKNPRS+LFYLEALLN+AFFEDFES+GFQKNASTQVLNPIERCEANF CFNFLHELTWEEVL+KGTKHFSE+FSRFCDRKMSEIVAMLGWNRAWPEP
Subjt:  ISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAWPEP

Query:  LLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA
        LLQAFF ASKSV+LLHLLANSVHPNLPIFRVEKEA+FDSVYMEDMGGDKARKLIPS+VRIM++PGFYVYGSVVKCKVLCRYNAA ++A
Subjt:  LLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA

A0A6J1E2B7 IRK-interacting protein-like1.2e-23286.97Show/hide
Query:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAA---ADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSS
        MA PSPTS +SS  S S APT HHSRSHFTPIQECEREE+E++ AA NRT+N AA    DRGTSP  H TPLNR          +TVK+RSESESDSVSS
Subjt:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAA---ADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSS

Query:  SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRL
        SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTR+SPKSINESSALTAREEQWR AVTELSQKLVQ TRKRDEAIMEASRL
Subjt:  SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRL

Query:  KYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY
        KYAMAELEKKL+KLETYCHSLKSG+EECS   GNSPCQIG Y++IQ   QMNQKQV+E+FLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY
Subjt:  KYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY

Query:  DVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAW
        D+KISFSKNPRSLLFYLEALLN+AFFEDFESVGFQKNASTQ+LNPIERC+ANFGCFNFLHELTWEEVL+KGTKHFSE+FSRFCDRKMSEIVAMLGWNRAW
Subjt:  DVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAW

Query:  PEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA
        PEPLLQAFFGASKSV+LLHLLANSVHPNLPIFRVEKEA+FDSVYMEDM G+KARKLIPSVVRIM+APGFYVYGSVVKCKVLCRYNAA + A
Subjt:  PEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA

A0A6J1I269 IRK-interacting protein-like4.1e-23086.15Show/hide
Query:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAA---ADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSS
        MA PSPT SF SSPS S A   HHSR HFTPIQECEREE+E++ AA NRT+N AA    DRGTSP  H TPLNR          +TVK+RSESESDSVSS
Subjt:  MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAA---ADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSS

Query:  SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRL
        SDGPVSCNRCRPHAREK SVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTR+SPKSINESSALTAREEQWR AVTELSQKLVQ TRKRDEAIMEASRL
Subjt:  SDGPVSCNRCRPHAREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRL

Query:  KYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY
        KYAMAELEKKL+KLETYCHSLKSG+EECS   GNSPCQIG Y++IQ   QMNQKQV+E+FLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY
Subjt:  KYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPY

Query:  DVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAW
        ++KISFSKNPRSLLFYLEALLN+A FEDFESVGFQKNASTQ+LNPIERC+ANFGCFNFLHELTWEEVL+KGTKHFSE+FSRFCDRKMSEIVAMLGWNRAW
Subjt:  DVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAW

Query:  PEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA
        PEPLLQAFFGASKSV+LLHLLANSVHPNLPIFRVEKEA+FDSVYMEDM G+KARKLIPSVVRIM+APGFYVYGSVVKCKVLCRYNAA + A
Subjt:  PEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAA

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-24.5e-0859.18Show/hide
Query:  TPLIQNPFDPSFYYLSLQGITVGQKFLLIPSSWFRLNTDGSGGVIIDSG
        T LI +  +P++YY++LQGITVG   L IPSS F+L  DG+GG+IIDSG
Subjt:  TPLIQNPFDPSFYYLSLQGITVGQKFLLIPSSWFRLNTDGSGGVIIDSG

Q766C3 Aspartic proteinase nepenthesin-12.8e-0540.79Show/hide
Query:  SIDNTTNRAIEVTPLIQNPFDPSFYYLSLQGITVGQKFLLIPSSWFRLNT-DGSGGVIIDSG-ALSIFGSMQHQNM
        S+ N+       T LIQ+   P+FYY++L G++VG   L I  S F LN+ +G+GG+IIDSG  L+ F +  +Q++
Subjt:  SIDNTTNRAIEVTPLIQNPFDPSFYYLSLQGITVGQKFLLIPSSWFRLNT-DGSGGVIIDSG-ALSIFGSMQHQNM

Q940R4 Probable aspartyl protease At4g165633.6e-0542.86Show/hide
Query:  TPLIQNPFDPSFYYLSLQGITVGQKFLLIPSSWFRLNTDGSGGVIIDSG
        T +++NP  P FY +SLQGI++G++ +  P+   R++ +G GGV++DSG
Subjt:  TPLIQNPFDPSFYYLSLQGITVGQKFLLIPSSWFRLNTDGSGGVIIDSG

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 26.1e-0545.83Show/hide
Query:  PLIQNPFDPSFYYLSLQGITVGQKFLLIPSSWFRLNTDGSGGVIIDSG
        PL++NP  PSFYY+ L+G+ VG   + +P   F L   G GGV++D+G
Subjt:  PLIQNPFDPSFYYLSLQGITVGQKFLLIPSSWFRLNTDGSGGVIIDSG

Q9LXU9 IRK-interacting protein1.2e-6440.57Show/hide
Query:  RKSPKSINESSALTAREEQWR-------AAVTELSQKLVQTTRKRDEAIMEASRLKYAMAELEKKLDKLETYCHSLKSGLEECS------GGGGNSPCQI
        ++   SI  S ++T + E+         + V +L ++L++  R RD A+ + S +K ++ EL +KL  LE+YC +LK  L E +        GG S    
Subjt:  RKSPKSINESSALTAREEQWR-------AAVTELSQKLVQTTRKRDEAIMEASRLKYAMAELEKKLDKLETYCHSLKSGLEECS------GGGGNSPCQI

Query:  GKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYDVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNAS
        GK N   S   ++++ +VE FL  VSE+R SI+   ++L  ++    + +   I+ LLQP+++  + SK  + + ++LEA+++++ ++DFE+  FQKN  
Subjt:  GKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYDVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNAS

Query:  TQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAWPEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAE
         ++L+P +  +ANF  F  L  L+W EVL KGTK++S+EFSRFCD KMS I+  L W R W E +LQAFF A+K V+LLHLLA S +P L I RVE+  E
Subjt:  TQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAWPEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAE

Query:  FDSVYMEDMGGDKARKLI---PSVVRIMVAPGFYVYGSVVKCKVLCRYNA
        F+S +MEDMG D+ R  +   P+ V++MV PGFYV   V++CKVLCRY +
Subjt:  FDSVYMEDMGGDKARKLI---PSVVRIMVAPGFYVYGSVVKCKVLCRYNA

Arabidopsis top hitse value%identityAlignment
AT1G12330.1 unknown protein1.6e-14157.63Show/hide
Query:  SFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRA--AADRGTS--PKH-HPTPLNRVDDRS-------KNGKSQTVKRRSESESDSVS
        S SSSPS SP P+ H S  HFTPI ECE ++ +E+     R KNRA  ++D G+S  P H H    N  ++          NGK QT KR  +++ D   
Subjt:  SFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRA--AADRGTS--PKH-HPTPLNRVDDRS-------KNGKSQTVKRRSESESDSVS

Query:  SSDGPVSCNRCRPH--AREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPK-----------SINESSALTAREEQWRAAVTELSQKLVQT
           G VSCN+CRPH   R+K SVVPL+++N  +      ++SPN I KS+  SLTR+SPK           S + S+A  +REEQWR AV ELS KL+Q 
Subjt:  SSDGPVSCNRCRPH--AREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPK-----------SINESSALTAREEQWRAAVTELSQKLVQT

Query:  TRKRDEAIMEASRLKYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVG
        T+K+++A++EASRLK +MAELEKKL+KLE YCH+LKSGL+ECS    + P +   +N+          ++++ FLVSVSESRSSIR LSRSL  QLR VG
Subjt:  TRKRDEAIMEASRLKYAMAELEKKLDKLETYCHSLKSGLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVG

Query:  AKVYERISVLLQPYDVKI-SFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDR
         KVYER+S+LLQP+DVKI SF+KNP+SL+FYLEA+L+RAFFEDFE+ GFQKN ST++LNPI+RCE+N+  FN L ELTW+EVL++GTKHFSEEFSRFCDR
Subjt:  AKVYERISVLLQPYDVKI-SFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDR

Query:  KMSEIVAMLGWNRAWPEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCR
        KMS++V+ML WNRAWPEPLLQAFFGASKSV+L+HLLANSV+P L IFRVEK+  FD +YME+ GG++ +    S+VR MV PGFYVYGSVVKCKV+C+
Subjt:  KMSEIVAMLGWNRAWPEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCR

AT1G25510.1 Eukaryotic aspartyl protease family protein6.1e-0844.78Show/hide
Query:  ESSTSIDNTTNRAIE--VTPLIQNPFDPSFYYLSLQGITVGQKFLLIPSSWFRLNTDGSGGVIIDSG
        +S++++D  T+ + +  V PL++N    +FYYL L GI+VG + L IP S F ++  GSGG+IIDSG
Subjt:  ESSTSIDNTTNRAIE--VTPLIQNPFDPSFYYLSLQGITVGQKFLLIPSSWFRLNTDGSGGVIIDSG

AT2G03200.1 Eukaryotic aspartyl protease family protein1.1e-1229.19Show/hide
Query:  ESSTSIDNTTNRAIEVTPLIQNPFDPSFYYLSLQGITVGQKFLLIPSSWFRLNTDGSGGVIIDSG-----------------------------------
        ++  S+D    + +    L++NP  PSFYYL LQGITVG K L +  S F L  DG+GG+IIDSG                                   
Subjt:  ESSTSIDNTTNRAIEVTPLIQNPFDPSFYYLSLQGITVGQKFLLIPSSWFRLNTDGSGGVIIDSG-----------------------------------

Query:  ---------------------------------------------------ALSIFGSMQHQNMLVLHDLKKEVVSFVPTQCARL
                                                            +SIFG++Q QN  VLHDL+KE VSFVPT+C +L
Subjt:  ---------------------------------------------------ALSIFGSMQHQNMLVLHDLKKEVVSFVPTQCARL

AT2G45260.1 Plant protein of unknown function (DUF641)8.8e-0740.26Show/hide
Query:  QAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKA--RKLIPSVVRIMVAPGFYVYGSVVKCKV
        QAF   +KS+++LH LA S  P   IF+V+K +EF   YME +  +     K     V +MV PGF++ GSV++ +V
Subjt:  QAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSVYMEDMGGDKA--RKLIPSVVRIMVAPGFYVYGSVVKCKV

AT5G12900.1 unknown protein8.4e-6640.57Show/hide
Query:  RKSPKSINESSALTAREEQWR-------AAVTELSQKLVQTTRKRDEAIMEASRLKYAMAELEKKLDKLETYCHSLKSGLEECS------GGGGNSPCQI
        ++   SI  S ++T + E+         + V +L ++L++  R RD A+ + S +K ++ EL +KL  LE+YC +LK  L E +        GG S    
Subjt:  RKSPKSINESSALTAREEQWR-------AAVTELSQKLVQTTRKRDEAIMEASRLKYAMAELEKKLDKLETYCHSLKSGLEECS------GGGGNSPCQI

Query:  GKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYDVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNAS
        GK N   S   ++++ +VE FL  VSE+R SI+   ++L  ++    + +   I+ LLQP+++  + SK  + + ++LEA+++++ ++DFE+  FQKN  
Subjt:  GKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYDVKISFSKNPRSLLFYLEALLNRAFFEDFESVGFQKNAS

Query:  TQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAWPEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAE
         ++L+P +  +ANF  F  L  L+W EVL KGTK++S+EFSRFCD KMS I+  L W R W E +LQAFF A+K V+LLHLLA S +P L I RVE+  E
Subjt:  TQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAWPEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAE

Query:  FDSVYMEDMGGDKARKLI---PSVVRIMVAPGFYVYGSVVKCKVLCRYNA
        F+S +MEDMG D+ R  +   P+ V++MV PGFYV   V++CKVLCRY +
Subjt:  FDSVYMEDMGGDKARKLI---PSVVRIMVAPGFYVYGSVVKCKVLCRYNA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTCCTTCTCCGACTTCTTCCTTTTCTTCTTCTCCTTCTTTCTCTCCTGCTCCGACCCACCACCATTCACGCTCCCATTTCACGCCGATTCAAGAATGCGAGAG
GGAAGAGCAGGAGGAGGATTCTGCTGCAGCCAATCGGACGAAGAACAGAGCCGCCGCGGACCGGGGAACGAGTCCGAAGCACCACCCGACGCCGCTGAATCGTGTCGATG
ACCGGAGCAAGAATGGGAAATCGCAGACGGTTAAGAGACGGTCGGAGTCGGAGTCGGATTCCGTGTCGAGTTCCGACGGGCCGGTGTCGTGCAACCGGTGCCGACCTCAT
GCGAGGGAGAAGATCTCCGTCGTTCCTCTGGACAACAACAATGGCGTTAATAAACAAACGTATTTCTCCATGGCGAGCCCTAATGGCATTTTCAAATCCCTAATTTCGTC
GTTGACGCGGAAGAGTCCGAAATCGATAAACGAATCTTCGGCCCTGACGGCTCGAGAGGAGCAATGGAGGGCCGCCGTGACGGAGCTCTCGCAGAAGCTGGTTCAGACGA
CGAGGAAGAGAGACGAGGCTATTATGGAAGCTTCTAGGTTAAAATACGCCATGGCCGAATTGGAGAAGAAACTCGACAAACTCGAGACGTATTGCCACAGTTTGAAATCC
GGACTCGAAGAATGCAGCGGCGGCGGCGGGAATTCGCCTTGCCAAATTGGGAAGTATAATGAAATCCAGAGTTTTCACCAGATGAATCAGAAGCAAGTAGTCGAAAATTT
CTTAGTTTCAGTATCGGAATCTCGATCGTCGATCCGGCTTCTCAGCCGGTCACTCACTCTGCAACTCCGCCACGTCGGAGCCAAAGTATACGAAAGAATCTCCGTTCTTC
TTCAACCTTACGACGTCAAAATCTCGTTCTCGAAGAACCCTAGAAGCTTGCTCTTCTACCTGGAAGCCCTTCTGAACCGAGCTTTCTTCGAGGATTTCGAATCGGTAGGG
TTTCAGAAGAACGCCTCGACTCAGGTTCTCAATCCGATTGAAAGATGCGAAGCGAATTTCGGATGTTTCAATTTCCTCCATGAATTAACGTGGGAGGAGGTTCTAACGAA
AGGAACGAAGCATTTCAGCGAGGAATTCAGCCGGTTCTGCGACCGGAAAATGAGCGAGATCGTGGCGATGCTGGGATGGAACAGAGCCTGGCCGGAGCCGCTGCTGCAGG
CGTTCTTCGGCGCGTCGAAGAGCGTGTTTCTTCTGCACCTCCTGGCCAACTCCGTTCATCCGAACCTTCCGATTTTCAGAGTGGAGAAAGAAGCCGAATTCGATTCCGTG
TACATGGAGGACATGGGCGGCGACAAGGCCAGAAAGCTGATTCCGTCGGTGGTCAGAATCATGGTCGCCCCTGGTTTCTACGTTTACGGCAGCGTAGTTAAATGCAAGGT
GTTGTGCAGATACAACGCCGCCGTAAGCGCCGCCACATTGTCAGCATCCAAGGAAGATGAAGGTTGCACCCATAATTTAGAATCTTCAACAAGTATCGACAACACTACGA
ATCGAGCCATCGAGGTCACGCCACTGATACAAAATCCATTTGACCCATCTTTCTACTATCTATCCCTACAAGGAATCACAGTAGGCCAGAAGTTCTTGCTCATCCCGTCC
TCGTGGTTTAGACTAAACACCGATGGCAGCGGCGGCGTGATTATTGATTCTGGAGCTTTGTCAATTTTCGGCAGCATGCAACACCAGAATATGTTGGTTCTTCATGATCT
CAAGAAGGAAGTTGTATCATTTGTTCCCACACAGTGTGCTCGGCTCGGTAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCTCCTTCTCCGACTTCTTCCTTTTCTTCTTCTCCTTCTTTCTCTCCTGCTCCGACCCACCACCATTCACGCTCCCATTTCACGCCGATTCAAGAATGCGAGAG
GGAAGAGCAGGAGGAGGATTCTGCTGCAGCCAATCGGACGAAGAACAGAGCCGCCGCGGACCGGGGAACGAGTCCGAAGCACCACCCGACGCCGCTGAATCGTGTCGATG
ACCGGAGCAAGAATGGGAAATCGCAGACGGTTAAGAGACGGTCGGAGTCGGAGTCGGATTCCGTGTCGAGTTCCGACGGGCCGGTGTCGTGCAACCGGTGCCGACCTCAT
GCGAGGGAGAAGATCTCCGTCGTTCCTCTGGACAACAACAATGGCGTTAATAAACAAACGTATTTCTCCATGGCGAGCCCTAATGGCATTTTCAAATCCCTAATTTCGTC
GTTGACGCGGAAGAGTCCGAAATCGATAAACGAATCTTCGGCCCTGACGGCTCGAGAGGAGCAATGGAGGGCCGCCGTGACGGAGCTCTCGCAGAAGCTGGTTCAGACGA
CGAGGAAGAGAGACGAGGCTATTATGGAAGCTTCTAGGTTAAAATACGCCATGGCCGAATTGGAGAAGAAACTCGACAAACTCGAGACGTATTGCCACAGTTTGAAATCC
GGACTCGAAGAATGCAGCGGCGGCGGCGGGAATTCGCCTTGCCAAATTGGGAAGTATAATGAAATCCAGAGTTTTCACCAGATGAATCAGAAGCAAGTAGTCGAAAATTT
CTTAGTTTCAGTATCGGAATCTCGATCGTCGATCCGGCTTCTCAGCCGGTCACTCACTCTGCAACTCCGCCACGTCGGAGCCAAAGTATACGAAAGAATCTCCGTTCTTC
TTCAACCTTACGACGTCAAAATCTCGTTCTCGAAGAACCCTAGAAGCTTGCTCTTCTACCTGGAAGCCCTTCTGAACCGAGCTTTCTTCGAGGATTTCGAATCGGTAGGG
TTTCAGAAGAACGCCTCGACTCAGGTTCTCAATCCGATTGAAAGATGCGAAGCGAATTTCGGATGTTTCAATTTCCTCCATGAATTAACGTGGGAGGAGGTTCTAACGAA
AGGAACGAAGCATTTCAGCGAGGAATTCAGCCGGTTCTGCGACCGGAAAATGAGCGAGATCGTGGCGATGCTGGGATGGAACAGAGCCTGGCCGGAGCCGCTGCTGCAGG
CGTTCTTCGGCGCGTCGAAGAGCGTGTTTCTTCTGCACCTCCTGGCCAACTCCGTTCATCCGAACCTTCCGATTTTCAGAGTGGAGAAAGAAGCCGAATTCGATTCCGTG
TACATGGAGGACATGGGCGGCGACAAGGCCAGAAAGCTGATTCCGTCGGTGGTCAGAATCATGGTCGCCCCTGGTTTCTACGTTTACGGCAGCGTAGTTAAATGCAAGGT
GTTGTGCAGATACAACGCCGCCGTAAGCGCCGCCACATTGTCAGCATCCAAGGAAGATGAAGGTTGCACCCATAATTTAGAATCTTCAACAAGTATCGACAACACTACGA
ATCGAGCCATCGAGGTCACGCCACTGATACAAAATCCATTTGACCCATCTTTCTACTATCTATCCCTACAAGGAATCACAGTAGGCCAGAAGTTCTTGCTCATCCCGTCC
TCGTGGTTTAGACTAAACACCGATGGCAGCGGCGGCGTGATTATTGATTCTGGAGCTTTGTCAATTTTCGGCAGCATGCAACACCAGAATATGTTGGTTCTTCATGATCT
CAAGAAGGAAGTTGTATCATTTGTTCCCACACAGTGTGCTCGGCTCGGTAACTAG
Protein sequenceShow/hide protein sequence
MAAPSPTSSFSSSPSFSPAPTHHHSRSHFTPIQECEREEQEEDSAAANRTKNRAAADRGTSPKHHPTPLNRVDDRSKNGKSQTVKRRSESESDSVSSSDGPVSCNRCRPH
AREKISVVPLDNNNGVNKQTYFSMASPNGIFKSLISSLTRKSPKSINESSALTAREEQWRAAVTELSQKLVQTTRKRDEAIMEASRLKYAMAELEKKLDKLETYCHSLKS
GLEECSGGGGNSPCQIGKYNEIQSFHQMNQKQVVENFLVSVSESRSSIRLLSRSLTLQLRHVGAKVYERISVLLQPYDVKISFSKNPRSLLFYLEALLNRAFFEDFESVG
FQKNASTQVLNPIERCEANFGCFNFLHELTWEEVLTKGTKHFSEEFSRFCDRKMSEIVAMLGWNRAWPEPLLQAFFGASKSVFLLHLLANSVHPNLPIFRVEKEAEFDSV
YMEDMGGDKARKLIPSVVRIMVAPGFYVYGSVVKCKVLCRYNAAVSAATLSASKEDEGCTHNLESSTSIDNTTNRAIEVTPLIQNPFDPSFYYLSLQGITVGQKFLLIPS
SWFRLNTDGSGGVIIDSGALSIFGSMQHQNMLVLHDLKKEVVSFVPTQCARLGN