; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012681 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012681
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAT-rich interactive domain-containing protein 2
Genome locationtig00153490:4182..12890
RNA-Seq ExpressionSgr012681
SyntenySgr012681
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0000118 - histone deacetylase complex (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003712 - transcription coregulator activity (molecular function)
InterPro domainsIPR001606 - ARID DNA-binding domain
IPR024145 - Histone deacetylase complex subunit SAP30/SAP30-like
IPR036431 - ARID DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044821.1 AT-rich interactive domain-containing protein 2 [Cucumis melo var. makuwa]1.7e-14470.18Show/hide
Query:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY
        MGRWPISSN SILDCNKD+DP  SNG CIAPDCLVEGS ANVD+DDCKAT+RC FEKIL VFLKEI  RG +RPVPALLGEGGSLD+FELFMVVRDKGGY
Subjt:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY

Query:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL
         VVSEKELW  VV ELGLD+ LSASVKLIY KYLSELEKWLMVR GGTKLENGNSD Y+  KSF  L+ELEAKIK ML GVL  KSIY E  GFKSNK  
Subjt:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL

Query:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI
        GN+N A  A EKEIKFPK++KKE +LH DVT  Q+ C+ETPR +GE N+IHV  DCRSLD+VNVETE DS  R RESLLRMLKWVR+TAK P + SNGT+
Subjt:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI

Query:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRL----------------LQAAYVRKLTYAEWEKLPPAIRTSGLHPRKQAANYSFY--VIP
        P  SKWK YAS +ALWLQV++A+DALL RKDVD+  EKRL                L  + +  L   +WEKLPPAI+TSG H  KQAA Y FY  V+P
Subjt:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRL----------------LQAAYVRKLTYAEWEKLPPAIRTSGLHPRKQAANYSFY--VIP

TYK16643.1 AT-rich interactive domain-containing protein 2 [Cucumis melo var. makuwa]1.2e-14570.43Show/hide
Query:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY
        MGRWPISSN SILDCNKD+DP  SNG CIAPDCLVEGS ANVD+DDCKAT+RC FEKIL VFLKEI  RG +RPVPALLGEGGSLD+FELFMVVRDKGGY
Subjt:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY

Query:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL
         VVSEKELW  VV ELGLD+ LSASVKLIY KYLSELEKWLMVR GGTKLENGNSD Y+  KSF  L+ELEAKIK ML GVL  KSIY E  GFKSNK  
Subjt:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL

Query:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI
        GN+N A  A EKEIKFPK++KKE +LH DVT  Q+ C+ETPR +GE N+IHV  DCRSLD+VNVETE DS  R RESLLRMLKWVR+TAK P +PSNGT+
Subjt:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI

Query:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRL----------------LQAAYVRKLTYAEWEKLPPAIRTSGLHPRKQAANYSFY--VIP
        P  SKWK YAS +ALWLQV++A+DALL RKDVD+  EKRL                L  + +  L   +WEKLPPAI+TSG H  KQAA Y FY  V+P
Subjt:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRL----------------LQAAYVRKLTYAEWEKLPPAIRTSGLHPRKQAANYSFY--VIP

XP_008452043.1 PREDICTED: AT-rich interactive domain-containing protein 2 [Cucumis melo]9.1e-13875.22Show/hide
Query:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY
        MGRWPISSN SILDCNKD+DP  SNG CIAPDCLVEGS ANVD+DDCKAT+RC FEKIL VFLKEI  RG +RPVPALLGEGGSLD+FELFMVVRDKGGY
Subjt:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY

Query:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL
         VVSEKELW  VV ELGLD+ LSASVKLIY KYLSELEKWLMVR GGTKLENGNSD Y+  KSF  L+ELEAKIK ML GVL  KSIY E  GFKSNK  
Subjt:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL

Query:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI
        GN+N A  A EKEIKFPK++KKE +LH DVT  Q+ C+ETPR +GE N+IHV  DCRSLD+VNVETE DS  R RESLLRMLKWVR+TAK P +PSNGT+
Subjt:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI

Query:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRLLQAAYVR
        P  SKWK YAS +ALWLQV++A+DALL RKDVD+  EKRLL    VR
Subjt:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRLLQAAYVR

XP_022136609.1 AT-rich interactive domain-containing protein 2 [Momordica charantia]8.2e-14779.24Show/hide
Query:  MGRWPISSNASILDCNKDIDP-YTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGG
        MGRWPISSNASILDC+KDIDP Y+SNGCCIAPDCLVE SYA+VDY DCKA LRC FEKILS FLKEIG RGIVRPVPALLGEGGSLD+FELFMVVRDKGG
Subjt:  MGRWPISSNASILDCNKDIDP-YTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGG

Query:  YHVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSDYHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL
        Y VVSEKELW  VV ELGLD+ELSASVKL+YSKYLSELEKWLMVRCGG KLENGNSDYHCEKSF F SEL AKIKGML GVL  KS+Y E SGF S+KQ+
Subjt:  YHVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSDYHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL

Query:  GNIN--GAAVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI
        GNIN   AAVEK+IK P++ KKE +L+G VTQSQ+  S+TP+DD     I VNEDCRSL SVNVET+ DS E  RESLLRMLKW RQ AK P DPSNGTI
Subjt:  GNIN--GAAVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI

Query:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRLLQ
        PGPSKWK+Y S N  WLQVVRA+DALLIRKDV+EN EKRLLQ
Subjt:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRLLQ

XP_038893741.1 AT-rich interactive domain-containing protein 2 [Benincasa hispida]1.1e-14376.01Show/hide
Query:  ISGFEFMGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVV
        +SG EFMGRWPISSNASI+DCNKD+DP  SNGCCIAPDCLVEGSYANV+YDDCKAT+RC FEKIL VFLKEIG RG +RPV ALLGEGGSLD+FELFMVV
Subjt:  ISGFEFMGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVV

Query:  RDKGGYHVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSDYHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFK
        RDKGGY VVSEKELW  VV ELGLD+ LSASVKLIYSKYLS+LEKWLMVR GGTKLENGNSDYH  KSF FLSELEAK+K ML         Y E SGFK
Subjt:  RDKGGYHVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSDYHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFK

Query:  SNKQLGNIN--GAAVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDP
        SNK  GN+N   AA+EKEIKFPKLKK+E +LHGDVT  Q+ C+ETPRD+GE ++IHV EDCRSL +VN+ETE+D+  RYRESLLRMLKW R+TAK P +P
Subjt:  SNKQLGNIN--GAAVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDP

Query:  SNGTIPGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRLL
        SN T+PG SKWK YAS +ALWLQV+RA+DALL RKDVD   EKRLL
Subjt:  SNGTIPGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRLL

TrEMBL top hitse value%identityAlignment
A0A0A0KZM1 ARID domain-containing protein1.1e-13372.62Show/hide
Query:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY
        MGRWPISSN SILDCNKD+DP  S G CIAPDCLVEGS ANVD+DDCKAT+RC FEK+L VFLKE   RG +RPVPALLGEG SLD+FELFMVVRDKGGY
Subjt:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY

Query:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL
         VVSEKELW  VV ELGLD+ LSASVKLIY KYLS+LEKWLMVR GGTKLENGNSD Y+  K+F  L+ELEAKIK +L GVL  KSIY E SGFKSNK  
Subjt:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL

Query:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI
        GN+N A  A EKEIK PK++KKE +LH DVT  Q+ C+ETPRD+G+ N+IHV  DCRS D+VNVETE DS    RESL RMLKWVR+TAK P +PSNGT+
Subjt:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI

Query:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRLLQAAYVR
        PG SKWK YAS +ALWLQV++A+DALL RKDVD+  EKRLL    VR
Subjt:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRLLQAAYVR

A0A1S3BSW2 AT-rich interactive domain-containing protein 24.4e-13875.22Show/hide
Query:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY
        MGRWPISSN SILDCNKD+DP  SNG CIAPDCLVEGS ANVD+DDCKAT+RC FEKIL VFLKEI  RG +RPVPALLGEGGSLD+FELFMVVRDKGGY
Subjt:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY

Query:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL
         VVSEKELW  VV ELGLD+ LSASVKLIY KYLSELEKWLMVR GGTKLENGNSD Y+  KSF  L+ELEAKIK ML GVL  KSIY E  GFKSNK  
Subjt:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL

Query:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI
        GN+N A  A EKEIKFPK++KKE +LH DVT  Q+ C+ETPR +GE N+IHV  DCRSLD+VNVETE DS  R RESLLRMLKWVR+TAK P +PSNGT+
Subjt:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI

Query:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRLLQAAYVR
        P  SKWK YAS +ALWLQV++A+DALL RKDVD+  EKRLL    VR
Subjt:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRLLQAAYVR

A0A5A7TTH9 AT-rich interactive domain-containing protein 28.3e-14570.18Show/hide
Query:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY
        MGRWPISSN SILDCNKD+DP  SNG CIAPDCLVEGS ANVD+DDCKAT+RC FEKIL VFLKEI  RG +RPVPALLGEGGSLD+FELFMVVRDKGGY
Subjt:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY

Query:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL
         VVSEKELW  VV ELGLD+ LSASVKLIY KYLSELEKWLMVR GGTKLENGNSD Y+  KSF  L+ELEAKIK ML GVL  KSIY E  GFKSNK  
Subjt:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL

Query:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI
        GN+N A  A EKEIKFPK++KKE +LH DVT  Q+ C+ETPR +GE N+IHV  DCRSLD+VNVETE DS  R RESLLRMLKWVR+TAK P + SNGT+
Subjt:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI

Query:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRL----------------LQAAYVRKLTYAEWEKLPPAIRTSGLHPRKQAANYSFY--VIP
        P  SKWK YAS +ALWLQV++A+DALL RKDVD+  EKRL                L  + +  L   +WEKLPPAI+TSG H  KQAA Y FY  V+P
Subjt:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRL----------------LQAAYVRKLTYAEWEKLPPAIRTSGLHPRKQAANYSFY--VIP

A0A5D3CXM4 AT-rich interactive domain-containing protein 25.7e-14670.43Show/hide
Query:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY
        MGRWPISSN SILDCNKD+DP  SNG CIAPDCLVEGS ANVD+DDCKAT+RC FEKIL VFLKEI  RG +RPVPALLGEGGSLD+FELFMVVRDKGGY
Subjt:  MGRWPISSNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGY

Query:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL
         VVSEKELW  VV ELGLD+ LSASVKLIY KYLSELEKWLMVR GGTKLENGNSD Y+  KSF  L+ELEAKIK ML GVL  KSIY E  GFKSNK  
Subjt:  HVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSD-YHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL

Query:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI
        GN+N A  A EKEIKFPK++KKE +LH DVT  Q+ C+ETPR +GE N+IHV  DCRSLD+VNVETE DS  R RESLLRMLKWVR+TAK P +PSNGT+
Subjt:  GNINGA--AVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI

Query:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRL----------------LQAAYVRKLTYAEWEKLPPAIRTSGLHPRKQAANYSFY--VIP
        P  SKWK YAS +ALWLQV++A+DALL RKDVD+  EKRL                L  + +  L   +WEKLPPAI+TSG H  KQAA Y FY  V+P
Subjt:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRL----------------LQAAYVRKLTYAEWEKLPPAIRTSGLHPRKQAANYSFY--VIP

A0A6J1C4T3 AT-rich interactive domain-containing protein 24.0e-14779.24Show/hide
Query:  MGRWPISSNASILDCNKDIDP-YTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGG
        MGRWPISSNASILDC+KDIDP Y+SNGCCIAPDCLVE SYA+VDY DCKA LRC FEKILS FLKEIG RGIVRPVPALLGEGGSLD+FELFMVVRDKGG
Subjt:  MGRWPISSNASILDCNKDIDP-YTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGG

Query:  YHVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSDYHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL
        Y VVSEKELW  VV ELGLD+ELSASVKL+YSKYLSELEKWLMVRCGG KLENGNSDYHCEKSF F SEL AKIKGML GVL  KS+Y E SGF S+KQ+
Subjt:  YHVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSDYHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQL

Query:  GNIN--GAAVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI
        GNIN   AAVEK+IK P++ KKE +L+G VTQSQ+  S+TP+DD     I VNEDCRSL SVNVET+ DS E  RESLLRMLKW RQ AK P DPSNGTI
Subjt:  GNIN--GAAVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTI

Query:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRLLQ
        PGPSKWK+Y S N  WLQVVRA+DALLIRKDV+EN EKRLLQ
Subjt:  PGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRLLQ

SwissProt top hitse value%identityAlignment
Q84JT7 AT-rich interactive domain-containing protein 14.2e-2133.46Show/hide
Query:  FEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGYHVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGN
        F  +L  FL E  S     P+PA+ GEG ++D+F LF+ V  KGG+  VSE   W  VV E GL+   SAS KLIY KYL    +WL       ++  G+
Subjt:  FEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGYHVVSEKELWCLVVGELGLDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGN

Query:  SDYHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQLGNINGAAVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCR
        +D    +       L A++ G L     VK  Y    G +  K+LG         E+K+   K K R     V +            G K      E   
Subjt:  SDYHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQLGNINGAAVEKEIKFPKLKKKERNLHGDVTQSQKICSETPRDDGEKNRIHVNEDCR

Query:  SLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTIPGPSKWKQYASINALWLQVVRAR
         L+SV  E     + R RE  L  LKW+   AK P DPS G +P  S+W  Y S    W Q++  R
Subjt:  SLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTIPGPSKWKQYASINALWLQVVRAR

Q9LDD4 AT-rich interactive domain-containing protein 21.8e-3232.89Show/hide
Query:  SYANVD---YDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGYHVVSEKELWCLVVGELGLDVELSASVKLIYSKYL
        SY +V+    D+C+  LR  F++ L VFL+E GS   ++P+PA++G+G ++D+F+LF++VR++ G+  VS K LW +V  +LG D  L  S+ LIY KYL
Subjt:  SYANVD---YDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGYHVVSEKELWCLVVGELGLDVELSASVKLIYSKYL

Query:  SELEKWLMVRCGGTKLENGNSDYHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQLGNINGAAVEKEIKFPKLKKKERNLHGDVTQSQKIC
        + +EKW +        +N +S+                 KG   G+L     +  G+GFKS    G              K +K+ R +       ++ C
Subjt:  SELEKWLMVRCGGTKLENGNSDYHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQLGNINGAAVEKEIKFPKLKKKERNLHGDVTQSQKIC

Query:  SETPRDDGEKNRIHVNEDCRSLDSVNVETEV----------DSRERYRESLLRMLKWVRQTAKRPEDPSNGTIPGPSKWKQYASINALWLQVVRARDALL
        SE  R          ++    L SV +  E           D     R+ L  MLKW+   A  P DP+ G IP  SKWKQY   N  WLQV RA+++LL
Subjt:  SETPRDDGEKNRIHVNEDCRSLDSVNVETEV----------DSRERYRESLLRMLKWVRQTAKRPEDPSNGTIPGPSKWKQYASINALWLQVVRARDALL

Query:  IRKD
        +++D
Subjt:  IRKD

Arabidopsis top hitse value%identityAlignment
AT1G19330.1 unknown protein3.1e-3552.38Show/hide
Query:  PRIPSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDGH-DFENSSCSSDIGEK
        P+I S + + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLE PTGNE D   DFEN+  +    + 
Subjt:  PRIPSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDGH-DFENSSCSSDIGEK

Query:  DNNFSSSIVFHKLSKPKVRHIRPWAPSSSAKSAGRGSYGEIQSIHMPKPETVKCEFSLISGFEFMGRW
          +F +S    K  K K+R  R    S    S    S  + +S     PE +K + S +     +  W
Subjt:  DNNFSSSIVFHKLSKPKVRHIRPWAPSSSAKSAGRGSYGEIQSIHMPKPETVKCEFSLISGFEFMGRW

AT1G19330.2 unknown protein2.4e-3552.66Show/hide
Query:  PRIPSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDGH-DFENSSCS-SDIGE
        P+I S + + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLE PTGNE D   DFEN+  + SD+  
Subjt:  PRIPSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDGH-DFENSSCS-SDIGE

Query:  KDNNFSSSIVFHKLSKPKVRHIRPWAPSSSAKSAGRGSYGEIQSIHMPKPETVKCEFSLISGFEFMGRW
        +D          K  K K+R  R    S    S    S  + +S     PE +K + S +     +  W
Subjt:  KDNNFSSSIVFHKLSKPKVRHIRPWAPSSSAKSAGRGSYGEIQSIHMPKPETVKCEFSLISGFEFMGRW

AT1G19330.3 unknown protein1.3e-3355.56Show/hide
Query:  PRIPSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDGH-DFENSSCSSDIGEK
        P+I S + + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLE PTGNE D   DFEN+  +    + 
Subjt:  PRIPSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDGH-DFENSSCSSDIGEK

Query:  DNNFSSSIVFHKLSKPKVRHIRPWAPSSSAKSAGRGSYGEIQSIHMPKPETVK
          +F +S    K  K K+R  R    S    S    S  + +S     PE ++
Subjt:  DNNFSSSIVFHKLSKPKVRHIRPWAPSSSAKSAGRGSYGEIQSIHMPKPETVK

AT1G75060.1 unknown protein3.2e-3265.49Show/hide
Query:  RIPSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDGHDFENSSCSSDIGEKDN
        ++ S F + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLEHPTGNE D +D E    +      D 
Subjt:  RIPSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDGHDFENSSCSSDIGEKDN

Query:  NFSSSIVFHKLSK
            ++  HK  K
Subjt:  NFSSSIVFHKLSK

AT1G75060.2 unknown protein3.2e-3265.49Show/hide
Query:  RIPSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDGHDFENSSCSSDIGEKDN
        ++ S F + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLEHPTGNE D +D E    +      D 
Subjt:  RIPSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDGHDFENSSCSSDIGEKDN

Query:  NFSSSIVFHKLSK
            ++  HK  K
Subjt:  NFSSSIVFHKLSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTACGGCATATGGCAGCCTGGCAGGTCGCTTCCTTGGCAGTTAGCAAGCATTTAGTGTGTGCGCTGCGCTCCTCCTACCACTCTCTCTCGCTCTTCTTTCTCTC
TCCTCTTGCATATTTCGATGTTTCATACTCTGAAACCTCTGTTTCTTTGTCCCATTCATCACTCTCTTCTCCGGACTTCTTCTTTCCCAAACTTTTACATCCAAATTCCA
CCTCTTCTTCCCTTCTAAATCTCTGCTTTCCTGTTTCTTATCTTCCGACCTCCCATTGCCGCTTCCACTCAAGCTTCTCCTGCAGAATCCATGATGCCCTACTGATCAGT
GAAAGTGGGTTGTTGGTTATTATTGCTGGAATTGAGCATGAATTCTTGCACTATATGGACCACAGGGCTTTGGTTCTACTCACTTCATCACTACCCGGAGGAAACACACT
TGAAAATCTTCAAAGAATGCTGGAACCAGAGTTTTGTTCTCCTAGAATACCGTCTCCTTTTCGCGAGGAGAGTGGGGATGAAGAGCTTTCAGTTCTTCCAAGGCACACTA
AAGTTATTGTTACTGGAAATAACAGAACGAAGTCTGTTTTGTTGGGATTGCAAGGCGTAGTTAAGAAGGCTGTTGGCCTCGGAGGCTGGCACTGGCTGGTATTGAAAAAT
GGGGTTGAAGTGAAGCTGCAAAGGAATGCATTGAGTGTGCTGGAACATCCCACAGGAAACGAAAATGATGGTCATGATTTTGAAAACTCAAGCTGTAGCTCCGACATTGG
TGAGAAGGACAATAATTTCTCTAGCAGCATTGTTTTCCACAAACTTAGCAAACCAAAAGTGAGGCATATAAGGCCATGGGCTCCATCTTCATCAGCAAAGTCAGCAGGCC
GGGGCAGCTATGGAGAAATTCAATCTATTCACATGCCAAAGCCGGAAACCGTGAAGTGCGAATTTTCTTTAATAAGTGGGTTTGAGTTCATGGGGAGATGGCCTATTTCA
TCCAATGCTTCCATTTTAGACTGCAACAAAGATATTGATCCTTATACCAGTAATGGATGTTGCATTGCCCCTGATTGTTTGGTAGAGGGAAGTTATGCGAATGTTGATTA
TGATGATTGCAAAGCAACACTTAGATGCTGTTTTGAGAAAATTCTTTCGGTTTTTCTAAAGGAAATTGGTAGCAGAGGAATTGTTAGGCCAGTGCCAGCGTTACTTGGTG
AAGGAGGATCTTTGGATATGTTTGAATTGTTCATGGTAGTCAGAGATAAAGGTGGTTATCATGTGGTTTCAGAAAAGGAATTATGGTGCTTAGTGGTTGGGGAATTAGGT
TTGGATGTTGAGCTTTCGGCTTCGGTGAAATTGATTTATTCCAAGTATTTAAGTGAGTTAGAGAAATGGCTTATGGTGAGATGCGGAGGTACAAAACTGGAAAATGGGAA
CTCTGATTATCACTGCGAGAAAAGTTTTCATTTTTTGTCGGAACTGGAGGCAAAGATTAAGGGTATGTTATGTGGTGTGCTGGGAGTAAAGAGCATATATGGTGAAGGTT
CTGGATTCAAATCTAACAAACAGCTTGGGAACATTAATGGCGCTGCAGTGGAGAAGGAAATAAAATTCCCTAAATTAAAGAAGAAAGAACGTAATCTACATGGGGATGTT
ACACAAAGCCAAAAAATTTGTAGTGAGACACCTCGGGATGATGGCGAAAAAAATCGTATCCATGTTAATGAAGATTGTAGAAGTTTGGATTCTGTTAATGTTGAAACTGA
AGTAGACTCTCGTGAGAGATATCGAGAATCTTTATTACGAATGTTGAAGTGGGTGAGACAGACCGCAAAGCGTCCTGAAGATCCATCTAATGGCACAATACCGGGGCCAT
CCAAGTGGAAACAGTATGCTAGCATCAATGCATTATGGCTTCAGGTAGTCAGGGCTAGGGATGCACTTTTAATCAGGAAGGATGTTGACGAAAACACTGAGAAACGTCTG
TTACAGGCTGCATATGTGCGCAAACTGACCTACGCAGAATGGGAAAAGTTACCTCCTGCTATTAGAACCTCGGGTCTCCATCCTCGTAAGCAAGCAGCAAACTACTCCTT
CTATGTCATCCCATATTGCGTCCTACTTGATAACATGTTGGACGCAGGTATAGTCTTTGCTCTTGGTGGGGGGAGCTTCAAGGATGCTCGGACGGTTTCAAACTTATCAG
AAATAAAATCTTGGAGATCGATAGGACTGAAAGGAAGAGAGTTTCATCTGAGGTTTGTGAGGAGGATAGAAGAAGGCTCAAGAGATAGTTGGAGGGGCTGCATATTAGTG
AGAGCTCTAGTTGGAAACAGAATTCCAAAGTCCAGTGGATTTAGGAGGGTGAGTGCTAATAGTGCCAGGGGGAATGGTAACCTAGTGAATGTGATTAAGACTGAAGATGG
CTTGTCTTTCACTAAGGAGGAGGACAAGGAAAGAGGAGGGCTGTCTTTTAACGTATGGAGTGGGCCTTTAAGACAGGGAGGGCTGTCTCAATTCTCTGATTTGTTTTCTC
GGACCTCCGGGATCCTTGCTCAGGTGTTGGTTGGTAAGGTAGGGGGTCACGGTGAATATGGCATTCATATATATCTGGGGAAGGTAGACAACTCTTGGATGAAGTTCTTG
TGGCATATGACTACGGATCATTTTGATTCTCACCTGGTTAGTTGGAATAGGATGGTGTGGCCTTTTGCTTCGATGATGAGGTATCGGAAACTTGGAAACCAAGAACATTA
TTTTGCTGGATACGTGGCTTTAGAGATTCTAGCTGGAGGTGGTTTGGCTTTGGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGTACGGCATATGGCAGCCTGGCAGGTCGCTTCCTTGGCAGTTAGCAAGCATTTAGTGTGTGCGCTGCGCTCCTCCTACCACTCTCTCTCGCTCTTCTTTCTCTC
TCCTCTTGCATATTTCGATGTTTCATACTCTGAAACCTCTGTTTCTTTGTCCCATTCATCACTCTCTTCTCCGGACTTCTTCTTTCCCAAACTTTTACATCCAAATTCCA
CCTCTTCTTCCCTTCTAAATCTCTGCTTTCCTGTTTCTTATCTTCCGACCTCCCATTGCCGCTTCCACTCAAGCTTCTCCTGCAGAATCCATGATGCCCTACTGATCAGT
GAAAGTGGGTTGTTGGTTATTATTGCTGGAATTGAGCATGAATTCTTGCACTATATGGACCACAGGGCTTTGGTTCTACTCACTTCATCACTACCCGGAGGAAACACACT
TGAAAATCTTCAAAGAATGCTGGAACCAGAGTTTTGTTCTCCTAGAATACCGTCTCCTTTTCGCGAGGAGAGTGGGGATGAAGAGCTTTCAGTTCTTCCAAGGCACACTA
AAGTTATTGTTACTGGAAATAACAGAACGAAGTCTGTTTTGTTGGGATTGCAAGGCGTAGTTAAGAAGGCTGTTGGCCTCGGAGGCTGGCACTGGCTGGTATTGAAAAAT
GGGGTTGAAGTGAAGCTGCAAAGGAATGCATTGAGTGTGCTGGAACATCCCACAGGAAACGAAAATGATGGTCATGATTTTGAAAACTCAAGCTGTAGCTCCGACATTGG
TGAGAAGGACAATAATTTCTCTAGCAGCATTGTTTTCCACAAACTTAGCAAACCAAAAGTGAGGCATATAAGGCCATGGGCTCCATCTTCATCAGCAAAGTCAGCAGGCC
GGGGCAGCTATGGAGAAATTCAATCTATTCACATGCCAAAGCCGGAAACCGTGAAGTGCGAATTTTCTTTAATAAGTGGGTTTGAGTTCATGGGGAGATGGCCTATTTCA
TCCAATGCTTCCATTTTAGACTGCAACAAAGATATTGATCCTTATACCAGTAATGGATGTTGCATTGCCCCTGATTGTTTGGTAGAGGGAAGTTATGCGAATGTTGATTA
TGATGATTGCAAAGCAACACTTAGATGCTGTTTTGAGAAAATTCTTTCGGTTTTTCTAAAGGAAATTGGTAGCAGAGGAATTGTTAGGCCAGTGCCAGCGTTACTTGGTG
AAGGAGGATCTTTGGATATGTTTGAATTGTTCATGGTAGTCAGAGATAAAGGTGGTTATCATGTGGTTTCAGAAAAGGAATTATGGTGCTTAGTGGTTGGGGAATTAGGT
TTGGATGTTGAGCTTTCGGCTTCGGTGAAATTGATTTATTCCAAGTATTTAAGTGAGTTAGAGAAATGGCTTATGGTGAGATGCGGAGGTACAAAACTGGAAAATGGGAA
CTCTGATTATCACTGCGAGAAAAGTTTTCATTTTTTGTCGGAACTGGAGGCAAAGATTAAGGGTATGTTATGTGGTGTGCTGGGAGTAAAGAGCATATATGGTGAAGGTT
CTGGATTCAAATCTAACAAACAGCTTGGGAACATTAATGGCGCTGCAGTGGAGAAGGAAATAAAATTCCCTAAATTAAAGAAGAAAGAACGTAATCTACATGGGGATGTT
ACACAAAGCCAAAAAATTTGTAGTGAGACACCTCGGGATGATGGCGAAAAAAATCGTATCCATGTTAATGAAGATTGTAGAAGTTTGGATTCTGTTAATGTTGAAACTGA
AGTAGACTCTCGTGAGAGATATCGAGAATCTTTATTACGAATGTTGAAGTGGGTGAGACAGACCGCAAAGCGTCCTGAAGATCCATCTAATGGCACAATACCGGGGCCAT
CCAAGTGGAAACAGTATGCTAGCATCAATGCATTATGGCTTCAGGTAGTCAGGGCTAGGGATGCACTTTTAATCAGGAAGGATGTTGACGAAAACACTGAGAAACGTCTG
TTACAGGCTGCATATGTGCGCAAACTGACCTACGCAGAATGGGAAAAGTTACCTCCTGCTATTAGAACCTCGGGTCTCCATCCTCGTAAGCAAGCAGCAAACTACTCCTT
CTATGTCATCCCATATTGCGTCCTACTTGATAACATGTTGGACGCAGGTATAGTCTTTGCTCTTGGTGGGGGGAGCTTCAAGGATGCTCGGACGGTTTCAAACTTATCAG
AAATAAAATCTTGGAGATCGATAGGACTGAAAGGAAGAGAGTTTCATCTGAGGTTTGTGAGGAGGATAGAAGAAGGCTCAAGAGATAGTTGGAGGGGCTGCATATTAGTG
AGAGCTCTAGTTGGAAACAGAATTCCAAAGTCCAGTGGATTTAGGAGGGTGAGTGCTAATAGTGCCAGGGGGAATGGTAACCTAGTGAATGTGATTAAGACTGAAGATGG
CTTGTCTTTCACTAAGGAGGAGGACAAGGAAAGAGGAGGGCTGTCTTTTAACGTATGGAGTGGGCCTTTAAGACAGGGAGGGCTGTCTCAATTCTCTGATTTGTTTTCTC
GGACCTCCGGGATCCTTGCTCAGGTGTTGGTTGGTAAGGTAGGGGGTCACGGTGAATATGGCATTCATATATATCTGGGGAAGGTAGACAACTCTTGGATGAAGTTCTTG
TGGCATATGACTACGGATCATTTTGATTCTCACCTGGTTAGTTGGAATAGGATGGTGTGGCCTTTTGCTTCGATGATGAGGTATCGGAAACTTGGAAACCAAGAACATTA
TTTTGCTGGATACGTGGCTTTAGAGATTCTAGCTGGAGGTGGTTTGGCTTTGGCATAA
Protein sequenceShow/hide protein sequence
MAVRHMAAWQVASLAVSKHLVCALRSSYHSLSLFFLSPLAYFDVSYSETSVSLSHSSLSSPDFFFPKLLHPNSTSSSLLNLCFPVSYLPTSHCRFHSSFSCRIHDALLIS
ESGLLVIIAGIEHEFLHYMDHRALVLLTSSLPGGNTLENLQRMLEPEFCSPRIPSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKN
GVEVKLQRNALSVLEHPTGNENDGHDFENSSCSSDIGEKDNNFSSSIVFHKLSKPKVRHIRPWAPSSSAKSAGRGSYGEIQSIHMPKPETVKCEFSLISGFEFMGRWPIS
SNASILDCNKDIDPYTSNGCCIAPDCLVEGSYANVDYDDCKATLRCCFEKILSVFLKEIGSRGIVRPVPALLGEGGSLDMFELFMVVRDKGGYHVVSEKELWCLVVGELG
LDVELSASVKLIYSKYLSELEKWLMVRCGGTKLENGNSDYHCEKSFHFLSELEAKIKGMLCGVLGVKSIYGEGSGFKSNKQLGNINGAAVEKEIKFPKLKKKERNLHGDV
TQSQKICSETPRDDGEKNRIHVNEDCRSLDSVNVETEVDSRERYRESLLRMLKWVRQTAKRPEDPSNGTIPGPSKWKQYASINALWLQVVRARDALLIRKDVDENTEKRL
LQAAYVRKLTYAEWEKLPPAIRTSGLHPRKQAANYSFYVIPYCVLLDNMLDAGIVFALGGGSFKDARTVSNLSEIKSWRSIGLKGREFHLRFVRRIEEGSRDSWRGCILV
RALVGNRIPKSSGFRRVSANSARGNGNLVNVIKTEDGLSFTKEEDKERGGLSFNVWSGPLRQGGLSQFSDLFSRTSGILAQVLVGKVGGHGEYGIHIYLGKVDNSWMKFL
WHMTTDHFDSHLVSWNRMVWPFASMMRYRKLGNQEHYFAGYVALEILAGGGLALA