; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0010868 (gene) of Chayote v1 genome

Gene IDSed0010868
OrganismSechium edule (Chayote v1)
DescriptionHydroxyproline-rich glycoprotein family protein, putative isoform 2
Genome locationLG06:1633756..1641976
RNA-Seq ExpressionSed0010868
SyntenySed0010868
Gene Ontology termsGO:0006402 - mRNA catabolic process (biological process)
GO:0070988 - demethylation (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0032451 - demethylase activity (molecular function)
InterPro domainsIPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily
IPR044842 - RNA demethylase ALKBH9B/ALKBH10B-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599881.1 RNA demethylase ALKBH10B, partial [Cucurbita argyrosperma subsp. sororia]7.0e-30579.68Show/hide
Query:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVL
        MAMPSGNVGVSDKV FQS GGV VSGGGG GG E+HQH PR W+PDERDG I+W R EFAASNAIIDALCHHLRAVGEPGEYD VIGCIQQRR NW PVL
Subjt:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVL

Query:  HMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGED
        HMQQYF VAEVM+ LQQVTSRRQQR+ DP+KVG KL+RRPGP FKQQ+QQ G+R E TVKEEMVTCA+S N GNSSSFVGSRKVEQVSN CEES   GED
Subjt:  HMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGED

Query:  EKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYE
         K NDKDS SAED KDT+GKDQ++SKPK AE L+DNASNKESQVEP DDG SSS RDKE  SVQSRNG QYAA  PRTFVANEIFDG TVNVMDGLK YE
Subjt:  EKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYE

Query:  DLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSCI
        +LLDD EVS+LLSLVNDLRASGKRGQ QG TY+VSKRPMKGHGREMIQLGFPIADA +DDDNSS LSKDRRIESIPSLLQDLID LV  QVM VKPDSCI
Subjt:  DLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSCI

Query:  IDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR---ADG
        IDFYNEGDHSQPHVWPPWF RPV VLLLTECEMTFGRVIGSDHSGNYRGAKTL+L PG+LLV +GKSA+FAKHAIPA+RKQRILVTLTKS P R   ADG
Subjt:  IDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR---ADG

Query:  QRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPT-LSTGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSAWPATHQRHPPPRLPVPGT
        QRTS NVG YSSWGPPS RSP++R  PG KHYP   STGVL  P IR Q+PPPNGIP I+V PVAP PMPFPP+PIP GP AWPA H RHPPPRLPVPGT
Subjt:  QRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPT-LSTGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSAWPATHQRHPPPRLPVPGT

Query:  GVFLPPGSSSAPSSQPLP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ-----ENLEAQNAG
        GVFLPPGSSS+PS Q +P   VETS++AEKENG T+SD+N GASPGEKSE+KPQ+QECNG +DG+GS K T+EE+ +QQ+     EN+E QNAG
Subjt:  GVFLPPGSSSAPSSQPLP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ-----ENLEAQNAG

XP_022987072.1 uncharacterized protein LOC111484609 [Cucurbita maxima]6.3e-29879.8Show/hide
Query:  MAMPSGNVGVSDKVSFQS--SGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAP
        MAMPSGNVGV DKVSFQS   GGV VSGGGG    E+HQHHPR WFPDERDGFI+WLR+EFAA+NA+IDALCHHLRAVGEPGEYD VIGCIQQRR NW P
Subjt:  MAMPSGNVGVSDKVSFQS--SGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAP

Query:  VLHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFK-QQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVI
        V+HMQQYF VA+V Y LQQV SRRQQRY DPVKVGPK YRRPGPGFK QQ QQQG+R+E TVKEEMVTCA+S N GNSSSFVGSRKVEQVSN CEESK  
Subjt:  VLHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFK-QQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVI

Query:  GEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLK
        GEDE  NDKDS SAEDRKDT+GKDQ +SKPK AE+LEDNASNKES VEP DDG SSS+R+KE  SVQS+NG QYAA TPRTFVANE+ DG  VNVMDGLK
Subjt:  GEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLK

Query:  SYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPD
         +ED LDDAEVSKLLSLVNDLRASGKRGQFQG TYVVSKRPMKGHGREMIQLGFPIAD P+DDDNSS LSKDRRIESIPSLLQDLIDRLVG Q+M+VKPD
Subjt:  SYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPD

Query:  SCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRA--
        SCIIDFYNEGDHSQPHVWPPWF RPV VLLLTECEMTFGRVIG+DHSGNY+GA  L+L PGTLLV EGKSA+FAKHAIPAIRKQRILVTLTKS PKRA  
Subjt:  SCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRA--

Query:  -DGQRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL-STGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPF-PPMPIPAGPSAWPATHQRHPPPRLP
         DGQRTS NVG ++SWGPPSVRSP+ RLSPG KHY ++ STGVL +P IR QMPPPNGIP +IV PVA  PMPF PP+PIP GP  WP  H RHPPPRLP
Subjt:  -DGQRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL-STGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPF-PPMPIPAGPSAWPATHQRHPPPRLP

Query:  VPGTGVFL-PPGSSSAPSSQP--LP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ--ENLEAQNAG
        VPGTGVFL PPGSSSAPS  P  LP   VET +++EKENGSTKSD+N GAS GEKSEAKPQ+QECN    G+ S KV +EEQQEQQ+  ENL+AQ+AG
Subjt:  VPGTGVFL-PPGSSSAPSSQP--LP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ--ENLEAQNAG

XP_023542424.1 uncharacterized protein LOC111802330 [Cucurbita pepo subsp. pepo]8.6e-30380.03Show/hide
Query:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVL
        MAMPSGNVGVSDKV FQS GGV VSGGGG GG E+HQH PR W+PDERDG I+W R EFAASNAIIDALCHHLRAVGEPGEYD VIGCIQQRR NW PVL
Subjt:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVL

Query:  HMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGED
        HMQQYF VAEVM  LQQVTSRRQQR+ DP+KVG KL+RRPGP FKQQ+QQ G+RVE TVKEEMVTCA+S N GNSSSFVG RKVEQVSN CEES   GED
Subjt:  HMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGED

Query:  EKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYE
         K NDKDS SAED KDT+GKDQ++SKPK AE L+DNASNKESQVEP DDG SSS RDKE  SVQSRNG QYAA  PRTFVANEIFDG T+NVMDGLK YE
Subjt:  EKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYE

Query:  DLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSCI
        +LLDD EVSKLLSLVNDLRASGKRGQ QG TY+VSKRPMKGHGREMIQLGFPIADA +DDDNSS LSKDRRIESIPSLLQDLID LV  Q M VKPDSCI
Subjt:  DLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSCI

Query:  IDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR---ADG
        IDFYNEGDHSQPHVWPPWF RPV VLLLTECEMTFGRVIGSDHSGNYRGA TL+L PG+LLV +GKSA+FAKHAIPA+RKQRILVTLTKS PKR   ADG
Subjt:  IDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR---ADG

Query:  QRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPT-LSTGVLTSPTIRTQMPPPNGIPSIIVPPVA-PQPMPFPP-MPIPAGPSAWPATHQRHPPPRLPVP
        QRTS NVG YSSWGPPS RSP++R  PG KHYP   STGVL  P IR Q+PPPNGIP I+V PVA P PMPFPP +PIP GP AWPA H RHPPPRLPVP
Subjt:  QRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPT-LSTGVLTSPTIRTQMPPPNGIPSIIVPPVA-PQPMPFPP-MPIPAGPSAWPATHQRHPPPRLPVP

Query:  GTGVFLPPGSSSAPSSQPLP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ-----ENLEAQNAG
        GTGVFLPPGSSSAPS Q +P   VETS++AEKENG T+SD+N GASPGEKSEAKPQ+QECNG +DG+GS K T+EEQ +QQQ     EN+EAQNAG
Subjt:  GTGVFLPPGSSSAPSSQPLP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ-----ENLEAQNAG

XP_038892010.1 RNA demethylase ALKBH10B isoform X1 [Benincasa hispida]1.4e-30079.91Show/hide
Query:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVL
        MAMPSGNVGV DKVS+QS GG V   GGGG   E+HQHHPR WFPDERDGFI+WLR EFAASNAIIDALCHHLRAVGEPGEYD VIGCIQQRR NW PVL
Subjt:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVL

Query:  HMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGED
        HMQQYF VAEVMY LQQVTSRRQQRY DPVKVG KLYRRPGPGFK   QQQG+RVE TVKEE+ TCA+S N  NSSS VG RKVEQVSN C+ESK  GED
Subjt:  HMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGED

Query:  EKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYE
         K NDKDS SAED KDT+GKDQ++SKPK AE LEDNASNKESQVEP DDG SSSHRDKE  SVQS+NG QYAA TPRTFVANE+FDG  VNVMDGLK +E
Subjt:  EKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYE

Query:  DLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSK----DRRIESIPSLLQDLIDRLVGGQVMAVKP
        +LLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVV KRPMKGHGREMIQLGFPIADAP+DDDNSS LSK    DRRIESIPSLLQDLIDRLVG QVM VKP
Subjt:  DLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSK----DRRIESIPSLLQDLIDRLVGGQVMAVKP

Query:  DSCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR--
        DSCI+DFYNEGDHSQPHVWPPWF RPV VLLLTECEMTFGRVIG+DHSGNYRGA  L+LTPGTLLV +GKSA+FAKHAIPAIRKQRILVTLTKS PKR  
Subjt:  DSCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR--

Query:  -ADGQRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL-STGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSAWPATHQRHPPPRLP
         ADGQRTS N+G +SSWGPPSVRSP+ RLSPG K YPT+ STGVL +P IR QM PPNGIP +IVPPVAP PMPFPP+PIP GPSAWP  H RHPPPRLP
Subjt:  -ADGQRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL-STGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSAWPATHQRHPPPRLP

Query:  VPGTGVFL-PPGSSSAPSSQP-----LPVETSAIAEKENGSTKSDNNTGASPGEKSEAK-PQKQECNGGVDGNGSGKVTKEEQQEQQQ-----ENLEAQN
        VPGTGVFL PPGSSSAP+  P       VET +++EKENGSTKSD+N+G SPGEK EAK PQ+QECNG +DG+GS KV +EEQQ QQQ     EN +AQN
Subjt:  VPGTGVFL-PPGSSSAPSSQP-----LPVETSAIAEKENGSTKSDNNTGASPGEKSEAK-PQKQECNGGVDGNGSGKVTKEEQQEQQQ-----ENLEAQN

Query:  AG
        AG
Subjt:  AG

XP_038892011.1 RNA demethylase ALKBH10B isoform X2 [Benincasa hispida]2.5e-30280.37Show/hide
Query:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVL
        MAMPSGNVGV DKVS+QS GG V   GGGG   E+HQHHPR WFPDERDGFI+WLR EFAASNAIIDALCHHLRAVGEPGEYD VIGCIQQRR NW PVL
Subjt:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVL

Query:  HMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGED
        HMQQYF VAEVMY LQQVTSRRQQRY DPVKVG KLYRRPGPGFK   QQQG+RVE TVKEE+ TCA+S N  NSSS VG RKVEQVSN C+ESK  GED
Subjt:  HMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGED

Query:  EKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYE
         K NDKDS SAED KDT+GKDQ++SKPK AE LEDNASNKESQVEP DDG SSSHRDKE  SVQS+NG QYAA TPRTFVANE+FDG  VNVMDGLK +E
Subjt:  EKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYE

Query:  DLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSCI
        +LLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVV KRPMKGHGREMIQLGFPIADAP+DDDNSS LSKDRRIESIPSLLQDLIDRLVG QVM VKPDSCI
Subjt:  DLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSCI

Query:  IDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR---ADG
        +DFYNEGDHSQPHVWPPWF RPV VLLLTECEMTFGRVIG+DHSGNYRGA  L+LTPGTLLV +GKSA+FAKHAIPAIRKQRILVTLTKS PKR   ADG
Subjt:  IDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR---ADG

Query:  QRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL-STGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSAWPATHQRHPPPRLPVPGT
        QRTS N+G +SSWGPPSVRSP+ RLSPG K YPT+ STGVL +P IR QM PPNGIP +IVPPVAP PMPFPP+PIP GPSAWP  H RHPPPRLPVPGT
Subjt:  QRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL-STGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSAWPATHQRHPPPRLPVPGT

Query:  GVFL-PPGSSSAPSSQP-----LPVETSAIAEKENGSTKSDNNTGASPGEKSEAK-PQKQECNGGVDGNGSGKVTKEEQQEQQQ-----ENLEAQNAG
        GVFL PPGSSSAP+  P       VET +++EKENGSTKSD+N+G SPGEK EAK PQ+QECNG +DG+GS KV +EEQQ QQQ     EN +AQNAG
Subjt:  GVFL-PPGSSSAPSSQP-----LPVETSAIAEKENGSTKSDNNTGASPGEKSEAK-PQKQECNGGVDGNGSGKVTKEEQQEQQQ-----ENLEAQNAG

TrEMBL top hitse value%identityAlignment
A0A5D3E038 Hydroxyproline-rich glycoprotein family protein, putative isoform 21.2e-28977.92Show/hide
Query:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQ-HHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPV
        MA+PSGNVGV DKVSFQS GG V   GGGG   E+HQ HHPR WFPDERDGFI+WLR EFAASNA+IDALCHHLRAVGEPGEYD VIGCIQQRR NW PV
Subjt:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQ-HHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPV

Query:  LHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGE
        LHMQQYF VAEVMY LQQVTSRRQQRY DPVKVGPKLYRRPGPGFK   QQQG+R E TVKEE +TCA+S N GNSSSFV SRKVEQVSN C+ESK  GE
Subjt:  LHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGE

Query:  DEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSY
        DEK ++KDS SAED KDT+GKDQ++SK K AE LEDNA NK+SQVEP DDG SSSHRDKE  SVQS+NG Q+AA TPRTFVANE+FDG  VNVMDGLK +
Subjt:  DEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSY

Query:  EDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSC
        E+LLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRP KGHGREMIQLGFPIADAPY+DDNS  LSKDRRIE IPSLLQDLIDRLVG QVM VKPDSC
Subjt:  EDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSC

Query:  IIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR---AD
        IIDFYNEGDHSQPHVWP WF RPV VLLLTECE+TFGRVIG+DHSGNYRGA  L+LTPG LLV +GKSA+FAKHAIPAIRKQRILVTLTKS PKR   AD
Subjt:  IIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR---AD

Query:  GQRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL-STGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSAWPATHQRHPPPRLPVPG
        GQR+S NVG +S WGPPS RSP+ RLSPG K Y  + STGVL  P IR QM PPNGIP +IVP VAP PMPF P+PIP GPS WP  H RHPPPRLPVPG
Subjt:  GQRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL-STGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSAWPATHQRHPPPRLPVPG

Query:  TGVFL-PPGSSSAPSSQP---LP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQENLEAQNA
        TGVFL PPGSSSAPS  P   LP   +E  +++EKENG TKSD+N+G  PGEK EAK Q+QECNG +DG+G+ KV +EEQQ+QQ+E   AQNA
Subjt:  TGVFL-PPGSSSAPSSQP---LP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQENLEAQNA

A0A6J1FRA9 uncharacterized protein LOC1114474805.8e-29778.53Show/hide
Query:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVL
        MAMPSGNVGVSDKV FQS GGV VSGGGG GG E+HQH PR W+PDERDG I+W R EFAASNAIIDALCHHLRAVGEPGEYD VIGCIQQRR NW PVL
Subjt:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVL

Query:  HMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGED
        HMQQYF VAEVM+ LQQVTSRRQQR+ DP+KVG KL+RRPGP FKQQ+QQ G+RVE TVKEEMVTCA+S N GNSS FVGSRKVEQVSN C+ES   GED
Subjt:  HMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGED

Query:  EKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYE
         K NDKDS SAED KDT+GKDQ++SKPK AE L+DNASNKESQVEP DDG SSS RDKE  SVQSRNG QYAA  PRTFVANEIFDG TVNVMDGLK YE
Subjt:  EKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYE

Query:  DLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSCI
        +LLDD EVSKLLSLVNDLRASGKRGQ QG TY+VSKRPMKGHGREMIQLGFPIADA +DDDNSS LSKDRRIESIPSLLQDLID LV  QVM VKPDSCI
Subjt:  DLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSCI

Query:  IDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR---ADG
        IDFYNEGDHSQPHVWPPWF RPV VLLLTECEMTFGRVIGSDHSGNYRGAKTL+L PG+LLV +GKSA+FAKHAIPA+RKQRILVTLTKS P+R   ADG
Subjt:  IDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR---ADG

Query:  QRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPT-LSTGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSAWPATHQRHPPPRLPVPGT
        QRTS NVG YSSWGPPS RSP++R  PG KHYP   STGVL  P IR Q+PPPNGIP I+V PVAP PMPFPP+PIP GP AWPA H RHPPPRLPVPGT
Subjt:  QRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPT-LSTGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSAWPATHQRHPPPRLPVPGT

Query:  GVFLPPGSSSAPSSQPLP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ-----ENLEAQNAG
        GVFLPPGSSSAPS Q +P   VETS++AEKENG T++D+N GASPGEK             +DG+GS K T+EEQ +QQ+     EN+EAQNAG
Subjt:  GVFLPPGSSSAPSSQPLP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ-----ENLEAQNAG

A0A6J1FUM5 uncharacterized protein LOC1114474302.6e-29779.06Show/hide
Query:  MAMPSGNVGVSDKVSFQSSGGVVVSGGG---GGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWA
        MAMPSGNVGV DKV FQS GG    GGG    GGG E+HQHHPR WFPDERDGFI+WLR+EFAA+NA+IDALCHHLRAVGEPGEYD VIGCIQQRR NW 
Subjt:  MAMPSGNVGVSDKVSFQSSGGVVVSGGG---GGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWA

Query:  PVLHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVI
        PV+HMQQYF VA+V Y LQQV SRRQQRY DPVKVGPK YRRPGPGFKQQ QQQG+R+ETTVKEEMVTCA+S N GNSSSFVGSRKVE VSN CE+SK  
Subjt:  PVLHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVI

Query:  GEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLK
        GEDE  NDKDS SAEDRKDT+GKDQ++SKPK AE+LEDNASNKES VEP DDG SSS+R+KE  SVQ++NG QYAA TPRTFVANE+ DG  VNVMDGLK
Subjt:  GEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLK

Query:  SYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPD
         +ED LDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIAD P+DDDNSS LSKDRRIESIPSLLQDLIDRLVG Q+M+VKPD
Subjt:  SYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPD

Query:  SCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRA--
        SCIIDFYNEGDHSQPHVWPPWF RPV VLLLTECEMTFGRVIG+DHSGNY+GA  L+L PGTLLV EGKSA+FAKHAIPAIRKQRILVTLTKS PKRA  
Subjt:  SCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRA--

Query:  -DGQRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL-STGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPF-PPMPIPAGPSAWPATHQRHPPPRLP
         DGQRTS NVG +SSWGPPS RSP+ RLSPGHKHYP++ STGVL +P IR QMPPPNGIP +IV PVA  PMPF PP+PIP GP  WP  H RHPPPRLP
Subjt:  -DGQRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL-STGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPF-PPMPIPAGPSAWPATHQRHPPPRLP

Query:  VPGTGVFL-PPGSSS-----APSSQPLP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ--ENL-EAQN
        VPGTGVFL PPGSSS     APS Q LP   VET +++EKENGSTKSD+NTGA  GEKSEAKPQ+QECN    G+ S KV +EEQQ++Q+  ENL EAQ+
Subjt:  VPGTGVFL-PPGSSS-----APSSQPLP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ--ENL-EAQN

Query:  AG
        AG
Subjt:  AG

A0A6J1ISJ2 uncharacterized protein LOC111478296 isoform X15.8e-29778.54Show/hide
Query:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVL
        MAMPSGNVGVSDKV FQSSGGV VSGG      E+HQH PR W+PDERDG I+W R EFAASNAIIDALCHHLRAVGEPGEYD VIGCIQQRR NW PVL
Subjt:  MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVL

Query:  HMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQY---QQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVI
        HMQQYF VAEVM+ LQQVTSRRQQR+ DP+KVG KL+RRPGP FKQQ+   QQ G+R+E TVKEEMVTCA+S N GNSSSFVGSRKVEQVSN CEES   
Subjt:  HMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQY---QQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVI

Query:  GEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDK--ESVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLK
        GED K NDKDS SAED KDT+GKDQ++SKPK AE LEDNASNKESQVEP DDG SSS RDK  +SVQSRN  QYAA  PRTF ANEIFDG TVNVMDGLK
Subjt:  GEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDK--ESVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLK

Query:  SYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPD
         YE+LLDD EVSKLLSLVNDLRASGKRGQ QG TY+VSKRPMKGHGREMIQLGFPIADA +DD NSS LSKDRRIESIPSLLQDLID LV  QVM VKPD
Subjt:  SYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPD

Query:  SCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR---
        SCIIDFYNEGDHSQPHVWPPWF RPV VLLLTECEM+FGRV+GSDHSGNYRGAKTL+L PG+LLV +GKSA+FAKHAIPA+RKQRILVTLTKS PKR   
Subjt:  SCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKR---

Query:  ADGQRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPT-LSTGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPP-MPIPAGPSAWPATHQRHPPPRLP
        ADGQRTS N+G YSSWGPPS RSP++R  PG KHYP   STGVL  P IR Q+PP NGIP I+V PVAP PMPFPP +PIP GP AWPA H RHPPPRLP
Subjt:  ADGQRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPT-LSTGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPP-MPIPAGPSAWPATHQRHPPPRLP

Query:  VPGTGVFLPPGSSSAPSSQPLP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ------ENLEAQNAG
        VPGTGVFLPPG+SSAPS Q +P   VETS++AEKENG T+SD+N GASPGEKSEAKPQ+QECNG +DG+GS K T+EEQ +QQQ      EN+EAQNAG
Subjt:  VPGTGVFLPPGSSSAPSSQPLP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ------ENLEAQNAG

A0A6J1J9C0 uncharacterized protein LOC1114846093.1e-29879.8Show/hide
Query:  MAMPSGNVGVSDKVSFQS--SGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAP
        MAMPSGNVGV DKVSFQS   GGV VSGGGG    E+HQHHPR WFPDERDGFI+WLR+EFAA+NA+IDALCHHLRAVGEPGEYD VIGCIQQRR NW P
Subjt:  MAMPSGNVGVSDKVSFQS--SGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAP

Query:  VLHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFK-QQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVI
        V+HMQQYF VA+V Y LQQV SRRQQRY DPVKVGPK YRRPGPGFK QQ QQQG+R+E TVKEEMVTCA+S N GNSSSFVGSRKVEQVSN CEESK  
Subjt:  VLHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFK-QQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVI

Query:  GEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLK
        GEDE  NDKDS SAEDRKDT+GKDQ +SKPK AE+LEDNASNKES VEP DDG SSS+R+KE  SVQS+NG QYAA TPRTFVANE+ DG  VNVMDGLK
Subjt:  GEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKE--SVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLK

Query:  SYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPD
         +ED LDDAEVSKLLSLVNDLRASGKRGQFQG TYVVSKRPMKGHGREMIQLGFPIAD P+DDDNSS LSKDRRIESIPSLLQDLIDRLVG Q+M+VKPD
Subjt:  SYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPD

Query:  SCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRA--
        SCIIDFYNEGDHSQPHVWPPWF RPV VLLLTECEMTFGRVIG+DHSGNY+GA  L+L PGTLLV EGKSA+FAKHAIPAIRKQRILVTLTKS PKRA  
Subjt:  SCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRA--

Query:  -DGQRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL-STGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPF-PPMPIPAGPSAWPATHQRHPPPRLP
         DGQRTS NVG ++SWGPPSVRSP+ RLSPG KHY ++ STGVL +P IR QMPPPNGIP +IV PVA  PMPF PP+PIP GP  WP  H RHPPPRLP
Subjt:  -DGQRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL-STGVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPF-PPMPIPAGPSAWPATHQRHPPPRLP

Query:  VPGTGVFL-PPGSSSAPSSQP--LP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ--ENLEAQNAG
        VPGTGVFL PPGSSSAPS  P  LP   VET +++EKENGSTKSD+N GAS GEKSEAKPQ+QECN    G+ S KV +EEQQEQQ+  ENL+AQ+AG
Subjt:  VPGTGVFL-PPGSSSAPSSQP--LP---VETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGKVTKEEQQEQQQ--ENLEAQNAG

SwissProt top hitse value%identityAlignment
Q9SL49 RNA demethylase ALKBH9B9.0e-2931.48Show/hide
Query:  RTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIP
        + F+  E   G  VNV+DGL+ +  +    E  +++  V  L+  G+RG+ + +T+    + M+G GRE IQ G     AP    N   + +   ++ +P
Subjt:  RTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIP

Query:  SLLQDLIDRLVGGQVM--AVKPDSCIIDFYNEGDHSQPHVWPPWFVRP-VSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKH
         L + +I +L+   V+     PDSCI++ Y+EGD   PH+    F+RP  ++  L+EC++ FG  +  +  G++ G+ ++ L  G++LV  G  A+ AKH
Subjt:  SLLQDLIDRLVGGQVM--AVKPDSCIIDFYNEGDHSQPHVWPPWFVRP-VSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKH

Query:  AIPAIRKQRILVTLTK
         +PA+  +RI +T  K
Subjt:  AIPAIRKQRILVTLTK

Q9ZT92 RNA demethylase ALKBH10B2.7e-5731.34Show/hide
Query:  RDGFIAWLRAEFAASNAIIDALCHHLRAVGEP---GEYDNVIGCIQQRRGNWAPVLHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGF
        +D  I+W R EFAA+NAIIDA+C HLR   E     EY+ V   I +RR NW PVL MQ+Y  +AEV   LQ+V +++ +                    
Subjt:  RDGFIAWLRAEFAASNAIIDALCHHLRAVGEP---GEYDNVIGCIQQRRGNWAPVLHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGF

Query:  KQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQV
        KQ+  ++    E  +KE + T                 + E+V   C   + + E++ + D +             D  D  P S   + D+ S+++   
Subjt:  KQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQV

Query:  EPIDDGSSS----SHRDKESVQSRNGNQYAAITP-RTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKG
          + D +      SH D ++       +   I P + F A E   G+TVNV+ GLK YE+LL + E+SKLL  V +LR +G  G+  G+++++  + +KG
Subjt:  EPIDDGSSS----SHRDKESVQSRNGNQYAAITP-RTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKG

Query:  HGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMA--VKPDSCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVI
        + RE+IQLG PI      D+NS+  +    IE IP LL+ +ID  V  +++    +P+ C+I+F+ EG++SQP + PP   +P+S L+L+E  M +GR++
Subjt:  HGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMA--VKPDSCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVI

Query:  GSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRADGQ---RTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL--ST
         SD+ GN+RG  TL+L  G+LLV  G SA+ A+H +   + +R+ +T  +  P          +  N G  + W P  + +P+  L+ G+ H   +    
Subjt:  GSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRADGQ---RTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL--ST

Query:  GVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSA----WPAT-----HQRHPPPRLPVPGTGVFLPPGSSSAPS
        GVL  P +   M P         PPV P  +P P +    G +     W +      H +H PPR       + LPP +SS+P+
Subjt:  GVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSA----WPAT-----HQRHPPPRLPVPGTGVFLPPGSSSAPS

Arabidopsis top hitse value%identityAlignment
AT1G14710.1 hydroxyproline-rich glycoprotein family protein2.8e-12646.61Show/hide
Query:  PRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVLHMQQYFPVAEVMYGLQQVTSRRQ-----QRYSDPVKVGP
        P  W PDERDGFI+WLRAEFAA+NAIID+LC HL+AVG+  EY++VIG I  RR  W+ VL MQQ+FPVA+V Y LQQ+  +RQ     QR+ +  +VG 
Subjt:  PRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVLHMQQYFPVAEVMYGLQQVTSRRQ-----QRYSDPVKVGP

Query:  KLYRRPGPGFKQQY-QQQGYRVETTVKEEMVTCADS-SNSGNSSSFVGSRKVEQVSNACEESKVIGEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEY
           RR GPGF + +    GYR            ADS + +G++ + V S +VE      EE+K+      S+ K  S AE+++D + K ++DSK +  + 
Subjt:  KLYRRPGPGFKQQY-QQQGYRVETTVKEEMVTCADS-SNSGNSSSFVGSRKVEQVSNACEESKVIGEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEY

Query:  LEDNASNKE-SQVEPIDDGSSSSHRDKESVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYV
        LE++ + +E  +    + GS  +    E  Q  N  +  A   +TFV  E++D   VNV++GLK Y+ +LD  EVS+L+SLV +LR +G+RGQ Q + YV
Subjt:  LEDNASNKE-SQVEPIDDGSSSSHRDKESVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYV

Query:  VSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEM
          KRP +GHGREMIQLG PIAD P DDD+     KDRRIE IPS L D+I+RLV  Q++ VKPD+CIIDF++EGDHSQPH++ PWF RP+SVL L+EC+ 
Subjt:  VSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEM

Query:  TFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRADGQRTSFNVGQYSSWGPPSVRSPSSRL---SPGHKHYP
        TFGRVI S++ G+Y+G+  L+LTPG++L+ EGKSA  AK+AI A RKQRIL++  KS P+              S+WGPP  RSP+  +   +   KHYP
Subjt:  TFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRADGQRTSFNVGQYSSWGPPSVRSPSSRL---SPGHKHYP

Query:  TL--STGVLTSPTIRTQMPPPNG-IPSIIVPPVAP--QPMPFPPMPIPAGPSAWP-----ATHQRHPPPRLPVPGTGVFLPPGSSSAPSSQPLPVETSAI
         +  STGVL +P+ R    PPNG +  I +PP  P   PMPFP   +P GP  WP       HQ  P PR+P+PGTGVFLPPG     S+Q L   ++  
Subjt:  TL--STGVLTSPTIRTQMPPPNG-IPSIIVPPVAP--QPMPFPPMPIPAGPSAWP-----ATHQRHPPPRLPVPGTGVFLPPGSSSAPSSQPLPVETSAI

Query:  AEKENGSTKSDNNTGASPGE
          K +   K +   G   GE
Subjt:  AEKENGSTKSDNNTGASPGE

AT1G14710.2 hydroxyproline-rich glycoprotein family protein2.8e-12646.61Show/hide
Query:  PRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVLHMQQYFPVAEVMYGLQQVTSRRQ-----QRYSDPVKVGP
        P  W PDERDGFI+WLRAEFAA+NAIID+LC HL+AVG+  EY++VIG I  RR  W+ VL MQQ+FPVA+V Y LQQ+  +RQ     QR+ +  +VG 
Subjt:  PRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVLHMQQYFPVAEVMYGLQQVTSRRQ-----QRYSDPVKVGP

Query:  KLYRRPGPGFKQQY-QQQGYRVETTVKEEMVTCADS-SNSGNSSSFVGSRKVEQVSNACEESKVIGEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEY
           RR GPGF + +    GYR            ADS + +G++ + V S +VE      EE+K+      S+ K  S AE+++D + K ++DSK +  + 
Subjt:  KLYRRPGPGFKQQY-QQQGYRVETTVKEEMVTCADS-SNSGNSSSFVGSRKVEQVSNACEESKVIGEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEY

Query:  LEDNASNKE-SQVEPIDDGSSSSHRDKESVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYV
        LE++ + +E  +    + GS  +    E  Q  N  +  A   +TFV  E++D   VNV++GLK Y+ +LD  EVS+L+SLV +LR +G+RGQ Q + YV
Subjt:  LEDNASNKE-SQVEPIDDGSSSSHRDKESVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYV

Query:  VSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEM
          KRP +GHGREMIQLG PIAD P DDD+     KDRRIE IPS L D+I+RLV  Q++ VKPD+CIIDF++EGDHSQPH++ PWF RP+SVL L+EC+ 
Subjt:  VSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEM

Query:  TFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRADGQRTSFNVGQYSSWGPPSVRSPSSRL---SPGHKHYP
        TFGRVI S++ G+Y+G+  L+LTPG++L+ EGKSA  AK+AI A RKQRIL++  KS P+              S+WGPP  RSP+  +   +   KHYP
Subjt:  TFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRADGQRTSFNVGQYSSWGPPSVRSPSSRL---SPGHKHYP

Query:  TL--STGVLTSPTIRTQMPPPNG-IPSIIVPPVAP--QPMPFPPMPIPAGPSAWP-----ATHQRHPPPRLPVPGTGVFLPPGSSSAPSSQPLPVETSAI
         +  STGVL +P+ R    PPNG +  I +PP  P   PMPFP   +P GP  WP       HQ  P PR+P+PGTGVFLPPG     S+Q L   ++  
Subjt:  TL--STGVLTSPTIRTQMPPPNG-IPSIIVPPVAP--QPMPFPPMPIPAGPSAWP-----ATHQRHPPPRLPVPGTGVFLPPGSSSAPSSQPLPVETSAI

Query:  AEKENGSTKSDNNTGASPGE
          K +   K +   G   GE
Subjt:  AEKENGSTKSDNNTGASPGE

AT2G17970.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.4e-3031.48Show/hide
Query:  RTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIP
        + F+  E   G  VNV+DGL+ +  +    E  +++  V  L+  G+RG+ + +T+    + M+G GRE IQ G     AP    N   + +   ++ +P
Subjt:  RTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIP

Query:  SLLQDLIDRLVGGQVM--AVKPDSCIIDFYNEGDHSQPHVWPPWFVRP-VSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKH
         L + +I +L+   V+     PDSCI++ Y+EGD   PH+    F+RP  ++  L+EC++ FG  +  +  G++ G+ ++ L  G++LV  G  A+ AKH
Subjt:  SLLQDLIDRLVGGQVM--AVKPDSCIIDFYNEGDHSQPHVWPPWFVRP-VSVLLLTECEMTFGRVIGSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKH

Query:  AIPAIRKQRILVTLTK
         +PA+  +RI +T  K
Subjt:  AIPAIRKQRILVTLTK

AT2G48080.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein2.1e-4927.84Show/hide
Query:  RDGFIAWLRAEFAASNAIIDALCHHL-RAVGEPGEYDNVIGCIQQRRGNWAPVLHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQ
        +D  + W R EFAA+NAIIDALC HL +A G   +Y++V+  + +RR NW PVL MQ+Y  +++V   LQQ  ++                     GF  
Subjt:  RDGFIAWLRAEFAASNAIIDALCHHL-RAVGEPGEYDNVIGCIQQRRGNWAPVLHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQ

Query:  QYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEP
                             D  +  +  +  GSR+ E +S  C                            K +++ + + A  L+ +          
Subjt:  QYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQVEP

Query:  IDDGSSSSHRDKESVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQ
                                    + F A E   G+T NV+ GLK Y+D+    ++SKLL  +N LR +G+  Q  G+T+V+  +  KG  RE++Q
Subjt:  IDDGSSSSHRDKESVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQ

Query:  LGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMA--VKPDSCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGN
        LG PI     D+ +         +E IP+L+Q +ID L+  +++    +P+ C+I+F++E +HSQP   PP   +P+S L+L+E  M FG  +G D+ GN
Subjt:  LGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMA--VKPDSCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSDHSGN

Query:  YRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRADGQ----------------------RTSFNVGQYSSWGPPSVRSPSSRLSP
        +RG+ TL L  G+LLV  G SA+ A+H +     +R+ +T  K  P     Q                      R     G +  W PP  R P+  L P
Subjt:  YRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRADGQ----------------------RTSFNVGQYSSWGPPSVRSPSSRLSP

Query:  GHKHYPTLST
          +    LS+
Subjt:  GHKHYPTLST

AT4G02940.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.9e-5831.34Show/hide
Query:  RDGFIAWLRAEFAASNAIIDALCHHLRAVGEP---GEYDNVIGCIQQRRGNWAPVLHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGF
        +D  I+W R EFAA+NAIIDA+C HLR   E     EY+ V   I +RR NW PVL MQ+Y  +AEV   LQ+V +++ +                    
Subjt:  RDGFIAWLRAEFAASNAIIDALCHHLRAVGEP---GEYDNVIGCIQQRRGNWAPVLHMQQYFPVAEVMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGF

Query:  KQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQV
        KQ+  ++    E  +KE + T                 + E+V   C   + + E++ + D +             D  D  P S   + D+ S+++   
Subjt:  KQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGEDEKSNDKDSSSAEDRKDTNGKDQNDSKPKSAEYLEDNASNKESQV

Query:  EPIDDGSSS----SHRDKESVQSRNGNQYAAITP-RTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKG
          + D +      SH D ++       +   I P + F A E   G+TVNV+ GLK YE+LL + E+SKLL  V +LR +G  G+  G+++++  + +KG
Subjt:  EPIDDGSSS----SHRDKESVQSRNGNQYAAITP-RTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKG

Query:  HGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMA--VKPDSCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVI
        + RE+IQLG PI      D+NS+  +    IE IP LL+ +ID  V  +++    +P+ C+I+F+ EG++SQP + PP   +P+S L+L+E  M +GR++
Subjt:  HGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMA--VKPDSCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVI

Query:  GSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRADGQ---RTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL--ST
         SD+ GN+RG  TL+L  G+LLV  G SA+ A+H +   + +R+ +T  +  P          +  N G  + W P  + +P+  L+ G+ H   +    
Subjt:  GSDHSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRADGQ---RTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTL--ST

Query:  GVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSA----WPAT-----HQRHPPPRLPVPGTGVFLPPGSSSAPS
        GVL  P +   M P         PPV P  +P P +    G +     W +      H +H PPR       + LPP +SS+P+
Subjt:  GVLTSPTIRTQMPPPNGIPSIIVPPVAPQPMPFPPMPIPAGPSA----WPAT-----HQRHPPPRLPVPGTGVFLPPGSSSAPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATGCCATCGGGAAATGTGGGTGTATCCGATAAGGTTTCGTTTCAGAGCAGTGGCGGAGTTGTGGTGAGCGGTGGCGGAGGCGGTGGGGGTGCTGAGATGCATCA
GCATCACCCCCGCCAATGGTTTCCTGATGAGCGTGATGGATTTATCGCATGGTTGCGAGCTGAATTTGCCGCTTCAAATGCAATCATTGATGCCCTTTGCCATCATCTGC
GTGCTGTGGGAGAGCCTGGGGAATATGATAATGTTATTGGGTGTATTCAGCAACGACGGGGTAATTGGGCGCCGGTGCTTCATATGCAACAGTACTTTCCAGTGGCAGAA
GTGATGTATGGCCTTCAGCAGGTAACCTCGAGGAGGCAGCAGAGGTATAGTGATCCTGTGAAAGTGGGGCCGAAGTTGTATAGGAGACCTGGGCCAGGGTTTAAGCAGCA
GTATCAACAACAGGGCTATCGTGTTGAAACCACAGTCAAGGAAGAGATGGTTACTTGTGCAGATTCTAGTAATAGTGGGAATTCTTCAAGTTTTGTAGGCTCTAGAAAGG
TGGAGCAAGTAAGTAATGCATGTGAGGAAAGTAAGGTAATAGGGGAGGATGAAAAATCGAATGACAAAGATTCAAGCTCAGCCGAGGATAGAAAAGATACTAATGGGAAA
GACCAAAATGATAGCAAACCGAAGTCTGCCGAATATTTAGAAGACAATGCAAGTAATAAAGAATCTCAAGTTGAACCTATTGATGATGGTAGCTCTTCTAGTCACAGAGA
TAAGGAATCTGTTCAAAGCCGGAATGGAAATCAGTATGCTGCCATAACCCCAAGAACCTTTGTTGCCAATGAGATATTTGATGGAAATACGGTTAATGTGATGGATGGAT
TGAAATCATATGAAGATTTATTGGATGATGCTGAGGTTTCAAAGCTTCTTTCATTAGTGAATGATTTGAGAGCTTCTGGGAAGAGAGGACAATTTCAAGGTCAGACATAT
GTAGTCTCAAAACGACCGATGAAGGGTCACGGGAGAGAAATGATCCAACTAGGCTTTCCAATTGCAGATGCTCCTTATGATGATGACAATTCTTCAAGGCTCTCAAAAGA
TAGAAGAATAGAATCCATCCCCTCATTGCTTCAAGATCTCATTGATCGCTTGGTTGGGGGGCAAGTAATGGCAGTGAAGCCAGATTCCTGCATCATTGACTTCTATAACG
AGGGTGATCATTCTCAGCCTCATGTCTGGCCACCATGGTTTGTGAGGCCTGTTAGTGTCCTCCTTTTGACTGAATGTGAAATGACTTTTGGTAGAGTAATTGGTTCCGAC
CATTCTGGCAACTATAGAGGTGCAAAGACCCTGGCTCTTACGCCCGGAACCCTTCTTGTGGCCGAAGGAAAATCTGCAGAGTTTGCTAAGCATGCAATTCCTGCTATCCG
CAAGCAACGGATACTAGTTACCTTGACCAAATCAGTACCAAAAAGAGCTGATGGGCAACGCACATCTTTCAACGTAGGTCAATATTCCAGTTGGGGCCCTCCATCTGTTA
GATCACCGAGCTCTCGTCTTTCCCCGGGACACAAGCATTACCCTACACTGTCGACCGGGGTGCTAACCTCCCCAACCATCCGCACCCAAATGCCACCACCAAATGGCATA
CCCTCCATAATTGTCCCTCCTGTAGCACCCCAACCTATGCCTTTTCCTCCCATGCCGATCCCAGCCGGTCCATCAGCATGGCCTGCCACACATCAAAGGCACCCTCCACC
TCGTCTCCCCGTCCCAGGCACCGGAGTATTCCTTCCTCCAGGTTCTTCCAGCGCTCCATCTTCTCAACCGTTGCCAGTTGAGACAAGCGCCATTGCAGAAAAGGAAAACG
GTTCGACAAAATCTGATAACAATACAGGCGCCTCTCCGGGAGAAAAATCTGAAGCAAAACCTCAAAAGCAAGAATGCAATGGAGGGGTGGATGGAAATGGGAGTGGTAAA
GTGACAAAGGAAGAACAACAAGAGCAGCAACAGGAGAATTTAGAGGCTCAAAATGCAGGATCCCAAGTCTAG
mRNA sequenceShow/hide mRNA sequence
TACCCTTTTTGAGCTCAAGAATTTGTGAGCATCTCTTACACTGCTCCCTTTTCGATTCCCAGTTTTTTCTTTTCTTTTTTTTTTAATTTTTTTGAAAAAATTTTCTTCAT
CAGTGAAGGAAAAAAAACCATTTAACATTCTTGTTCATTATGACCATTCACAATTTTCACTTTCCAAACCCTAGATCTTAATTCCCACAATTCCCACAATTCCCACAATT
CCCTTTTTCTTCTTTGTATTTCACTGTGCGTTGTTTTCAAGCACAAACTCTAAAAAGAGCTTTTTCTCAAACCCCATTTTGCAGATCTGATACAATTGGGCTCCATCGCC
ACCCAAATTCTCCTGTAGCTGTTTATGAATTCCCAGAATTTAGCTTCTTAATTTATCGGTTCAGATTTGAGGATTTACTACTCTGTATATGTGGTTTTAGTGTTTATTCT
GGTTAGAGGATTCTCATGGCAATGCCATCGGGAAATGTGGGTGTATCCGATAAGGTTTCGTTTCAGAGCAGTGGCGGAGTTGTGGTGAGCGGTGGCGGAGGCGGTGGGGG
TGCTGAGATGCATCAGCATCACCCCCGCCAATGGTTTCCTGATGAGCGTGATGGATTTATCGCATGGTTGCGAGCTGAATTTGCCGCTTCAAATGCAATCATTGATGCCC
TTTGCCATCATCTGCGTGCTGTGGGAGAGCCTGGGGAATATGATAATGTTATTGGGTGTATTCAGCAACGACGGGGTAATTGGGCGCCGGTGCTTCATATGCAACAGTAC
TTTCCAGTGGCAGAAGTGATGTATGGCCTTCAGCAGGTAACCTCGAGGAGGCAGCAGAGGTATAGTGATCCTGTGAAAGTGGGGCCGAAGTTGTATAGGAGACCTGGGCC
AGGGTTTAAGCAGCAGTATCAACAACAGGGCTATCGTGTTGAAACCACAGTCAAGGAAGAGATGGTTACTTGTGCAGATTCTAGTAATAGTGGGAATTCTTCAAGTTTTG
TAGGCTCTAGAAAGGTGGAGCAAGTAAGTAATGCATGTGAGGAAAGTAAGGTAATAGGGGAGGATGAAAAATCGAATGACAAAGATTCAAGCTCAGCCGAGGATAGAAAA
GATACTAATGGGAAAGACCAAAATGATAGCAAACCGAAGTCTGCCGAATATTTAGAAGACAATGCAAGTAATAAAGAATCTCAAGTTGAACCTATTGATGATGGTAGCTC
TTCTAGTCACAGAGATAAGGAATCTGTTCAAAGCCGGAATGGAAATCAGTATGCTGCCATAACCCCAAGAACCTTTGTTGCCAATGAGATATTTGATGGAAATACGGTTA
ATGTGATGGATGGATTGAAATCATATGAAGATTTATTGGATGATGCTGAGGTTTCAAAGCTTCTTTCATTAGTGAATGATTTGAGAGCTTCTGGGAAGAGAGGACAATTT
CAAGGTCAGACATATGTAGTCTCAAAACGACCGATGAAGGGTCACGGGAGAGAAATGATCCAACTAGGCTTTCCAATTGCAGATGCTCCTTATGATGATGACAATTCTTC
AAGGCTCTCAAAAGATAGAAGAATAGAATCCATCCCCTCATTGCTTCAAGATCTCATTGATCGCTTGGTTGGGGGGCAAGTAATGGCAGTGAAGCCAGATTCCTGCATCA
TTGACTTCTATAACGAGGGTGATCATTCTCAGCCTCATGTCTGGCCACCATGGTTTGTGAGGCCTGTTAGTGTCCTCCTTTTGACTGAATGTGAAATGACTTTTGGTAGA
GTAATTGGTTCCGACCATTCTGGCAACTATAGAGGTGCAAAGACCCTGGCTCTTACGCCCGGAACCCTTCTTGTGGCCGAAGGAAAATCTGCAGAGTTTGCTAAGCATGC
AATTCCTGCTATCCGCAAGCAACGGATACTAGTTACCTTGACCAAATCAGTACCAAAAAGAGCTGATGGGCAACGCACATCTTTCAACGTAGGTCAATATTCCAGTTGGG
GCCCTCCATCTGTTAGATCACCGAGCTCTCGTCTTTCCCCGGGACACAAGCATTACCCTACACTGTCGACCGGGGTGCTAACCTCCCCAACCATCCGCACCCAAATGCCA
CCACCAAATGGCATACCCTCCATAATTGTCCCTCCTGTAGCACCCCAACCTATGCCTTTTCCTCCCATGCCGATCCCAGCCGGTCCATCAGCATGGCCTGCCACACATCA
AAGGCACCCTCCACCTCGTCTCCCCGTCCCAGGCACCGGAGTATTCCTTCCTCCAGGTTCTTCCAGCGCTCCATCTTCTCAACCGTTGCCAGTTGAGACAAGCGCCATTG
CAGAAAAGGAAAACGGTTCGACAAAATCTGATAACAATACAGGCGCCTCTCCGGGAGAAAAATCTGAAGCAAAACCTCAAAAGCAAGAATGCAATGGAGGGGTGGATGGA
AATGGGAGTGGTAAAGTGACAAAGGAAGAACAACAAGAGCAGCAACAGGAGAATTTAGAGGCTCAAAATGCAGGATCCCAAGTCTAGGGAGAGAAGAAGATACATCATTA
GAGAGAGAAAATGCAGACTTGAATGAGTTCCAAAGCAAAATGTAGAGAGCAGCAACATTCAAAGACTGATGATTGTCACACACACTAATCACTACTACCAAGGAGAACAG
GTTCATTCCTTTTTTTTTTGGTTGTCCTAAAAAACATATTGGTTAGATGGGAAAACTCATTCCTAATGGAACTTTGGATCTTACTTATCATCTTCTTTAACACAGCTTTT
TAAAAGTCAGAATCAATAGGGATTACTTTTTTTTGAAACAGTTGAGAAAAAAAAATGCTAACCTTTTGAGTTATCAGGAATTGGATCTTTTCTCTGATATGTGGAATACA
TGATCAGTTTTATACTTCCCAAAACTCTGTTAAACCATGGATTTTAAGGTTAATTCCACTCGTTAACGTGATTTCAAATCGAC
Protein sequenceShow/hide protein sequence
MAMPSGNVGVSDKVSFQSSGGVVVSGGGGGGGAEMHQHHPRQWFPDERDGFIAWLRAEFAASNAIIDALCHHLRAVGEPGEYDNVIGCIQQRRGNWAPVLHMQQYFPVAE
VMYGLQQVTSRRQQRYSDPVKVGPKLYRRPGPGFKQQYQQQGYRVETTVKEEMVTCADSSNSGNSSSFVGSRKVEQVSNACEESKVIGEDEKSNDKDSSSAEDRKDTNGK
DQNDSKPKSAEYLEDNASNKESQVEPIDDGSSSSHRDKESVQSRNGNQYAAITPRTFVANEIFDGNTVNVMDGLKSYEDLLDDAEVSKLLSLVNDLRASGKRGQFQGQTY
VVSKRPMKGHGREMIQLGFPIADAPYDDDNSSRLSKDRRIESIPSLLQDLIDRLVGGQVMAVKPDSCIIDFYNEGDHSQPHVWPPWFVRPVSVLLLTECEMTFGRVIGSD
HSGNYRGAKTLALTPGTLLVAEGKSAEFAKHAIPAIRKQRILVTLTKSVPKRADGQRTSFNVGQYSSWGPPSVRSPSSRLSPGHKHYPTLSTGVLTSPTIRTQMPPPNGI
PSIIVPPVAPQPMPFPPMPIPAGPSAWPATHQRHPPPRLPVPGTGVFLPPGSSSAPSSQPLPVETSAIAEKENGSTKSDNNTGASPGEKSEAKPQKQECNGGVDGNGSGK
VTKEEQQEQQQENLEAQNAGSQV