; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G007310 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G007310
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide (PPR) repeat protein-like
Genome locationCmo_Chr20:3623481..3630422
RNA-Seq ExpressionCmoCh20G007310
SyntenyCmoCh20G007310
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsIPR001958 - Tetracycline resistance protein TetA/multidrug resistance protein MdtG
IPR002625 - Smr domain
IPR002885 - Pentatricopeptide repeat
IPR011701 - Major facilitator superfamily
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR020846 - Major facilitator superfamily domain
IPR036063 - Smr domain superfamily
IPR036259 - MFS transporter superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571003.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]3.5e-23592.44Show/hide
Query:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG
        MELRFCPPPYVIGDSVRLFSKAPKRYD FCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKS+TLNVLSDILSSRTAQPG
Subjt:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG

Query:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR
        LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG AYA LLELLY SSSIYVKR
Subjt:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR

Query:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
        RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADM           
Subjt:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA

Query:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC
                         ITSMLQDKS DLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC
Subjt:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC

Query:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNWVCLR
        VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNT CFVAKGKAVKNWVCLR
Subjt:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNWVCLR

KAG7010834.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]2.2e-25398.47Show/hide
Query:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG
        MELRFCPPPYVIGDSVRLFSKAPKRYD FCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKS+TLNVLSDILSSRTAQPG
Subjt:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG

Query:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR
        LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG AYA LLELLY SSSIYVKR
Subjt:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR

Query:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
        RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
Subjt:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA

Query:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC
        LPFSVRTYNSVLNSC KITSMLQDKS DLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC
Subjt:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC

Query:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
        VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
Subjt:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW

XP_022943803.1 pentatricopeptide repeat-containing protein At2g17033 isoform X1 [Cucurbita moschata]2.7e-259100Show/hide
Query:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG
        MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG
Subjt:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG

Query:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR
        LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR
Subjt:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR

Query:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
        RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
Subjt:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA

Query:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC
        LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC
Subjt:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC

Query:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
        VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
Subjt:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW

XP_022985638.1 pentatricopeptide repeat-containing protein At2g17033 isoform X1 [Cucurbita maxima]1.5e-24996.95Show/hide
Query:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG
        MELR CPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG
Subjt:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG

Query:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR
        LCSVA  LYSRITETSWF WNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGF  AYACL ELLY SSSIYVKR
Subjt:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR

Query:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
        RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKND IALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
Subjt:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA

Query:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC
        LPFSVRTYNSVLNSCPKITSMLQDKS DLPVLIEDLI+VLDGDEALLVEELVGSSVL+EVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFE+ESC
Subjt:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC

Query:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
        VIPAQVTVICGSGNHSIVR ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
Subjt:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW

XP_023512520.1 pentatricopeptide repeat-containing protein At2g17033 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-25498.26Show/hide
Query:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG
        MELR CPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG
Subjt:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG

Query:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR
        LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFG AYACLLELLY SSSIYVKR
Subjt:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR

Query:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
        RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYG HNKLADMVLWLQIMKTSA
Subjt:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA

Query:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC
        LPFSVRTYNSVLNSCPKITS+LQDKS DLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHG HVGAAYVIILEWMKEMRLKFEDESC
Subjt:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC

Query:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
        VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCF+AKGKAVKNW
Subjt:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW

TrEMBL top hitse value%identityAlignment
A0A6J1FTY6 uncharacterized protein LOC111448122 isoform X23.1e-22191.67Show/hide
Query:  MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRT
        MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRT
Subjt:  MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRT

Query:  TNFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQ
        TNFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQ
Subjt:  TNFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQ

Query:  PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP
        PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP
Subjt:  PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP

Query:  ALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPI----------------------------IFNIASSQVGPSEQGKAQGYI
        ALALAIRQERLLSIGLWASIIN           VPYALRAFTIFTILVSPI                            IFNIASSQVGPSEQGKAQGYI
Subjt:  ALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPI----------------------------IFNIASSQVGPSEQGKAQGYI

Query:  SGINSLANIASPLLFSPLIALFLSKDAPFDFPGFGILCIGLASLIGFTLSLMIRVDPFIFIQKIKNLV
        SGINSLANIASPLLFSPLIALFLSKDAPFDFPGFGILCIGLASLIGFTLSLMIRVDPFIFIQKIKNLV
Subjt:  SGINSLANIASPLLFSPLIALFLSKDAPFDFPGFGILCIGLASLIGFTLSLMIRVDPFIFIQKIKNLV

A0A6J1FXE0 pentatricopeptide repeat-containing protein At2g17033 isoform X11.3e-259100Show/hide
Query:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG
        MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG
Subjt:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG

Query:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR
        LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR
Subjt:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR

Query:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
        RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
Subjt:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA

Query:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC
        LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC
Subjt:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC

Query:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
        VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
Subjt:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW

A0A6J1FXN8 uncharacterized protein LOC111448122 isoform X32.7e-21297.12Show/hide
Query:  MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRT
        MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRT
Subjt:  MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRT

Query:  TNFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQ
        TNFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQ
Subjt:  TNFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQ

Query:  PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP
        PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP
Subjt:  PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP

Query:  ALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAP
        ALALAIRQERLLSIGLWASIIN           VPYALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAP
Subjt:  ALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAP

Query:  FDFPGFGILCIGLASL
        FDFPGFGILCIGLAS+
Subjt:  FDFPGFGILCIGLASL

A0A6J1JBV8 hippocampus abundant transcript 1 protein-like isoform X33.0e-21693.41Show/hide
Query:  MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRT
        MEKMMSLNHLFVTTFIGSLSMFMVIP+IVD+TMEFVCPHQDHCSIAIYLSG+QQAIVGLGA+VITPVIGNLSDRYGRK MLTIP+TFSIIPLAIMGYRRT
Subjt:  MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRT

Query:  TNFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQ
        TNFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTF ARLLSTATVFQVAAFMSVLAVVHMR FLKESIPDQNELTQ
Subjt:  TNFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQ

Query:  PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP
        PI DENLSGGDDENGPELPT TQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMII+G+AGA+SLFLLMP
Subjt:  PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMP

Query:  ALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAP
        ALALAIRQERLLSIGLWASIIN           VPYALRA TIFTILVSPIIFNIASSQVGPSEQGKAQG ISGINSLANI SPLLFSPLIALFLSKDAP
Subjt:  ALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAP

Query:  FDFPGFGILCIGLASLIGFTLSLMIRVDPFIFIQKIKNLV
        FDFPGFGILCIGLASLIGF LSLMIRVDPFI IQKIKNLV
Subjt:  FDFPGFGILCIGLASLIGFTLSLMIRVDPFIFIQKIKNLV

A0A6J1JE75 pentatricopeptide repeat-containing protein At2g17033 isoform X17.2e-25096.95Show/hide
Query:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG
        MELR CPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG
Subjt:  MELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPG

Query:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR
        LCSVA  LYSRITETSWF WNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGF  AYACL ELLY SSSIYVKR
Subjt:  LCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKR

Query:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
        RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKND IALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA
Subjt:  RAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSA

Query:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC
        LPFSVRTYNSVLNSCPKITSMLQDKS DLPVLIEDLI+VLDGDEALLVEELVGSSVL+EVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFE+ESC
Subjt:  LPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESC

Query:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
        VIPAQVTVICGSGNHSIVR ESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW
Subjt:  VIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNW

SwissProt top hitse value%identityAlignment
A4IF94 Hippocampus abundant transcript-like protein 17.4e-1022.78Show/hide
Query:  HQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQ
        H+        ++G+ Q + GL + +  P+IG LSD +GRK  L   + F+  P+ +M   R + ++Y  + M +++ + S   T S+  AYVAD T E +
Subjt:  HQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQ

Query:  RISAFGILSGVRSVGYV----CGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQPIFDENLSGGDDENGPELPTRTQLSIGMSSIRDV
        R +A+G +S   +   V     G +L+     + V  VA  +++L +  +   + ES+P++           LS G            ++S   +     
Subjt:  RISAFGILSGVRSVGYV----CGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQPIFDENLSGGDDENGPELPTRTQLSIGMSSIRDV

Query:  ISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMPALALAIRQERLLSIGL--------WASIINVPYA
        +  +   +T        F + L E G  +S   +L+    F   + A  + + G+   V+  + + +L  ++  +  + +GL        W    +  + 
Subjt:  ISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMPALALAIRQERLLSIGL--------WASIINVPYA

Query:  LRAFTIFTILVS---PIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALF
        + A  I   + S   P +  + S     ++QG AQG I+GI  L N   P L+  +  +F
Subjt:  LRAFTIFTILVS---PIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALF

P02982 Tetracycline resistance protein, class A1.6e-1224.54Show/hide
Query:  TTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKT
        T  + ++ + +++P +  L  + V     H +      G+  A+  L      PV+G LSDR+GR+ +L + +  + +  AIM    T  F +  YI + 
Subjt:  TTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIMKT

Query:  LTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLL---STATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQPIFDENLSG
        +  +   G T ++A AY+AD T  D+R   FG +S     G V G  L  L+   S    F  AA ++ L  +     L ES                  
Subjt:  LTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLL---STATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQPIFDENLSG

Query:  GDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMPALALAIRQE
             G   P R +    ++S R         T  +    V F   L  +   A    F + RFH+D       +   G+  +++  ++   +A  + + 
Subjt:  GDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMPALALAIRQE

Query:  RLLSIGLWA---SIINVPYALRAFTIFTILV--------SPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIA
        R L +G+ A     I + +A R +  F I+V         P +  + S QV    QG+ QG ++ + SL +I  PLLF+ + A
Subjt:  RLLSIGLWA---SIINVPYALRAFTIFTILV--------SPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIA

P70187 Hippocampus abundant transcript 1 protein7.9e-1222.42Show/hide
Query:  SLNHLFVTTFIGSLSM-FMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFF
        S+ H  +  F+   +   +  PT+V L       H+        ++G+ Q + GL + +  P+IG LSD +GRK+ L + + F+  P+ +M   + + ++
Subjt:  SLNHLFVTTFIGSLSM-FMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFF

Query:  YAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYV----CGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQ
        Y   I  +    V    T S+  AYVAD T E +R  A+G++S   +   V     G +L R+   + V  +A  +++L +  +   + ES+P++     
Subjt:  YAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYV----CGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQ

Query:  ---PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFL
           PI  E             P  +   +G  SI  +I +              F + L E G  +S   +L+    F     A  + + G+   ++  +
Subjt:  ---PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFL

Query:  LMPALALAIRQERLLSIGLWASIINVP-----------YALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALF
        ++  L  +I  +  + +GL   I+ +            +A  A    + +  P +  + S      +QG  QG I+GI  L N   P L+  +  +F
Subjt:  LMPALALAIRQERLLSIGLWASIINVP-----------YALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALF

Q8GWA9 Pentatricopeptide repeat-containing protein At2g170332.5e-13056.59Show/hide
Query:  LTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETL
        L K   RFLS+L++ A AGD SA NR I+KFVA+SPKS+ LNVLS +LS +T+ P L   AL+LYS ITE SWF WN KL+A+L+A L+K  +  +SETL
Subjt:  LTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETL

Query:  ISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYA
        +S A+S+L   ER    F C LVES SK GS +GF  A   L E++ +SSS+YVK +AY+SMV+GLC+M +P +AE +++EM+ +   P  FEY+S++Y 
Subjt:  ISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYA

Query:  YGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSALPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDE
        YG LGLF+DM R +  M  +   +DTVCSNMVLSSYG H+ L  M  WLQ +K   +PFS+RTYNSVLNSCP I SML+D  D  PV + +L T L+ DE
Subjt:  YGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSALPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDE

Query:  ALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRK
        ALLV EL  SSVL E + W+A+E KLDLHG H+ ++Y+I+L+WM E RL+F +E CVIPA++ V+ GSG HS VRGESPVKAL+++IM RT SP+RIDRK
Subjt:  ALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRK

Query:  NTGCFVAKGKAVKNWVC
        N G F+AKGK VK W+C
Subjt:  NTGCFVAKGKAVKNWVC

Q96MC6 Hippocampus abundant transcript 1 protein7.9e-1222.42Show/hide
Query:  SLNHLFVTTFIGSLSM-FMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFF
        S+ H  +  F+   +   +  PT+V L       H+        ++G+ Q + GL + +  P+IG LSD +GRK+ L + + F+  P+ +M   + + ++
Subjt:  SLNHLFVTTFIGSLSM-FMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFF

Query:  YAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYV----CGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQ
        Y   I  +    V    T S+  AYVAD T E +R  A+G++S   +   V     G +L R+   + V  +A  +++L +  +   + ES+P++     
Subjt:  YAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYV----CGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQ

Query:  ---PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFL
           PI  E             P  +   +G  SI  +I +              F + L E G  +S   +L+    F     A  + + G+   ++  +
Subjt:  ---PIFDENLSGGDDENGPELPTRTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFL

Query:  LMPALALAIRQERLLSIGLWASIINVP-----------YALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALF
        ++  L  +I  +  + +GL   I+ +            +A  A    + +  P +  + S      +QG  QG I+GI  L N   P L+  +  +F
Subjt:  LMPALALAIRQERLLSIGLWASIINVP-----------YALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALF

Arabidopsis top hitse value%identityAlignment
AT2G16980.2 Major facilitator superfamily protein3.8e-9445.84Show/hide
Query:  KMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPH-QDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTT
        ++  L HL VT F+  L+ +++ P + D+T+  VC    D CS+A+YL+GVQQ  VG+G +V+ PVIGNLSDRYG KAMLT+PM  S++P AI+GYRR T
Subjt:  KMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPH-QDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTT

Query:  NFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQP
        NFFYAFY++KTL DMV +GT   LA AYVA      +RIS FGIL+GV S+  VC +  AR LS A+ FQVAA    + +V+MR FLKE + D ++    
Subjt:  NFFYAFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQP

Query:  IFDENLSGG-----DDENGPEL-----------PTRTQL-SIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLM
          DE  SGG     +  NG +L           PT+T + +   SS +D++SLI +ST   QA  V+FF + +E G  ++L YFLKARF F+KN FA+L 
Subjt:  IFDENLSGG-----DDENGPEL-----------PTRTQL-SIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLM

Query:  IIEGVAGAVSLFLLMPALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASP
        ++  + G++S   ++P L+  I + ++LS GL     N           VPYA+       + V P +  IAS QVG SEQGK QG ISG+ + A + +P
Subjt:  IIEGVAGAVSLFLLMPALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASP

Query:  LLFSPLIALFLSKDAPFDFPGFGILCIGLASLIGFTLSLMIRVDP
         ++SPL ALFLS++APF FPGF ILCI ++ +IGF  SL+I+  P
Subjt:  LLFSPLIALFLSKDAPFDFPGFGILCIGLASLIGFTLSLMIRVDP

AT2G16990.1 Major facilitator superfamily protein1.4e-9346.19Show/hide
Query:  LNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCP-HQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFY
        L H+  T F+ + + FMV+P I D+T+  VC    D CS+A+YL+G QQ  +G+G +++ PVIGNLSDRYG K +LT+PM  SI+P  I+GYRR   FFY
Subjt:  LNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCP-HQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFY

Query:  AFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQPIFDE
         FYI K LT MV EGT   LA AYVA       RISAFGIL+G++++  + GT +AR L  A  FQV+A    + +V+MR FLKE + D  +        
Subjt:  AFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQPIFDE

Query:  NLSGGDDENGPEL--------PTRTQL-SIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLF
        +    D  N   L        P +TQ+     SS++D+ISL+ +ST F QA  V+FF+S ++ GM+++  YFLKARF FDK QFADL+++  + G++S  
Subjt:  NLSGGDDENGPEL--------PTRTQL-SIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLF

Query:  LLMPALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLS
         ++P  A AI + +LLS GL+   IN           VPY    F    + V P +  IAS QVGP EQGK QG ISG+ S   + +P +FSPL ALFLS
Subjt:  LLMPALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLS

Query:  KDAPFDFPGFGILCIGLASLIGFTLSLMIRVDP
        K+APF FPGF +LCI L+SLIGF  SL+I+  P
Subjt:  KDAPFDFPGFGILCIGLASLIGFTLSLMIRVDP

AT2G16990.2 Major facilitator superfamily protein1.4e-9346.19Show/hide
Query:  LNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCP-HQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFY
        L H+  T F+ + + FMV+P I D+T+  VC    D CS+A+YL+G QQ  +G+G +++ PVIGNLSDRYG K +LT+PM  SI+P  I+GYRR   FFY
Subjt:  LNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCP-HQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFY

Query:  AFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQPIFDE
         FYI K LT MV EGT   LA AYVA       RISAFGIL+G++++  + GT +AR L  A  FQV+A    + +V+MR FLKE + D  +        
Subjt:  AFYIMKTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQPIFDE

Query:  NLSGGDDENGPEL--------PTRTQL-SIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLF
        +    D  N   L        P +TQ+     SS++D+ISL+ +ST F QA  V+FF+S ++ GM+++  YFLKARF FDK QFADL+++  + G++S  
Subjt:  NLSGGDDENGPEL--------PTRTQL-SIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLF

Query:  LLMPALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLS
         ++P  A AI + +LLS GL+   IN           VPY    F    + V P +  IAS QVGP EQGK QG ISG+ S   + +P +FSPL ALFLS
Subjt:  LLMPALALAIRQERLLSIGLWASIIN-----------VPYALRAFTIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLS

Query:  KDAPFDFPGFGILCIGLASLIGFTLSLMIRVDP
        K+APF FPGF +LCI L+SLIGF  SL+I+  P
Subjt:  KDAPFDFPGFGILCIGLASLIGFTLSLMIRVDP

AT2G17033.1 pentatricopeptide (PPR) repeat-containing protein1.7e-13156.59Show/hide
Query:  LTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETL
        L K   RFLS+L++ A AGD SA NR I+KFVA+SPKS+ LNVLS +LS +T+ P L   AL+LYS ITE SWF WN KL+A+L+A L+K  +  +SETL
Subjt:  LTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETL

Query:  ISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYA
        +S A+S+L   ER    F C LVES SK GS +GF  A   L E++ +SSS+YVK +AY+SMV+GLC+M +P +AE +++EM+ +   P  FEY+S++Y 
Subjt:  ISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYA

Query:  YGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSALPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDE
        YG LGLF+DM R +  M  +   +DTVCSNMVLSSYG H+ L  M  WLQ +K   +PFS+RTYNSVLNSCP I SML+D  D  PV + +L T L+ DE
Subjt:  YGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSALPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDE

Query:  ALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRK
        ALLV EL  SSVL E + W+A+E KLDLHG H+ ++Y+I+L+WM E RL+F +E CVIPA++ V+ GSG HS VRGESPVKAL+++IM RT SP+RIDRK
Subjt:  ALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRK

Query:  NTGCFVAKGKAVKNWVC
        N G F+AKGK VK W+C
Subjt:  NTGCFVAKGKAVKNWVC

AT2G17033.2 pentatricopeptide (PPR) repeat-containing protein1.7e-13156.59Show/hide
Query:  LTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETL
        L K   RFLS+L++ A AGD SA NR I+KFVA+SPKS+ LNVLS +LS +T+ P L   AL+LYS ITE SWF WN KL+A+L+A L+K  +  +SETL
Subjt:  LTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDILSSRTAQPGLCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETL

Query:  ISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYA
        +S A+S+L   ER    F C LVES SK GS +GF  A   L E++ +SSS+YVK +AY+SMV+GLC+M +P +AE +++EM+ +   P  FEY+S++Y 
Subjt:  ISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKRRAYESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYA

Query:  YGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSALPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDE
        YG LGLF+DM R +  M  +   +DTVCSNMVLSSYG H+ L  M  WLQ +K   +PFS+RTYNSVLNSCP I SML+D  D  PV + +L T L+ DE
Subjt:  YGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSALPFSVRTYNSVLNSCPKITSMLQDKSDDLPVLIEDLITVLDGDE

Query:  ALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRK
        ALLV EL  SSVL E + W+A+E KLDLHG H+ ++Y+I+L+WM E RL+F +E CVIPA++ V+ GSG HS VRGESPVKAL+++IM RT SP+RIDRK
Subjt:  ALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRGESPVKALIREIMFRTQSPLRIDRK

Query:  NTGCFVAKGKAVKNWVC
        N G F+AKGK VK W+C
Subjt:  NTGCFVAKGKAVKNWVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGATGATGAGTTTAAACCATCTGTTCGTGACGACGTTCATCGGAAGCTTGTCGATGTTCATGGTCATTCCGACCATTGTTGACTTAACAATGGAGTTTGTGTG
TCCTCACCAGGATCACTGTTCCATCGCCATTTATCTCTCTGGTGTCCAGCAGGCGATTGTAGGGCTTGGAGCAGTGGTGATAACACCAGTAATTGGGAATCTATCAGACA
GATACGGAAGGAAAGCAATGCTGACTATCCCAATGACGTTTTCAATCATACCGCTCGCCATAATGGGTTATAGAAGAACTACCAACTTCTTCTATGCATTTTACATCATG
AAAACTCTCACAGACATGGTTTCAGAAGGCACTACAGTTTCACTTGCTCTTGCTTATGTGGCAGACAAAACTTCAGAGGATCAGAGGATCTCGGCGTTCGGAATCCTATC
TGGGGTCAGATCTGTAGGTTATGTGTGTGGAACCTTTTTGGCTCGGCTCCTTTCAACTGCTACAGTGTTTCAGGTGGCTGCTTTCATGTCAGTGCTTGCGGTAGTGCACA
TGAGGACTTTTCTCAAGGAAAGTATTCCAGATCAGAATGAGTTGACTCAACCGATCTTCGACGAAAACTTAAGTGGTGGTGATGATGAAAATGGACCAGAATTGCCTACA
AGAACTCAGTTATCGATAGGGATGTCTTCTATAAGAGACGTTATCTCCTTAATCACGAGTAGCACAACATTTTCACAAGCAGCAAGAGTTTCCTTCTTCAATAGTCTAGC
AGAGAAGGGGATGCAAGCATCACTAGCGTATTTCTTAAAGGCCCGTTTTCACTTCGACAAAAACCAGTTTGCTGACTTGATGATAATTGAGGGGGTTGCCGGGGCCGTTT
CACTGTTTCTTTTGATGCCCGCTTTGGCACTAGCTATAAGACAGGAGAGGTTGCTATCAATAGGGCTGTGGGCGAGCATTATAAATGTTCCTTATGCTTTAAGAGCATTT
ACAATTTTTACAATTCTGGTCAGTCCAATAATATTCAACATTGCATCGAGTCAAGTTGGACCGAGTGAGCAGGGGAAGGCCCAAGGATACATCTCAGGCATTAATTCCCT
TGCGAACATTGCTTCTCCATTACTTTTCAGTCCCTTGATAGCTCTTTTCCTCTCCAAGGATGCACCATTTGACTTCCCCGGCTTCGGTATTTTGTGCATTGGGCTCGCTT
CGTTGATTGGCTTCACTCTAAGCCTGATGATCCGTGTAGACCCGTTCATTTTCATTCAGAAAATCAAAAACTTAGTATACTGTGAATTTGAAGCAACACAGAGACAACAG
ATACTCAAACTCAAAGCAACTACAGCAGAAAATGCGATTTACTTTTGGCGCCTAATGGAGCTCCGCTTTTGCCCGCCGCCGTACGTGATTGGGGATAGCGTTCGACTCTT
CTCAAAGGCACCTAAACGCTACGACGGCTTCTGCAGTTACCATTTCCGGCCAAATCTGCAGGTCAAATGTGCTACACTCACCAAACAAAGTCACCGATTCCTCTCTACTT
TGGCCACAACCGCCGCCGCCGGCGACCATTCAGCTACCAATCGTTTGATTCGGAAGTTTGTTGCGAGTTCTCCGAAATCTATTACTCTCAATGTCCTCTCCGATATCCTT
TCCTCTCGCACGGCTCAACCTGGACTCTGCTCTGTTGCTCTCACCTTATATTCCAGAATTACTGAGACGTCCTGGTTCACATGGAATTCCAAGCTAGTTGCTGACCTTGT
TGCCTTCCTCGATAAAAATGGACAGATTGTTGACTCGGAAACCCTAATTTCCGAGGCAATTTCGAAATTAGGGATTCAAGAAAGAAAGCTTGTAAACTTCTACTGTCAGC
TGGTTGAATCTCAATCCAAACACGGTTCAGAAAGAGGATTTGGTATCGCATATGCTTGTCTTCTTGAGCTTCTTTATAAGTCGTCCTCGATTTATGTGAAACGTCGAGCT
TATGAATCAATGGTTACTGGTTTGTGCTCCATGAAAAGGCCTCAGGAAGCTGAGAGTTTGGTAAAAGAAATGAAAGCCAAAGGATTTGCTCCTGCTGCATTTGAATACAG
GTCCATTATTTACGCATATGGAACATTGGGGTTGTTTGAAGATATGAAGAGGAGTTTGGAAGAGATGAAGAACGATGATATTGCTTTAGACACAGTTTGTTCTAACATGG
TGCTTTCATCATATGGAGTTCATAATAAGCTTGCAGATATGGTTCTATGGCTTCAAATAATGAAAACTTCTGCTCTTCCTTTCTCGGTTCGAACGTACAATTCTGTCTTG
AATTCATGTCCGAAGATTACGTCGATGCTACAAGACAAGAGCGACGATCTTCCAGTTTTGATTGAAGACTTGATCACGGTTCTGGACGGGGACGAGGCTTTGTTGGTTGA
AGAGTTGGTTGGTTCATCTGTTTTGAAAGAAGTAATGGTGTGGGATGCAATGGAGATGAAGTTGGATTTGCATGGAGCACATGTTGGTGCAGCTTATGTGATCATTTTGG
AGTGGATGAAGGAGATGAGACTGAAGTTTGAGGATGAGAGCTGTGTGATTCCAGCACAAGTTACAGTGATTTGTGGATCTGGAAACCATAGTATTGTTAGAGGAGAGTCT
CCTGTAAAAGCTCTAATTAGAGAGATTATGTTTCGGACACAAAGTCCGCTGAGAATTGATCGCAAGAACACTGGTTGCTTTGTCGCCAAAGGAAAAGCGGTAAAGAATTG
GGTATGTTTGAGGTGA
mRNA sequenceShow/hide mRNA sequence
TGAACTTCTTATAATAACAAGAATGGAGCTGTTGCTTTGCTTTTTCTACTAAGAGCAACAGAGATGGAGAAGATGATGAGTTTAAACCATCTGTTCGTGACGACGTTCAT
CGGAAGCTTGTCGATGTTCATGGTCATTCCGACCATTGTTGACTTAACAATGGAGTTTGTGTGTCCTCACCAGGATCACTGTTCCATCGCCATTTATCTCTCTGGTGTCC
AGCAGGCGATTGTAGGGCTTGGAGCAGTGGTGATAACACCAGTAATTGGGAATCTATCAGACAGATACGGAAGGAAAGCAATGCTGACTATCCCAATGACGTTTTCAATC
ATACCGCTCGCCATAATGGGTTATAGAAGAACTACCAACTTCTTCTATGCATTTTACATCATGAAAACTCTCACAGACATGGTTTCAGAAGGCACTACAGTTTCACTTGC
TCTTGCTTATGTGGCAGACAAAACTTCAGAGGATCAGAGGATCTCGGCGTTCGGAATCCTATCTGGGGTCAGATCTGTAGGTTATGTGTGTGGAACCTTTTTGGCTCGGC
TCCTTTCAACTGCTACAGTGTTTCAGGTGGCTGCTTTCATGTCAGTGCTTGCGGTAGTGCACATGAGGACTTTTCTCAAGGAAAGTATTCCAGATCAGAATGAGTTGACT
CAACCGATCTTCGACGAAAACTTAAGTGGTGGTGATGATGAAAATGGACCAGAATTGCCTACAAGAACTCAGTTATCGATAGGGATGTCTTCTATAAGAGACGTTATCTC
CTTAATCACGAGTAGCACAACATTTTCACAAGCAGCAAGAGTTTCCTTCTTCAATAGTCTAGCAGAGAAGGGGATGCAAGCATCACTAGCGTATTTCTTAAAGGCCCGTT
TTCACTTCGACAAAAACCAGTTTGCTGACTTGATGATAATTGAGGGGGTTGCCGGGGCCGTTTCACTGTTTCTTTTGATGCCCGCTTTGGCACTAGCTATAAGACAGGAG
AGGTTGCTATCAATAGGGCTGTGGGCGAGCATTATAAATGTTCCTTATGCTTTAAGAGCATTTACAATTTTTACAATTCTGGTCAGTCCAATAATATTCAACATTGCATC
GAGTCAAGTTGGACCGAGTGAGCAGGGGAAGGCCCAAGGATACATCTCAGGCATTAATTCCCTTGCGAACATTGCTTCTCCATTACTTTTCAGTCCCTTGATAGCTCTTT
TCCTCTCCAAGGATGCACCATTTGACTTCCCCGGCTTCGGTATTTTGTGCATTGGGCTCGCTTCGTTGATTGGCTTCACTCTAAGCCTGATGATCCGTGTAGACCCGTTC
ATTTTCATTCAGAAAATCAAAAACTTAGTATACTGTGAATTTGAAGCAACACAGAGACAACAGATACTCAAACTCAAAGCAACTACAGCAGAAAATGCGATTTACTTTTG
GCGCCTAATGGAGCTCCGCTTTTGCCCGCCGCCGTACGTGATTGGGGATAGCGTTCGACTCTTCTCAAAGGCACCTAAACGCTACGACGGCTTCTGCAGTTACCATTTCC
GGCCAAATCTGCAGGTCAAATGTGCTACACTCACCAAACAAAGTCACCGATTCCTCTCTACTTTGGCCACAACCGCCGCCGCCGGCGACCATTCAGCTACCAATCGTTTG
ATTCGGAAGTTTGTTGCGAGTTCTCCGAAATCTATTACTCTCAATGTCCTCTCCGATATCCTTTCCTCTCGCACGGCTCAACCTGGACTCTGCTCTGTTGCTCTCACCTT
ATATTCCAGAATTACTGAGACGTCCTGGTTCACATGGAATTCCAAGCTAGTTGCTGACCTTGTTGCCTTCCTCGATAAAAATGGACAGATTGTTGACTCGGAAACCCTAA
TTTCCGAGGCAATTTCGAAATTAGGGATTCAAGAAAGAAAGCTTGTAAACTTCTACTGTCAGCTGGTTGAATCTCAATCCAAACACGGTTCAGAAAGAGGATTTGGTATC
GCATATGCTTGTCTTCTTGAGCTTCTTTATAAGTCGTCCTCGATTTATGTGAAACGTCGAGCTTATGAATCAATGGTTACTGGTTTGTGCTCCATGAAAAGGCCTCAGGA
AGCTGAGAGTTTGGTAAAAGAAATGAAAGCCAAAGGATTTGCTCCTGCTGCATTTGAATACAGGTCCATTATTTACGCATATGGAACATTGGGGTTGTTTGAAGATATGA
AGAGGAGTTTGGAAGAGATGAAGAACGATGATATTGCTTTAGACACAGTTTGTTCTAACATGGTGCTTTCATCATATGGAGTTCATAATAAGCTTGCAGATATGGTTCTA
TGGCTTCAAATAATGAAAACTTCTGCTCTTCCTTTCTCGGTTCGAACGTACAATTCTGTCTTGAATTCATGTCCGAAGATTACGTCGATGCTACAAGACAAGAGCGACGA
TCTTCCAGTTTTGATTGAAGACTTGATCACGGTTCTGGACGGGGACGAGGCTTTGTTGGTTGAAGAGTTGGTTGGTTCATCTGTTTTGAAAGAAGTAATGGTGTGGGATG
CAATGGAGATGAAGTTGGATTTGCATGGAGCACATGTTGGTGCAGCTTATGTGATCATTTTGGAGTGGATGAAGGAGATGAGACTGAAGTTTGAGGATGAGAGCTGTGTG
ATTCCAGCACAAGTTACAGTGATTTGTGGATCTGGAAACCATAGTATTGTTAGAGGAGAGTCTCCTGTAAAAGCTCTAATTAGAGAGATTATGTTTCGGACACAAAGTCC
GCTGAGAATTGATCGCAAGAACACTGGTTGCTTTGTCGCCAAAGGAAAAGCGGTAAAGAATTGGGTATGTTTGAGGTGAATATAGAGAGATGTTGTCTTCTTTGGACTTT
TTCTTTTGGGTTTTCCATCAAAGTTTCTAAAACGCGCTAGGAAGAGGTTTCACACTCTCATATAGAAGGTAGGAAGAGGTTGTTGAACACAACTCACTGTGGGAAACACT
CGCTCTCTTTATTAAGACCAATCGAGAAGAGAATACAAGACACTCTGTAGAATACTTCTGCTTTTTATTGTTTTTAGATTTCTTGGATGAATAACCTAGGTAGGGTGGGG
GTATTTATACTAAGAGTAAACAATCTATATTTAACCAATCTAAATCTAGACCTAACCGATTTGAATCTAGA
Protein sequenceShow/hide protein sequence
MEKMMSLNHLFVTTFIGSLSMFMVIPTIVDLTMEFVCPHQDHCSIAIYLSGVQQAIVGLGAVVITPVIGNLSDRYGRKAMLTIPMTFSIIPLAIMGYRRTTNFFYAFYIM
KTLTDMVSEGTTVSLALAYVADKTSEDQRISAFGILSGVRSVGYVCGTFLARLLSTATVFQVAAFMSVLAVVHMRTFLKESIPDQNELTQPIFDENLSGGDDENGPELPT
RTQLSIGMSSIRDVISLITSSTTFSQAARVSFFNSLAEKGMQASLAYFLKARFHFDKNQFADLMIIEGVAGAVSLFLLMPALALAIRQERLLSIGLWASIINVPYALRAF
TIFTILVSPIIFNIASSQVGPSEQGKAQGYISGINSLANIASPLLFSPLIALFLSKDAPFDFPGFGILCIGLASLIGFTLSLMIRVDPFIFIQKIKNLVYCEFEATQRQQ
ILKLKATTAENAIYFWRLMELRFCPPPYVIGDSVRLFSKAPKRYDGFCSYHFRPNLQVKCATLTKQSHRFLSTLATTAAAGDHSATNRLIRKFVASSPKSITLNVLSDIL
SSRTAQPGLCSVALTLYSRITETSWFTWNSKLVADLVAFLDKNGQIVDSETLISEAISKLGIQERKLVNFYCQLVESQSKHGSERGFGIAYACLLELLYKSSSIYVKRRA
YESMVTGLCSMKRPQEAESLVKEMKAKGFAPAAFEYRSIIYAYGTLGLFEDMKRSLEEMKNDDIALDTVCSNMVLSSYGVHNKLADMVLWLQIMKTSALPFSVRTYNSVL
NSCPKITSMLQDKSDDLPVLIEDLITVLDGDEALLVEELVGSSVLKEVMVWDAMEMKLDLHGAHVGAAYVIILEWMKEMRLKFEDESCVIPAQVTVICGSGNHSIVRGES
PVKALIREIMFRTQSPLRIDRKNTGCFVAKGKAVKNWVCLR