; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034826 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034826
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSelenoprotein O
Genome locationchr3:11212884..11243013
RNA-Seq ExpressionLag0034826
SyntenyLag0034826
Gene Ontology termsGO:0009249 - protein lipoylation (biological process)
GO:0005524 - ATP binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0033819 - lipoyl(octanoyl) transferase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0102555 - octanoyl transferase activity (acting on glycine-cleavage complex H protein) (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR003846 - Protein adenylyltransferase SelO


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6590231.1 hypothetical protein SDJN03_15654, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0091.6Show/hide
Query:  MSLLSHLFPKLSLLPHISLPCHG--HRLGLVRRRSSLLI-RRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGS
        MSL+SHLFPKLSLLPHISL CHG  HRLGLVRR S+ LI RRP  S+ SPPSPL GHSRHGRRRVSMDSASPE+SASVDSVA+GLKNQSLNSDDR   GS
Subjt:  MSLLSHLFPKLSLLPHISLPCHG--HRLGLVRRRSSLLI-RRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGS

Query:  SVEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGG
         VEH AKKKLE+LNWDNSFVRELPGDPRTD++PR+VLHACYSNVLPSV V+SPQLVAWSESVA+LLDLD QEF+RPDFPLLFSGASPLVGVSPYAQCYGG
Subjt:  SVEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGG

Query:  HQFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGA
        HQFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSS+REFLCSEAMHSLGIP+TRALCLLTTGTFVTRDMFYDGN KEEPGA
Subjt:  HQFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGA

Query:  IVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVL
        IVCRVAQSFLRFGS+QIHASRGKDDYKIVRALADYAIRHHFPH ENMSSSQSLSFST ++DSSV+DLTSNKYAAW VEVAERTASLIASWQGVGFTHGVL
Subjt:  IVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVL

Query:  NTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSK
        NTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPD+GLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKY+K
Subjt:  NTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSK

Query:  QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAI
        QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELL+PLKAVLLD+GKERKEAWVSWVK YI ELA SGISDEERKASMDA+NPKYILRNYLCQTAI
Subjt:  QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAI

Query:  DAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA
        DAAEQGDFGEVRRLLKIMERP+DEQPGMEKYARLPPAWAYRPGVC     +SCS+
Subjt:  DAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA

XP_004149028.1 uncharacterized protein LOC101218327 [Cucumis sativus]0.0e+0091.58Show/hide
Query:  MSLLSHLFPKLSLLPHISLPCHGHRLGLVRRRSSLLIRR-PSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGSSV
        MSL+SHLFPK S+  +ISL CHGHRLGLVRRRS+LLIRR P  S  S PSPL  HSRHGRR++SMDSASPE+SASVDSVAEGLKNQSLN+DDR DGGSS+
Subjt:  MSLLSHLFPKLSLLPHISLPCHGHRLGLVRRRSSLLIRR-PSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGSSV

Query:  EHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQ
         H  KKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYS VLPSV V SPQLVAWSESVA+LLDLDPQEF+RPDFPLLFSGASPLVG SPYAQCYGGHQ
Subjt:  EHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQ

Query:  FGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIV
        FGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSS+REFLCSEAMHSLGIP+TRALCLLTTGTFVTRDMFYDGNPKEEPGAIV
Subjt:  FGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIV

Query:  CRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT
        CRVAQSFLRFGSYQIHASRGKDD+KIVRALADY IRHHFPHLENMSSSQS+SFST N DSSV+DLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT
Subjt:  CRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT

Query:  DNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQL
        DNMSILGLTIDYGPFGFLDAFDPS+TPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQ IMTKKIGLPKY+KQL
Subjt:  DNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQL

Query:  ISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDA
        ISKLLNNMAVDKVDYTNFFRSLSN+KADPS PEEELL+PLKAVLLDIGKERKEAWVSWVKTY+EELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDA
Subjt:  ISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDA

Query:  AEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA
        AEQGDFGEVR+LLKIMERP+DEQPGMEKYARLPPAWAYRPGVC     +SCS+
Subjt:  AEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA

XP_022154095.1 uncharacterized protein LOC111021431 [Momordica charantia]0.0e+0091.74Show/hide
Query:  MSLLSHLFPKLSLLPHISLPCHGHRLGLVRRRSSLLIRRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPE--ISASVDSVAEGLKNQSLNSDDRDDGGSS
        MSL SH FPKLSLLPHISL CHGHRLGLVRR SSLLIR  S+SLVSPP PLAGH  H RRRVSMDSASPE  +S SVDSVA+ LKNQSLNSDD +  GSS
Subjt:  MSLLSHLFPKLSLLPHISLPCHGHRLGLVRRRSSLLIRRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPE--ISASVDSVAEGLKNQSLNSDDRDDGGSS

Query:  VEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGH
        ++++ KKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSV V+SPQLVAWSESVA+LL+LDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGH
Subjt:  VEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGH

Query:  QFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAI
        QFGMWAGQLGDGRAITLGEILNS+SERWELQLKGAGKTPYSRFADGLAVLRSS+REFLCSE+MH LGIP+TRALC++TTGT VTRDMFYDGNPKEEPGAI
Subjt:  QFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAI

Query:  VCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLN
        VCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAI HHFPHLENMSSSQSLSFST N+D SV+DLTSNKYAAWTVEVAERTASL+ASWQGVGFTHGVLN
Subjt:  VCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLN

Query:  TDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQ
        TDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQF+STLSAAELINDKEANYAMERYG KFMDDYQTIMTKKIGLPKY+KQ
Subjt:  TDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQ

Query:  LISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAID
        LISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAID
Subjt:  LISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAID

Query:  AAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA
        AAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVC     +SCS+
Subjt:  AAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA

XP_022960705.1 uncharacterized protein LOC111461421 [Cucurbita moschata]0.0e+0091.3Show/hide
Query:  MSLLSHLFPKLSLLPHISLPCHG--HRLGLVRRRSSLLI-RRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGS
        MSL+SHLFPKLSLLPHISL CHG  HRLGLVRR S+ LI RRP  S+ S PSPL GHSRHGRRRVSMDSASPE+SASVDSVA+GLKNQSLNSDDR   GS
Subjt:  MSLLSHLFPKLSLLPHISLPCHG--HRLGLVRRRSSLLI-RRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGS

Query:  SVEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGG
         VEH AKKKLE+LNWDNSFVRELPGDPRTD++PR+VLHACYSNVLPSV V+SPQLVAWSESVA+LLDLD QEF+RPDFPLLFSGASPLVGVSPYAQCYGG
Subjt:  SVEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGG

Query:  HQFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGA
        HQFGMWAGQLGDGRAITLGEILN RSERWELQLKGAGKTPYSRFADGLAVLRSS+REFLCSEAMHSLGIP+TRALCLLTTGTFVTRDMFYDGN KEEPGA
Subjt:  HQFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGA

Query:  IVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVL
        IVCRVAQSFLRFGS+QIHASRGKDDYKIVRALADYAIRHHFPH ENMSSSQSLSFST ++DSSV+DLTSNKYAAW VEVAERTASLIASWQGVGFTHGVL
Subjt:  IVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVL

Query:  NTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSK
        NTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPD+GLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKY+K
Subjt:  NTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSK

Query:  QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAI
        QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELL+PLKAVLLD+GKERKEAWVSWVK YI ELA SGISDEERKASMDA+NPKYILRNYLCQTAI
Subjt:  QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAI

Query:  DAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA
        DAAEQGDFGEVRRLLKIMERP+DEQPGMEKYARLPPAWAYRPGVC     +SCS+
Subjt:  DAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA

XP_038878878.1 protein adenylyltransferase SelO [Benincasa hispida]0.0e+0093.57Show/hide
Query:  MSLLSHLFPKLSLLPHISLPCHGHRLGLVRRRSSLLI-RRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGSSV
        MSL+SHLFPKLS+LPHISL CHGHRLGLVRRRS+LLI RRP  S +SPPSPLAGHSRHGRRRVSMDSASPE+SASVDSVAEGLKNQSLNSD+  DGGSSV
Subjt:  MSLLSHLFPKLSLLPHISLPCHGHRLGLVRRRSSLLI-RRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGSSV

Query:  EHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQ
        +HEAKKKLEDLNWDNSFVRELPGDPR DIIPREVLHACYSNVLPSV V+SPQLVAWSESVA+LLDLDPQEF+RPDFPLLFSGA+PLVG SPYAQCYGGHQ
Subjt:  EHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQ

Query:  FGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIV
        FGMWAGQLGDGRAITLGEI+NSRSERWELQLKGAGKTPYSRFADGLAVLRSS+REFLCSEAMHSLGIP+TRALCLLTTGTFVTRDMFYDGNPKEEPGAIV
Subjt:  FGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIV

Query:  CRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT
        CRVAQSFLRFGSYQIHASRGKDDYKIV ALADY IRHHFPHLENMSSSQSLSFST N DSSV+DLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT
Subjt:  CRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT

Query:  DNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQL
        DNMSILGLTIDYGPFGFLDAFDPS+TPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQ IMTKKIGLPKY+KQL
Subjt:  DNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQL

Query:  ISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDA
        ISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELL+PLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKY+LRNYLCQTAIDA
Subjt:  ISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDA

Query:  AEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA
        AEQGDFGEVRRLLKIMERP+DEQPGMEKYARLPPAWAYRPGVC     +SCS+
Subjt:  AEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA

TrEMBL top hitse value%identityAlignment
A0A0A0LXE5 Selenoprotein O0.0e+0091.58Show/hide
Query:  MSLLSHLFPKLSLLPHISLPCHGHRLGLVRRRSSLLIRR-PSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGSSV
        MSL+SHLFPK S+  +ISL CHGHRLGLVRRRS+LLIRR P  S  S PSPL  HSRHGRR++SMDSASPE+SASVDSVAEGLKNQSLN+DDR DGGSS+
Subjt:  MSLLSHLFPKLSLLPHISLPCHGHRLGLVRRRSSLLIRR-PSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGSSV

Query:  EHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQ
         H  KKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYS VLPSV V SPQLVAWSESVA+LLDLDPQEF+RPDFPLLFSGASPLVG SPYAQCYGGHQ
Subjt:  EHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQ

Query:  FGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIV
        FGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSS+REFLCSEAMHSLGIP+TRALCLLTTGTFVTRDMFYDGNPKEEPGAIV
Subjt:  FGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIV

Query:  CRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT
        CRVAQSFLRFGSYQIHASRGKDD+KIVRALADY IRHHFPHLENMSSSQS+SFST N DSSV+DLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT
Subjt:  CRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT

Query:  DNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQL
        DNMSILGLTIDYGPFGFLDAFDPS+TPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQ IMTKKIGLPKY+KQL
Subjt:  DNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQL

Query:  ISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDA
        ISKLLNNMAVDKVDYTNFFRSLSN+KADPS PEEELL+PLKAVLLDIGKERKEAWVSWVKTY+EELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDA
Subjt:  ISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDA

Query:  AEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA
        AEQGDFGEVR+LLKIMERP+DEQPGMEKYARLPPAWAYRPGVC     +SCS+
Subjt:  AEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA

A0A5D3DUC9 Selenoprotein O0.0e+0091.42Show/hide
Query:  MSLLSHLFPKLSLLPHISLPCHGHRLGLVRRRSSLLIRRPS-TSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGSSV
        MSL+SHLFPK S+  +ISL CHGHRLGLV RRS+LLIRR S +S  S PSPL  HSRHGRR++SMDSASPE+SASVDSVAEGLKNQSLN+DDR DGGSS+
Subjt:  MSLLSHLFPKLSLLPHISLPCHGHRLGLVRRRSSLLIRRPS-TSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGSSV

Query:  EHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQ
         H  KKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYS VLPSV V SPQLVAWSESVANLLDLDPQEF+RPDFPLLFSGASPLVG SPYAQCYGGHQ
Subjt:  EHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQ

Query:  FGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIV
        FGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSS+REFLCSEAMHSLGIP+TRALCLLTTGTFVTRDMFYDGNPKEEPGAIV
Subjt:  FGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIV

Query:  CRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT
        CRVAQSFLRFGSYQIHASRGKDDYKIVRALADY I HHFPHLENMSSSQS+SFST N DSSV+DLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT
Subjt:  CRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT

Query:  DNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQL
        DNMSILGLTIDYGPFGFLDAFDPS+TPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQ IMTKKIGLPKY+KQL
Subjt:  DNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQL

Query:  ISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDA
        ISKLLNNMAVDKVDYTNFFRSLSNIKAD S PEEELL+PLKAVLLDIGKERKEAWVSWVKTY+EELAGSGISDEERKASMD VNPKYILRNYLCQTAIDA
Subjt:  ISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDA

Query:  AEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA
        AEQGDFGEVR+LLKIMERP+DEQPGMEKYARLPPAWAYRPGVC     +SCS+
Subjt:  AEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA

A0A6J1DIN2 Selenoprotein O0.0e+0091.74Show/hide
Query:  MSLLSHLFPKLSLLPHISLPCHGHRLGLVRRRSSLLIRRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPE--ISASVDSVAEGLKNQSLNSDDRDDGGSS
        MSL SH FPKLSLLPHISL CHGHRLGLVRR SSLLIR  S+SLVSPP PLAGH  H RRRVSMDSASPE  +S SVDSVA+ LKNQSLNSDD +  GSS
Subjt:  MSLLSHLFPKLSLLPHISLPCHGHRLGLVRRRSSLLIRRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPE--ISASVDSVAEGLKNQSLNSDDRDDGGSS

Query:  VEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGH
        ++++ KKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSV V+SPQLVAWSESVA+LL+LDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGH
Subjt:  VEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGH

Query:  QFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAI
        QFGMWAGQLGDGRAITLGEILNS+SERWELQLKGAGKTPYSRFADGLAVLRSS+REFLCSE+MH LGIP+TRALC++TTGT VTRDMFYDGNPKEEPGAI
Subjt:  QFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAI

Query:  VCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLN
        VCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAI HHFPHLENMSSSQSLSFST N+D SV+DLTSNKYAAWTVEVAERTASL+ASWQGVGFTHGVLN
Subjt:  VCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLN

Query:  TDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQ
        TDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQF+STLSAAELINDKEANYAMERYG KFMDDYQTIMTKKIGLPKY+KQ
Subjt:  TDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQ

Query:  LISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAID
        LISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAID
Subjt:  LISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAID

Query:  AAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA
        AAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVC     +SCS+
Subjt:  AAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA

A0A6J1H9V3 Selenoprotein O0.0e+0091.3Show/hide
Query:  MSLLSHLFPKLSLLPHISLPCHG--HRLGLVRRRSSLLI-RRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGS
        MSL+SHLFPKLSLLPHISL CHG  HRLGLVRR S+ LI RRP  S+ S PSPL GHSRHGRRRVSMDSASPE+SASVDSVA+GLKNQSLNSDDR   GS
Subjt:  MSLLSHLFPKLSLLPHISLPCHG--HRLGLVRRRSSLLI-RRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGS

Query:  SVEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGG
         VEH AKKKLE+LNWDNSFVRELPGDPRTD++PR+VLHACYSNVLPSV V+SPQLVAWSESVA+LLDLD QEF+RPDFPLLFSGASPLVGVSPYAQCYGG
Subjt:  SVEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGG

Query:  HQFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGA
        HQFGMWAGQLGDGRAITLGEILN RSERWELQLKGAGKTPYSRFADGLAVLRSS+REFLCSEAMHSLGIP+TRALCLLTTGTFVTRDMFYDGN KEEPGA
Subjt:  HQFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGA

Query:  IVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVL
        IVCRVAQSFLRFGS+QIHASRGKDDYKIVRALADYAIRHHFPH ENMSSSQSLSFST ++DSSV+DLTSNKYAAW VEVAERTASLIASWQGVGFTHGVL
Subjt:  IVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVL

Query:  NTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSK
        NTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPD+GLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKY+K
Subjt:  NTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSK

Query:  QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAI
        QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELL+PLKAVLLD+GKERKEAWVSWVK YI ELA SGISDEERKASMDA+NPKYILRNYLCQTAI
Subjt:  QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAI

Query:  DAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA
        DAAEQGDFGEVRRLLKIMERP+DEQPGMEKYARLPPAWAYRPGVC     +SCS+
Subjt:  DAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA

A0A6J1JKI6 Selenoprotein O0.0e+0090.99Show/hide
Query:  MSLLSHLFPKLSLLPHISLPCHG--HRLGLVRRRSSLLI-RRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGS
        MSL+SHLFPKLSLLPHISL C+G  HRLGLVRR S+ LI RRP  ++ SPPSPL GHSRHGRRRVSMDSASPE+SASVDSVAEGLKNQ+LNSDDR   GS
Subjt:  MSLLSHLFPKLSLLPHISLPCHG--HRLGLVRRRSSLLI-RRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGS

Query:  SVEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGG
         VEH A+KKLE+LNWDNSFVRELPGDPRTD+IPR+VLHACYSNVLPSV V+SPQLVAWSESVA+LLDLD QEF+RPDFPLLFSGASPLVGVSPYAQCYGG
Subjt:  SVEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGG

Query:  HQFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGA
        HQFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSS+REFLCSEAMHSLGIP+TRALCLLTTGTFVTRDMFYDGN KEEPGA
Subjt:  HQFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGA

Query:  IVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVL
        IVCRVAQSFLRFGS+QIHASRGKDDYKIVRALADYAIRHHFPH ENMSSSQSLSFST ++DSSV+DLTSNKYAAW VEVAERTASLIASWQGVGFTHGVL
Subjt:  IVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVL

Query:  NTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSK
        NTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPD+GLWNI+QFASTLSAAELINDKEANYAMERYGDKFMD+YQTIMTKKIGLPKY+K
Subjt:  NTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSK

Query:  QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAI
        QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELL+PLKAVLLD+GKERKEAWVSWVK YI ELA SGISDEERKASMDA+NPKYILRNYLCQTAI
Subjt:  QLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAI

Query:  DAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA
        DAAEQGDFGEVRRLLKIMERP+DEQPGMEKYARLPPAWAYRPGVC     +SCS+
Subjt:  DAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA

SwissProt top hitse value%identityAlignment
A1K5T6 Protein adenylyltransferase SelO2.2e-13848.74Show/hide
Query:  LEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQ
        +  L +DN FVRELP DP T    R+V  A YS V P+  V +P LVA S  VA LL  D  +   P+F  +F G   L G+ PYA CYGGHQFG WAGQ
Subjt:  LEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQ

Query:  LGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIVCRVAQSF
        LGDGRAITLGE+LN +  RWELQLKGAG TPYSR ADG AVLRSS+REFLCSEAMH LG+P+TRAL L+ TG  V RDMFYDGNP+ EPGAIVCRVA SF
Subjt:  LGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIVCRVAQSF

Query:  LRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNTDNMSILG
        +RFG++++ A+RG  D  ++  L D+ I   FP +E  +                     +K A W   V  RTA+++A W  VGF HGV+NTDNMSILG
Subjt:  LRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNTDNMSILG

Query:  LTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYA-MERYGDKFMDDYQTIMTKKIGLPKYSK---QLISK
        LTIDYGP+G++D FDP +TPNTTD  GRRY F +QP I  WN+ Q A+ L  A      EA  A +  Y + +  + + +   K+GL   +     ++  
Subjt:  LTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYA-MERYGDKFMDDYQTIMTKKIGLPKYSK---QLISK

Query:  LLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDI------GKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTA
        L   M   +VD T FFR+L+         E +LL P  A+ LD         E  E +  W++ Y +     G+  ++R+A M+A NP+Y++RNYL Q A
Subjt:  LLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDI------GKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTA

Query:  IDAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA
        IDAAEQGD+G VR LL +M RPYDEQP    YA+  P WA     C   S +SCS+
Subjt:  IDAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA

C4LAV8 Protein adenylyltransferase SelO1.7e-13046.07Show/hide
Query:  LNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGD
        L++DN F+RELPGDP T   PR+V HA + + +    V  PQL+A S  VA LL +   E Q+P +    SG   L G+SP+A CYGGHQFG WAGQLGD
Subjt:  LNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGD

Query:  GRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF
        GRAI+LGE++++   RWELQLKGAG TPYSR  DG AVLRSS+REFLCSEAM  LG+P+TRAL L+ TG  + RDMFYDGNP++EPGAIVCRVA SF+RF
Subjt:  GRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF

Query:  GSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNTDNMSILGLTI
        G +Q+ A RG+ D  ++  L D+ I   FPHL    S+Q  +                +   W  EV   TA L+  W  VGF HGV+NTDNMSILGLTI
Subjt:  GSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNTDNMSILGLTI

Query:  DYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKY---SKQLISKLLNN
        DYGP+G++D FD ++TPNTTD  G RYCF  QP I  WN+ + A  L    + +       +E + + F  +   ++  K+G  ++     +L+++L + 
Subjt:  DYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKY---SKQLISKLLNN

Query:  MAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFG
        +   +VD T FFR L+ +  D S P+  +L        D+  + + A+  W+  Y + +   G+   ER A M+ VNP Y+LRNYL Q  IDAAEQG++ 
Subjt:  MAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFG

Query:  EVRRLLKIMERPYDEQPGMEKYARLPPAWA-YRPGVCFILSSISCSA
         +  LL+++ +PY EQ G E YA+  P WA ++PG     S +SCS+
Subjt:  EVRRLLKIMERPYDEQPGMEKYARLPPAWA-YRPGVCFILSSISCSA

Q1H0D2 Protein adenylyltransferase SelO3.6e-13646.45Show/hide
Query:  LNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGD
        L +DN F+RELPGDP T    R+V  AC+S V+P+  V SP+L+A+S  +   L+L  +E + P +    +G   + G+ PYA CYGGHQFG WAGQLGD
Subjt:  LNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGD

Query:  GRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF
        GRAI+LGE++N + +RWELQLKGAG TPYSR ADG AVLRSSVREFLCSEAMH LGIP+TRAL L+ TG  V RDMFYDG+P+ E GAIVCRV+ SF+RF
Subjt:  GRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF

Query:  GSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNTDNMSILGLTI
        G+++I A R  DD + ++ L D+ I   FP L N    + L                   A W   +  RTA LIA W  VGF HGV+NTDNMSILGLTI
Subjt:  GSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNTDNMSILGLTI

Query:  DYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWN---IAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQ---LISKL
        DYGP+G++D FDP +TPNTTD  GRRYCF  QPDI  WN   +AQ   TL     I D+     +  Y   + +++  ++  K G   +  +   L++++
Subjt:  DYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWN---IAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQ---LISKL

Query:  LNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQG
           M   ++D T FFR L+ +  D + P+  +L    A    + +  K  +  W+  Y +     G    ER+ +M+ VNP+Y+LRNYL Q AID A+ G
Subjt:  LNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQG

Query:  DFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA
        D   +  L+ ++ +PYDEQPG E++A L P WA     C   S +SCS+
Subjt:  DFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA

Q5NYD9 Protein adenylyltransferase SelO6.5e-13846.87Show/hide
Query:  LEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQ
        +++L  DN FV ELPGDP      R+V  ACYS V+P+  V +P L+AWS  VA LL  D  + + P+F  +F+G + + G+ PYA CYGGHQFG WAGQ
Subjt:  LEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQ

Query:  LGDGRAITLGEILNSRSE----RWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIVCRV
        LGDGRAITLGE + +R +    RWELQLKGAG TPYSR ADG AVLRSS+REFLCSEAMH LG+P+TRALCL+ TG  V RDMFYDG PK EPGA+VCRV
Subjt:  LGDGRAITLGEILNSRSE----RWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIVCRV

Query:  AQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNTDNM
        A SF+RFG+++I  SRG  D  ++  L D+ I   FP L    ++                    + A W  +V ERTA +IA W  VGF HGV+NTDNM
Subjt:  AQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNTDNM

Query:  SILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTL----SAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQ
        SILGLTIDYGP+G++D FDP +TPNTTD  G+RY F NQP I  WN+ Q A+ L     AAE +++      ++ Y   F ++ + ++  K+G   +  +
Subjt:  SILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTL----SAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQ

Query:  ---LISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKE--RKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLC
           L+  L   +   +VD T FFR L+++       E   + PL+       K    +    SW+  Y +         ++R+  M+AVNP+++LRNYL 
Subjt:  ---LISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKE--RKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLC

Query:  QTAIDAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA
        Q AIDAAEQG++  V  LL +M  PYDEQPG E++A   P WA     C   S +SCS+
Subjt:  QTAIDAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSA

Q7UKT5 Protein adenylyltransferase SelO1.5e-13145.47Show/hide
Query:  DLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLG
        DL +DN F R+LP D       R+V  A +S V P+  V +P+ VA S+ VA L+ LDP+     +   + +G +   G+ P+A CYGGHQFG WAGQLG
Subjt:  DLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLG

Query:  DGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIVCRVAQSFLR
        DGRAI LGE++ +  + W LQLKGAG TPYSR ADGLAVLRSSVREFLCSEAMH LG+P+TRAL L+ TG  V RDMFYDG+P+ E GAIVCRVA SF+R
Subjt:  DGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIVCRVAQSFLR

Query:  FGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNTDNMSILGLT
        FG+++I ASR  +D + ++ L ++ IR  F HL +   +               ++  +  AA   EV   TA ++  W  VGF HGV+NTDNMSILGLT
Subjt:  FGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNTDNMSILGLT

Query:  IDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKE-ANYAMERYGDKFMDDYQTIMTKKIGLPKYSK----QLISKL
        IDYGP+G+L+ +DP +TPNTTD  GRRY +A+QP I  WN+   A+ L    L+ + E     +  Y ++F   + ++M  K+GL KY      +L+  L
Subjt:  IDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAAELINDKE-ANYAMERYGDKFMDDYQTIMTKKIGLPKYSK----QLISKL

Query:  LNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLL----------DIGKERKEAWVSWVKTYIEE-LAGSGI--SDEERKASMDAVNPKYILRN
        L  + + + D T F+R L++I+    T E+ + + L AVL           ++ +E ++A + W+++Y    LA  G    D +R+  M+AVNPKY+LRN
Subjt:  LNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLL----------DIGKERKEAWVSWVKTYIEE-LAGSGI--SDEERKASMDAVNPKYILRN

Query:  YLCQTAIDAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWA-YRPGVCFILSSISCSA
        YL Q AIDA ++GD   V  LL+++ RPYD+QPG E++A   P WA +RPG     S +SCS+
Subjt:  YLCQTAIDAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWA-YRPGVCFILSSISCSA

Arabidopsis top hitse value%identityAlignment
AT5G13030.1 unknown protein5.4e-28176.36Show/hide
Query:  PSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGSSVEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGV
        PS  +   R      S  S +P   +S DS+A+ L+NQSL + D          + KKKLED NWD+SFV+ELPGDPRTD+I REVLHACYS V PSV V
Subjt:  PSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGSSVEHEAKKKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGV

Query:  DSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAV
        D PQLVAWS SVA LLDLDP+EF+RPDFPL+ SGA PL G   YAQCYGGHQFGMWAGQLGDGRAITLGE+LNS+ ERWELQLKGAG+TPYSRFADGLAV
Subjt:  DSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILNSRSERWELQLKGAGKTPYSRFADGLAV

Query:  LRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSS
        LRSS+REFLCSE MH LGIP+TRALCLLTTG  VTRDMFYDGNPKEEPGAIVCRV+QSFLRFGSYQIHASRGK+D  IVR LADYAI+HHFPH+E+M  S
Subjt:  LRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALADYAIRHHFPHLENMSSS

Query:  QSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLW
         SLSF T ++D SV+DLTSNKYAAW VE+AERTA+L+A WQGVGFTHGVLNTDNMSILG TIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLW
Subjt:  QSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFANQPDIGLW

Query:  NIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIG
        NIAQF+ TL+ A+LIN KEANYAMERYGDKFMD+YQ IM+KK+GL KY+K++ISKLLNNM+VDKVDYTNFFR L+N+KA+P+TPE ELL PLKAVLLDIG
Subjt:  NIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIG

Query:  KERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSS
        KERKEAW+ W+++YI+E+ GS +SDEERKA MD+VNPKYILRNYLCQ+AIDAAEQGDF EV  L+++M+RPY+EQPGMEKYARLPPAWAYRPGVC     
Subjt:  KERKEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSS

Query:  ISCSA
        +SCS+
Subjt:  ISCSA

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.3e-0560Show/hide
Query:  LINGRPRGKIMASRGLRQGDPLSPFLFTLVGDAIS
        +ING P+G +  SRGLRQGDPLSP+LF L  + +S
Subjt:  LINGRPRGKIMASRGLRQGDPLSPFLFTLVGDAIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTGCTTTCACATCTCTTCCCCAAGCTCTCGCTGCTTCCCCACATTTCTCTCCCCTGCCATGGCCACCGCCTCGGCTTAGTTCGGCGCCGTTCCTCTCTCCTCAT
TCGCCGCCCTTCTACCTCACTCGTCTCTCCGCCTTCACCACTCGCCGGCCACTCTCGCCACGGCCGCCGGAGAGTTTCCATGGACTCTGCTTCACCGGAGATTTCGGCGT
CGGTCGACTCCGTGGCGGAGGGTTTGAAGAATCAGAGCTTGAACAGCGACGATCGTGACGATGGAGGCAGTAGCGTCGAGCACGAGGCGAAAAAGAAGCTCGAAGATCTG
AACTGGGACAATTCCTTTGTTAGAGAGCTGCCCGGCGATCCCCGTACTGATATCATTCCTCGAGAGGTATTACATGCATGTTACTCAAATGTATTACCTTCAGTTGGAGT
AGACAGTCCTCAGCTTGTTGCTTGGTCTGAATCGGTTGCCAATTTGCTAGATTTGGATCCTCAAGAATTTCAGAGGCCAGATTTCCCCCTCTTGTTCTCTGGAGCATCTC
CATTAGTTGGAGTGTCGCCTTACGCTCAATGTTATGGGGGCCATCAGTTTGGCATGTGGGCCGGACAGTTGGGCGATGGTCGAGCAATAACCCTTGGAGAGATACTTAAT
TCCCGATCTGAAAGGTGGGAGTTGCAGCTAAAAGGTGCTGGGAAGACTCCATATAGTCGGTTTGCTGATGGCTTGGCTGTGCTACGTAGTAGCGTTAGGGAGTTCCTTTG
TAGTGAAGCAATGCACAGTCTTGGAATACCATCAACTCGTGCACTATGTCTACTGACCACAGGAACATTTGTTACCCGAGATATGTTTTATGATGGGAATCCAAAAGAAG
AGCCTGGTGCAATTGTATGCAGAGTGGCTCAATCCTTTTTGCGTTTTGGATCATACCAAATTCATGCCTCTAGAGGAAAAGATGATTATAAAATTGTTCGGGCTTTAGCA
GACTATGCGATCCGCCACCATTTTCCTCACCTAGAGAATATGAGCAGCAGTCAGAGTTTATCTTTCAGCACAGACAATAAAGATAGTTCAGTTATCGATCTCACTTCAAA
CAAGTATGCAGCTTGGACAGTAGAGGTTGCTGAGCGAACTGCTTCCTTAATAGCAAGTTGGCAGGGAGTTGGGTTCACACATGGTGTACTCAACACTGACAATATGAGCA
TCTTGGGTCTTACCATTGATTATGGTCCGTTTGGATTTTTGGATGCTTTTGATCCTAGTTATACGCCTAATACAACTGATCTTCCGGGCAGAAGATACTGTTTTGCAAAT
CAGCCAGATATAGGCTTATGGAATATAGCCCAGTTTGCTTCAACTCTTTCAGCTGCGGAATTAATAAATGATAAAGAAGCAAACTATGCCATGGAAAGATACGGAGACAA
ATTTATGGATGACTATCAAACGATAATGACCAAGAAAATTGGTCTGCCAAAGTACAGCAAACAGTTAATCAGCAAGCTTCTCAACAACATGGCTGTTGATAAGGTTGATT
ATACAAATTTCTTTAGATCACTTTCCAATATCAAAGCTGATCCCAGCACCCCAGAGGAGGAGCTGTTGATCCCTCTGAAGGCAGTTCTACTAGATATTGGCAAGGAGCGC
AAGGAAGCTTGGGTTAGCTGGGTAAAGACGTACATAGAGGAGCTCGCTGGAAGTGGCATCTCAGATGAGGAGCGGAAGGCCTCTATGGATGCAGTAAATCCTAAATATAT
TCTGAGGAACTACCTATGTCAGACTGCCATAGATGCAGCTGAACAGGGTGATTTTGGAGAGGTTCGTCGGCTGCTGAAGATAATGGAACGGCCATATGACGAGCAGCCAG
GAATGGAGAAATATGCACGATTACCCCCAGCTTGGGCTTATCGGCCGGGTGTTTGTTTCATTCTTTCTTCTATTTCTTGTTCTGCTCCATTTCTCTTCATCTTGTTAGTT
TCAGTTATCAAGAGTGAAGACGCTTTCAGAGTCAAGCTGGAAGACGCTTTCTGTGAAGAGGAAATCTATAAAGCAGTTCAGGATATGGGAAACCAAAAATCCCCGGGTCC
GGATGGCATGACGGGAGAATTTTGGAAACATTATTGGAACATCTTGAAGCCCGATATAGTAGAGGTGTTCCAAGAATTTTTCCAAAAAGACCTAGGGAAGGCCTACGATA
TGGTGAATTGGGATTTCCTAGATGGGATCTTAGAGTTGAAGGGCTTTGGCCATAAATGGAGAATGTGGATCAAAGGGTGTCTCAATAATACCAACTTCTCAATTCTCATT
AATGGGAGACCGAGGGGTAAGATTATGGCTTCTAGAGGTTTAAGGCAAGGGGATCCTCTCTCCCCCTTCTTGTTCACGCTGGTGGGAGATGCTATAAGCAAGTCTGTTCA
TTACTGTATAGAGCAGAAGATCTTGAGAGCTAATGAAAGTGACGTGACCAAATGGTGGGATATTCTCAGGATGATAATGAGGGGCTCGGGGCTTGCTTTGAACTTGGCCA
AATCTGTGCTGATTGGTATAAATGTCATCTCCCATCAGACGAACAAAGTGGCCACCATGCTAGGGTGCCAAGCGGACTCTTTGCCCATTAACTACCTTGACTTTCCATTG
GGAGATCTTTACTTGATTTCTGAGAGAAAAGAAGCGGTTATTGCTGATTGCTGGGAGGGCCGTAACCAAACGTGGAATCTGGCTTTTAGAAGAGGATTATTTGATAGGGA
GTTAAACAGCTGGATGACCCTTGTGGAGAAGCTCAGCACGATTAGATTGAATAGTGGCCAAGATGAGATCTGCTGGACCCTTGAAGGATTGGGCAACTATACAGCTAGTT
CAATGTTCCAGAAAATGACCAAAAATCGCCCAAAGTTGGCTATGTCTCCGCTCGGGGAAACTTTAGACCATTTATTTATACACGGTAATTTTGCTAAGAAAGTCCGGTTT
TTTGTGGCTAATCTCTTTGGTATATCCTTCTGTCTTCCCAACAATATTGATGATTGGCTCGTTGAAGGGTTGGCTGCGTGGAACCTAAGGAAAAAAGCCAAGATTATGGC
TGGTTGTGCTTTTAGGGCTGCCTTATGGCTTTTGTGGAAAGAGAGGAACTCTAGAACCTTTGAAGACAAGTCGACTAGCCTTGAGTTCTTTTGTGACAACGTTCAAAATA
CAGCGTCGTGGTGGATTTCTATGCATAAACTTTCTTTTTGTAATTACAGTTTGCTATCAATTATTAAAGATTGGCGAGCTATTTTGTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCATTGCTTTCACATCTCTTCCCCAAGCTCTCGCTGCTTCCCCACATTTCTCTCCCCTGCCATGGCCACCGCCTCGGCTTAGTTCGGCGCCGTTCCTCTCTCCTCAT
TCGCCGCCCTTCTACCTCACTCGTCTCTCCGCCTTCACCACTCGCCGGCCACTCTCGCCACGGCCGCCGGAGAGTTTCCATGGACTCTGCTTCACCGGAGATTTCGGCGT
CGGTCGACTCCGTGGCGGAGGGTTTGAAGAATCAGAGCTTGAACAGCGACGATCGTGACGATGGAGGCAGTAGCGTCGAGCACGAGGCGAAAAAGAAGCTCGAAGATCTG
AACTGGGACAATTCCTTTGTTAGAGAGCTGCCCGGCGATCCCCGTACTGATATCATTCCTCGAGAGGTATTACATGCATGTTACTCAAATGTATTACCTTCAGTTGGAGT
AGACAGTCCTCAGCTTGTTGCTTGGTCTGAATCGGTTGCCAATTTGCTAGATTTGGATCCTCAAGAATTTCAGAGGCCAGATTTCCCCCTCTTGTTCTCTGGAGCATCTC
CATTAGTTGGAGTGTCGCCTTACGCTCAATGTTATGGGGGCCATCAGTTTGGCATGTGGGCCGGACAGTTGGGCGATGGTCGAGCAATAACCCTTGGAGAGATACTTAAT
TCCCGATCTGAAAGGTGGGAGTTGCAGCTAAAAGGTGCTGGGAAGACTCCATATAGTCGGTTTGCTGATGGCTTGGCTGTGCTACGTAGTAGCGTTAGGGAGTTCCTTTG
TAGTGAAGCAATGCACAGTCTTGGAATACCATCAACTCGTGCACTATGTCTACTGACCACAGGAACATTTGTTACCCGAGATATGTTTTATGATGGGAATCCAAAAGAAG
AGCCTGGTGCAATTGTATGCAGAGTGGCTCAATCCTTTTTGCGTTTTGGATCATACCAAATTCATGCCTCTAGAGGAAAAGATGATTATAAAATTGTTCGGGCTTTAGCA
GACTATGCGATCCGCCACCATTTTCCTCACCTAGAGAATATGAGCAGCAGTCAGAGTTTATCTTTCAGCACAGACAATAAAGATAGTTCAGTTATCGATCTCACTTCAAA
CAAGTATGCAGCTTGGACAGTAGAGGTTGCTGAGCGAACTGCTTCCTTAATAGCAAGTTGGCAGGGAGTTGGGTTCACACATGGTGTACTCAACACTGACAATATGAGCA
TCTTGGGTCTTACCATTGATTATGGTCCGTTTGGATTTTTGGATGCTTTTGATCCTAGTTATACGCCTAATACAACTGATCTTCCGGGCAGAAGATACTGTTTTGCAAAT
CAGCCAGATATAGGCTTATGGAATATAGCCCAGTTTGCTTCAACTCTTTCAGCTGCGGAATTAATAAATGATAAAGAAGCAAACTATGCCATGGAAAGATACGGAGACAA
ATTTATGGATGACTATCAAACGATAATGACCAAGAAAATTGGTCTGCCAAAGTACAGCAAACAGTTAATCAGCAAGCTTCTCAACAACATGGCTGTTGATAAGGTTGATT
ATACAAATTTCTTTAGATCACTTTCCAATATCAAAGCTGATCCCAGCACCCCAGAGGAGGAGCTGTTGATCCCTCTGAAGGCAGTTCTACTAGATATTGGCAAGGAGCGC
AAGGAAGCTTGGGTTAGCTGGGTAAAGACGTACATAGAGGAGCTCGCTGGAAGTGGCATCTCAGATGAGGAGCGGAAGGCCTCTATGGATGCAGTAAATCCTAAATATAT
TCTGAGGAACTACCTATGTCAGACTGCCATAGATGCAGCTGAACAGGGTGATTTTGGAGAGGTTCGTCGGCTGCTGAAGATAATGGAACGGCCATATGACGAGCAGCCAG
GAATGGAGAAATATGCACGATTACCCCCAGCTTGGGCTTATCGGCCGGGTGTTTGTTTCATTCTTTCTTCTATTTCTTGTTCTGCTCCATTTCTCTTCATCTTGTTAGTT
TCAGTTATCAAGAGTGAAGACGCTTTCAGAGTCAAGCTGGAAGACGCTTTCTGTGAAGAGGAAATCTATAAAGCAGTTCAGGATATGGGAAACCAAAAATCCCCGGGTCC
GGATGGCATGACGGGAGAATTTTGGAAACATTATTGGAACATCTTGAAGCCCGATATAGTAGAGGTGTTCCAAGAATTTTTCCAAAAAGACCTAGGGAAGGCCTACGATA
TGGTGAATTGGGATTTCCTAGATGGGATCTTAGAGTTGAAGGGCTTTGGCCATAAATGGAGAATGTGGATCAAAGGGTGTCTCAATAATACCAACTTCTCAATTCTCATT
AATGGGAGACCGAGGGGTAAGATTATGGCTTCTAGAGGTTTAAGGCAAGGGGATCCTCTCTCCCCCTTCTTGTTCACGCTGGTGGGAGATGCTATAAGCAAGTCTGTTCA
TTACTGTATAGAGCAGAAGATCTTGAGAGCTAATGAAAGTGACGTGACCAAATGGTGGGATATTCTCAGGATGATAATGAGGGGCTCGGGGCTTGCTTTGAACTTGGCCA
AATCTGTGCTGATTGGTATAAATGTCATCTCCCATCAGACGAACAAAGTGGCCACCATGCTAGGGTGCCAAGCGGACTCTTTGCCCATTAACTACCTTGACTTTCCATTG
GGAGATCTTTACTTGATTTCTGAGAGAAAAGAAGCGGTTATTGCTGATTGCTGGGAGGGCCGTAACCAAACGTGGAATCTGGCTTTTAGAAGAGGATTATTTGATAGGGA
GTTAAACAGCTGGATGACCCTTGTGGAGAAGCTCAGCACGATTAGATTGAATAGTGGCCAAGATGAGATCTGCTGGACCCTTGAAGGATTGGGCAACTATACAGCTAGTT
CAATGTTCCAGAAAATGACCAAAAATCGCCCAAAGTTGGCTATGTCTCCGCTCGGGGAAACTTTAGACCATTTATTTATACACGGTAATTTTGCTAAGAAAGTCCGGTTT
TTTGTGGCTAATCTCTTTGGTATATCCTTCTGTCTTCCCAACAATATTGATGATTGGCTCGTTGAAGGGTTGGCTGCGTGGAACCTAAGGAAAAAAGCCAAGATTATGGC
TGGTTGTGCTTTTAGGGCTGCCTTATGGCTTTTGTGGAAAGAGAGGAACTCTAGAACCTTTGAAGACAAGTCGACTAGCCTTGAGTTCTTTTGTGACAACGTTCAAAATA
CAGCGTCGTGGTGGATTTCTATGCATAAACTTTCTTTTTGTAATTACAGTTTGCTATCAATTATTAAAGATTGGCGAGCTATTTTGTATTAG
Protein sequenceShow/hide protein sequence
MSLLSHLFPKLSLLPHISLPCHGHRLGLVRRRSSLLIRRPSTSLVSPPSPLAGHSRHGRRRVSMDSASPEISASVDSVAEGLKNQSLNSDDRDDGGSSVEHEAKKKLEDL
NWDNSFVRELPGDPRTDIIPREVLHACYSNVLPSVGVDSPQLVAWSESVANLLDLDPQEFQRPDFPLLFSGASPLVGVSPYAQCYGGHQFGMWAGQLGDGRAITLGEILN
SRSERWELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHSLGIPSTRALCLLTTGTFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDYKIVRALA
DYAIRHHFPHLENMSSSQSLSFSTDNKDSSVIDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGRRYCFAN
QPDIGLWNIAQFASTLSAAELINDKEANYAMERYGDKFMDDYQTIMTKKIGLPKYSKQLISKLLNNMAVDKVDYTNFFRSLSNIKADPSTPEEELLIPLKAVLLDIGKER
KEAWVSWVKTYIEELAGSGISDEERKASMDAVNPKYILRNYLCQTAIDAAEQGDFGEVRRLLKIMERPYDEQPGMEKYARLPPAWAYRPGVCFILSSISCSAPFLFILLV
SVIKSEDAFRVKLEDAFCEEEIYKAVQDMGNQKSPGPDGMTGEFWKHYWNILKPDIVEVFQEFFQKDLGKAYDMVNWDFLDGILELKGFGHKWRMWIKGCLNNTNFSILI
NGRPRGKIMASRGLRQGDPLSPFLFTLVGDAISKSVHYCIEQKILRANESDVTKWWDILRMIMRGSGLALNLAKSVLIGINVISHQTNKVATMLGCQADSLPINYLDFPL
GDLYLISERKEAVIADCWEGRNQTWNLAFRRGLFDRELNSWMTLVEKLSTIRLNSGQDEICWTLEGLGNYTASSMFQKMTKNRPKLAMSPLGETLDHLFIHGNFAKKVRF
FVANLFGISFCLPNNIDDWLVEGLAAWNLRKKAKIMAGCAFRAALWLLWKERNSRTFEDKSTSLEFFCDNVQNTASWWISMHKLSFCNYSLLSIIKDWRAILY