; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016380 (gene) of Snake gourd v1 genome

Gene IDTan0016380
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPoly(A) polymerase I
Genome locationLG10:17352380..17359607
RNA-Seq ExpressionTan0016380
SyntenyTan0016380
Gene Ontology termsGO:0001680 - tRNA 3'-terminal CCA addition (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR002646 - Poly A polymerase, head domain
IPR032828 - tRNA nucleotidyltransferase/poly(A) polymerase, RNA and SrmB- binding domain
IPR043519 - Nucleotidyltransferase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018405.1 pcnB [Cucurbita argyrosperma subsp. argyrosperma]7.8e-27290.67Show/hide
Query:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE
        MAF AP LA RIHYTR PL+FCIRKAR+QSSV IGS  ESAILCKEE+K+CISFALRRKEIER D SIPQWKTLSSKDLGIDTSMISKPTR+VLNGLRK+
Subjt:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE

Query:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF
        GYEVYLVGGCVRDLILRRIPKDFDIITSA+LKEVRKIF+RCLVVGKRFPICHVYV DT+IEVSSFSTSG R GFN+YINKPSNL +PDYIRW NCSQRDF
Subjt:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF

Query:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL
        TINSLMYDPY+KVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGF FTRDIALSIKELSCSVLK+DKGRILMEMNYMLAFGSAEASL
Subjt:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL

Query:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI
        RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLL+LLSNLDK VAPN+PCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHS GSLYEAIEIAQ+I
Subjt:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI

Query:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL
        SQPHASF EI+ESNHKES+YALM QVVDLA SV  VLWKMTDH YVSQAMI YPQAPWSDLVFIPQALSMRVCKIFECVRRGKE   +PKRSRTI++DSL
Subjt:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL

Query:  ALGNLSEVRHVFARIVFDTVYPRNQ
        ALGNLSEVRHVFARIVF+TVYPRNQ
Subjt:  ALGNLSEVRHVFARIVFDTVYPRNQ

XP_022145835.1 uncharacterized protein LOC111015199 isoform X1 [Momordica charantia]3.0e-27189.14Show/hide
Query:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE
        MAFSAP L RR+H+TR PL+FCIRK RL SSVA GS +E AILCKE++K+CISFA RRKEIERTD SIPQWKTL+S++LGI+TSMISKPT  VL+GLRK+
Subjt:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE

Query:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF
        GYEVYLVGGCVRDLILRRIPKDFDIITSAQL+EVRKIF+RCLVVGKRFPICHVYVLDTIIEVSSFSTSGSR+GFNNYI KPSNLN+PDYIRWMNCSQRDF
Subjt:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF

Query:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL
        TINSLMYDPY+K+VYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGF FTRDIALSIKELSCSVLK+DKGRILMEMNYMLAFGSAEASL
Subjt:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL

Query:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI
        RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDK VAPNRPCHSSLWIA+LAFH ALVD+PQDPLVVAAFSLA+H+GGSLYEAIEIA+NI
Subjt:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI

Query:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL
        SQPH SFHEIVE+NH+ESDY LMA+VV+LAASVK +LWKMTDHHYVSQAMI+YPQAPWSDLVFIPQALSMRVCKIFECV+RGKESG+VPKRSRTI+++SL
Subjt:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL

Query:  ALGNLSEVRHVFARIVFDTVYPRNQ
        ALGNLSEVRHVFARIVFDTVYP N+
Subjt:  ALGNLSEVRHVFARIVFDTVYPRNQ

XP_022955968.1 uncharacterized protein LOC111457802 [Cucurbita moschata]4.6e-27290.67Show/hide
Query:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE
        MAF AP LA RIHYTR PL+FCIRKAR+QSSV IGS  ESAILCKEE+K+CISFALRRKEIER D SIPQWKTLSSKDLGIDTSMISKPTR+VLNGLRK+
Subjt:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE

Query:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF
        GYEVYLVGGCVRDLILRRIPKDFDIITSA+LKEVRKIF+RCLVVGKRFPICHVYV DT+IEVSSFSTSG R GFN+YINKPSNL +PDYIRW NCSQRDF
Subjt:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF

Query:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL
        TINSLMYDPY+KVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGF FTRDIALSIKELSCSVLK+DKGRILMEMNYMLAFGSAEASL
Subjt:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL

Query:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI
        RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLL+LLSNLDK VAPN+PCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHS GSLYEAIEIAQ+I
Subjt:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI

Query:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL
        SQPHASF EI+ESNHKES+YALM QVVDLA SV  VLWKMTDH YVSQAMI YPQAPWSDLVFIPQALSMRVCKIFECVRRGKE   +PKRSRTI++DSL
Subjt:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL

Query:  ALGNLSEVRHVFARIVFDTVYPRNQ
        ALGNLSEVRHVFARIVF+TVYPRNQ
Subjt:  ALGNLSEVRHVFARIVFDTVYPRNQ

XP_022980788.1 uncharacterized protein LOC111480073 [Cucurbita maxima]4.6e-27290.86Show/hide
Query:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE
        MAF AP LA RIH TR PL+FCIRKAR+QSSV IGS  ESAILCKEE+K+CISFALRRKEIER D SIPQWKTLSSKDLGIDTSMISKPTR+VLNGLRK+
Subjt:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE

Query:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF
        GYEVYLVGGCVRDLILRRIPKDFDIITSA+LKEVRKIF+RCLVVGKRFPICHVYV DT+IEVSSFSTSG R GFN+YINKPSNL +PDYIRW NCSQRDF
Subjt:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF

Query:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL
        TINSLMYDPY+KVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGF FTRDIALSIKELSCSVLK+DKGRILMEMNYMLAFGSAEASL
Subjt:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL

Query:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI
        RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLL+LLSNLDKLVAPN+PCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHS GSLYEAIEIAQ+I
Subjt:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI

Query:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL
        SQPHASF EI+ESNHKES+YALM QVVDLA SV  VLWKMTDH YVSQAMI YPQAPWSDLVFIPQALSMRVCKIFECVRRGKE   +PKRSRTID+DSL
Subjt:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL

Query:  ALGNLSEVRHVFARIVFDTVYPRNQ
        ALGNLSEVRHVFARIVF+TVYPRNQ
Subjt:  ALGNLSEVRHVFARIVFDTVYPRNQ

XP_023527892.1 uncharacterized protein LOC111790975 [Cucurbita pepo subsp. pepo]4.6e-27290.67Show/hide
Query:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE
        MAF AP LA RIHYTR PL+FCIRKAR+QSSV IGS  ESAILCKEE+K+CISFALRRKEIER D SIPQWKTLSSKDLGIDTSMISKPTR+VLNGLRK+
Subjt:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE

Query:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF
        GYEVYLVGGCVRDLILRRIPKDFDIITSA+LKEVRKIF+RCLVVGKRFPICHVYV DT+IEVSSFSTSG R GFN+YINKPSNL +PDYIRW NCSQRDF
Subjt:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF

Query:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL
        TINSLMYDPY+KVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGF FTRDIALSIKELSCSVLK+DKGRILMEMNYMLAFGSAEASL
Subjt:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL

Query:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI
        RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLL+LLSNLDK VAPN+PCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHS GSLYEAIEIAQ+I
Subjt:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI

Query:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL
        SQPHASF EI+ESNHKES+YALM QVVDLA SV  VLWKMTDH YVSQAMI YPQAPWSDLVFIPQALSMRVCKIFECVRRGKE   +PKRSRTI++DSL
Subjt:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL

Query:  ALGNLSEVRHVFARIVFDTVYPRNQ
        ALGNLSEVRHVFARIVF+TVYPRNQ
Subjt:  ALGNLSEVRHVFARIVFDTVYPRNQ

TrEMBL top hitse value%identityAlignment
A0A0A0KTW7 Uncharacterized protein1.3e-25985.52Show/hide
Query:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE
        MAFSA  LA+RI YTR PL+ CIRKAR++SSV I S AES ILCKEE+K+CISFALRRKEI+++D S+PQWK LS +DLGID+SMISKPTRLVLNGLRK+
Subjt:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE

Query:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF
        GYEVYLVGGCVRDLIL+RIPKDFD+ITSAQLKEVR+IFS+CLVVGKRFPICHV VL TI+EVSSFSTS SR GFNNYINKPSNL++PDYIRW NCSQRDF
Subjt:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF

Query:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL
        TINSLMYDPY+KVVYDYLGAMEDIRKSKVRT+KPANLSFTEDCARILR VR+AARL F F++DIALSIKELSCSVLK+DKGR+LMEMNYMLAFGSAEASL
Subjt:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL

Query:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI
        RLLWRFGLLEILLPIQASYFVSQGFRRRDA SNMLLIL SNLDK VAPNRPCHSSLWIALLAFHKALVDQPQDP+VVAAFSLA+HSGGSLYEA+EIAQNI
Subjt:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI

Query:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL
        SQPH SFHEIVESNHKESDY+L+ QV+DLA SV  VLWKMT+  YVS+AMIKYPQAPWSDLVFI Q+LS+ VCKIF+CVRRG E+GSVPKRSR I++DSL
Subjt:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL

Query:  ALGNLSEVRHVFARIVFDTVYPRNQ
        A+GNLSEVRHVFARIVFDTVYPRNQ
Subjt:  ALGNLSEVRHVFARIVFDTVYPRNQ

A0A6J1CVM8 uncharacterized protein LOC111015199 isoform X11.4e-27189.14Show/hide
Query:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE
        MAFSAP L RR+H+TR PL+FCIRK RL SSVA GS +E AILCKE++K+CISFA RRKEIERTD SIPQWKTL+S++LGI+TSMISKPT  VL+GLRK+
Subjt:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE

Query:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF
        GYEVYLVGGCVRDLILRRIPKDFDIITSAQL+EVRKIF+RCLVVGKRFPICHVYVLDTIIEVSSFSTSGSR+GFNNYI KPSNLN+PDYIRWMNCSQRDF
Subjt:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF

Query:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL
        TINSLMYDPY+K+VYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGF FTRDIALSIKELSCSVLK+DKGRILMEMNYMLAFGSAEASL
Subjt:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL

Query:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI
        RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDK VAPNRPCHSSLWIA+LAFH ALVD+PQDPLVVAAFSLA+H+GGSLYEAIEIA+NI
Subjt:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI

Query:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL
        SQPH SFHEIVE+NH+ESDY LMA+VV+LAASVK +LWKMTDHHYVSQAMI+YPQAPWSDLVFIPQALSMRVCKIFECV+RGKESG+VPKRSRTI+++SL
Subjt:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL

Query:  ALGNLSEVRHVFARIVFDTVYPRNQ
        ALGNLSEVRHVFARIVFDTVYP N+
Subjt:  ALGNLSEVRHVFARIVFDTVYPRNQ

A0A6J1CXK6 uncharacterized protein LOC111015199 isoform X25.1e-26988.76Show/hide
Query:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE
        MAFSAP L RR+H+TR PL+FCIRK RL SSVA GS +E AILCKE++K+CISFA RRK  ERTD SIPQWKTL+S++LGI+TSMISKPT  VL+GLRK+
Subjt:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE

Query:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF
        GYEVYLVGGCVRDLILRRIPKDFDIITSAQL+EVRKIF+RCLVVGKRFPICHVYVLDTIIEVSSFSTSGSR+GFNNYI KPSNLN+PDYIRWMNCSQRDF
Subjt:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF

Query:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL
        TINSLMYDPY+K+VYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGF FTRDIALSIKELSCSVLK+DKGRILMEMNYMLAFGSAEASL
Subjt:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL

Query:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI
        RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDK VAPNRPCHSSLWIA+LAFH ALVD+PQDPLVVAAFSLA+H+GGSLYEAIEIA+NI
Subjt:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI

Query:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL
        SQPH SFHEIVE+NH+ESDY LMA+VV+LAASVK +LWKMTDHHYVSQAMI+YPQAPWSDLVFIPQALSMRVCKIFECV+RGKESG+VPKRSRTI+++SL
Subjt:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL

Query:  ALGNLSEVRHVFARIVFDTVYPRNQ
        ALGNLSEVRHVFARIVFDTVYP N+
Subjt:  ALGNLSEVRHVFARIVFDTVYPRNQ

A0A6J1GVG7 uncharacterized protein LOC1114578022.2e-27290.67Show/hide
Query:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE
        MAF AP LA RIHYTR PL+FCIRKAR+QSSV IGS  ESAILCKEE+K+CISFALRRKEIER D SIPQWKTLSSKDLGIDTSMISKPTR+VLNGLRK+
Subjt:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE

Query:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF
        GYEVYLVGGCVRDLILRRIPKDFDIITSA+LKEVRKIF+RCLVVGKRFPICHVYV DT+IEVSSFSTSG R GFN+YINKPSNL +PDYIRW NCSQRDF
Subjt:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF

Query:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL
        TINSLMYDPY+KVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGF FTRDIALSIKELSCSVLK+DKGRILMEMNYMLAFGSAEASL
Subjt:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL

Query:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI
        RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLL+LLSNLDK VAPN+PCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHS GSLYEAIEIAQ+I
Subjt:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI

Query:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL
        SQPHASF EI+ESNHKES+YALM QVVDLA SV  VLWKMTDH YVSQAMI YPQAPWSDLVFIPQALSMRVCKIFECVRRGKE   +PKRSRTI++DSL
Subjt:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL

Query:  ALGNLSEVRHVFARIVFDTVYPRNQ
        ALGNLSEVRHVFARIVF+TVYPRNQ
Subjt:  ALGNLSEVRHVFARIVFDTVYPRNQ

A0A6J1J0A7 uncharacterized protein LOC1114800732.2e-27290.86Show/hide
Query:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE
        MAF AP LA RIH TR PL+FCIRKAR+QSSV IGS  ESAILCKEE+K+CISFALRRKEIER D SIPQWKTLSSKDLGIDTSMISKPTR+VLNGLRK+
Subjt:  MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKE

Query:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF
        GYEVYLVGGCVRDLILRRIPKDFDIITSA+LKEVRKIF+RCLVVGKRFPICHVYV DT+IEVSSFSTSG R GFN+YINKPSNL +PDYIRW NCSQRDF
Subjt:  GYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDF

Query:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL
        TINSLMYDPY+KVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGF FTRDIALSIKELSCSVLK+DKGRILMEMNYMLAFGSAEASL
Subjt:  TINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASL

Query:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI
        RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLL+LLSNLDKLVAPN+PCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHS GSLYEAIEIAQ+I
Subjt:  RLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNI

Query:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL
        SQPHASF EI+ESNHKES+YALM QVVDLA SV  VLWKMTDH YVSQAMI YPQAPWSDLVFIPQALSMRVCKIFECVRRGKE   +PKRSRTID+DSL
Subjt:  SQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSL

Query:  ALGNLSEVRHVFARIVFDTVYPRNQ
        ALGNLSEVRHVFARIVF+TVYPRNQ
Subjt:  ALGNLSEVRHVFARIVFDTVYPRNQ

SwissProt top hitse value%identityAlignment
P0ABF1 Poly(A) polymerase I9.3e-3430.51Show/hide
Query:  LRRKEIERTDG-SIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVY
        L R+E E     + PQ   +  +   I    IS+    V+  L K GYE +LVGG VRDL+L + PKDFD+ T+A  ++VRK+F  C +VG+RF + HV 
Subjt:  LRRKEIERTDG-SIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVY

Query:  VLDTIIEVSSF---------STSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCAR
            IIEV++F           + S+ G N  + + +     +     +  +RDFTINSL Y   +  V DY+G M+D++   +R I      + ED  R
Subjt:  VLDTIIEVSSF---------STSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCAR

Query:  ILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKL
        +LRAVR AA+LG   + + A  I  L+  +  I   R+  E   +L  G    + +LL  + L + L P    YF   G         ++  +L N D  
Subjt:  ILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKL

Query:  VAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQN
        +  +   + +   A + ++         PL+  A  +A  SG + ++A  +A N
Subjt:  VAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQN

P0ABF2 Poly(A) polymerase I9.3e-3430.51Show/hide
Query:  LRRKEIERTDG-SIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVY
        L R+E E     + PQ   +  +   I    IS+    V+  L K GYE +LVGG VRDL+L + PKDFD+ T+A  ++VRK+F  C +VG+RF + HV 
Subjt:  LRRKEIERTDG-SIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVY

Query:  VLDTIIEVSSF---------STSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCAR
            IIEV++F           + S+ G N  + + +     +     +  +RDFTINSL Y   +  V DY+G M+D++   +R I      + ED  R
Subjt:  VLDTIIEVSSF---------STSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCAR

Query:  ILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKL
        +LRAVR AA+LG   + + A  I  L+  +  I   R+  E   +L  G    + +LL  + L + L P    YF   G         ++  +L N D  
Subjt:  ILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKL

Query:  VAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQN
        +  +   + +   A + ++         PL+  A  +A  SG + ++A  +A N
Subjt:  VAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQN

P0ABF3 Poly(A) polymerase I9.3e-3430.51Show/hide
Query:  LRRKEIERTDG-SIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVY
        L R+E E     + PQ   +  +   I    IS+    V+  L K GYE +LVGG VRDL+L + PKDFD+ T+A  ++VRK+F  C +VG+RF + HV 
Subjt:  LRRKEIERTDG-SIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVY

Query:  VLDTIIEVSSF---------STSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCAR
            IIEV++F           + S+ G N  + + +     +     +  +RDFTINSL Y   +  V DY+G M+D++   +R I      + ED  R
Subjt:  VLDTIIEVSSF---------STSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCAR

Query:  ILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKL
        +LRAVR AA+LG   + + A  I  L+  +  I   R+  E   +L  G    + +LL  + L + L P    YF   G         ++  +L N D  
Subjt:  ILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKL

Query:  VAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQN
        +  +   + +   A + ++         PL+  A  +A  SG + ++A  +A N
Subjt:  VAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQN

Q8Z9C3 Poly(A) polymerase I4.2e-3430.79Show/hide
Query:  LRRKEIERTDG-SIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVY
        L R+E E     + P    +  +   I    IS+    VL  L K GYE YLVGG VRDL+L + PKDFD+ T+A   +VRK+F  C +VG+RF + HV 
Subjt:  LRRKEIERTDG-SIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVY

Query:  VLDTIIEVSSF---------STSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCAR
            IIEV++F           + S+ G N  + + +     +     +  +RDFTINSL Y   +  V DY+G M+D+++  +R I      + ED  R
Subjt:  VLDTIIEVSSF---------STSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCAR

Query:  ILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKL
        +LRAVR AA+L    + + A  I  L+  +  I   R+  E   +L  G+   + + L  + L + L P    YF   G     A   ++  +L N D  
Subjt:  ILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKL

Query:  VAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQN
        +      + +   A + ++         PL+  A  +A  SG + Y+A  +A N
Subjt:  VAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQN

Q8ZRQ8 Poly(A) polymerase I3.2e-3430.79Show/hide
Query:  LRRKEIERTDG-SIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVY
        L R+E E     + P    +  +   I    IS+    VL  L K GYE YLVGG VRDL+L + PKDFD+ T+A   +VRK+F  C +VG+RF + HV 
Subjt:  LRRKEIERTDG-SIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVY

Query:  VLDTIIEVSSF---------STSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCAR
            IIEV++F           + S+ G N  + + +     +     +  +RDFTINSL Y   +  V DY+G M+D+++  +R I      + ED  R
Subjt:  VLDTIIEVSSF---------STSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCAR

Query:  ILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKL
        +LRAVR AA+L    + + A  I  L+  +  I   R+  E   +L  G+   + + L  + L + L P    YF   G     A   ++  +L N D  
Subjt:  ILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKL

Query:  VAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQN
        +      + +   A + ++         PL+  A  +A  SG + Y+A  +A N
Subjt:  VAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQN

Arabidopsis top hitse value%identityAlignment
AT1G28090.1 Polynucleotide adenylyltransferase family protein8.9e-15757.34Show/hide
Query:  LVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRR
        L F +R + +     +G+ A    + K  ++    F+  R      D S+P WK L + + GI  SMI   TR+VLN L+K+G++VYLVGGCVRDLIL R
Subjt:  LVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRR

Query:  IPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGF--NNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYD
        IPKDFD+IT+A+LKEVRK+F  C +VG+RFPICHVYV D IIEVSSFSTS +R G   N    +P+  ++ DYIRW NC QRDFT+N LM+DP E VVYD
Subjt:  IPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGF--NNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYD

Query:  YLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQ
        Y+G +ED+R SKVRT+  ANLSF ED ARILRA+RIAARLGF  T+D+A+S+KELS S+L++D  RI ME+NYMLA+GSAEASLRLLWRFGL+EILLPIQ
Subjt:  YLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQ

Query:  ASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNISQPHASFHEIVESNHK
        ASY VSQGFRRRD  SNMLL L  NLD+LVAP+RPC   LWI +LAFHKALVDQP+DP VVA+F LA++S  SL EAI IA++ S+ H S  + + S  K
Subjt:  ASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNISQPHASFHEIVESNHK

Query:  ---ESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRR--GKESGSVPKRSRTIDYDSLALGNLSEVRHV
           +S+  +  QV+ LA S++    K+ +  Y++ AM KYPQAP SD+VF+ + +  RV K+F  VRR   +E   VP   R I+Y SLALG+  E R V
Subjt:  ---ESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRR--GKESGSVPKRSRTIDYDSLALGNLSEVRHV

Query:  FARIVFDTVYP
        FARIVFDT+YP
Subjt:  FARIVFDTVYP

AT1G28090.2 Polynucleotide adenylyltransferase family protein2.0e-15661.29Show/hide
Query:  DGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSS
        D S+P WK L + + GI  SMI   TR+VLN L+K+G++VYLVGGCVRDLIL RIPKDFD+IT+A+LKEVRK+F  C +VG+RFPICHVYV D IIEVSS
Subjt:  DGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSS

Query:  FSTSGSRVGF--NNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTR
        FSTS +R G   N    +P+  ++ DYIRW NC QRDFT+N LM+DP E VVYDY+G +ED+R SKVRT+  ANLSF ED ARILRA+RIAARLGF  T+
Subjt:  FSTSGSRVGF--NNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTR

Query:  DIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLA
        D+A+S+KELS S+L++D  RI ME+NYMLA+GSAEASLRLLWRFGL+EILLPIQASY VSQGFRRRD  SNMLL L  NLD+LVAP+RPC   LWI +LA
Subjt:  DIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLA

Query:  FHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNISQPHASFHEIVESNHK---ESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWS
        FHKALVDQP+DP VVA+F LA++S  SL EAI IA++ S+ H S  + + S  K   +S+  +  QV+ LA S++    K+ +  Y++ AM KYPQAP S
Subjt:  FHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNISQPHASFHEIVESNHK---ESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWS

Query:  DLVFIPQALSMRVCKIFECVRR--GKESGSVPKRSRTIDYDSLALGNLSEVRHVFARIVFDTVYP
        D+VF+ + +  RV K+F  VRR   +E   VP   R I+Y SLALG+  E R VFARIVFDT+YP
Subjt:  DLVFIPQALSMRVCKIFECVRR--GKESGSVPKRSRTIDYDSLALGNLSEVRHVFARIVFDTVYP

AT1G28090.3 Polynucleotide adenylyltransferase family protein2.0e-15661.29Show/hide
Query:  DGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSS
        D S+P WK L + + GI  SMI   TR+VLN L+K+G++VYLVGGCVRDLIL RIPKDFD+IT+A+LKEVRK+F  C +VG+RFPICHVYV D IIEVSS
Subjt:  DGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSS

Query:  FSTSGSRVGF--NNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTR
        FSTS +R G   N    +P+  ++ DYIRW NC QRDFT+N LM+DP E VVYDY+G +ED+R SKVRT+  ANLSF ED ARILRA+RIAARLGF  T+
Subjt:  FSTSGSRVGF--NNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTR

Query:  DIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLA
        D+A+S+KELS S+L++D  RI ME+NYMLA+GSAEASLRLLWRFGL+EILLPIQASY VSQGFRRRD  SNMLL L  NLD+LVAP+RPC   LWI +LA
Subjt:  DIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLA

Query:  FHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNISQPHASFHEIVESNHK---ESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWS
        FHKALVDQP+DP VVA+F LA++S  SL EAI IA++ S+ H S  + + S  K   +S+  +  QV+ LA S++    K+ +  Y++ AM KYPQAP S
Subjt:  FHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNISQPHASFHEIVESNHK---ESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWS

Query:  DLVFIPQALSMRVCKIFECVRR--GKESGSVPKRSRTIDYDSLALGNLSEVRHVFARIVFDTVYP
        D+VF+ + +  RV K+F  VRR   +E   VP   R I+Y SLALG+  E R VFARIVFDT+YP
Subjt:  DLVFIPQALSMRVCKIFECVRR--GKESGSVPKRSRTIDYDSLALGNLSEVRHVFARIVFDTVYP

AT3G48830.1 polynucleotide adenylyltransferase family protein / RNA recognition motif (RRM)-containing protein9.3e-14654.53Show/hide
Query:  ARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVG
        A R H+  LPL +C  K +L +  A     +        +K                   P+WK L+SKDLGI TSMISKPTR+VLNGL+ +GY+VYLVG
Subjt:  ARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVG

Query:  GCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYI--NKPSNLNDPDYIRWMNCSQRDFTINSLM
        GCVRDLIL+R PKDFDI+TSA+L+EV + FSRC ++GK+FPICHV++ + +IEVSSFSTS      N      K +   D D IR+ NC QRDFTIN LM
Subjt:  GCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYI--NKPSNLNDPDYIRWMNCSQRDFTINSLM

Query:  YDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRF
        +DPY KV+YDYLG +EDI+K+KVRT+  A  SF ED ARILR  RIAARLGF  +++ A  +K LS  V ++ +GRIL+EMNYMLA+GSAEASLRLLW+F
Subjt:  YDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRF

Query:  GLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNISQPH-A
        G+LEILLPIQA+Y V  GF+RRD  SN+LL L  NLDKL+AP++PCHSSLW+ +LA HKAL DQP+ P VVAAFSLAVH+GG + EA++  + +++PH  
Subjt:  GLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNISQPH-A

Query:  SFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKR
        SF E++E    +S   L+ +V+D  +S+K+ L +MTD  ++S+AM  YPQAP+SD+VFIP  L +   +IFECV+   + G VPK+
Subjt:  SFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKR

AT5G23690.1 Polynucleotide adenylyltransferase family protein3.1e-15761.09Show/hide
Query:  QWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSG
        +WK L+SKDLG+ +SMI+K TR VLNGL+ +G++VYLVGGCVRDLIL+R PKDFDI+TSA+L+EV + F RC +VG+RFPICHV++ D +IEVSSFSTS 
Subjt:  QWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGCVRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSG

Query:  SRVGFNNYI---NKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIAL
             N          +  D D IR  NC QRDFTIN LM+DPY KVVYDYLG MEDIRK+KVRT+  A  SF +DCARILRA+RIAARLGF  +++ A 
Subjt:  SRVGFNNYI---NKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGAMEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIAL

Query:  SIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKA
         IK LS  V ++DKGRILMEMNYMLA+GSAEASLRLLW+FG+LEILLPIQA+Y    GFRRRD  +NMLL L +NLDKL+AP+RPCHSSLWIA+LAFHKA
Subjt:  SIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDASSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKA

Query:  LVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNISQPH-ASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIP
        L D+P+ P+VVAAFSLAVH+ G + EA+EI + I++PH  SF E+VE         L+ +V+DL AS++D L +MTD +++S+AM  YPQAP+SDLVFIP
Subjt:  LVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNISQPH-ASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKMTDHHYVSQAMIKYPQAPWSDLVFIP

Query:  QALSMRVCKIFECVRRGK-ESGSVPKRSRTIDYDSLALGNLSEVRHVFARIVFDTVYPRN
          L +R  +IF+CV+  +   G   K+   I+Y SL  G   E+RHVFAR+VFDTV+P N
Subjt:  QALSMRVCKIFECVRRGK-ESGSVPKRSRTIDYDSLALGNLSEVRHVFARIVFDTVYPRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATTTTCAGCCCCAGCCCTTGCTCGAAGGATTCATTACACTCGCCTTCCGCTGGTCTTTTGCATTCGCAAGGCACGACTTCAAAGCTCTGTAGCCATTGGATCTGC
TGCTGAATCAGCAATTCTCTGCAAGGAGGAGGAGAAAAATTGTATTTCGTTTGCTTTGCGTAGAAAAGAAATAGAGCGAACTGATGGCTCAATACCTCAATGGAAGACAT
TGAGTTCGAAAGATCTTGGGATTGACACATCAATGATTTCAAAGCCAACTAGGCTGGTTTTAAATGGACTAAGGAAAGAAGGGTATGAGGTCTACCTTGTTGGTGGGTGT
GTTCGGGATCTTATCCTAAGGAGAATTCCTAAAGACTTTGATATCATAACTTCAGCTCAACTTAAAGAGGTACGGAAAATTTTTAGCCGATGTCTAGTTGTTGGAAAGCG
GTTTCCAATCTGCCATGTGTATGTCCTTGATACTATTATAGAGGTCTCAAGCTTTAGCACCTCCGGAAGTAGGGTTGGTTTTAATAATTACATTAACAAACCTTCCAACT
TAAACGACCCTGATTATATTCGCTGGATGAATTGCTCACAGCGAGACTTTACCATTAACAGTTTGATGTATGATCCATACGAAAAGGTTGTATATGATTATTTGGGGGCA
ATGGAGGATATTAGAAAATCCAAGGTACGAACTATAAAACCTGCAAATCTTTCTTTTACTGAGGACTGTGCTCGAATTTTACGTGCGGTCAGAATTGCAGCCCGTTTAGG
ATTTTGTTTCACCAGAGACATAGCACTTTCTATAAAAGAATTATCTTGCTCCGTGTTAAAAATTGACAAGGGGCGGATACTCATGGAAATGAATTATATGCTTGCATTTG
GGTCTGCAGAGGCTTCTTTGAGATTATTATGGAGATTTGGGCTTCTGGAGATACTTCTTCCAATCCAAGCATCATATTTTGTTTCACAAGGTTTCAGGAGGCGTGATGCA
AGCTCAAACATGCTTCTGATTTTGCTTTCCAACCTTGATAAATTAGTGGCACCCAATCGACCATGCCATAGCAGCTTATGGATTGCTCTCTTAGCATTTCACAAAGCTTT
GGTTGACCAGCCTCAAGATCCATTGGTAGTTGCAGCATTTAGCCTTGCTGTCCATAGTGGTGGATCCTTATATGAAGCAATAGAAATAGCCCAAAATATCTCGCAGCCGC
ATGCGTCATTTCATGAAATAGTAGAAAGTAACCACAAAGAATCAGATTATGCACTGATGGCACAGGTTGTTGATCTTGCAGCTTCAGTGAAAGATGTGTTATGGAAGATG
ACCGATCACCATTATGTTTCGCAAGCTATGATCAAATATCCTCAAGCACCTTGGTCTGATCTGGTTTTTATCCCACAGGCTTTATCAATGAGGGTATGTAAAATTTTTGA
ATGTGTTAGAAGGGGCAAGGAAAGTGGATCAGTTCCGAAAAGAAGTAGAACAATCGACTATGATTCCTTGGCTTTGGGTAACTTGTCAGAGGTTCGACATGTTTTTGCTA
GGATTGTTTTCGACACAGTTTACCCTCGAAATCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCATTTTCAGCCCCAGCCCTTGCTCGAAGGATTCATTACACTCGCCTTCCGCTGGTCTTTTGCATTCGCAAGGCACGACTTCAAAGCTCTGTAGCCATTGGATCTGC
TGCTGAATCAGCAATTCTCTGCAAGGAGGAGGAGAAAAATTGTATTTCGTTTGCTTTGCGTAGAAAAGAAATAGAGCGAACTGATGGCTCAATACCTCAATGGAAGACAT
TGAGTTCGAAAGATCTTGGGATTGACACATCAATGATTTCAAAGCCAACTAGGCTGGTTTTAAATGGACTAAGGAAAGAAGGGTATGAGGTCTACCTTGTTGGTGGGTGT
GTTCGGGATCTTATCCTAAGGAGAATTCCTAAAGACTTTGATATCATAACTTCAGCTCAACTTAAAGAGGTACGGAAAATTTTTAGCCGATGTCTAGTTGTTGGAAAGCG
GTTTCCAATCTGCCATGTGTATGTCCTTGATACTATTATAGAGGTCTCAAGCTTTAGCACCTCCGGAAGTAGGGTTGGTTTTAATAATTACATTAACAAACCTTCCAACT
TAAACGACCCTGATTATATTCGCTGGATGAATTGCTCACAGCGAGACTTTACCATTAACAGTTTGATGTATGATCCATACGAAAAGGTTGTATATGATTATTTGGGGGCA
ATGGAGGATATTAGAAAATCCAAGGTACGAACTATAAAACCTGCAAATCTTTCTTTTACTGAGGACTGTGCTCGAATTTTACGTGCGGTCAGAATTGCAGCCCGTTTAGG
ATTTTGTTTCACCAGAGACATAGCACTTTCTATAAAAGAATTATCTTGCTCCGTGTTAAAAATTGACAAGGGGCGGATACTCATGGAAATGAATTATATGCTTGCATTTG
GGTCTGCAGAGGCTTCTTTGAGATTATTATGGAGATTTGGGCTTCTGGAGATACTTCTTCCAATCCAAGCATCATATTTTGTTTCACAAGGTTTCAGGAGGCGTGATGCA
AGCTCAAACATGCTTCTGATTTTGCTTTCCAACCTTGATAAATTAGTGGCACCCAATCGACCATGCCATAGCAGCTTATGGATTGCTCTCTTAGCATTTCACAAAGCTTT
GGTTGACCAGCCTCAAGATCCATTGGTAGTTGCAGCATTTAGCCTTGCTGTCCATAGTGGTGGATCCTTATATGAAGCAATAGAAATAGCCCAAAATATCTCGCAGCCGC
ATGCGTCATTTCATGAAATAGTAGAAAGTAACCACAAAGAATCAGATTATGCACTGATGGCACAGGTTGTTGATCTTGCAGCTTCAGTGAAAGATGTGTTATGGAAGATG
ACCGATCACCATTATGTTTCGCAAGCTATGATCAAATATCCTCAAGCACCTTGGTCTGATCTGGTTTTTATCCCACAGGCTTTATCAATGAGGGTATGTAAAATTTTTGA
ATGTGTTAGAAGGGGCAAGGAAAGTGGATCAGTTCCGAAAAGAAGTAGAACAATCGACTATGATTCCTTGGCTTTGGGTAACTTGTCAGAGGTTCGACATGTTTTTGCTA
GGATTGTTTTCGACACAGTTTACCCTCGAAATCAATAATGTCGAATCAACCGAGAGGTGAACAGATTACACTCATATCTGAATCTTTCTTTGTGCAATAATCTGGCAGCA
TGAAGAGGTGGCTGATATTGAGTTGATCGGAAAATGCATAACTTGTCGATATCAGAGGGAAAGTGGATTGAATTGGAGTTTTGTTTAATCTGAATCATCTTCCAAATTGA
GTTTTTGTAAATTCATTACAGTAATACATTGCAGAGAAATTTGAGTTTTAGTTTGATCTCATAGTACCATACAATTTCCAATCATTTTAAAGCCCTCAACCTTTGACAAC
CAACCAAGTTTAGAGATGATTATCCAGTTTGTGTTGTGAAATTGAGATGCAAAACAAGGGA
Protein sequenceShow/hide protein sequence
MAFSAPALARRIHYTRLPLVFCIRKARLQSSVAIGSAAESAILCKEEEKNCISFALRRKEIERTDGSIPQWKTLSSKDLGIDTSMISKPTRLVLNGLRKEGYEVYLVGGC
VRDLILRRIPKDFDIITSAQLKEVRKIFSRCLVVGKRFPICHVYVLDTIIEVSSFSTSGSRVGFNNYINKPSNLNDPDYIRWMNCSQRDFTINSLMYDPYEKVVYDYLGA
MEDIRKSKVRTIKPANLSFTEDCARILRAVRIAARLGFCFTRDIALSIKELSCSVLKIDKGRILMEMNYMLAFGSAEASLRLLWRFGLLEILLPIQASYFVSQGFRRRDA
SSNMLLILLSNLDKLVAPNRPCHSSLWIALLAFHKALVDQPQDPLVVAAFSLAVHSGGSLYEAIEIAQNISQPHASFHEIVESNHKESDYALMAQVVDLAASVKDVLWKM
TDHHYVSQAMIKYPQAPWSDLVFIPQALSMRVCKIFECVRRGKESGSVPKRSRTIDYDSLALGNLSEVRHVFARIVFDTVYPRNQ