; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029738 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029738
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPoly(A) polymerase I
Genome locationtig00153449:2325859..2333627
RNA-Seq ExpressionSgr029738
SyntenySgr029738
Gene Ontology termsGO:0001680 - tRNA 3'-terminal CCA addition (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0004652 - polynucleotide adenylyltransferase activity (molecular function)
InterPro domainsIPR002646 - Poly A polymerase, head domain
IPR032828 - tRNA nucleotidyltransferase/poly(A) polymerase, RNA and SrmB- binding domain
IPR043519 - Nucleotidyltransferase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596268.1 hypothetical protein SDJN03_09448, partial [Cucurbita argyrosperma subsp. sororia]8.2e-14990.3Show/hide
Query:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE
        DA+NIDM KWNKVDGRAFGI+RSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLL R PKDFDVITTAGL+QI KLFH A+IVGRRFPICMVNIKGSVIE
Subjt:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE

Query:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL
        VSSFETVAK SKG+ETVT    PRKCD+KDLIRWRNS+HRDFTINSLFFDPFLN+IYDYAEGIADLR LKLRTLIPASLSF EDCARILRGLRIAARLGL
Subjt:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL

Query:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW
        SLSKDTETAMRKLS SI SLDK+RLMME NYMLSYGAAVPSL+LL RFNLLEILLPFHAAYLDKQ IKKS+LNS MLMKLFSNLDKLVSCDRPSDCNIW
Subjt:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW

KAG7027820.1 pcnB [Cucurbita argyrosperma subsp. argyrosperma]3.1e-14887.42Show/hide
Query:  CKLKVLCFMPVDAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPI
        C   +L  +   A+NIDM KWNKVDGRAFGI+RSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLL R PKDFDVITTAGL+QI KLFH A+IVGRRFPI
Subjt:  CKLKVLCFMPVDAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPI

Query:  CMVNIKGSVIEVSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARIL
        CMVNIKGSVIEVSSFETVAK SKG+ETVT    PRKCD+KDLIRWRNS+HRDFTINSLFFDPFLN+IYDYAEGIADLR LKLRTLIPASLSF EDCARIL
Subjt:  CMVNIKGSVIEVSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARIL

Query:  RGLRIAARLGLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVS
        RGLRIAARLGLSLSKDTETAMRKLS SI SLDK+RLMME NYMLSYGAAVPSL+LL RFNLLEILLPFHAAYLDKQ IKKS+LNS MLMKLFSNLDKLVS
Subjt:  RGLRIAARLGLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVS

Query:  CDRPSDCNIW
        CDRPSDCNIW
Subjt:  CDRPSDCNIW

XP_022947617.1 uncharacterized protein LOC111451426 [Cucurbita moschata]8.2e-14990.3Show/hide
Query:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE
        DA+NIDM KWNKVDGRAFGI+RSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLL R PKDFDVITTAGL+QI KLFH A+IVGRRFPICMVNIKGSVIE
Subjt:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE

Query:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL
        VSSFETVAK SKG+ETVT    PRKCD+KDLIRWRNS+HRDFTINSLFFDPFLN+IYDYAEGIADLR LKLRTLIPASLSF EDCARILRGLRIAARLGL
Subjt:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL

Query:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW
        SLSKDTETAMRKLS SI SLDK+RLMME NYMLSYGAAVPSL+LL RFNLLEILLPFHAAYLDKQ IKKS+LNS MLMKLFSNLDKLVSCDRPSDCNIW
Subjt:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW

XP_022971476.1 uncharacterized protein LOC111470184 [Cucurbita maxima]5.9e-14789.3Show/hide
Query:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE
        DA+NIDM KWNKVDGRAFGI+RSMIPSSSWMVLKILHNKGFE YLVGGCVRDLLL R PKDFDVITTAGL+QI KLFH A+IVGRRFPICMVNIKGSVIE
Subjt:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE

Query:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL
        VSSFETVAK SKG+ETVT    PRKCD+KDLIRWRNS+HRDFTINSLFFDPFLN+IYDYAEGIADLR LKLRTLIPASLSF EDCARILRGLRIAARLGL
Subjt:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL

Query:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW
        SLSKDTETAM KLS SI SLDK+RLMME NYMLSYGAAVPSL+LL RFNLLEILLPFHAAYL+KQ IKKS LNS MLMKLFSNLDKLVSCDRPSDCNIW
Subjt:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW

XP_023540398.1 uncharacterized protein LOC111800784 [Cucurbita pepo subsp. pepo]1.1e-14890.3Show/hide
Query:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE
        DA+N+DM KWNKVDGRAFGI+RSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLL R PKDFDVITTAGL+QI KLFH A+IVGRRFPICMVNIKGSVIE
Subjt:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE

Query:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL
        VSSFETVAK SKG+ETVT    PRKCD+KDLIRWRNS+HRDFTINSLFFDPFLNIIYDYAEGIADLR LKLRTLIPASLSF EDCARILRGLRIAARLGL
Subjt:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL

Query:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW
        SLSKDTETAMRKLS SI SLDK+RLMME NYMLSYGAAVPSL+LL RFNLLEILLPFHAAYLDKQ IKKS+LNS MLMKLFSNLDKLVSCDRPSDCNIW
Subjt:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW

TrEMBL top hitse value%identityAlignment
A0A6J1CVT5 uncharacterized protein LOC111014843 isoform X42.9e-14487.96Show/hide
Query:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE
        +AANI MSKWNKVD RAFGI RSMIP SSWMVL+IL  KGFE YLVGGCVRDL+L RVPKDFDVITTAGL+QIRKLFHRA+IVGRRFPICMVNIKGSVIE
Subjt:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE

Query:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL
        VSSFETVAK S+G+ TV   QIPRKC K+DLIRWRNSMHRDFTINSLFFDPF N+IYDYAEGI DLR LKLRTLIPASLSF EDCARILRGLRIAARLGL
Subjt:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL

Query:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW
        SLSKDTETA+RKLSPSI SLDKTR+MMELNYMLSYGAAVPSL+LL RFNLL+ILLPFHAAYLDKQ IK+S+LNSIMLMKLFSNLDKLVSCDRPSDCNIW
Subjt:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW

A0A6J1CW57 uncharacterized protein LOC111014843 isoform X21.0e-14184.84Show/hide
Query:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE
        +AANI MSKWNKVD RAFGI RSMIP SSWMVL+IL  KGFE YLVGGCVRDL+L RVPKDFDVITTAGL+QIRKLFHRA+IVGRRFPICMVNIKGSVIE
Subjt:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE

Query:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLK-----------LRTLIPASLSFTEDCARIL
        VSSFETVAK S+G+ TV   QIPRKC K+DLIRWRNSMHRDFTINSLFFDPF N+IYDYAEGI DLR LK           LRTLIPASLSF EDCARIL
Subjt:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLK-----------LRTLIPASLSFTEDCARIL

Query:  RGLRIAARLGLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVS
        RGLRIAARLGLSLSKDTETA+RKLSPSI SLDKTR+MMELNYMLSYGAAVPSL+LL RFNLL+ILLPFHAAYLDKQ IK+S+LNSIMLMKLFSNLDKLVS
Subjt:  RGLRIAARLGLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVS

Query:  CDRPSDCNIW
        CDRPSDCNIW
Subjt:  CDRPSDCNIW

A0A6J1CWE8 uncharacterized protein LOC111014843 isoform X39.4e-14387.38Show/hide
Query:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQ--IRKLFHRARIVGRRFPICMVNIKGSV
        +AANI MSKWNKVD RAFGI RSMIP SSWMVL+IL  KGFE YLVGGCVRDL+L RVPKDFDVITTAGL+Q  IRKLFHRA+IVGRRFPICMVNIKGSV
Subjt:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQ--IRKLFHRARIVGRRFPICMVNIKGSV

Query:  IEVSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARL
        IEVSSFETVAK S+G+ TV   QIPRKC K+DLIRWRNSMHRDFTINSLFFDPF N+IYDYAEGI DLR LKLRTLIPASLSF EDCARILRGLRIAARL
Subjt:  IEVSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARL

Query:  GLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNI
        GLSLSKDTETA+RKLSPSI SLDKTR+MMELNYMLSYGAAVPSL+LL RFNLL+ILLPFHAAYLDKQ IK+S+LNSIMLMKLFSNLDKLVSCDRPSDCNI
Subjt:  GLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNI

Query:  W
        W
Subjt:  W

A0A6J1G6Y5 uncharacterized protein LOC1114514264.0e-14990.3Show/hide
Query:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE
        DA+NIDM KWNKVDGRAFGI+RSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLL R PKDFDVITTAGL+QI KLFH A+IVGRRFPICMVNIKGSVIE
Subjt:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE

Query:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL
        VSSFETVAK SKG+ETVT    PRKCD+KDLIRWRNS+HRDFTINSLFFDPFLN+IYDYAEGIADLR LKLRTLIPASLSF EDCARILRGLRIAARLGL
Subjt:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL

Query:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW
        SLSKDTETAMRKLS SI SLDK+RLMME NYMLSYGAAVPSL+LL RFNLLEILLPFHAAYLDKQ IKKS+LNS MLMKLFSNLDKLVSCDRPSDCNIW
Subjt:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW

A0A6J1I3F4 uncharacterized protein LOC1114701842.8e-14789.3Show/hide
Query:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE
        DA+NIDM KWNKVDGRAFGI+RSMIPSSSWMVLKILHNKGFE YLVGGCVRDLLL R PKDFDVITTAGL+QI KLFH A+IVGRRFPICMVNIKGSVIE
Subjt:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE

Query:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL
        VSSFETVAK SKG+ETVT    PRKCD+KDLIRWRNS+HRDFTINSLFFDPFLN+IYDYAEGIADLR LKLRTLIPASLSF EDCARILRGLRIAARLGL
Subjt:  VSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGL

Query:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW
        SLSKDTETAM KLS SI SLDK+RLMME NYMLSYGAAVPSL+LL RFNLLEILLPFHAAYL+KQ IKKS LNS MLMKLFSNLDKLVSCDRPSDCNIW
Subjt:  SLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW

SwissProt top hitse value%identityAlignment
P0ABF1 Poly(A) polymerase I1.1e-2833.07Show/hide
Query:  INRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIEVSSFETVAKPSKGEETVTF
        I+R  I  ++  V+  L+  G+EA+LVGG VRDLLL + PKDFDV T A  +Q+RKLF   R+VGRRF +  V     +IEV++F    + +  + T + 
Subjt:  INRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIEVSSFETVAKPSKGEETVTF

Query:  PQIPRKCDKKDLIR-------WRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSLSKDTETAMRK
            ++     L+R         ++  RDFTINSL++      + DY  G+ DL+   +R +      + ED  R+LR +R AA+LG+ +S +T   + +
Subjt:  PQIPRKCDKKDLIR-------WRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSLSKDTETAMRK

Query:  LSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQG
        L+  +  +   RL  E   +L  G    +  LL  ++L + L P    Y  + G
Subjt:  LSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQG

P0ABF2 Poly(A) polymerase I1.1e-2833.07Show/hide
Query:  INRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIEVSSFETVAKPSKGEETVTF
        I+R  I  ++  V+  L+  G+EA+LVGG VRDLLL + PKDFDV T A  +Q+RKLF   R+VGRRF +  V     +IEV++F    + +  + T + 
Subjt:  INRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIEVSSFETVAKPSKGEETVTF

Query:  PQIPRKCDKKDLIR-------WRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSLSKDTETAMRK
            ++     L+R         ++  RDFTINSL++      + DY  G+ DL+   +R +      + ED  R+LR +R AA+LG+ +S +T   + +
Subjt:  PQIPRKCDKKDLIR-------WRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSLSKDTETAMRK

Query:  LSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQG
        L+  +  +   RL  E   +L  G    +  LL  ++L + L P    Y  + G
Subjt:  LSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQG

P0ABF3 Poly(A) polymerase I1.1e-2833.07Show/hide
Query:  INRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIEVSSFETVAKPSKGEETVTF
        I+R  I  ++  V+  L+  G+EA+LVGG VRDLLL + PKDFDV T A  +Q+RKLF   R+VGRRF +  V     +IEV++F    + +  + T + 
Subjt:  INRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIEVSSFETVAKPSKGEETVTF

Query:  PQIPRKCDKKDLIR-------WRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSLSKDTETAMRK
            ++     L+R         ++  RDFTINSL++      + DY  G+ DL+   +R +      + ED  R+LR +R AA+LG+ +S +T   + +
Subjt:  PQIPRKCDKKDLIR-------WRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSLSKDTETAMRK

Query:  LSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQG
        L+  +  +   RL  E   +L  G    +  LL  ++L + L P    Y  + G
Subjt:  LSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQG

P44439 Poly(A) polymerase I1.8e-3432.75Show/hide
Query:  NKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHR-ARIVGRRFPICMVNIKGSVIEVSSFETVAK
        N +    F I+      ++  V++ L  +GFEAY+VGGC+RDLLL + PKDFDV T A  +QI+ +F R  R+VGRRF +  +     +IEV++F     
Subjt:  NKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHR-ARIVGRRFPICMVNIKGSVIEVSSFETVAK

Query:  PSKGEETVTFPQIPRKCDKKDLIR-------WRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSL
         ++ E         ++ ++  L+R        +++  RDFT+N+L+++P  N + DY EGI DL+  KLR +      + ED  R+LR +R  A+L + L
Subjt:  PSKGEETVTFPQIPRKCDKKDLIR-------WRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSL

Query:  SKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVS
         K +E  +R+L+P + ++   RL  E   +L  G  V +  LL ++ L E L P  +AY  +   K+ +    M++   ++ D+ V+
Subjt:  SKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVS

Q8Z9C3 Poly(A) polymerase I2.0e-2833.46Show/hide
Query:  INRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIEVSSFETVAKPSKGEETVTF
        I+R  I  ++  VL  L+  G+EAYLVGG VRDLLL + PKDFDV T A   Q+RKLF   R+VGRRF +  V     +IEV++F    + S+ + T + 
Subjt:  INRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIEVSSFETVAKPSKGEETVTF

Query:  PQIPRKCDKKDLIR-------WRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSLSKDTETAMRK
            ++     L+R         ++  RDFTINSL++      + DY  G+ DL+   +R +      + ED  R+LR +R AA+L + +S +T   + +
Subjt:  PQIPRKCDKKDLIR-------WRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSLSKDTETAMRK

Query:  LSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQG
        L+  +  +   RL  E   +L  G    +   L  ++L + L P    Y  + G
Subjt:  LSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQG

Arabidopsis top hitse value%identityAlignment
AT1G28090.1 Polynucleotide adenylyltransferase family protein8.0e-8650.14Show/hide
Query:  TNAVVFC-PMWSSLSLLPLKLFIFSY--------FLLPRQV--GCKLKVLCFMPVDAANIDMS-KWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAY
        + A+V+C P + S+  LP  L   +            PR V    K +   F     +N D S  W K+D   FGI RSMIP S+ MVL  L  KGF+ Y
Subjt:  TNAVVFC-PMWSSLSLLPLKLFIFSY--------FLLPRQV--GCKLKVLCFMPVDAANIDMS-KWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAY

Query:  LVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIEVSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTI
        LVGGCVRDL+L R+PKDFDVITTA LK++RK+F   +IVGRRFPIC V +   +IEVSSF T A+  K     +F + P  CD++D IRW+N + RDFT+
Subjt:  LVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIEVSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTI

Query:  NSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHL
        N L FDP  N++YDY  G+ DLR  K+RT+  A+LSF ED ARILR +RIAARLG SL+KD   ++++LS S+  LD +R+ ME+NYML+YG+A  SL L
Subjt:  NSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHL

Query:  LLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW
        L RF L+EILLP  A+YL  QG ++    S ML+ LF NLD+LV+ DRP    +W
Subjt:  LLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW

AT1G28090.2 Polynucleotide adenylyltransferase family protein2.1e-8654.61Show/hide
Query:  FMPVDAANIDMS-KWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIK
        F     +N D S  W K+D   FGI RSMIP S+ MVL  L  KGF+ YLVGGCVRDL+L R+PKDFDVITTA LK++RK+F   +IVGRRFPIC V + 
Subjt:  FMPVDAANIDMS-KWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIK

Query:  GSVIEVSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIA
          +IEVSSF T A+  K     +F + P  CD++D IRW+N + RDFT+N L FDP  N++YDY  G+ DLR  K+RT+  A+LSF ED ARILR +RIA
Subjt:  GSVIEVSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIA

Query:  ARLGLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSD
        ARLG SL+KD   ++++LS S+  LD +R+ ME+NYML+YG+A  SL LL RF L+EILLP  A+YL  QG ++    S ML+ LF NLD+LV+ DRP  
Subjt:  ARLGLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSD

Query:  CNIW
          +W
Subjt:  CNIW

AT1G28090.3 Polynucleotide adenylyltransferase family protein2.1e-8654.61Show/hide
Query:  FMPVDAANIDMS-KWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIK
        F     +N D S  W K+D   FGI RSMIP S+ MVL  L  KGF+ YLVGGCVRDL+L R+PKDFDVITTA LK++RK+F   +IVGRRFPIC V + 
Subjt:  FMPVDAANIDMS-KWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIK

Query:  GSVIEVSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIA
          +IEVSSF T A+  K     +F + P  CD++D IRW+N + RDFT+N L FDP  N++YDY  G+ DLR  K+RT+  A+LSF ED ARILR +RIA
Subjt:  GSVIEVSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIA

Query:  ARLGLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSD
        ARLG SL+KD   ++++LS S+  LD +R+ ME+NYML+YG+A  SL LL RF L+EILLP  A+YL  QG ++    S ML+ LF NLD+LV+ DRP  
Subjt:  ARLGLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSD

Query:  CNIW
          +W
Subjt:  CNIW

AT2G17580.1 Polynucleotide adenylyltransferase family protein4.8e-9957.91Show/hide
Query:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE
        D  ++D SKW KV     GI  SMIP SS  VL++L  +GF+AYLVGGCVRDL+L RVPKD+DVITTA LKQIR+LFHRA+++G+RFPIC V + GS+IE
Subjt:  DAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIE

Query:  VSSFETVA------KPSKGEETVTFPQIPRK----------CDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTED
        VSSF+TVA      + SK +  V+      K           D KD  RWRNS+ RDFTINSLF++PF   IYDYA G+ DL  LKLRTL+PA LSF ED
Subjt:  VSSFETVA------KPSKGEETVTFPQIPRK----------CDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTED

Query:  CARILRGLRIAARLGLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNL
        CARILRGLRIAARLGLSLSKD +TA+ +   S+ +LD+ RL+ME+NYML+YGAA PS+ LL++F LL +LLPF AAYLD Q  K S  +S+ML++LFSN+
Subjt:  CARILRGLRIAARLGLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNL

Query:  DKLVSCDRPSDCNIWRITQNNLSRIAEDQAHVALI
        DKLVSCD+P+D  +W         IA    H+AL+
Subjt:  DKLVSCDRPSDCNIWRITQNNLSRIAEDQAHVALI

AT5G23690.1 Polynucleotide adenylyltransferase family protein5.8e-7648.8Show/hide
Query:  KWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIEVSSFETVA
        +W +++ +  G++ SMI  S+  VL  L +KG + YLVGGCVRDL+LKR PKDFD++T+A L+++ + F R  IVGRRFPIC V+I   +IEVSSF T A
Subjt:  KWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPKDFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIEVSSFETVA

Query:  KPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSLSKDTET
        + S         +       +D IR  N + RDFTIN L FDP+  ++YDY  G+ D+R  K+RT+I A  SF +DCARILR +RIAARLG  +SK+T  
Subjt:  KPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLKLRTLIPASLSFTEDCARILRGLRIAARLGLSLSKDTET

Query:  AMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW
         ++ LS  +  LDK R++ME+NYML+YG+A  SL LL +F +LEILLP  AAYL + G ++    + ML+ LF+NLDKL++ DRP   ++W
Subjt:  AMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKLFSNLDKLVSCDRPSDCNIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATACAATACTCCTCTTCTTATGGCGTTTTTTTCTCTGAGAAGCAACAACGGCTTCTTATCTCGCCTCAAAGACCTAATCAAGCTTCAGGTGCTCCCCCCCCTCTGTG
TCGGCTTCTTCATTGTCTTTCGTTTTACACAATTCTTCGAACAGTTTCAGCACTTCCTGTCATTTTGTTAACGTTTGGAAATTTGGATGTCAGAGGCTCGGACACGCTTT
TGCTAACGGAGAACTTATTCGGGCTCCTATTTATCCTGAAATGGATTTTCACTCTGTGGGTTATCAAGGTGTTTTCCCCCCCTCTTCCGTTCAAGTTTGATTTGCTTTCA
AACCAATCTAAGGAAGTTACTAATGCAGTAGTATTTTGTCCAATGTGGTCTTCGCTGTCACTCCTGCCATTAAAATTGTTCATTTTTAGCTATTTCTTGCTTCCTAGGCA
AGTTGGTTGTAAGCTTAAGGTACTGTGTTTCATGCCTGTAGATGCAGCCAACATTGACATGTCCAAATGGAATAAAGTTGATGGACGGGCTTTTGGGATCAATCGCTCTA
TGATCCCGTCTTCATCATGGATGGTCTTGAAAATTCTTCACAATAAAGGGTTTGAAGCCTATTTGGTGGGTGGATGTGTGAGAGACTTACTCCTAAAAAGAGTACCAAAA
GACTTTGATGTGATTACCACAGCCGGACTTAAACAGATCCGCAAGCTATTTCATCGTGCACGTATTGTTGGACGTCGGTTTCCTATTTGTATGGTTAATATTAAAGGCTC
TGTTATTGAGGTTTCAAGTTTTGAAACAGTTGCAAAACCTTCTAAAGGAGAAGAAACAGTAACATTTCCACAAATTCCAAGAAAATGTGACAAAAAGGACTTAATCCGAT
GGAGGAACTCTATGCACCGGGATTTCACAATTAATAGTTTATTCTTTGACCCCTTTCTGAACATAATCTATGACTATGCCGAAGGAATAGCAGACTTAAGGTTCTTGAAG
CTGCGGACACTAATTCCTGCATCATTGTCATTCACAGAGGACTGCGCAAGAATTCTGCGGGGCTTAAGAATTGCAGCTCGTTTGGGTTTGTCACTCTCAAAGGATACTGA
GACTGCAATGCGTAAACTTTCTCCTTCCATCACGAGCTTGGATAAGACCAGGTTAATGATGGAATTGAACTATATGCTATCTTATGGAGCTGCTGTTCCCTCTCTCCATT
TGCTTCTGAGGTTCAACCTGCTTGAAATTCTGCTGCCATTTCATGCTGCATATCTTGATAAACAGGGCATTAAGAAATCTACTCTCAATTCCATAATGTTGATGAAACTG
TTCTCCAATTTGGATAAGTTGGTTTCTTGTGATCGGCCTTCAGACTGCAATATATGGCGCATAACTCAAAACAATTTGAGTCGGATTGCGGAAGATCAAGCTCATGTTGC
TTTGATTGGCGGTTCAGAAATCACACTGGAATGGTGTAGTCGGACGGTGGGCATTCTGGAAAATGCTCATGGAGGGCGTGTGAATATGTTGAGAGTTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGATACAATACTCCTCTTCTTATGGCGTTTTTTTCTCTGAGAAGCAACAACGGCTTCTTATCTCGCCTCAAAGACCTAATCAAGCTTCAGGTGCTCCCCCCCCTCTGTG
TCGGCTTCTTCATTGTCTTTCGTTTTACACAATTCTTCGAACAGTTTCAGCACTTCCTGTCATTTTGTTAACGTTTGGAAATTTGGATGTCAGAGGCTCGGACACGCTTT
TGCTAACGGAGAACTTATTCGGGCTCCTATTTATCCTGAAATGGATTTTCACTCTGTGGGTTATCAAGGTGTTTTCCCCCCCTCTTCCGTTCAAGTTTGATTTGCTTTCA
AACCAATCTAAGGAAGTTACTAATGCAGTAGTATTTTGTCCAATGTGGTCTTCGCTGTCACTCCTGCCATTAAAATTGTTCATTTTTAGCTATTTCTTGCTTCCTAGGCA
AGTTGGTTGTAAGCTTAAGGTACTGTGTTTCATGCCTGTAGATGCAGCCAACATTGACATGTCCAAATGGAATAAAGTTGATGGACGGGCTTTTGGGATCAATCGCTCTA
TGATCCCGTCTTCATCATGGATGGTCTTGAAAATTCTTCACAATAAAGGGTTTGAAGCCTATTTGGTGGGTGGATGTGTGAGAGACTTACTCCTAAAAAGAGTACCAAAA
GACTTTGATGTGATTACCACAGCCGGACTTAAACAGATCCGCAAGCTATTTCATCGTGCACGTATTGTTGGACGTCGGTTTCCTATTTGTATGGTTAATATTAAAGGCTC
TGTTATTGAGGTTTCAAGTTTTGAAACAGTTGCAAAACCTTCTAAAGGAGAAGAAACAGTAACATTTCCACAAATTCCAAGAAAATGTGACAAAAAGGACTTAATCCGAT
GGAGGAACTCTATGCACCGGGATTTCACAATTAATAGTTTATTCTTTGACCCCTTTCTGAACATAATCTATGACTATGCCGAAGGAATAGCAGACTTAAGGTTCTTGAAG
CTGCGGACACTAATTCCTGCATCATTGTCATTCACAGAGGACTGCGCAAGAATTCTGCGGGGCTTAAGAATTGCAGCTCGTTTGGGTTTGTCACTCTCAAAGGATACTGA
GACTGCAATGCGTAAACTTTCTCCTTCCATCACGAGCTTGGATAAGACCAGGTTAATGATGGAATTGAACTATATGCTATCTTATGGAGCTGCTGTTCCCTCTCTCCATT
TGCTTCTGAGGTTCAACCTGCTTGAAATTCTGCTGCCATTTCATGCTGCATATCTTGATAAACAGGGCATTAAGAAATCTACTCTCAATTCCATAATGTTGATGAAACTG
TTCTCCAATTTGGATAAGTTGGTTTCTTGTGATCGGCCTTCAGACTGCAATATATGGCGCATAACTCAAAACAATTTGAGTCGGATTGCGGAAGATCAAGCTCATGTTGC
TTTGATTGGCGGTTCAGAAATCACACTGGAATGGTGTAGTCGGACGGTGGGCATTCTGGAAAATGCTCATGGAGGGCGTGTGAATATGTTGAGAGTTGATTGA
Protein sequenceShow/hide protein sequence
MIQYSSSYGVFFSEKQQRLLISPQRPNQASGAPPPLCRLLHCLSFYTILRTVSALPVILLTFGNLDVRGSDTLLLTENLFGLLFILKWIFTLWVIKVFSPPLPFKFDLLS
NQSKEVTNAVVFCPMWSSLSLLPLKLFIFSYFLLPRQVGCKLKVLCFMPVDAANIDMSKWNKVDGRAFGINRSMIPSSSWMVLKILHNKGFEAYLVGGCVRDLLLKRVPK
DFDVITTAGLKQIRKLFHRARIVGRRFPICMVNIKGSVIEVSSFETVAKPSKGEETVTFPQIPRKCDKKDLIRWRNSMHRDFTINSLFFDPFLNIIYDYAEGIADLRFLK
LRTLIPASLSFTEDCARILRGLRIAARLGLSLSKDTETAMRKLSPSITSLDKTRLMMELNYMLSYGAAVPSLHLLLRFNLLEILLPFHAAYLDKQGIKKSTLNSIMLMKL
FSNLDKLVSCDRPSDCNIWRITQNNLSRIAEDQAHVALIGGSEITLEWCSRTVGILENAHGGRVNMLRVD