; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg003688 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg003688
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold4:47584626..47590369
RNA-Seq ExpressionSpg003688
SyntenySpg003688
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039949.1 hypothetical protein E6C27_scaffold122G002290 [Cucumis melo var. makuwa]1.1e-12337.08Show/hide
Query:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID
        MA +  LPR   +EKKEF L++DKRSR +N+L+TE+G YKSFS+++ P+++EWLK+TF+ L+ TPR  RFF E+RY + C+WV K  NRRG IAEI+R+D
Subjt:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID

Query:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFREGRPPCA----------PRQQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSS------------PL
        ++GRKCC++VPEG DK+GW  F  MLT ++                  +++  +S DSST   S+T +++Y E ++S S S  SSS             L
Subjt:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFREGRPPCA----------PRQQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSS------------PL

Query:  DEDLFA-NRDLAVEIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTAS
         + L A   ++  E  +++  WE  ++LSRRCFH DW K+  +L++  D   S   + PFHADKALL+++++E ANLLC N GW TVG FYVKFE+W+ +
Subjt:  DEDLFA-NRDLAVEIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTAS

Query:  KHASPKLIPCYGGWVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNP
         HA  K+IP YGGW  FRGIPLH WN+NTF QIG ACGGF++ A  +  KL+L +A IK+K+NY GF+P  +   D+EG +F VQT+    GKWL  RNP
Subjt:  KHASPKLIPCYGGWVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNP

Query:  TIHGTFTREAARNYDEFDASAESYSFRGNSACVAN--------------------HVSASVRKILIKASTTDVSVPNQADASSSK-------------QM
        ++HG+FT+ AA  ++EF+  AE Y+FR N A +A                       + +++K     S  +++    +D+S+S              + 
Subjt:  TIHGTFTREAARNYDEFDASAESYSFRGNSACVAN--------------------HVSASVRKILIKASTTDVSVPNQADASSSK-------------QM

Query:  DKGKQPCTD--LEKATP--KLKSVTFPSPRNQTISFNTTYAPARKL--------TPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPV
         KGKQ C+    +K +P    + V+F SP+N+T  F+   AP + L          M   S++P     K++YR KS +  + +              P 
Subjt:  DKGKQPCTD--LEKATP--KLKSVTFPSPRNQTISFNTTYAPARKL--------TPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPV

Query:  EPAHKKVNLVAPSGPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAV
            K    + P+          +F L VDLG +SP+SD   S PE+    PSP + T     S I+ +SL  +++                        
Subjt:  EPAHKKVNLVAPSGPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAV

Query:  IDDNSPRKDPQEVIEHGKSNDESFKKKLNDWLTENDFCL
          ++  +K  + + E  +  + +FK+KL DWL EN+  L
Subjt:  IDDNSPRKDPQEVIEHGKSNDESFKKKLNDWLTENDFCL

KAA0040039.1 hypothetical protein E6C27_scaffold366G00060 [Cucumis melo var. makuwa]4.7e-12538.49Show/hide
Query:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID
        M  ++ LPRS  IEKK+F L++D  SR   +L+TE+G YKSFS+++  E++EWLK +F+TL+ TPR  RFF E+R+EECC+WV KT NR+G IAEI+R+D
Subjt:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID

Query:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFREGRPPCAPRQQRF-----HRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSSPLDEDLFANRDLAVEIDED
        ++GRKCC++VPEG +KSGW  F+S+L  +  + P +    R       R   SS+ S  DT +RSYAE +   S SD  S+   E++   +         
Subjt:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFREGRPPCAPRQQRF-----HRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSSPLDEDLFANRDLAVEIDED

Query:  NFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVASFNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPKLIPCYGGWVSFRGIP
        +F WE   VL+RR FH DW+++ +KL E  D    + PFHADKAL+Y +NEEQA LLC NKGW TVG FYVKFE W+   HASPK+IP YGGW+  RG+P
Subjt:  NFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVASFNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPKLIPCYGGWVSFRGIP

Query:  LHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTFTREAARNYDEFDASA
        LHAWN+ +F QIG+ACGGF+EVA+ T    D+ +A+IKIKDNY GFIP  +   D E  +F VQ +V   GKW   R+P+IHGTFTREAA+ +DEF+ ++
Subjt:  LHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTFTREAARNYDEFDASA

Query:  ESYSFRGNSACVANHVSASV-----------RKILIKASTTDVSVPNQADASSSKQMDKGKQPCTDLEKATPKL----------------KSVTFPSPRN
        E Y F  N A       + V           +   + A    ++    +D+S  ++M       T + +   K                 + V+F SP++
Subjt:  ESYSFRGNSACVANHVSASV-----------RKILIKASTTDVSVPNQADASSSKQMDKGKQPCTDLEKATPKL----------------KSVTFPSPRN

Query:  QTISFNTTYAPARKLTPMGRNSEKPK-----------SGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPSGPVYQDINGEK----
        +T+ ++   AP + L  +    E PK            GK K++YR KS                        P  +       SG   Q IN  K    
Subjt:  QTISFNTTYAPARKLTPMGRNSEKPK-----------SGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPSGPVYQDINGEK----

Query:  FTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKSNDESF
        F+L+VDLG LSP SD  +SSP+    + SP + T     S II +S++                         + ++ +N  + + ++  E  K+ +E+F
Subjt:  FTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKSNDESF

Query:  KKKLNDWLTENDFCLVP
        K+KL +WL  N+  L P
Subjt:  KKKLNDWLTENDFCLVP

KAA0041398.1 hypothetical protein E6C27_scaffold206G00440 [Cucumis melo var. makuwa]2.0e-12038.78Show/hide
Query:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID
        MA +  LPR   +EKK+F L++DKRSR +N+L+TE+G YKSFS+++  +++EWLK TF+ L+ TPR  RFF E+RY + C+WV K  NRRG IAEI+R+D
Subjt:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID

Query:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFRE--GRPPCAPR--------QQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDS------SSSPLDEDLFA
        ++GRKCC++VPEG DK+GW  F  MLT ++   +   + R        +++  +  DSST   S++ +++YAEV++S S S+S      ++S L   L A
Subjt:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFRE--GRPPCAPR--------QQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDS------SSSPLDEDLFA

Query:  NR-DLAVEIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPK
         + ++  E  +++  WE  ++LSRRCFH DWDK+  +LRE  D   S   + PFHADKALL++++++ A LLC N GW TVG FYVKFE+W+ S HA  K
Subjt:  NR-DLAVEIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPK

Query:  LIPCYGGWVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTF
        +IP YGGW  FRGIPLH WN+NTF  IG ACGGF++ A  +  K++L +A IK+K+NY GF+P  +   D+EG  F +Q V    G+WL  RNP+IHG+F
Subjt:  LIPCYGGWVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTF

Query:  TREAARNYDEFDASAESYSFRGNSACVANHVSASVRKILIKASTTDVSVPNQADASSSKQM--DKGKQPCTDLEKATPKLKSVTFPSPRNQTISFNTTYA
         + AA N++EF+  AE ++FR N A +               S  D+S   ++  + SKQM  D+ K P   ++K                   FN T  
Subjt:  TREAARNYDEFDASAESYSFRGNSACVANHVSASVRKILIKASTTDVSVPNQADASSSKQM--DKGKQPCTDLEKATPKLKSVTFPSPRNQTISFNTTYA

Query:  PARKLTPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPSGPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN--
               M   S++P     K++YR KS +    +E    + N     G V+P                     +F L VDLG +SP+SD   S PE+  
Subjt:  PARKLTPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPSGPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN--

Query:  -TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKSNDE-SFKKKLNDWLTENDFCL
         TPSP   T     S I+ +SL  +++                         +D   +K  +E+I    ++DE SFK+KL +WL EN+  L
Subjt:  -TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKSNDE-SFKKKLNDWLTENDFCL

KAA0044449.1 hypothetical protein E6C27_scaffold46G001820 [Cucumis melo var. makuwa]3.2e-11837.05Show/hide
Query:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID
        MA +  LPR   +EKKEF L++DKRSR +N+L+TE+G YKSFS+++ P+++EWLK+TF+ L+ TPR  RFF E+RY + C+WV    NRRG IAEI+R+D
Subjt:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID

Query:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFRE--GRPPCAPR--------QQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSSPLDEDLFANRDLAV
        ++GRKCC++VPEG DK+GW  F  MLT ++   +     R        +++  +S DSST   S++ +++YAE ++S S S SS S              
Subjt:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFRE--GRPPCAPR--------QQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSSPLDEDLFANRDLAV

Query:  EIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPKLIPCYGG
            D    ++   L RRCFH DW K+  +LR+  D   S   + PFHADKALL+++++E A LLC N GW TVG FYVKFE+W+ + HA  K+IP YGG
Subjt:  EIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPKLIPCYGG

Query:  WVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTFTREAARN
        W  FRGIPLH WN+NTF QIG A GGF++ A  +  KL+L +A IK+K+NY GF+P  +   D+EG +F +QTV    GKWL  RNP+IHG+FT+ AA N
Subjt:  WVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTFTREAARN

Query:  YDEFDASAESYSFRGNSACVAN--------------------HVSASVRKILIKASTTDVSVPNQADASSSK-------------QMDKGKQPCTD--LE
        ++EF+  AE Y+FR N A +A                       + +++K     S  +++    +D S+S              +  KGKQ C     +
Subjt:  YDEFDASAESYSFRGNSACVAN--------------------HVSASVRKILIKASTTDVSVPNQADASSSK-------------QMDKGKQPCTD--LE

Query:  KATP--KLKSVTFPSPRNQTISFNTTYAPARKL--------TPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPS
        K +P    + V+F SP+N+T  F+   AP + L          M   S++P     K++YR KS    +  E    + +     G ++P           
Subjt:  KATP--KLKSVTFPSPRNQTISFNTTYAPARKL--------TPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPS

Query:  GPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEV
                  +F L VDLG +S +SD   S PE+    PSP + T     S I+ +SL  +++                          ++  +K  + +
Subjt:  GPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEV

Query:  IEHGKSNDESFKKKLNDWLTENDFCL
         E  + ++ SFK+KL DWL EN+  L
Subjt:  IEHGKSNDESFKKKLNDWLTENDFCL

TYK29576.1 hypothetical protein E5676_scaffold655G001820 [Cucumis melo var. makuwa]3.2e-11837.05Show/hide
Query:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID
        MA +  LPR   +EKKEF L++DKRSR +N+L+TE+G YKSFS+++ P+++EWLK+TF+ L+ TPR  RFF E+RY + C+WV    NRRG IAEI+R+D
Subjt:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID

Query:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFRE--GRPPCAPR--------QQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSSPLDEDLFANRDLAV
        ++GRKCC++VPEG DK+GW  F  MLT ++   +     R        +++  +S DSST   S++ +++YAE ++S S S SS S              
Subjt:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFRE--GRPPCAPR--------QQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSSPLDEDLFANRDLAV

Query:  EIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPKLIPCYGG
            D    ++   L RRCFH DW K+  +LR+  D   S   + PFHADKALL+++++E A LLC N GW TVG FYVKFE+W+ + HA  K+IP YGG
Subjt:  EIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPKLIPCYGG

Query:  WVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTFTREAARN
        W  FRGIPLH WN+NTF QIG A GGF++ A  +  KL+L +A IK+K+NY GF+P  +   D+EG +F +QTV    GKWL  RNP+IHG+FT+ AA N
Subjt:  WVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTFTREAARN

Query:  YDEFDASAESYSFRGNSACVAN--------------------HVSASVRKILIKASTTDVSVPNQADASSSK-------------QMDKGKQPCTD--LE
        ++EF+  AE Y+FR N A +A                       + +++K     S  +++    +D S+S              +  KGKQ C     +
Subjt:  YDEFDASAESYSFRGNSACVAN--------------------HVSASVRKILIKASTTDVSVPNQADASSSK-------------QMDKGKQPCTD--LE

Query:  KATP--KLKSVTFPSPRNQTISFNTTYAPARKL--------TPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPS
        K +P    + V+F SP+N+T  F+   AP + L          M   S++P     K++YR KS    +  E    + +     G ++P           
Subjt:  KATP--KLKSVTFPSPRNQTISFNTTYAPARKL--------TPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPS

Query:  GPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEV
                  +F L VDLG +S +SD   S PE+    PSP + T     S I+ +SL  +++                          ++  +K  + +
Subjt:  GPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEV

Query:  IEHGKSNDESFKKKLNDWLTENDFCL
         E  + ++ SFK+KL DWL EN+  L
Subjt:  IEHGKSNDESFKKKLNDWLTENDFCL

TrEMBL top hitse value%identityAlignment
A0A5A7TEP0 DUF4283 domain-containing protein9.8e-12138.78Show/hide
Query:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID
        MA +  LPR   +EKK+F L++DKRSR +N+L+TE+G YKSFS+++  +++EWLK TF+ L+ TPR  RFF E+RY + C+WV K  NRRG IAEI+R+D
Subjt:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID

Query:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFRE--GRPPCAPR--------QQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDS------SSSPLDEDLFA
        ++GRKCC++VPEG DK+GW  F  MLT ++   +   + R        +++  +  DSST   S++ +++YAEV++S S S+S      ++S L   L A
Subjt:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFRE--GRPPCAPR--------QQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDS------SSSPLDEDLFA

Query:  NR-DLAVEIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPK
         + ++  E  +++  WE  ++LSRRCFH DWDK+  +LRE  D   S   + PFHADKALL++++++ A LLC N GW TVG FYVKFE+W+ S HA  K
Subjt:  NR-DLAVEIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPK

Query:  LIPCYGGWVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTF
        +IP YGGW  FRGIPLH WN+NTF  IG ACGGF++ A  +  K++L +A IK+K+NY GF+P  +   D+EG  F +Q V    G+WL  RNP+IHG+F
Subjt:  LIPCYGGWVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTF

Query:  TREAARNYDEFDASAESYSFRGNSACVANHVSASVRKILIKASTTDVSVPNQADASSSKQM--DKGKQPCTDLEKATPKLKSVTFPSPRNQTISFNTTYA
         + AA N++EF+  AE ++FR N A +               S  D+S   ++  + SKQM  D+ K P   ++K                   FN T  
Subjt:  TREAARNYDEFDASAESYSFRGNSACVANHVSASVRKILIKASTTDVSVPNQADASSSKQM--DKGKQPCTDLEKATPKLKSVTFPSPRNQTISFNTTYA

Query:  PARKLTPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPSGPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN--
               M   S++P     K++YR KS +    +E    + N     G V+P                     +F L VDLG +SP+SD   S PE+  
Subjt:  PARKLTPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPSGPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN--

Query:  -TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKSNDE-SFKKKLNDWLTENDFCL
         TPSP   T     S I+ +SL  +++                         +D   +K  +E+I    ++DE SFK+KL +WL EN+  L
Subjt:  -TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKSNDE-SFKKKLNDWLTENDFCL

A0A5A7TFK7 DUF4283 domain-containing protein2.3e-12538.49Show/hide
Query:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID
        M  ++ LPRS  IEKK+F L++D  SR   +L+TE+G YKSFS+++  E++EWLK +F+TL+ TPR  RFF E+R+EECC+WV KT NR+G IAEI+R+D
Subjt:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID

Query:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFREGRPPCAPRQQRF-----HRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSSPLDEDLFANRDLAVEIDED
        ++GRKCC++VPEG +KSGW  F+S+L  +  + P +    R       R   SS+ S  DT +RSYAE +   S SD  S+   E++   +         
Subjt:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFREGRPPCAPRQQRF-----HRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSSPLDEDLFANRDLAVEIDED

Query:  NFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVASFNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPKLIPCYGGWVSFRGIP
        +F WE   VL+RR FH DW+++ +KL E  D    + PFHADKAL+Y +NEEQA LLC NKGW TVG FYVKFE W+   HASPK+IP YGGW+  RG+P
Subjt:  NFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVASFNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPKLIPCYGGWVSFRGIP

Query:  LHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTFTREAARNYDEFDASA
        LHAWN+ +F QIG+ACGGF+EVA+ T    D+ +A+IKIKDNY GFIP  +   D E  +F VQ +V   GKW   R+P+IHGTFTREAA+ +DEF+ ++
Subjt:  LHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTFTREAARNYDEFDASA

Query:  ESYSFRGNSACVANHVSASV-----------RKILIKASTTDVSVPNQADASSSKQMDKGKQPCTDLEKATPKL----------------KSVTFPSPRN
        E Y F  N A       + V           +   + A    ++    +D+S  ++M       T + +   K                 + V+F SP++
Subjt:  ESYSFRGNSACVANHVSASV-----------RKILIKASTTDVSVPNQADASSSKQMDKGKQPCTDLEKATPKL----------------KSVTFPSPRN

Query:  QTISFNTTYAPARKLTPMGRNSEKPK-----------SGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPSGPVYQDINGEK----
        +T+ ++   AP + L  +    E PK            GK K++YR KS                        P  +       SG   Q IN  K    
Subjt:  QTISFNTTYAPARKLTPMGRNSEKPK-----------SGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPSGPVYQDINGEK----

Query:  FTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKSNDESF
        F+L+VDLG LSP SD  +SSP+    + SP + T     S II +S++                         + ++ +N  + + ++  E  K+ +E+F
Subjt:  FTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKSNDESF

Query:  KKKLNDWLTENDFCLVP
        K+KL +WL  N+  L P
Subjt:  KKKLNDWLTENDFCLVP

A0A5A7TTA1 DUF4283 domain-containing protein1.6e-11837.05Show/hide
Query:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID
        MA +  LPR   +EKKEF L++DKRSR +N+L+TE+G YKSFS+++ P+++EWLK+TF+ L+ TPR  RFF E+RY + C+WV    NRRG IAEI+R+D
Subjt:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID

Query:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFRE--GRPPCAPR--------QQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSSPLDEDLFANRDLAV
        ++GRKCC++VPEG DK+GW  F  MLT ++   +     R        +++  +S DSST   S++ +++YAE ++S S S SS S              
Subjt:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFRE--GRPPCAPR--------QQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSSPLDEDLFANRDLAV

Query:  EIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPKLIPCYGG
            D    ++   L RRCFH DW K+  +LR+  D   S   + PFHADKALL+++++E A LLC N GW TVG FYVKFE+W+ + HA  K+IP YGG
Subjt:  EIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPKLIPCYGG

Query:  WVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTFTREAARN
        W  FRGIPLH WN+NTF QIG A GGF++ A  +  KL+L +A IK+K+NY GF+P  +   D+EG +F +QTV    GKWL  RNP+IHG+FT+ AA N
Subjt:  WVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTFTREAARN

Query:  YDEFDASAESYSFRGNSACVAN--------------------HVSASVRKILIKASTTDVSVPNQADASSSK-------------QMDKGKQPCTD--LE
        ++EF+  AE Y+FR N A +A                       + +++K     S  +++    +D S+S              +  KGKQ C     +
Subjt:  YDEFDASAESYSFRGNSACVAN--------------------HVSASVRKILIKASTTDVSVPNQADASSSK-------------QMDKGKQPCTD--LE

Query:  KATP--KLKSVTFPSPRNQTISFNTTYAPARKL--------TPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPS
        K +P    + V+F SP+N+T  F+   AP + L          M   S++P     K++YR KS    +  E    + +     G ++P           
Subjt:  KATP--KLKSVTFPSPRNQTISFNTTYAPARKL--------TPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPS

Query:  GPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEV
                  +F L VDLG +S +SD   S PE+    PSP + T     S I+ +SL  +++                          ++  +K  + +
Subjt:  GPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEV

Query:  IEHGKSNDESFKKKLNDWLTENDFCL
         E  + ++ SFK+KL DWL EN+  L
Subjt:  IEHGKSNDESFKKKLNDWLTENDFCL

A0A5D3DLT1 DUF4283 domain-containing protein5.5e-12437.08Show/hide
Query:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID
        MA +  LPR   +EKKEF L++DKRSR +N+L+TE+G YKSFS+++ P+++EWLK+TF+ L+ TPR  RFF E+RY + C+WV K  NRRG IAEI+R+D
Subjt:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID

Query:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFREGRPPCA----------PRQQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSS------------PL
        ++GRKCC++VPEG DK+GW  F  MLT ++                  +++  +S DSST   S+T +++Y E ++S S S  SSS             L
Subjt:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFREGRPPCA----------PRQQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSS------------PL

Query:  DEDLFA-NRDLAVEIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTAS
         + L A   ++  E  +++  WE  ++LSRRCFH DW K+  +L++  D   S   + PFHADKALL+++++E ANLLC N GW TVG FYVKFE+W+ +
Subjt:  DEDLFA-NRDLAVEIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTAS

Query:  KHASPKLIPCYGGWVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNP
         HA  K+IP YGGW  FRGIPLH WN+NTF QIG ACGGF++ A  +  KL+L +A IK+K+NY GF+P  +   D+EG +F VQT+    GKWL  RNP
Subjt:  KHASPKLIPCYGGWVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNP

Query:  TIHGTFTREAARNYDEFDASAESYSFRGNSACVAN--------------------HVSASVRKILIKASTTDVSVPNQADASSSK-------------QM
        ++HG+FT+ AA  ++EF+  AE Y+FR N A +A                       + +++K     S  +++    +D+S+S              + 
Subjt:  TIHGTFTREAARNYDEFDASAESYSFRGNSACVAN--------------------HVSASVRKILIKASTTDVSVPNQADASSSK-------------QM

Query:  DKGKQPCTD--LEKATP--KLKSVTFPSPRNQTISFNTTYAPARKL--------TPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPV
         KGKQ C+    +K +P    + V+F SP+N+T  F+   AP + L          M   S++P     K++YR KS +  + +              P 
Subjt:  DKGKQPCTD--LEKATP--KLKSVTFPSPRNQTISFNTTYAPARKL--------TPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPV

Query:  EPAHKKVNLVAPSGPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAV
            K    + P+          +F L VDLG +SP+SD   S PE+    PSP + T     S I+ +SL  +++                        
Subjt:  EPAHKKVNLVAPSGPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAV

Query:  IDDNSPRKDPQEVIEHGKSNDESFKKKLNDWLTENDFCL
          ++  +K  + + E  +  + +FK+KL DWL EN+  L
Subjt:  IDDNSPRKDPQEVIEHGKSNDESFKKKLNDWLTENDFCL

A0A5D3E0Y8 DUF4283 domain-containing protein1.6e-11837.05Show/hide
Query:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID
        MA +  LPR   +EKKEF L++DKRSR +N+L+TE+G YKSFS+++ P+++EWLK+TF+ L+ TPR  RFF E+RY + C+WV    NRRG IAEI+R+D
Subjt:  MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRID

Query:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFRE--GRPPCAPR--------QQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSSPLDEDLFANRDLAV
        ++GRKCC++VPEG DK+GW  F  MLT ++   +     R        +++  +S DSST   S++ +++YAE ++S S S SS S              
Subjt:  EKGRKCCVMVPEGFDKSGWNSFLSMLTFRE--GRPPCAPR--------QQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSSPLDEDLFANRDLAV

Query:  EIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPKLIPCYGG
            D    ++   L RRCFH DW K+  +LR+  D   S   + PFHADKALL+++++E A LLC N GW TVG FYVKFE+W+ + HA  K+IP YGG
Subjt:  EIDEDNFAWENIVVLSRRCFHYDWDKVFKKLREMNDYVAS---FNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPKLIPCYGG

Query:  WVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTFTREAARN
        W  FRGIPLH WN+NTF QIG A GGF++ A  +  KL+L +A IK+K+NY GF+P  +   D+EG +F +QTV    GKWL  RNP+IHG+FT+ AA N
Subjt:  WVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDATIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTFTREAARN

Query:  YDEFDASAESYSFRGNSACVAN--------------------HVSASVRKILIKASTTDVSVPNQADASSSK-------------QMDKGKQPCTD--LE
        ++EF+  AE Y+FR N A +A                       + +++K     S  +++    +D S+S              +  KGKQ C     +
Subjt:  YDEFDASAESYSFRGNSACVAN--------------------HVSASVRKILIKASTTDVSVPNQADASSSK-------------QMDKGKQPCTD--LE

Query:  KATP--KLKSVTFPSPRNQTISFNTTYAPARKL--------TPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPS
        K +P    + V+F SP+N+T  F+   AP + L          M   S++P     K++YR KS    +  E    + +     G ++P           
Subjt:  KATP--KLKSVTFPSPRNQTISFNTTYAPARKL--------TPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPS

Query:  GPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEV
                  +F L VDLG +S +SD   S PE+    PSP + T     S I+ +SL  +++                          ++  +K  + +
Subjt:  GPVYQDINGEKFTLSVDLGSLSPISDAPISSPEN---TPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEV

Query:  IEHGKSNDESFKKKLNDWLTENDFCL
         E  + ++ SFK+KL DWL EN+  L
Subjt:  IEHGKSNDESFKKKLNDWLTENDFCL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.9e-0640.91Show/hide
Query:  DIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWEPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNN
        DI   +   S +LP+ YLG+PL      T  + P +EKI+ RI  W    LS  GRL LI SV+++
Subjt:  DIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWEPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNN

AT3G25270.1 Ribonuclease H-like superfamily protein2.4e-1031.45Show/hide
Query:  PGES-ILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGV----LSVPPATIQSLC
        PG++ I   +WK     KIK F+W L   ++ T D L+   R    N   C  C    E+  HLF  C    F   +V++A G+    L     T+++  
Subjt:  PGES-ILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGV----LSVPPATIQSLC

Query:  EDLLAFKASSQRQILLRNFSIASLWIVWNERNNRIFQDK----------VRSGIQLWED
        E LL     + RQ  L N +I  LW +W  RN  +FQ K           R+ +Q WED
Subjt:  EDLLAFKASSQRQILLRNFSIASLWIVWNERNNRIFQDK----------VRSGIQLWED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATTTGTTCGACATCTACCAAGATCAGTTCTCATCGAGAAAAAGGAATTCACCTTGACGATCGACAAGCGTTCTCGCGGCAACAATGTTCTTCTTACTGAAATTGG
CTCCTACAAATCTTTCTCTATGTCTCTTCTCCCTGAAGCAATTGAATGGCTCAAGGTAACTTTTCAAACTCTGATTGCTACTCCCCGAAACAATCGCTTTTTTCAGGAAA
GAAGATATGAGGAATGCTGTATTTGGGTGGACAAAACGACAAACAGACGTGGAAATATCGCTGAGATTTTTCGTATTGACGAAAAAGGAAGGAAATGCTGTGTAATGGTC
CCGGAAGGTTTCGACAAAAGTGGTTGGAATTCGTTTTTATCAATGCTCACTTTCAGAGAGGGGCGCCCTCCGTGTGCTCCGAGACAACAGAGATTCCATAGAAGCCCTGA
TTCATCTACTTCTTCTGTCAGTGACACCTCAAAAAGATCTTATGCTGAAGTCCTCACCTCCAAATCTTTGTCAGATTCTAGTAGCTCCCCTTTGGATGAAGATCTTTTCG
CCAATAGAGACCTTGCTGTAGAGATTGATGAGGATAATTTTGCTTGGGAAAATATTGTTGTTCTAAGCCGTCGTTGCTTTCACTATGATTGGGATAAAGTGTTCAAAAAG
CTCCGTGAGATGAACGATTATGTTGCTTCCTTCAACCCCTTTCATGCAGACAAAGCCCTTCTGTATATGAGGAATGAAGAACAGGCAAATCTTCTATGCATGAACAAAGG
GTGGGTTACGGTTGGGAATTTCTATGTTAAATTCGAACGTTGGACTGCTTCAAAGCATGCTTCCCCTAAATTAATCCCCTGCTATGGAGGTTGGGTCAGTTTCAGAGGTA
TTCCCCTCCATGCATGGAACATCAACACTTTCAAGCAAATTGGAAACGCCTGCGGTGGTTTTCTGGAGGTTGCTCGTTGCACTTGGCGTAAACTCGATCTCATAGACGCC
ACCATCAAAATAAAGGATAATTACTGTGGTTTCATCCCTGATTCTGTTATTTTCTCCGATGATGAAGGGACAAATTTTACTGTTCAGACGGTGGTTCCGTTGATCGGAAA
ATGGCTGGTCTGTCGCAACCCTACAATCCATGGTACGTTCACACGGGAAGCTGCTAGAAATTATGACGAGTTTGATGCCTCTGCTGAATCCTACAGTTTTCGAGGCAATT
CTGCATGCGTAGCCAACCACGTCAGTGCATCGGTCAGGAAAATTCTCATCAAAGCATCAACTACTGACGTTTCAGTTCCAAATCAGGCAGACGCTTCATCTTCCAAACAG
ATGGATAAAGGTAAACAACCTTGTACAGACCTCGAAAAAGCAACTCCCAAACTGAAATCTGTGACGTTTCCTTCCCCAAGAAACCAGACTATATCCTTCAATACCACTTA
TGCTCCAGCGAGGAAATTAACTCCAATGGGTCGAAATTCTGAAAAGCCCAAATCCGGCAAAGGTAAGGAGATATATCGACCCAAATCCACTACTGGAATAAAAATATCTG
AGCCCGCCCAAGGGAAAGTCAATCTCGTTGCACCTTTTGGCCCAGTTGAGCCCGCCCACAAGAAAGTCAATCTCGTTGCTCCTTCTGGCCCAGTTTACCAAGATATTAAT
GGTGAAAAGTTTACTTTAAGTGTTGATTTGGGCTCATTGTCCCCTATATCTGATGCACCCATTTCAAGTCCAGAAAATACTCCATCTCCAAAAGCCCACACTGTGATTGA
ACCTCCTTCAGCAATTATTAATGAAAGTCTCAAATTTTTGGTTTCTCCGGACAAAATGGATAGTACGGGTGAAGACAGTTTGAATGGCACTCCTAGATTCAAGAACATTG
AAGCAGTCATTGATGATAATAGTCCTCGGAAGGATCCTCAAGAAGTTATAGAACATGGCAAGTCGAATGACGAAAGTTTCAAGAAGAAGCTCAACGACTGGCTGACTGAA
AACGATTTTTGTCTTGTTCCTACTAAATCTGTTTCGGGTTTATTTTGTAATACTTCTACTTCTGATGATCATGTAACAAGCCGTCTGGCTTTGGGGAATCTTGATTTTGT
CATTCTTACTGAAACTAAATTGACTAACGTCAGCAAGCGTATTATCAAATCGTTATTTAGTTCTATTAGTGTCAATTGGATTGCCCTGGACGCGCTTGGTTCTTCAGGAG
GTTTCTCTTTTATGGGAAGACTTAAGCTTCTAGCTCGGAAAGTCAAAGATTGGAAGTCTTCCAATTCCGAATCTTTCAAAGAAAAGAAAAGGGTCTTAATAACAGAAATA
GATCGCATTGATTCGCTGGAATCCATGGGTTCTTTGGATGATATTGCTAGCTCTCTCAGAAAATCGCTAAAAGCTGATCTCCAGCAAACTGCTCTTTTAGAAGCTCGCTA
TTGGAATCAGCGTTGCAAAAAGCTCTGGACTACTGATATTGCTCGCTTTTGGGGTTGCTGCTCTCATTCTCTGCCAATTGCTTATCTTGGTGTCCCTTTAGGCGGCATTC
CAAAAAATACTCAGTTTTGGGAGCCCACGATTGAGAAGATTCAGAGACGAATTCACAATTGGCGGTTTGTTTCTCTTTCTAAGGGAGGTCGTCTTACTCTTATTCAATCG
GTTCTTAACAATTCTTTTCGTGGTTCTGATGTTCATCGTTGGCTAGCTTCCGAAGACGGTTCCTTCTCCACAAAAGCTGCTCGATCCGTCCTTTTGGTTGCTCCTCCTAG
ACCTTTTTATAGCCCTGGAGAATCAATTCTCAACAACCTTTGGAAAGCTGATATCCCTAAAAAAATAAAGGTTTTCATCTGGTCCCTTTTTCATAGAAGTGTCAACACGA
CTGACAGGCTTCAGAAAATTTTTCGAGGCTCGTACTTCAATCCTTCCATCTGTCTCCTTTGCCGAATGAGTTCCGAAAGTATCGATCATCTATTCATTCATTGCTGCTGT
GTGTCCTTTCTTCGAAACAAGGTTTACCAAGCTTTGGGTGTTCTTTCTGTTCCTCCGGCAACGATTCAATCCCTTTGCGAAGATTTGCTTGCTTTCAAAGCTTCATCCCA
AAGACAAATTCTTCTCCGAAATTTTTCTATTGCCTCCCTTTGGATCGTGTGGAATGAACGCAACAATCGTATTTTTCAGGACAAGGTTCGCAGCGGGATTCAACTTTGGG
AGGATGTCATTTCTCTTGCCGCTTTTTGGGCTACAAGATCAAAAGCTTTCTCTGATTATTCTGCTTCTACTATTGCTTTAAATTGGGAATCCTTTATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCATTTGTTCGACATCTACCAAGATCAGTTCTCATCGAGAAAAAGGAATTCACCTTGACGATCGACAAGCGTTCTCGCGGCAACAATGTTCTTCTTACTGAAATTGG
CTCCTACAAATCTTTCTCTATGTCTCTTCTCCCTGAAGCAATTGAATGGCTCAAGGTAACTTTTCAAACTCTGATTGCTACTCCCCGAAACAATCGCTTTTTTCAGGAAA
GAAGATATGAGGAATGCTGTATTTGGGTGGACAAAACGACAAACAGACGTGGAAATATCGCTGAGATTTTTCGTATTGACGAAAAAGGAAGGAAATGCTGTGTAATGGTC
CCGGAAGGTTTCGACAAAAGTGGTTGGAATTCGTTTTTATCAATGCTCACTTTCAGAGAGGGGCGCCCTCCGTGTGCTCCGAGACAACAGAGATTCCATAGAAGCCCTGA
TTCATCTACTTCTTCTGTCAGTGACACCTCAAAAAGATCTTATGCTGAAGTCCTCACCTCCAAATCTTTGTCAGATTCTAGTAGCTCCCCTTTGGATGAAGATCTTTTCG
CCAATAGAGACCTTGCTGTAGAGATTGATGAGGATAATTTTGCTTGGGAAAATATTGTTGTTCTAAGCCGTCGTTGCTTTCACTATGATTGGGATAAAGTGTTCAAAAAG
CTCCGTGAGATGAACGATTATGTTGCTTCCTTCAACCCCTTTCATGCAGACAAAGCCCTTCTGTATATGAGGAATGAAGAACAGGCAAATCTTCTATGCATGAACAAAGG
GTGGGTTACGGTTGGGAATTTCTATGTTAAATTCGAACGTTGGACTGCTTCAAAGCATGCTTCCCCTAAATTAATCCCCTGCTATGGAGGTTGGGTCAGTTTCAGAGGTA
TTCCCCTCCATGCATGGAACATCAACACTTTCAAGCAAATTGGAAACGCCTGCGGTGGTTTTCTGGAGGTTGCTCGTTGCACTTGGCGTAAACTCGATCTCATAGACGCC
ACCATCAAAATAAAGGATAATTACTGTGGTTTCATCCCTGATTCTGTTATTTTCTCCGATGATGAAGGGACAAATTTTACTGTTCAGACGGTGGTTCCGTTGATCGGAAA
ATGGCTGGTCTGTCGCAACCCTACAATCCATGGTACGTTCACACGGGAAGCTGCTAGAAATTATGACGAGTTTGATGCCTCTGCTGAATCCTACAGTTTTCGAGGCAATT
CTGCATGCGTAGCCAACCACGTCAGTGCATCGGTCAGGAAAATTCTCATCAAAGCATCAACTACTGACGTTTCAGTTCCAAATCAGGCAGACGCTTCATCTTCCAAACAG
ATGGATAAAGGTAAACAACCTTGTACAGACCTCGAAAAAGCAACTCCCAAACTGAAATCTGTGACGTTTCCTTCCCCAAGAAACCAGACTATATCCTTCAATACCACTTA
TGCTCCAGCGAGGAAATTAACTCCAATGGGTCGAAATTCTGAAAAGCCCAAATCCGGCAAAGGTAAGGAGATATATCGACCCAAATCCACTACTGGAATAAAAATATCTG
AGCCCGCCCAAGGGAAAGTCAATCTCGTTGCACCTTTTGGCCCAGTTGAGCCCGCCCACAAGAAAGTCAATCTCGTTGCTCCTTCTGGCCCAGTTTACCAAGATATTAAT
GGTGAAAAGTTTACTTTAAGTGTTGATTTGGGCTCATTGTCCCCTATATCTGATGCACCCATTTCAAGTCCAGAAAATACTCCATCTCCAAAAGCCCACACTGTGATTGA
ACCTCCTTCAGCAATTATTAATGAAAGTCTCAAATTTTTGGTTTCTCCGGACAAAATGGATAGTACGGGTGAAGACAGTTTGAATGGCACTCCTAGATTCAAGAACATTG
AAGCAGTCATTGATGATAATAGTCCTCGGAAGGATCCTCAAGAAGTTATAGAACATGGCAAGTCGAATGACGAAAGTTTCAAGAAGAAGCTCAACGACTGGCTGACTGAA
AACGATTTTTGTCTTGTTCCTACTAAATCTGTTTCGGGTTTATTTTGTAATACTTCTACTTCTGATGATCATGTAACAAGCCGTCTGGCTTTGGGGAATCTTGATTTTGT
CATTCTTACTGAAACTAAATTGACTAACGTCAGCAAGCGTATTATCAAATCGTTATTTAGTTCTATTAGTGTCAATTGGATTGCCCTGGACGCGCTTGGTTCTTCAGGAG
GTTTCTCTTTTATGGGAAGACTTAAGCTTCTAGCTCGGAAAGTCAAAGATTGGAAGTCTTCCAATTCCGAATCTTTCAAAGAAAAGAAAAGGGTCTTAATAACAGAAATA
GATCGCATTGATTCGCTGGAATCCATGGGTTCTTTGGATGATATTGCTAGCTCTCTCAGAAAATCGCTAAAAGCTGATCTCCAGCAAACTGCTCTTTTAGAAGCTCGCTA
TTGGAATCAGCGTTGCAAAAAGCTCTGGACTACTGATATTGCTCGCTTTTGGGGTTGCTGCTCTCATTCTCTGCCAATTGCTTATCTTGGTGTCCCTTTAGGCGGCATTC
CAAAAAATACTCAGTTTTGGGAGCCCACGATTGAGAAGATTCAGAGACGAATTCACAATTGGCGGTTTGTTTCTCTTTCTAAGGGAGGTCGTCTTACTCTTATTCAATCG
GTTCTTAACAATTCTTTTCGTGGTTCTGATGTTCATCGTTGGCTAGCTTCCGAAGACGGTTCCTTCTCCACAAAAGCTGCTCGATCCGTCCTTTTGGTTGCTCCTCCTAG
ACCTTTTTATAGCCCTGGAGAATCAATTCTCAACAACCTTTGGAAAGCTGATATCCCTAAAAAAATAAAGGTTTTCATCTGGTCCCTTTTTCATAGAAGTGTCAACACGA
CTGACAGGCTTCAGAAAATTTTTCGAGGCTCGTACTTCAATCCTTCCATCTGTCTCCTTTGCCGAATGAGTTCCGAAAGTATCGATCATCTATTCATTCATTGCTGCTGT
GTGTCCTTTCTTCGAAACAAGGTTTACCAAGCTTTGGGTGTTCTTTCTGTTCCTCCGGCAACGATTCAATCCCTTTGCGAAGATTTGCTTGCTTTCAAAGCTTCATCCCA
AAGACAAATTCTTCTCCGAAATTTTTCTATTGCCTCCCTTTGGATCGTGTGGAATGAACGCAACAATCGTATTTTTCAGGACAAGGTTCGCAGCGGGATTCAACTTTGGG
AGGATGTCATTTCTCTTGCCGCTTTTTGGGCTACAAGATCAAAAGCTTTCTCTGATTATTCTGCTTCTACTATTGCTTTAAATTGGGAATCCTTTATGTAG
Protein sequenceShow/hide protein sequence
MAFVRHLPRSVLIEKKEFTLTIDKRSRGNNVLLTEIGSYKSFSMSLLPEAIEWLKVTFQTLIATPRNNRFFQERRYEECCIWVDKTTNRRGNIAEIFRIDEKGRKCCVMV
PEGFDKSGWNSFLSMLTFREGRPPCAPRQQRFHRSPDSSTSSVSDTSKRSYAEVLTSKSLSDSSSSPLDEDLFANRDLAVEIDEDNFAWENIVVLSRRCFHYDWDKVFKK
LREMNDYVASFNPFHADKALLYMRNEEQANLLCMNKGWVTVGNFYVKFERWTASKHASPKLIPCYGGWVSFRGIPLHAWNINTFKQIGNACGGFLEVARCTWRKLDLIDA
TIKIKDNYCGFIPDSVIFSDDEGTNFTVQTVVPLIGKWLVCRNPTIHGTFTREAARNYDEFDASAESYSFRGNSACVANHVSASVRKILIKASTTDVSVPNQADASSSKQ
MDKGKQPCTDLEKATPKLKSVTFPSPRNQTISFNTTYAPARKLTPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQGKVNLVAPFGPVEPAHKKVNLVAPSGPVYQDIN
GEKFTLSVDLGSLSPISDAPISSPENTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKSNDESFKKKLNDWLTE
NDFCLVPTKSVSGLFCNTSTSDDHVTSRLALGNLDFVILTETKLTNVSKRIIKSLFSSISVNWIALDALGSSGGFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEI
DRIDSLESMGSLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWEPTIEKIQRRIHNWRFVSLSKGGRLTLIQS
VLNNSFRGSDVHRWLASEDGSFSTKAARSVLLVAPPRPFYSPGESILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCC
VSFLRNKVYQALGVLSVPPATIQSLCEDLLAFKASSQRQILLRNFSIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM