; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009564 (gene) of Chayote v1 genome

Gene IDSed0009564
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG06:33731538..33732715
RNA-Seq ExpressionSed0009564
SyntenySed0009564
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG67121.1 hypothetical protein EZV62_008396 [Acer yangbiense]4.9e-2628.96Show/hide
Query:  YEKLPDFCFECGCIGHLTKECPKVIEERLILQ--KYEYGDWMGAPYVKKSGQKTGEEGVGQESPKRERTGDRGGRDGRWAEGGRSGGESWRRIMNEEEED
        YE+LP+FC+ C  +GH   EC  V   + +L     ++G W+ A   +K   +   +G G    + + T           EG R G             +
Subjt:  YEKLPDFCFECGCIGHLTKECPKVIEERLILQ--KYEYGDWMGAPYVKKSGQKTGEEGVGQESPKRERTGDRGGRDGRWAEGGRSGGESWRRIMNEEEED

Query:  SDKDMNVVKGKIAKAKGGLIGDTREENSNNRGIVEQGDVGGGLVCLNKEGENIIENSTLSRMCGSGPSKISNDEVGE-KQGKGHEEAVVKCRMRKFRRFK
         D  +++    +A  K   +         N G     +  G L+   +EG  +I+++    MC  GP   ++ +VG+ K+ K   ++         ++ +
Subjt:  SDKDMNVVKGKIAKAKGGLIGDTREENSNNRGIVEQGDVGGGLVCLNKEGENIIENSTLSRMCGSGPSKISNDEVGE-KQGKGHEEAVVKCRMRKFRRFK

Query:  GEHISMNLDLNAIQVDKGKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFC
        G   SM+  + A QV K K   +  D         +R  P  M  LSWNVRG GN  A A L  +L+ H P ++FL ETK   +   +L+  L +   FC
Subjt:  GEHISMNLDLNAIQVDKGKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFC

Query:  IPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRVSW-DDICFRFSGVYGFPASNDKHLTCALLRRL
        + S+G SGG+ + W + + ++V+SSS  +ID RVS  D  C+RFSG YG P ++++  + +LL RL
Subjt:  IPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRVSW-DDICFRFSGVYGFPASNDKHLTCALLRRL

VFQ84297.1 unnamed protein product [Cuscuta campestris]9.6e-2238.26Show/hide
Query:  GKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEG
        G+   +D D+   +  KR R PPG M ++SWN RG GN R + ++ ++  +  P+ +FLMETKC  +  E+LR +L F+  F + S+GLSGG+ +LW E 
Subjt:  GKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEG

Query:  VHINVISSSRNYIDCRVSWDDI-CFRFSGVYGFPASNDKHLTCALLRRL
           N+IS SR ++D  VS  ++  +R +G YG P  + +  +  LL+ L
Subjt:  VHINVISSSRNYIDCRVSWDDI-CFRFSGVYGFPASNDKHLTCALLRRL

XP_018826186.1 uncharacterized protein LOC108995141 [Juglans regia]7.4e-2226.53Show/hide
Query:  MEGNVVLCPILYEKLPDFCFECGCIGHLTKECPKVIEERLILQKYEYGDWMGAPYVKKSGQKTGEEGVG---QESPKRERTGDRGGRDGRWAEGGRSGGE
        + GN    P  YEKLP  CF CGC+ H    C                           GQ   EE      +ES K+E    R G     A+ G++G  
Subjt:  MEGNVVLCPILYEKLPDFCFECGCIGHLTKECPKVIEERLILQKYEYGDWMGAPYVKKSGQKTGEEGVG---QESPKRERTGDRGGRDGRWAEGGRSGGE

Query:  SWRRIMNEEEEDSDKDMNVVKGKIAKAKGGLIGDTREENSN--NRGIVEQGDVGGGLVCLNKEGENIIENS------TLSRMCGSGPSKISNDEVGEKQG
        S        +ED  +     KG+  +++GG   D  EEN +  + G++ +G+         +E EN+++ +       +  +   G   I  ++  +K+G
Subjt:  SWRRIMNEEEEDSDKDMNVVKGKIAKAKGGLIGDTREENSN--NRGIVEQGDVGGGLVCLNKEGENIIENS------TLSRMCGSGPSKISNDEVGEKQG

Query:  KGHEEAVVKCRMRKFRRFKGEHISMNLDLNAIQVDKGKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKC
        +    A                             KGK+ + D  +           PP IMKILSWN RG GN R + DL +++  + P ++F+METK 
Subjt:  KGHEEAVVKCRMRKFRRFKGEHISMNLDLNAIQVDKGKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKC

Query:  GVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRV------SWDDICFRFSGVYGFPASNDKHLTCALLRRLSP
           SF+ LRR+LQ + CF + ++G  GG+ +LW   V   +++ S+++I+  +       W   CF     YG P +N +  + +LL  L P
Subjt:  GVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRV------SWDDICFRFSGVYGFPASNDKHLTCALLRRLSP

XP_020412490.1 uncharacterized protein LOC18793550 [Prunus persica]7.4e-2241.27Show/hide
Query:  MKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRVS--WDDIC
        M +LSWN RG GN R + DL+ ++    P V+FL ET+C   +F  ++ QL FD+CF + ++GLSGG+ + W   +++ + SSS ++ID  V    D + 
Subjt:  MKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRVS--WDDIC

Query:  FRFSGVYGFPASNDKHLTCALLRRLS
        +R +G YG+PA+ D HL+  LLR L+
Subjt:  FRFSGVYGFPASNDKHLTCALLRRLS

XP_030486805.1 uncharacterized protein LOC115703709 [Cannabis sativa]7.4e-2241.94Show/hide
Query:  MKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRVSWDD-ICF
        M ++SWN RG G+ RA   L+ ++  H P VLF+ME+K  +    K R    F +   +P +GLSGG+ +LW   V+INV++   N++DC ++  D + F
Subjt:  MKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRVSWDD-ICF

Query:  RFSGVYGFPASNDKHLTCALLRRL
         FSG YG PA++ +HLT  LL+RL
Subjt:  RFSGVYGFPASNDKHLTCALLRRL

TrEMBL top hitse value%identityAlignment
A0A2I4F3F9 uncharacterized protein LOC1089951413.6e-2226.53Show/hide
Query:  MEGNVVLCPILYEKLPDFCFECGCIGHLTKECPKVIEERLILQKYEYGDWMGAPYVKKSGQKTGEEGVG---QESPKRERTGDRGGRDGRWAEGGRSGGE
        + GN    P  YEKLP  CF CGC+ H    C                           GQ   EE      +ES K+E    R G     A+ G++G  
Subjt:  MEGNVVLCPILYEKLPDFCFECGCIGHLTKECPKVIEERLILQKYEYGDWMGAPYVKKSGQKTGEEGVG---QESPKRERTGDRGGRDGRWAEGGRSGGE

Query:  SWRRIMNEEEEDSDKDMNVVKGKIAKAKGGLIGDTREENSN--NRGIVEQGDVGGGLVCLNKEGENIIENS------TLSRMCGSGPSKISNDEVGEKQG
        S        +ED  +     KG+  +++GG   D  EEN +  + G++ +G+         +E EN+++ +       +  +   G   I  ++  +K+G
Subjt:  SWRRIMNEEEEDSDKDMNVVKGKIAKAKGGLIGDTREENSN--NRGIVEQGDVGGGLVCLNKEGENIIENS------TLSRMCGSGPSKISNDEVGEKQG

Query:  KGHEEAVVKCRMRKFRRFKGEHISMNLDLNAIQVDKGKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKC
        +    A                             KGK+ + D  +           PP IMKILSWN RG GN R + DL +++  + P ++F+METK 
Subjt:  KGHEEAVVKCRMRKFRRFKGEHISMNLDLNAIQVDKGKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKC

Query:  GVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRV------SWDDICFRFSGVYGFPASNDKHLTCALLRRLSP
           SF+ LRR+LQ + CF + ++G  GG+ +LW   V   +++ S+++I+  +       W   CF     YG P +N +  + +LL  L P
Subjt:  GVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRV------SWDDICFRFSGVYGFPASNDKHLTCALLRRLSP

A0A2N9EH45 Reverse transcriptase domain-containing protein1.1e-2325.79Show/hide
Query:  GNVVLCPILYEKLPDFCFECGCIGHLTKECPKVIEERLILQK--YEYGDWM----GAPYVKKSGQKTGEEGVGQESPKRERTGDRGGRDGRWAEGGRSGG
        GN       YE+LP+FC+ CG + H  ++C   +  +  L++   +YG W+    G P+ +   +  G   +    P+ +R  +          G  +GG
Subjt:  GNVVLCPILYEKLPDFCFECGCIGHLTKECPKVIEERLILQK--YEYGDWM----GAPYVKKSGQKTGEEGVGQESPKRERTGDRGGRDGRWAEGGRSGG

Query:  -------------------------ESWRRIMNEEEE------------DSDKDMNVVKGKIAKAKGGLIGDTREEN----------------SNNRGIV
                                 ES   +++  E+            D+ +   + K  I++ +     D    N                 +N  +V
Subjt:  -------------------------ESWRRIMNEEEE------------DSDKDMNVVKGKIAKAKGGLIGDTREEN----------------SNNRGIV

Query:  EQGDVGGGL-VCLNKEGENIIENSTLSRMCGSGPSKISNDEVGEKQGK------GHEEAVVKCRMR----KFRRFKGEHISMNLDLNAIQVDKGKRKLED
        +    G    +CL +      + S        G     +  VG++  K         + VV+   R    +  R K  H++     +   V +G  KL +
Subjt:  EQGDVGGGL-VCLNKEGENIIENSTLSRMCGSGPSKISNDEVGEKQGK------GHEEAVVKCRMR----KFRRFKGEHISMNLDLNAIQVDKGKRKLED

Query:  MDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEGVHINVIS
        +D    +  K+ RAPP  M ILSWN RG GN  A+  L ++++   P VLFLMETK      E LR +L+F  CF +PS+G SGG+ +LW +   I V +
Subjt:  MDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEGVHINVIS

Query:  SSRNYIDCRVSWDD-ICFRFSGVYGFPASNDKHLTCALLRRL
         S+N+ID  V  D+ I +RF+G YGFP  + K  + AL+ +L
Subjt:  SSRNYIDCRVSWDD-ICFRFSGVYGFPASNDKHLTCALLRRL

A0A2N9ERX7 CCHC-type domain-containing protein2.7e-2227.27Show/hide
Query:  NVVLCPILYEKLPDFCFECGCIGHLTKECPKVIEERLILQK--YEYGDWMGAPYVKKSGQKTGEEGVGQESPK----RERTGDRGGRDGRWAEGGRSGGE
        N V     YE+LP  CF CG +GHL + C   ++     +    +YG W+ A  V +     G  G+ Q S      R R+   GG    +  G  +G  
Subjt:  NVVLCPILYEKLPDFCFECGCIGHLTKECPKVIEERLILQK--YEYGDWMGAPYVKKSGQKTGEEGVGQESPK----RERTGDRGGRDGRWAEGGRSGGE

Query:  SWRRIMNEEEEDSDKDMNVVKGKIAKAKGGLIGDTREENSNNRGIVEQGDV--GGGLVCLNKEGENIIENSTLSRMCGSGPSKISNDEVGEKQ----GKG
                   D+   +++V+           G++ E    +  + E G V   GG V     G +    +   +  GSGPS +    V E Q    G G
Subjt:  SWRRIMNEEEEDSDKDMNVVKGKIAKAKGGLIGDTREENSNNRGIVEQGDV--GGGLVCLNKEGENIIENSTLSRMCGSGPSKISNDEVGEKQ----GKG

Query:  HEEA-----------VVKCRMRKFRRFKGEHISMNLDLNAIQVDKGKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQ
         ++A           + K + +K   +K + +    +  +  +   KR LE  + V+ +     R P   M++LSWN RG GN + + +L  +L+   P 
Subjt:  HEEA-----------VVKCRMRKFRRFKGEHISMNLDLNAIQVDKGKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQ

Query:  VLFLMETKCGVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRVSWDD--ICFRFSGVYGFPASNDKHLTCALLRRLS
        +LFL ET+      E LR + +F + FC+P + + GG+ +LW + V + + S S+N+ID  V   D  + FR +  YG P ++ +  T ALL+ LS
Subjt:  VLFLMETKCGVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRVSWDD--ICFRFSGVYGFPASNDKHLTCALLRRLS

A0A5C7ID90 WD_REPEATS_REGION domain-containing protein2.4e-2628.96Show/hide
Query:  YEKLPDFCFECGCIGHLTKECPKVIEERLILQ--KYEYGDWMGAPYVKKSGQKTGEEGVGQESPKRERTGDRGGRDGRWAEGGRSGGESWRRIMNEEEED
        YE+LP+FC+ C  +GH   EC  V   + +L     ++G W+ A   +K   +   +G G    + + T           EG R G             +
Subjt:  YEKLPDFCFECGCIGHLTKECPKVIEERLILQ--KYEYGDWMGAPYVKKSGQKTGEEGVGQESPKRERTGDRGGRDGRWAEGGRSGGESWRRIMNEEEED

Query:  SDKDMNVVKGKIAKAKGGLIGDTREENSNNRGIVEQGDVGGGLVCLNKEGENIIENSTLSRMCGSGPSKISNDEVGE-KQGKGHEEAVVKCRMRKFRRFK
         D  +++    +A  K   +         N G     +  G L+   +EG  +I+++    MC  GP   ++ +VG+ K+ K   ++         ++ +
Subjt:  SDKDMNVVKGKIAKAKGGLIGDTREENSNNRGIVEQGDVGGGLVCLNKEGENIIENSTLSRMCGSGPSKISNDEVGE-KQGKGHEEAVVKCRMRKFRRFK

Query:  GEHISMNLDLNAIQVDKGKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFC
        G   SM+  + A QV K K   +  D         +R  P  M  LSWNVRG GN  A A L  +L+ H P ++FL ETK   +   +L+  L +   FC
Subjt:  GEHISMNLDLNAIQVDKGKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFC

Query:  IPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRVSW-DDICFRFSGVYGFPASNDKHLTCALLRRL
        + S+G SGG+ + W + + ++V+SSS  +ID RVS  D  C+RFSG YG P ++++  + +LL RL
Subjt:  IPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRVSW-DDICFRFSGVYGFPASNDKHLTCALLRRL

A0A803PLV0 Uncharacterized protein6.1e-2228.96Show/hide
Query:  YEKLPDFCFECGCIGHLTKECPKVIEERLILQKYEYGDWMGAPYVKKSGQKTGEEGVGQESPKRERTGDRGGRDGRWAEG---GRSGGESWRRIMNEEEE
        YE+ P FCF CG IGH  K C K+ E+ L      YG++M A +                  +++  G R  R   W  G   G S G    R  ++   
Subjt:  YEKLPDFCFECGCIGHLTKECPKVIEERLILQKYEYGDWMGAPYVKKSGQKTGEEGVGQESPKRERTGDRGGRDGRWAEG---GRSGGESWRRIMNEEEE

Query:  DSDKDMNVVKGKIAKAKGGLIGDTREENSNNRGIVEQGDVGGGLVCLNKEGENIIENSTLSRMCGSGPSKISNDEVGEKQGKGHEEAVVKCRMRKFRRFK
        +SD              GG IG   + N ++ G   QG   G L    + G+N           G  P    N+E+ E+  + H++  V           
Subjt:  DSDKDMNVVKGKIAKAKGGLIGDTREENSNNRGIVEQGDVGGGLVCLNKEGENIIENSTLSRMCGSGPSKISNDEVGEKQGKGHEEAVVKCRMRKFRRFK

Query:  GEHISMNLDLNAIQVDKGKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFC
                    + +D  +RK       EE+E K A      + +  WN RG GN RA   + +++    P V+FL ET C     E+L   + F+ CF 
Subjt:  GEHISMNLDLNAIQVDKGKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFC

Query:  IPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRVSWD-DICFRFSGVYGFPASNDKHLTCALLRRL
        +   G SGG+ MLW E   +++ S SRN+ID  V W+ +  FR +G+YG P  N +  T  LLR L
Subjt:  IPSMGLSGGIGMLWGEGVHINVISSSRNYIDCRVSWD-DICFRFSGVYGFPASNDKHLTCALLRRL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGTAATGTTGTTCTTTGTCCAATTCTTTATGAAAAACTTCCAGATTTCTGTTTTGAATGTGGATGTATTGGACACTTGACGAAAGAATGTCCTAAAGTGATTGA
AGAAAGGCTAATTCTGCAGAAGTATGAGTATGGTGATTGGATGGGTGCACCTTATGTCAAAAAATCTGGACAAAAAACAGGAGAAGAGGGGGTTGGACAGGAAAGTCCTA
AGAGAGAAAGGACTGGGGATCGTGGAGGGAGAGATGGACGTTGGGCAGAAGGAGGTAGAAGTGGAGGGGAGAGTTGGAGACGTATTATGAATGAGGAAGAGGAGGATTCT
GATAAGGACATGAATGTTGTTAAGGGGAAAATAGCGAAGGCGAAGGGGGGGTTAATAGGGGATACCAGAGAGGAAAATAGTAACAACAGGGGCATTGTTGAACAGGGAGA
TGTTGGAGGAGGTTTGGTTTGTCTCAATAAAGAAGGGGAAAATATTATAGAAAACAGCACTTTAAGTAGGATGTGTGGGTCAGGGCCTTCTAAAATAAGTAATGATGAAG
TGGGAGAGAAACAAGGGAAAGGGCATGAGGAAGCTGTTGTTAAATGTAGAATGAGAAAATTCCGGAGATTCAAAGGAGAACATATTTCTATGAATCTAGACCTGAATGCA
ATTCAAGTTGATAAAGGAAAGAGAAAATTAGAAGATATGGATGTGGTCGAGGAGGTGGAAGGGAAACGTGCTCGAGCCCCACCTGGAATTATGAAAATCCTAAGTTGGAA
TGTCCGAGGAGGGGGGAATTCTCGGGCGTTGGCTGATTTGAAGGACATACTGCGCCTTCATCATCCTCAAGTGTTGTTTTTAATGGAGACAAAATGTGGTGTTGCAAGTT
TTGAGAAACTGAGAAGGCAACTTCAGTTTGATCATTGCTTCTGTATTCCGTCTATGGGGTTAAGTGGTGGCATTGGTATGCTTTGGGGGGAAGGCGTTCACATTAATGTC
ATTTCTTCTTCTCGTAATTACATTGATTGTAGAGTGAGCTGGGATGACATATGCTTTCGTTTTTCGGGAGTTTATGGTTTTCCTGCTTCTAATGATAAACACCTTACATG
TGCTCTTCTAAGGAGGCTTTCTCCATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGTAATGTTGTTCTTTGTCCAATTCTTTATGAAAAACTTCCAGATTTCTGTTTTGAATGTGGATGTATTGGACACTTGACGAAAGAATGTCCTAAAGTGATTGA
AGAAAGGCTAATTCTGCAGAAGTATGAGTATGGTGATTGGATGGGTGCACCTTATGTCAAAAAATCTGGACAAAAAACAGGAGAAGAGGGGGTTGGACAGGAAAGTCCTA
AGAGAGAAAGGACTGGGGATCGTGGAGGGAGAGATGGACGTTGGGCAGAAGGAGGTAGAAGTGGAGGGGAGAGTTGGAGACGTATTATGAATGAGGAAGAGGAGGATTCT
GATAAGGACATGAATGTTGTTAAGGGGAAAATAGCGAAGGCGAAGGGGGGGTTAATAGGGGATACCAGAGAGGAAAATAGTAACAACAGGGGCATTGTTGAACAGGGAGA
TGTTGGAGGAGGTTTGGTTTGTCTCAATAAAGAAGGGGAAAATATTATAGAAAACAGCACTTTAAGTAGGATGTGTGGGTCAGGGCCTTCTAAAATAAGTAATGATGAAG
TGGGAGAGAAACAAGGGAAAGGGCATGAGGAAGCTGTTGTTAAATGTAGAATGAGAAAATTCCGGAGATTCAAAGGAGAACATATTTCTATGAATCTAGACCTGAATGCA
ATTCAAGTTGATAAAGGAAAGAGAAAATTAGAAGATATGGATGTGGTCGAGGAGGTGGAAGGGAAACGTGCTCGAGCCCCACCTGGAATTATGAAAATCCTAAGTTGGAA
TGTCCGAGGAGGGGGGAATTCTCGGGCGTTGGCTGATTTGAAGGACATACTGCGCCTTCATCATCCTCAAGTGTTGTTTTTAATGGAGACAAAATGTGGTGTTGCAAGTT
TTGAGAAACTGAGAAGGCAACTTCAGTTTGATCATTGCTTCTGTATTCCGTCTATGGGGTTAAGTGGTGGCATTGGTATGCTTTGGGGGGAAGGCGTTCACATTAATGTC
ATTTCTTCTTCTCGTAATTACATTGATTGTAGAGTGAGCTGGGATGACATATGCTTTCGTTTTTCGGGAGTTTATGGTTTTCCTGCTTCTAATGATAAACACCTTACATG
TGCTCTTCTAAGGAGGCTTTCTCCATGTTGA
Protein sequenceShow/hide protein sequence
MEGNVVLCPILYEKLPDFCFECGCIGHLTKECPKVIEERLILQKYEYGDWMGAPYVKKSGQKTGEEGVGQESPKRERTGDRGGRDGRWAEGGRSGGESWRRIMNEEEEDS
DKDMNVVKGKIAKAKGGLIGDTREENSNNRGIVEQGDVGGGLVCLNKEGENIIENSTLSRMCGSGPSKISNDEVGEKQGKGHEEAVVKCRMRKFRRFKGEHISMNLDLNA
IQVDKGKRKLEDMDVVEEVEGKRARAPPGIMKILSWNVRGGGNSRALADLKDILRLHHPQVLFLMETKCGVASFEKLRRQLQFDHCFCIPSMGLSGGIGMLWGEGVHINV
ISSSRNYIDCRVSWDDICFRFSGVYGFPASNDKHLTCALLRRLSPC