; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008769 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008769
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:29581225..29583520
RNA-Seq ExpressionLag0008769
SyntenyLag0008769
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG53035.1 hypothetical protein EZV62_022204 [Acer yangbiense]3.7e-3030.37Show/hide
Query:  EILTRPRKMRGKMTGETLRVKIKIYISEPLKRDTNIQIGSKAEKKWVPIMYEKLPDFCYSCRRLGHVSQECEAEKEGQNEEKKYGVALRETQGSKGFYRG
        E++  P + R +  G+ +RVK+KI IS+PLK    +++G   E   V + YE+LP+F ++C R+GH  + C  E     E +K  +     +GS   +  
Subjt:  EILTRPRKMRGKMTGETLRVKIKIYISEPLKRDTNIQIGSKAEKKWVPIMYEKLPDFCYSCRRLGHVSQECEAEKEGQNEEKKYGVALRETQGSKGFYRG

Query:  WRPEGRGYNPRGRGKGRSYSWRRDGGKTQRDSDLQENVQEN-QPEKSGGKESMNQPANNWPENVPEADKREEEREKVNKDYLTTIEHGL---STTTAMTE
        W         + RG  ++Y         +  SD+  +++ + + E+       +    ++ E   +A    +    V        + GL    T+    +
Subjt:  WRPEGRGYNPRGRGKGRSYSWRRDGGKTQRDSDLQENVQEN-QPEKSGGKESMNQPANNWPENVPEADKREEEREKVNKDYLTTIEHGL---STTTAMTE

Query:  G---KNQSNEEEKG--------KRKGENNQEEGNDKKNRNLDLTYSGVPGGILAEAGCQPCRTGSSGGLILLWKEGLDVRIVSYSSGHIDL-VVKDPKFF
        G   + + +  + G         ++G ++ +    +  ++L LT           A  +    GSSGGL+LLWK+  DV ++S+S GHID  V  +  F 
Subjt:  G---KNQSNEEEKG--------KRKGENNQEEGNDKKNRNLDLTYSGVPGGILAEAGCQPCRTGSSGGLILLWKEGLDVRIVSYSSGHIDL-VVKDPKFF

Query:  WRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENLREAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLD
        W+F+GFYG+ I+ RR  S  LL+RL +  N  LPW+  GDFN+IL   EKEGG++K    M N R+ I  CNL+D G    K+TW   R   N ++ERLD
Subjt:  WRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENLREAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLD

Query:  RFLIN
        +FL N
Subjt:  RFLIN

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]1.5e-3131.5Show/hide
Query:  GKMTGETLRVKIKIYISEPLKRDTNIQIGSKAEKKWVPIMYEKLPDFCYSCRRLGHVSQEC--EAEKEGQNEEKKYGVALRETQGSKGFYRGWR---PEG
        G+  G+ +R+++ I +  PLKR   + +G   +   V I YE+LP+FCY C ++GH+ ++C    ++   +   K+G  +R    ++    G +   PEG
Subjt:  GKMTGETLRVKIKIYISEPLKRDTNIQIGSKAEKKWVPIMYEKLPDFCYSCRRLGHVSQEC--EAEKEGQNEEKKYGVALRETQGSKGFYRGWR---PEG

Query:  RG--------YNPRGRGKGRSYSWRRDGGKTQRDSDLQENVQENQPEKSGGK-ESMNQPANNWPENVPEADK--REEEREKVNKDYLTTIEHGLSTTTAM
                   N R +G  + ++  +D     RD +  + + E    KSG   E+    +     +  E  +  R   +EK+ +D  T     + TT  +
Subjt:  RG--------YNPRGRGKGRSYSWRRDGGKTQRDSDLQENVQENQPEKSGGK-ESMNQPANNWPENVPEADK--REEEREKVNKDYLTTIEHGLSTTTAM

Query:  TE---GKNQSNEE-----------------------EKGKRKGENNQEEGNDKKNRNLDL-TYSGVPGGILAEAGCQPCRTGSSGGLILLWKEGLDVRIV
        T    G+N SN+E                       EK     E NQ  G  KK+ ++D+  YS      +        R G  GGL LLWK  ++V I 
Subjt:  TE---GKNQSNEE-----------------------EKGKRKGENNQEEGNDKKNRNLDL-TYSGVPGGILAEAGCQPCRTGSSGGLILLWKEGLDVRIV

Query:  SYSSGHIDLVVKD-PKFFWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENLREAISHCNLLDRGNNGSK
        S++ GHID V+KD     WRFTGFYG  I   R+ S +LL+RL   +N  LPW+V GDFNEIL   EK+GG  +    M + REA+  C L+D G  G+K
Subjt:  SYSSGHIDLVVKD-PKFFWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENLREAISHCNLLDRGNNGSK

Query:  FTWFRGRSKKNKVKERLDR
        +TW   + K   ++ER+DR
Subjt:  FTWFRGRSKKNKVKERLDR

XP_023871998.1 uncharacterized protein LOC111984613 [Quercus suber]2.8e-3053.52Show/hide
Query:  SGGLILLWKEGLDVRIVSYSSGHIDLVV--KDPKFFWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENL
        SGGL+LLWK+ L V + SYS  HID +V   D    WRFTGFYGN    RR ES  LLKRLS  +N  LPW+  GDFNE+++S EKEGG S+  Q M N 
Subjt:  SGGLILLWKEGLDVRIVSYSSGHIDLVV--KDPKFFWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENL

Query:  REAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLDRFLINS
         EAI+ C L D G  G  FTW R    +  V+ERLDR L++S
Subjt:  REAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLDRFLINS

XP_023903659.1 uncharacterized protein LOC112015491 [Quercus suber]2.8e-3053.52Show/hide
Query:  SGGLILLWKEGLDVRIVSYSSGHIDLVV--KDPKFFWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENL
        SGGL+LLWK+ L V + SYS  HID +V   D    WRFTGFYGN    RR ES  LLKRLS  +N  LPW+  GDFNE+++S EKEGG S+  Q M N 
Subjt:  SGGLILLWKEGLDVRIVSYSSGHIDLVV--KDPKFFWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENL

Query:  REAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLDRFLINS
         EAI+ C L D G  G  FTW R    +  V+ERLDR L++S
Subjt:  REAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLDRFLINS

XP_023911327.1 uncharacterized protein LOC112022938 [Quercus suber]5.4e-2948.95Show/hide
Query:  GSSGGLILLWKEGLDVRIVSYSSGHIDLVVKDPKF--FWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLME
        G SGGL LLWKEG+DVR  S S+ HID+VV++      WR TGFYG   +E+R  S  LL+ L D     +PW+V GDFNEI++SHEK GG+ + ++ ME
Subjt:  GSSGGLILLWKEGLDVRIVSYSSGHIDLVVKDPKF--FWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLME

Query:  NLREAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLDRFLIN
        N R+ +  C L D G  G +FTW  GR    + K RLDR + N
Subjt:  NLREAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLDRFLIN

TrEMBL top hitse value%identityAlignment
A0A2N9FNH6 Reverse transcriptase domain-containing protein2.2e-2850Show/hide
Query:  RTGSSGGLILLWKEGLDVRIVSYSSGHIDLVVKD-PKFFWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLM
        RTG+ GGL LLWKEG+ V  +S+SS HID+ ++      W FTGFYGN    +R +S TLL+RL  Y +  +PW+V GDFNE+L + EK G  ++    M
Subjt:  RTGSSGGLILLWKEGLDVRIVSYSSGHIDLVVKD-PKFFWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLM

Query:  ENLREAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLDR
        EN R+A+S C L D G  G+KFTW+ GR   + V ERLDR
Subjt:  ENLREAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLDR

A0A2N9J3U0 Reverse transcriptase domain-containing protein2.2e-2850Show/hide
Query:  RTGSSGGLILLWKEGLDVRIVSYSSGHIDLVVKD-PKFFWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLM
        RTG+ GGL LLWKEG+ V  +S+SS HID+ ++      W FTGFYGN    +R +S TLL+RL  Y +  +PW+V GDFNE+L + EK G  ++    M
Subjt:  RTGSSGGLILLWKEGLDVRIVSYSSGHIDLVVKD-PKFFWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLM

Query:  ENLREAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLDR
        EN R+A+S C L D G  G+KFTW+ GR   + V ERLDR
Subjt:  ENLREAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLDR

A0A5B6UMM3 Reverse transcriptase1.7e-2830.83Show/hide
Query:  MRGKMTGETLRVKIKIYISEPLKRDTNIQIGSKAEKKWVPIMYEKLPDFCYSCRRLGHVSQECEA----EKEGQNEEKKYGVALRETQGSKGFYRGWRPE
        +R ++  E  R+++ + + +PL+R      GS     WVP  YEKLP FC+ C  LGH  Q+C      EK    ++  + VAL+      G       E
Subjt:  MRGKMTGETLRVKIKIYISEPLKRDTNIQIGSKAEKKWVPIMYEKLPDFCYSCRRLGHVSQECEA----EKEGQNEEKKYGVALRETQGSKGFYRGWRPE

Query:  GRGYNPRGRGKG--RSYSWRRDGGKTQRDSDLQENVQENQPEKSGGKESMNQPANNWPENVPEADKREEEREKVNKDYLTTIEHG-LSTTTAMTEGKNQS
           +    + KG  R Y+ R   G    +S +   +  N      G++      + W E +      E+  + V +  +     G L  +    E +N++
Subjt:  GRGYNPRGRGKG--RSYSWRRDGGKTQRDSDLQENVQENQPEKSGGKESMNQPANNWPENVPEADKREEEREKVNKDYLTTIEHG-LSTTTAMTEGKNQS

Query:  NE----EEKGKRK-----GENNQEEGN---DKKNRNLDLTYSGVPGGILAEAGCQPCRTGSSGGLILLWKEGLDVRIVSYSSGHIDLVVKDPKF--FWRF
        +     +  GKRK     G+ N +E +   +K+         G   GI  EA       GS GGL + WK+ LDV + S+S  HID+++K+      WR+
Subjt:  NE----EEKGKRK-----GENNQEEGN---DKKNRNLDLTYSGVPGGILAEAGCQPCRTGSSGGLILLWKEGLDVRIVSYSSGHIDLVVKDPKF--FWRF

Query:  TGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENLREAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLDRFL
        TGFYG+   + +    +LLKRL+    +  PW+V GDFNEILYS EK GG  + ++ ME+ RE +  C L D G +G  +TW RG   +  ++ERLD+ L
Subjt:  TGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENLREAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLDRFL

Query:  INSTLQSVVSSL
            ++S V SL
Subjt:  INSTLQSVVSSL

A0A5C7H981 Uncharacterized protein1.8e-3030.37Show/hide
Query:  EILTRPRKMRGKMTGETLRVKIKIYISEPLKRDTNIQIGSKAEKKWVPIMYEKLPDFCYSCRRLGHVSQECEAEKEGQNEEKKYGVALRETQGSKGFYRG
        E++  P + R +  G+ +RVK+KI IS+PLK    +++G   E   V + YE+LP+F ++C R+GH  + C  E     E +K  +     +GS   +  
Subjt:  EILTRPRKMRGKMTGETLRVKIKIYISEPLKRDTNIQIGSKAEKKWVPIMYEKLPDFCYSCRRLGHVSQECEAEKEGQNEEKKYGVALRETQGSKGFYRG

Query:  WRPEGRGYNPRGRGKGRSYSWRRDGGKTQRDSDLQENVQEN-QPEKSGGKESMNQPANNWPENVPEADKREEEREKVNKDYLTTIEHGL---STTTAMTE
        W         + RG  ++Y         +  SD+  +++ + + E+       +    ++ E   +A    +    V        + GL    T+    +
Subjt:  WRPEGRGYNPRGRGKGRSYSWRRDGGKTQRDSDLQENVQEN-QPEKSGGKESMNQPANNWPENVPEADKREEEREKVNKDYLTTIEHGL---STTTAMTE

Query:  G---KNQSNEEEKG--------KRKGENNQEEGNDKKNRNLDLTYSGVPGGILAEAGCQPCRTGSSGGLILLWKEGLDVRIVSYSSGHIDL-VVKDPKFF
        G   + + +  + G         ++G ++ +    +  ++L LT           A  +    GSSGGL+LLWK+  DV ++S+S GHID  V  +  F 
Subjt:  G---KNQSNEEEKG--------KRKGENNQEEGNDKKNRNLDLTYSGVPGGILAEAGCQPCRTGSSGGLILLWKEGLDVRIVSYSSGHIDL-VVKDPKFF

Query:  WRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENLREAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLD
        W+F+GFYG+ I+ RR  S  LL+RL +  N  LPW+  GDFN+IL   EKEGG++K    M N R+ I  CNL+D G    K+TW   R   N ++ERLD
Subjt:  WRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENLREAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLD

Query:  RFLIN
        +FL N
Subjt:  RFLIN

A0A5C7H9Y2 CCHC-type domain-containing protein7.3e-3231.5Show/hide
Query:  GKMTGETLRVKIKIYISEPLKRDTNIQIGSKAEKKWVPIMYEKLPDFCYSCRRLGHVSQEC--EAEKEGQNEEKKYGVALRETQGSKGFYRGWR---PEG
        G+  G+ +R+++ I +  PLKR   + +G   +   V I YE+LP+FCY C ++GH+ ++C    ++   +   K+G  +R    ++    G +   PEG
Subjt:  GKMTGETLRVKIKIYISEPLKRDTNIQIGSKAEKKWVPIMYEKLPDFCYSCRRLGHVSQEC--EAEKEGQNEEKKYGVALRETQGSKGFYRGWR---PEG

Query:  RG--------YNPRGRGKGRSYSWRRDGGKTQRDSDLQENVQENQPEKSGGK-ESMNQPANNWPENVPEADK--REEEREKVNKDYLTTIEHGLSTTTAM
                   N R +G  + ++  +D     RD +  + + E    KSG   E+    +     +  E  +  R   +EK+ +D  T     + TT  +
Subjt:  RG--------YNPRGRGKGRSYSWRRDGGKTQRDSDLQENVQENQPEKSGGK-ESMNQPANNWPENVPEADK--REEEREKVNKDYLTTIEHGLSTTTAM

Query:  TE---GKNQSNEE-----------------------EKGKRKGENNQEEGNDKKNRNLDL-TYSGVPGGILAEAGCQPCRTGSSGGLILLWKEGLDVRIV
        T    G+N SN+E                       EK     E NQ  G  KK+ ++D+  YS      +        R G  GGL LLWK  ++V I 
Subjt:  TE---GKNQSNEE-----------------------EKGKRKGENNQEEGNDKKNRNLDL-TYSGVPGGILAEAGCQPCRTGSSGGLILLWKEGLDVRIV

Query:  SYSSGHIDLVVKD-PKFFWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENLREAISHCNLLDRGNNGSK
        S++ GHID V+KD     WRFTGFYG  I   R+ S +LL+RL   +N  LPW+V GDFNEIL   EK+GG  +    M + REA+  C L+D G  G+K
Subjt:  SYSSGHIDLVVKD-PKFFWRFTGFYGNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENLREAISHCNLLDRGNNGSK

Query:  FTWFRGRSKKNKVKERLDR
        +TW   + K   ++ER+DR
Subjt:  FTWFRGRSKKNKVKERLDR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGAGGAGAGGCAACATTTGGTTTGCCAAGAGGGATCCAGTGGTGATCAACAAGGATGTAATCATAAAGGAAATACACAAGAAATGGAAGAAGGGGCCCATGCAGA
TAGCTTGAATGAACAAATTGAAAAGCTCAGCTTAGAAGAAGTTGAAAAAAGACAAATATTATGCCTCGAATCTGAGGTGTGGAAGGGAGAATTAGGATCGAGAAATCAGG
GACGAACTTATTCTTATGCAAGTTCTGAAACCAGAGAGACAAAACCAGAATCTGGAGAGGAAATCCGTGGAGCTTTGATGACTCGACAATCATTCTTGAAGAGTCGAGAG
GAGACCGTAGTGTTGAAGAGCTGGATTTCAGGTACGCATATTTTTGGATTCACTTCCACGGCTTACCACGGGTTTGCTTTTGCAGGAAATACACAGAAGCATTGGAAAAC
TACATTGGAAATTTTGACTCGGCCGAGGAAGATGAGGGGCAAGATGACCGGAGAAACGTTGAGAGTGAAGATTAAGATATACATTTCTGAGCCTCTAAAAAGGGACACTA
ACATTCAAATTGGGTCCAAAGCCGAGAAAAAATGGGTCCCAATCATGTACGAGAAGCTCCCGGACTTCTGCTACAGTTGCCGGAGGCTAGGCCATGTATCTCAAGAGTGT
GAGGCTGAAAAAGAAGGGCAGAACGAAGAGAAGAAATACGGAGTGGCCTTAAGAGAGACACAAGGGAGTAAAGGTTTTTACAGGGGATGGAGACCCGAAGGCAGGGGATA
CAATCCGAGAGGAAGAGGTAAAGGTAGAAGTTACAGCTGGAGAAGAGATGGAGGGAAGACGCAAAGGGACAGTGACCTTCAAGAAAATGTCCAAGAAAACCAGCCGGAAA
AGTCAGGGGGAAAGGAAAGTATGAACCAACCGGCAAACAATTGGCCGGAAAATGTGCCAGAGGCCGACAAGAGAGAGGAGGAAAGAGAGAAAGTCAATAAGGACTATCTG
ACAACGATTGAACATGGGCTTAGTACGACAACTGCCATGACAGAGGGAAAAAATCAGAGCAACGAGGAAGAAAAAGGAAAAAGAAAAGGGGAGAACAATCAAGAAGAAGG
GAATGACAAGAAGAATAGGAACTTGGATCTGACTTACTCCGGAGTGCCTGGAGGGATATTGGCGGAGGCTGGTTGCCAGCCCTGCCGGACCGGCAGTAGTGGCGGGTTAA
TTTTATTATGGAAGGAAGGTTTGGATGTTAGAATAGTTTCGTATTCGTCGGGTCATATAGATTTAGTTGTTAAAGATCCTAAATTTTTTTGGAGATTTACAGGATTTTAT
GGCAATCTCATTGCTGAGAGGAGGGTCGAGTCTTCGACTCTTCTTAAGAGGTTAAGCGATTATGCCAACGTGGGCCTCCCCTGGATGGTGGGAGGGGATTTCAATGAGAT
TTTATACAGTCACGAGAAAGAAGGTGGGGCTTCCAAGCGGGCGCAGCTCATGGAGAACCTTCGAGAGGCTATAAGTCATTGCAACCTCCTAGATCGGGGTAACAATGGTA
GCAAATTCACGTGGTTTAGAGGCAGATCGAAAAAGAACAAAGTTAAAGAAAGATTGGATAGATTTTTAATTAACTCCACTCTTCAGTCTGTGGTTAGCAGCCTGTTGGTG
GTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGAGGAGAGGCAACATTTGGTTTGCCAAGAGGGATCCAGTGGTGATCAACAAGGATGTAATCATAAAGGAAATACACAAGAAATGGAAGAAGGGGCCCATGCAGA
TAGCTTGAATGAACAAATTGAAAAGCTCAGCTTAGAAGAAGTTGAAAAAAGACAAATATTATGCCTCGAATCTGAGGTGTGGAAGGGAGAATTAGGATCGAGAAATCAGG
GACGAACTTATTCTTATGCAAGTTCTGAAACCAGAGAGACAAAACCAGAATCTGGAGAGGAAATCCGTGGAGCTTTGATGACTCGACAATCATTCTTGAAGAGTCGAGAG
GAGACCGTAGTGTTGAAGAGCTGGATTTCAGGTACGCATATTTTTGGATTCACTTCCACGGCTTACCACGGGTTTGCTTTTGCAGGAAATACACAGAAGCATTGGAAAAC
TACATTGGAAATTTTGACTCGGCCGAGGAAGATGAGGGGCAAGATGACCGGAGAAACGTTGAGAGTGAAGATTAAGATATACATTTCTGAGCCTCTAAAAAGGGACACTA
ACATTCAAATTGGGTCCAAAGCCGAGAAAAAATGGGTCCCAATCATGTACGAGAAGCTCCCGGACTTCTGCTACAGTTGCCGGAGGCTAGGCCATGTATCTCAAGAGTGT
GAGGCTGAAAAAGAAGGGCAGAACGAAGAGAAGAAATACGGAGTGGCCTTAAGAGAGACACAAGGGAGTAAAGGTTTTTACAGGGGATGGAGACCCGAAGGCAGGGGATA
CAATCCGAGAGGAAGAGGTAAAGGTAGAAGTTACAGCTGGAGAAGAGATGGAGGGAAGACGCAAAGGGACAGTGACCTTCAAGAAAATGTCCAAGAAAACCAGCCGGAAA
AGTCAGGGGGAAAGGAAAGTATGAACCAACCGGCAAACAATTGGCCGGAAAATGTGCCAGAGGCCGACAAGAGAGAGGAGGAAAGAGAGAAAGTCAATAAGGACTATCTG
ACAACGATTGAACATGGGCTTAGTACGACAACTGCCATGACAGAGGGAAAAAATCAGAGCAACGAGGAAGAAAAAGGAAAAAGAAAAGGGGAGAACAATCAAGAAGAAGG
GAATGACAAGAAGAATAGGAACTTGGATCTGACTTACTCCGGAGTGCCTGGAGGGATATTGGCGGAGGCTGGTTGCCAGCCCTGCCGGACCGGCAGTAGTGGCGGGTTAA
TTTTATTATGGAAGGAAGGTTTGGATGTTAGAATAGTTTCGTATTCGTCGGGTCATATAGATTTAGTTGTTAAAGATCCTAAATTTTTTTGGAGATTTACAGGATTTTAT
GGCAATCTCATTGCTGAGAGGAGGGTCGAGTCTTCGACTCTTCTTAAGAGGTTAAGCGATTATGCCAACGTGGGCCTCCCCTGGATGGTGGGAGGGGATTTCAATGAGAT
TTTATACAGTCACGAGAAAGAAGGTGGGGCTTCCAAGCGGGCGCAGCTCATGGAGAACCTTCGAGAGGCTATAAGTCATTGCAACCTCCTAGATCGGGGTAACAATGGTA
GCAAATTCACGTGGTTTAGAGGCAGATCGAAAAAGAACAAAGTTAAAGAAAGATTGGATAGATTTTTAATTAACTCCACTCTTCAGTCTGTGGTTAGCAGCCTGTTGGTG
GTATGA
Protein sequenceShow/hide protein sequence
MDEERQHLVCQEGSSGDQQGCNHKGNTQEMEEGAHADSLNEQIEKLSLEEVEKRQILCLESEVWKGELGSRNQGRTYSYASSETRETKPESGEEIRGALMTRQSFLKSRE
ETVVLKSWISGTHIFGFTSTAYHGFAFAGNTQKHWKTTLEILTRPRKMRGKMTGETLRVKIKIYISEPLKRDTNIQIGSKAEKKWVPIMYEKLPDFCYSCRRLGHVSQEC
EAEKEGQNEEKKYGVALRETQGSKGFYRGWRPEGRGYNPRGRGKGRSYSWRRDGGKTQRDSDLQENVQENQPEKSGGKESMNQPANNWPENVPEADKREEEREKVNKDYL
TTIEHGLSTTTAMTEGKNQSNEEEKGKRKGENNQEEGNDKKNRNLDLTYSGVPGGILAEAGCQPCRTGSSGGLILLWKEGLDVRIVSYSSGHIDLVVKDPKFFWRFTGFY
GNLIAERRVESSTLLKRLSDYANVGLPWMVGGDFNEILYSHEKEGGASKRAQLMENLREAISHCNLLDRGNNGSKFTWFRGRSKKNKVKERLDRFLINSTLQSVVSSLLV
V