; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003280 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003280
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr11:19657596..19662897
RNA-Seq ExpressionHG10003280
SyntenyHG10003280
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5464386.1 hypothetical protein F2P56_014464 [Juglans regia]1.8e-3635.87Show/hide
Query:  RGLGNPRAFRSLCDLVRSNNPDVLFLSEQK------------------------------------EVQLFV---IRSSFAVTL---LVAYRSWRFMGVY
        RGLGNPR  R+L DLVR  +P VLFL E K                                    +V + +    RS    TL     A  SW F GVY
Subjt:  RGLGNPRAFRSLCDLVRSNNPDVLFLSEQK------------------------------------EVQLFV---IRSSFAVTL---LVAYRSWRFMGVY

Query:  GYPESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRN-GNLIWERLDRFLCNDDFW
        G P+ +R+H  W+LL  L   +   W++ GD NE+L  N K GG  R    + AF E +D C L+D+GFKG+P+TW  +R     I ERLDR L N  + 
Subjt:  GYPESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRN-GNLIWERLDRFLCNDDFW

Query:  SLFCQISISHLGWLFPDHRPIMVSLEISTRERKRFRPFRFEEVWALNPDCKNIIMKGDHWELPSSVVLVLLFEMRR
        +LF   S+SH    + DH PI + L+++   RK  + FRFE +W     CK II K          +L ++ ++RR
Subjt:  SLFCQISISHLGWLFPDHRPIMVSLEISTRERKRFRPFRFEEVWALNPDCKNIIMKGDHWELPSSVVLVLLFEMRR

XP_030923204.1 uncharacterized protein LOC115950094 [Quercus lobata]2.8e-3730.24Show/hide
Query:  RGLGNPRAFRSLCDLVRSNNPDVLFLSEQKEVQ--------------LFVIR-----------------------SSFAVTLLV---AYRSWRFMGVYGY
        RGLGN RA   L DL+++ +P ++FLSE    Q              LF +                        SS+ + ++V   +  +WR    YG 
Subjt:  RGLGNPRAFRSLCDLVRSNNPDVLFLSEQKEVQ--------------LFVIR-----------------------SSFAVTLLV---AYRSWRFMGVYGY

Query:  PESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNLIWERLDRFLCNDDFWSLF
        P+++ ++  W++L  L S   +PW   GD NE+L    K GGPPRA   + +F++ LD+C  VD+GF G  +TW GR  G LIWERLDR + N ++ + F
Subjt:  PESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNLIWERLDRFLCNDDFWSLF

Query:  CQISISHLGWLFPDHRPIMVSLEIS-TRERKRFRPFRFEEVWALNPDCKNIIMKGDHWELPSSVVLVLLFEMRRAKFWLFQSQRLNRDYFPPLAELFAIK
            I HL     DH PI++SL+ +   +R + +PF FE +W  +P+CK ++          + ++V   ++++ K  L      NRD+F  + +    K
Subjt:  CQISISHLGWLFPDHRPIMVSLEIS-TRERKRFRPFRFEEVWALNPDCKNIIMKGDHWELPSSVVLVLLFEMRRAKFWLFQSQRLNRDYFPPLAELFAIK

Query:  NGLNLALQTDATHLKVESDCILAINLLNGKQHVV
              L   A  + V + C+  +N L  + +V+
Subjt:  NGLNLALQTDATHLKVESDCILAINLLNGKQHVV

XP_030924743.1 uncharacterized protein LOC115951733 [Quercus lobata]3.3e-3838.05Show/hide
Query:  RGLGNPRAFRSLCDLVRSNNPDVLFLSE-------QKEVQLFVIRSSF------AVTLLVAYRSWRFMGVYGYPESARKHLKWDLLSRLGSNNVVPWMIG
        R LGN RA + L D+V++ +P ++FLSE        K   + V   SF      ++    +  +WRF G YG P++ R+   W++L  L S   +PW   
Subjt:  RGLGNPRAFRSLCDLVRSNNPDVLFLSE-------QKEVQLFVIRSSF------AVTLLVAYRSWRFMGVYGYPESARKHLKWDLLSRLGSNNVVPWMIG

Query:  GDLNEILCNNVKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNLIWERLDRFLCNDDFWSLFCQISISHLGWLFPDHRPIMVSLEISTR
        GD  E+L  + K GG PRA   +  F++ALD C  VD+GF G  +TW GRR G  IWERLDR + N ++ + F    + HL     DHRPI++SL+ ++ 
Subjt:  GDLNEILCNNVKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNLIWERLDRFLCNDDFWSLFCQISISHLGWLFPDHRPIMVSLEISTR

Query:  ERKRFRPFRFEEVWALNPDCKNIIMK
         +KR +PFRFE +W  + +CK ++ +
Subjt:  ERKRFRPFRFEEVWALNPDCKNIIMK

XP_030934533.1 uncharacterized protein LOC115959984 [Quercus lobata]2.8e-3735.98Show/hide
Query:  RGLGNPRAFRSLCDLVRSNNPDVLFLSE---QKEVQLFV-----IRSSFAVTL---------------------LVAY-----------RSWRFMGVYGY
        RGLGN RA + L D+V++ +P ++FLSE    KE   +V         F VT+                        Y           ++WR +G YG 
Subjt:  RGLGNPRAFRSLCDLVRSNNPDVLFLSE---QKEVQLFV-----IRSSFAVTL---------------------LVAY-----------RSWRFMGVYGY

Query:  PESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNLIWERLDRFLCNDDFWSLF
        P+++ +   W++L  LGS   +PW   GD NE+L  + K GGPPRA H +  F+E LD C  VD+ F G  +TW GRR G LIWERLDR + N ++ + F
Subjt:  PESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNLIWERLDRFLCNDDFWSLF

Query:  CQISISHLGWLFPDHRPIMVSLEISTRERK-RFRPFRFEEVWALNPDCKNIIMKGDHWELPSSV
            I HL     DH PI++SL+ S   +K   +PFRFE +W L+  C+++I +   W+    V
Subjt:  CQISISHLGWLFPDHRPIMVSLEISTRERK-RFRPFRFEEVWALNPDCKNIIMKGDHWELPSSV

XP_030969676.1 uncharacterized protein LOC115989953 [Quercus lobata]2.5e-3831.14Show/hide
Query:  RGLGNPRAFRSLCDLVRSNNPDVLFLSE-------QKEVQ-------LFVIR-----------------------SSFAVTLLV---AYRSWRFMGVYGY
        RGLGN RA   L DL+++ +P ++FLSE        K V+       LF +                        SS+ + ++V   +  +WR  G YG 
Subjt:  RGLGNPRAFRSLCDLVRSNNPDVLFLSE-------QKEVQ-------LFVIR-----------------------SSFAVTLLV---AYRSWRFMGVYGY

Query:  PESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNLIWERLDRFLCNDDFWSLF
        P+++ ++  W++L  L S   +PW   GD NE+L    K GGPPRA   + +F++ LD+C  VD+GF G  +TW GR  G LIWERLD  + N ++ + F
Subjt:  PESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNLIWERLDRFLCNDDFWSLF

Query:  CQISISHLGWLFPDHRPIMVSLEIS-TRERKRFRPFRFEEVWALNPDCKNIIMKGDHWELPSSVVLVLLFEMRRAKFWLFQSQRLNRDYFPPLAELFAIK
            I HL     DHRPI++SL+ +   +R + +PFRFE +W  +P+CK ++          + ++V   ++++ K  L   +  NRD+F  + +    K
Subjt:  CQISISHLGWLFPDHRPIMVSLEIS-TRERKRFRPFRFEEVWALNPDCKNIIMKGDHWELPSSVVLVLLFEMRRAKFWLFQSQRLNRDYFPPLAELFAIK

Query:  NGLNLALQTDATHLKVESDCILAINLLNGKQHVV
              L   A  + V + C+  +N L  + +V+
Subjt:  NGLNLALQTDATHLKVESDCILAINLLNGKQHVV

TrEMBL top hitse value%identityAlignment
A0A2N9G5I8 Reverse transcriptase domain-containing protein1.3e-3537.44Show/hide
Query:  RGLGNPRAFRSLCDLVRSNNPDVLFLSEQ--KEVQLFVIRSS--FAVTLLVAY---------RSWRFMGVYGYPESARKHLKWDLLSRLGSNNVVPWMIG
        RGLGNP   + L  LVR  +P VLF+ E    E +L V+R    F++   + +          +WRF G YG PE+ R+HL W LL        +PW+  
Subjt:  RGLGNPRAFRSLCDLVRSNNPDVLFLSEQ--KEVQLFVIRSS--FAVTLLVAY---------RSWRFMGVYGYPESARKHLKWDLLSRLGSNNVVPWMIG

Query:  GDLNEILCNNVKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRR-NGNLIWERLDRFLCNDDFWSLFCQISISHLGWLFPDHRPI-MVSLEIS
        GD NE+L  + K GGP R+   +  F++A+D C  +D+G++G P+TW   R +   +WERLDR L    ++++F +  + HL     DH PI +V+    
Subjt:  GDLNEILCNNVKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRR-NGNLIWERLDRFLCNDDFWSLFCQISISHLGWLFPDHRPI-MVSLEIS

Query:  TRERKRFRPFRFEEVWALNPDCKNIIM
        T    + R FRFEEVW  NP C+  +M
Subjt:  TRERKRFRPFRFEEVWALNPDCKNIIM

A0A2N9H1U1 Reverse transcriptase domain-containing protein6.6e-3734.22Show/hide
Query:  KTWKKRLRASTDSPPGGRQRCPLKRKGVIVEGGGEGKSK--------RGLGNPRAFRSLCDLVRSNNPDVLFLSEQ--KEVQLFVIRSS--FAVTLLVAY
        K+WK++  +S  +P       PLKR G          +         RGLGNP   + L  LVR  +P VLF+ E    E +L V+R    F+  L+V+ 
Subjt:  KTWKKRLRASTDSPPGGRQRCPLKRKGVIVEGGGEGKSK--------RGLGNPRAFRSLCDLVRSNNPDVLFLSEQ--KEVQLFVIRSS--FAVTLLVAY

Query:  R------------------------------------SWRFMGVYGYPESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPPRAVHYLSAF
        R                                    +WRF G YG PE+ R+HL W LL  L     +PW+  GD NE+L  + K GGP R+   +  F
Subjt:  R------------------------------------SWRFMGVYGYPESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPPRAVHYLSAF

Query:  KEALDVCHLVDMGFKGSPYTWFGRR-NGNLIWERLDRFLCNDDFWSLFCQISISHLGWLFPDHRPI-MVSLEISTRERKRFRPFRFEEVWALNPDCKNII
        ++A+DVC  +D+G++G P+TW   R +   +WERLDR L    +++ F +  I HL     DH PI +V+    T  R + RPFRFEEVW  NP C+  +
Subjt:  KEALDVCHLVDMGFKGSPYTWFGRR-NGNLIWERLDRFLCNDDFWSLFCQISISHLGWLFPDHRPI-MVSLEISTRERKRFRPFRFEEVWALNPDCKNII

Query:  M
        M
Subjt:  M

A0A2N9H936 Uncharacterized protein2.5e-3635.44Show/hide
Query:  NIKMTTKTWKKRLRASTDSPPGGRQRCPLKRKGVIVEGGGEGKSK-----------RGLGNPRAFRSLCDLVRSNNPDVLFLSEQK--EVQLFVIR----
        N K+   TWK+  R +    P   +  P K+K  I  GGG   +            RGLGNP+    L  +VR  +P VLFLSE K  E QL V+R    
Subjt:  NIKMTTKTWKKRLRASTDSPPGGRQRCPLKRKGVIVEGGGEGKSK-----------RGLGNPRAFRSLCDLVRSNNPDVLFLSEQK--EVQLFVIR----

Query:  --SSFAVT-------LLVAYRS-------------------------WRFMGVYGYPESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPP
            F V        L + +RS                         WRF G YG P  + K   WDLL  L  ++ +PW+ GGD NEIL    K G   
Subjt:  --SSFAVT-------LLVAYRS-------------------------WRFMGVYGYPESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPP

Query:  RAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNL-IWERLDRFLCNDDFWSLFCQISISHLGWLFPDHRPIMVSLEISTRE-RKRFRPFRFEEVWA
        R    +SAF+  +D C  VD+GF GSPYTW+ ++ G   + ERLDR L   D+   F    ++HL  +F DHRP+ V L +  R+ R R + FRFEE+W 
Subjt:  RAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNL-IWERLDRFLCNDDFWSLFCQISISHLGWLFPDHRPIMVSLEISTRE-RKRFRPFRFEEVWA

Query:  LNPDCKNIIMKGDHWE
        ++  C++ I     WE
Subjt:  LNPDCKNIIMKGDHWE

A0A2N9HW04 Reverse transcriptase domain-containing protein6.6e-3734.22Show/hide
Query:  KTWKKRLRASTDSPPGGRQRCPLKRKGVIVEGGGEGKSK--------RGLGNPRAFRSLCDLVRSNNPDVLFLSEQ--KEVQLFVIRSS--FAVTLLVAY
        K+WK++  +S  +P       PLKR G          +         RGLGNP   + L  LVR  +P VLF+ E    E +L V+R    F+  L+V+ 
Subjt:  KTWKKRLRASTDSPPGGRQRCPLKRKGVIVEGGGEGKSK--------RGLGNPRAFRSLCDLVRSNNPDVLFLSEQ--KEVQLFVIRSS--FAVTLLVAY

Query:  R------------------------------------SWRFMGVYGYPESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPPRAVHYLSAF
        R                                    +WRF G YG PE+ R+HL W LL  L     +PW+  GD NE+L  + K GGP R+   +  F
Subjt:  R------------------------------------SWRFMGVYGYPESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKLGGPPRAVHYLSAF

Query:  KEALDVCHLVDMGFKGSPYTWFGRR-NGNLIWERLDRFLCNDDFWSLFCQISISHLGWLFPDHRPI-MVSLEISTRERKRFRPFRFEEVWALNPDCKNII
        ++A+DVC  +D+G++G P+TW   R +   +WERLDR L    +++ F +  I HL     DH PI +V+    T  R + RPFRFEEVW  NP C+  +
Subjt:  KEALDVCHLVDMGFKGSPYTWFGRR-NGNLIWERLDRFLCNDDFWSLFCQISISHLGWLFPDHRPI-MVSLEISTRERKRFRPFRFEEVWALNPDCKNII

Query:  M
        M
Subjt:  M

A0A2N9IYL5 RNase H domain-containing protein2.1e-3537.91Show/hide
Query:  RGLGNPRAFRSLCDLVRSNNPDVLFLSEQKEVQLFVIRSSFAVTLLVAYRSWRFMGVYGYPESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKL
        RGLGNPR  + +  LVR+ +  V+FL E  + +  V R  +A+       +WRF G YG PE+ ++   W+LL RL +   VPW   GD NE++    K 
Subjt:  RGLGNPRAFRSLCDLVRSNNPDVLFLSEQKEVQLFVIRSSFAVTLLVAYRSWRFMGVYGYPESARKHLKWDLLSRLGSNNVVPWMIGGDLNEILCNNVKL

Query:  GGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNLIWERLDRFLCNDDFWSLFCQISISHLGWLFPDHRPIMVSLEISTRERKRFRPFRFEEV
        G   R+   +   ++ LD C  VD+GF G  +TW   R G++ WERLDR +   ++   F    + HL   + DH+PI VS E     ++  RPFRFEEV
Subjt:  GGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNLIWERLDRFLCNDDFWSLFCQISISHLGWLFPDHRPIMVSLEISTRERKRFRPFRFEEV

Query:  WALNPDCKNII
        W  +  C+N+I
Subjt:  WALNPDCKNII

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein1.5e-0437.18Show/hide
Query:  PPLAELFAIKNGLNLALQTDATHLKVESDCILAINLLNGKQHVVNEIGCCVEEILRLSKEFRAVSFKYINRDSNKVAD
        P  AE +AIK+ +  ALQ + + L V SD    ++ LN     +NEI   + EI  +   FR++SF++I R  N +AD
Subjt:  PPLAELFAIKNGLNLALQTDATHLKVESDCILAINLLNGKQHVVNEIGCCVEEILRLSKEFRAVSFKYINRDSNKVAD

AT1G43760.1 DNAse I-like superfamily protein1.7e-0831.41Show/hide
Query:  SWRFMGVYGYPESARKHLKWD-LLSRLGSNNVVPWMI-GGDLNEILCNN-----VKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNLI
        SWR    Y   E  R  + WD  +S L        MI  GD ++I   +     ++   P R    L  F+  L    LVD+  +G  YTW   ++ N I
Subjt:  SWRFMGVYGYPESARKHLKWD-LLSRLGSNNVVPWMI-GGDLNEILCNN-----VKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNLI

Query:  WERLDRFLCNDDFWSLF-CQISISHLGWLFPDHRPIMVSLE-ISTRERKRFRPFRF
          +LDR + N D++S F   I++  L  +  DH P ++ LE +  R +K FR F F
Subjt:  WERLDRFLCNDDFWSLF-CQISISHLGWLFPDHRPIMVSLE-ISTRERKRFRPFRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGAAAGGATAGTTGCGGATTTGGATTCGTTGAAATTAACAGGGGAAGAGGAAAACACTTTAATTCCTCTAGATCCTGAGATTGCTCTTGAGATAGATGGACGTCT
TAATCATTGCCTGGGCACTGACGCCCAGATGGAAGTGGTTTCTGAAGAACCGATAACTGATGTCCTTCAGTCTTCTTTCCAAAAGGCCGTTAATAAGGCCCCTTCTTTTT
CGTCTCCAGGCACGGGCTCAGTTCCCTCGCCGGTTTTATTCGAGTCTTTGGAAGAGGAATCCCAAAAGGAAAGAGAGCTAGGTGTAACAAACATCAAGATGACCACCAAA
ACCTGGAAGAAACGTCTGAGAGCTTCGACGGATTCTCCTCCTGGAGGGAGACAAAGATGTCCTTTAAAAAGGAAGGGTGTCATTGTTGAGGGTGGAGGAGAAGGAAAAAG
CAAAAGGGGCTTAGGCAACCCTCGTGCATTCCGATCTTTATGTGATTTAGTCCGTTCGAATAATCCTGATGTTCTTTTCCTCTCTGAGCAAAAGGAGGTTCAACTCTTTG
TAATAAGATCAAGTTTCGCTGTAACTTTGCTGGTTGCTTACCGGTCTTGGAGATTTATGGGTGTGTATGGGTACCCTGAGTCAGCTAGGAAGCATCTTAAATGGGACTTA
TTATCCAGGTTAGGGTCGAATAACGTTGTTCCTTGGATGATCGGGGGCGACCTTAATGAAATTCTTTGTAATAATGTGAAGTTAGGTGGTCCTCCCAGAGCTGTTCATTA
CCTTAGTGCCTTCAAAGAAGCGTTGGATGTGTGTCATCTTGTTGATATGGGCTTTAAAGGTTCTCCTTACACTTGGTTTGGCAGAAGGAATGGGAACTTAATCTGGGAAA
GGCTTGATAGGTTTCTCTGTAATGATGATTTCTGGTCTCTATTTTGCCAAATATCTATCAGTCATCTGGGCTGGCTTTTTCCTGATCATAGACCTATTATGGTTTCATTA
GAGATTTCAACAAGGGAAAGGAAGAGATTTAGACCCTTTAGGTTTGAGGAAGTTTGGGCTCTCAACCCAGATTGCAAAAATATTATAATGAAGGGTGACCACTGGGAGCT
CCCAAGTTCTGTAGTCTTGGTGCTGTTATTCGAGATGAGAAGGGCAAAGTTTTGGCTATTTCAGTCTCAGAGATTAAACAGAGACTACTTTCCCCCTTTAGCTGAATTAT
TTGCAATTAAAAATGGACTGAATTTAGCTCTTCAAACAGATGCTACTCATTTGAAAGTTGAATCTGATTGCATCCTGGCTATTAACCTATTGAATGGAAAACAACATGTT
GTTAACGAGATTGGCTGCTGCGTTGAGGAAATTCTCAGATTGAGCAAAGAGTTTAGAGCTGTTTCTTTTAAATATATTAATAGGGATAGTAACAAAGTCGCTGATCGTCT
TTTGCTAAAAATGCAAGGAATAGTGATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACTGAAAGGATAGTTGCGGATTTGGATTCGTTGAAATTAACAGGGGAAGAGGAAAACACTTTAATTCCTCTAGATCCTGAGATTGCTCTTGAGATAGATGGACGTCT
TAATCATTGCCTGGGCACTGACGCCCAGATGGAAGTGGTTTCTGAAGAACCGATAACTGATGTCCTTCAGTCTTCTTTCCAAAAGGCCGTTAATAAGGCCCCTTCTTTTT
CGTCTCCAGGCACGGGCTCAGTTCCCTCGCCGGTTTTATTCGAGTCTTTGGAAGAGGAATCCCAAAAGGAAAGAGAGCTAGGTGTAACAAACATCAAGATGACCACCAAA
ACCTGGAAGAAACGTCTGAGAGCTTCGACGGATTCTCCTCCTGGAGGGAGACAAAGATGTCCTTTAAAAAGGAAGGGTGTCATTGTTGAGGGTGGAGGAGAAGGAAAAAG
CAAAAGGGGCTTAGGCAACCCTCGTGCATTCCGATCTTTATGTGATTTAGTCCGTTCGAATAATCCTGATGTTCTTTTCCTCTCTGAGCAAAAGGAGGTTCAACTCTTTG
TAATAAGATCAAGTTTCGCTGTAACTTTGCTGGTTGCTTACCGGTCTTGGAGATTTATGGGTGTGTATGGGTACCCTGAGTCAGCTAGGAAGCATCTTAAATGGGACTTA
TTATCCAGGTTAGGGTCGAATAACGTTGTTCCTTGGATGATCGGGGGCGACCTTAATGAAATTCTTTGTAATAATGTGAAGTTAGGTGGTCCTCCCAGAGCTGTTCATTA
CCTTAGTGCCTTCAAAGAAGCGTTGGATGTGTGTCATCTTGTTGATATGGGCTTTAAAGGTTCTCCTTACACTTGGTTTGGCAGAAGGAATGGGAACTTAATCTGGGAAA
GGCTTGATAGGTTTCTCTGTAATGATGATTTCTGGTCTCTATTTTGCCAAATATCTATCAGTCATCTGGGCTGGCTTTTTCCTGATCATAGACCTATTATGGTTTCATTA
GAGATTTCAACAAGGGAAAGGAAGAGATTTAGACCCTTTAGGTTTGAGGAAGTTTGGGCTCTCAACCCAGATTGCAAAAATATTATAATGAAGGGTGACCACTGGGAGCT
CCCAAGTTCTGTAGTCTTGGTGCTGTTATTCGAGATGAGAAGGGCAAAGTTTTGGCTATTTCAGTCTCAGAGATTAAACAGAGACTACTTTCCCCCTTTAGCTGAATTAT
TTGCAATTAAAAATGGACTGAATTTAGCTCTTCAAACAGATGCTACTCATTTGAAAGTTGAATCTGATTGCATCCTGGCTATTAACCTATTGAATGGAAAACAACATGTT
GTTAACGAGATTGGCTGCTGCGTTGAGGAAATTCTCAGATTGAGCAAAGAGTTTAGAGCTGTTTCTTTTAAATATATTAATAGGGATAGTAACAAAGTCGCTGATCGTCT
TTTGCTAAAAATGCAAGGAATAGTGATTTAA
Protein sequenceShow/hide protein sequence
MTERIVADLDSLKLTGEEENTLIPLDPEIALEIDGRLNHCLGTDAQMEVVSEEPITDVLQSSFQKAVNKAPSFSSPGTGSVPSPVLFESLEEESQKERELGVTNIKMTTK
TWKKRLRASTDSPPGGRQRCPLKRKGVIVEGGGEGKSKRGLGNPRAFRSLCDLVRSNNPDVLFLSEQKEVQLFVIRSSFAVTLLVAYRSWRFMGVYGYPESARKHLKWDL
LSRLGSNNVVPWMIGGDLNEILCNNVKLGGPPRAVHYLSAFKEALDVCHLVDMGFKGSPYTWFGRRNGNLIWERLDRFLCNDDFWSLFCQISISHLGWLFPDHRPIMVSL
EISTRERKRFRPFRFEEVWALNPDCKNIIMKGDHWELPSSVVLVLLFEMRRAKFWLFQSQRLNRDYFPPLAELFAIKNGLNLALQTDATHLKVESDCILAINLLNGKQHV
VNEIGCCVEEILRLSKEFRAVSFKYINRDSNKVADRLLLKMQGIVI