; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg013600 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg013600
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold2:24474591..24492500
RNA-Seq ExpressionSpg013600
SyntenySpg013600
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]2.7e-2030.74Show/hide
Query:  AEEVIQRQLENLGLEEEERGRVVDIEDDDIDETDKDFLNSLACKILSCRTINPKGFASLMPKIWGLSENVRIEKAGRNLFLCKFRHQRFKNRVLREGPWS
        +E  I +  ENL LE+E+   V +I +D I + D+D    L  K+L+ + +N + F  L+ +IW     V +E  G N F+  F ++ ++N+V   GPW 
Subjt:  AEEVIQRQLENLGLEEEERGRVVDIEDDDIDETDKDFLNSLACKILSCRTINPKGFASLMPKIWGLSENVRIEKAGRNLFLCKFRHQRFKNRVLREGPWS

Query:  FA----------------------------------------TRSWKYAIALGNSIGKFVMAESDENGKMSGETLRVKVQMDINKPLRRGTNIKTGSMAD
        F                                         T  W     L   IG+ V   + E+ +  G+ +RVKVQ+DI KPL+R   IK G   +
Subjt:  FA----------------------------------------TRSWKYAIALGNSIGKFVMAESDENGKMSGETLRVKVQMDINKPLRRGTNIKTGSMAD

Query:  KAWIRVTYEKLPDFCYFCGKLGHVVQECEEE
           + + YE+LPDFC+ CG++GH V+EC +E
Subjt:  KAWIRVTYEKLPDFCYFCGKLGHVVQECEEE

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]8.3e-2232.73Show/hide
Query:  GKEDTTIAILVLWNIWNARNITNLNNQQPDLDQLQ----RNILN------NIEECKSNQDKSSRAEHGENLLSHLLWKPPDPSCWKLNADASWFEGKGVG
        G+E+   ++++ W IW  RN +      P+   +Q    R I+N      N++   +N+D        +N  +   WKPP  + WKLN +A+W      G
Subjt:  GKEDTTIAILVLWNIWNARNITNLNNQQPDLDQLQ----RNILN------NIEECKSNQDKSSRAEHGENLLSHLLWKPPDPSCWKLNADASWFEGKGVG

Query:  GLGWTIRDSNGSLIGAGCKKFKRNWSIKCLEAEAILEGMKAYSSHGDIEGRGCKLPLVVESDSIEAVGALNRDWEDQSEIKLIVEEIENSEGFVGVLSVS
        G+GW +RD  G +I A C+  +   +I  LE  AI EG++A      I    C+ P+ +ESDS+EA+  L+R  +DQ+EI  ++EEI      + ++S+ 
Subjt:  GLGWTIRDSNGSLIGAGCKKFKRNWSIKCLEAEAILEGMKAYSSHGDIEGRGCKLPLVVESDSIEAVGALNRDWEDQSEIKLIVEEIENSEGFVGVLSVS

Query:  KCSRSENRMAHNLAQAAASN
          SR  N++AH LA+ A  N
Subjt:  KCSRSENRMAHNLAQAAASN

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]2.4e-2133.98Show/hide
Query:  EDTTIAILVLWNIWNARNITNLNNQQPDLDQLQRNILNNIEECKSNQDKSSRAEHGENLLSHLLWKPPDPSCWKLNADASWFEGKGVGGLGWTIRDSNGS
        ED  + ++  W IWN RN      +      + + +   + E  S Q ++S +   + L + L W+PP    W LNADASW +    GG+GW IR  +G 
Subjt:  EDTTIAILVLWNIWNARNITNLNNQQPDLDQLQRNILNNIEECKSNQDKSSRAEHGENLLSHLLWKPPDPSCWKLNADASWFEGKGVGGLGWTIRDSNGS

Query:  LIGAGCKKFKRNWSIKCLEAEAILEGMKAYSSHGDIEGRGCKLPLVVESDSIEAVGALNRDWEDQSEIKLIVEEIENSEGFVGVLSVSKCSRSENRMAHN
        ++ AG +  +   ++K LEA AILEG++      ++   G   PL +E+DS E    LNR  ED ++   +VEEI N      +L+ +K  R  N  AH+
Subjt:  LIGAGCKKFKRNWSIKCLEAEAILEGMKAYSSHGDIEGRGCKLPLVVESDSIEAVGALNRDWEDQSEIKLIVEEIENSEGFVGVLSVSKCSRSENRMAHN

Query:  LAQAAA
        LAQ A+
Subjt:  LAQAAA

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]1.3e-2232.88Show/hide
Query:  GKEDTTIAILVLWNIWNARN---ITNLNNQQPDLD-QLQRNILN------NIEECKSNQDKSSRAEHGENLLSHLLWKPPDPSCWKLNADASWFEGKGVG
        G+E+   ++++ W IW  RN      ++++  D+   + R I+N      N++   +N+D       G+N  +   WKPP  + WKLN DA+W      G
Subjt:  GKEDTTIAILVLWNIWNARN---ITNLNNQQPDLD-QLQRNILN------NIEECKSNQDKSSRAEHGENLLSHLLWKPPDPSCWKLNADASWFEGKGVG

Query:  GLGWTIRDSNGSLIGAGCKKFKRNWSIKCLEAEAILEGMKAYSSH--GDIEGRGCKLPLVVESDSIEAVGALNRDWEDQSEIKLIVEEIENSEGFVGVLS
        G+GW +RD  G +I A C+  +   +I  LE  AI EG++A        I+   C+ P+ +ESDS+EA+  L+R  +DQ+EI  ++EEI      + ++S
Subjt:  GLGWTIRDSNGSLIGAGCKKFKRNWSIKCLEAEAILEGMKAYSSH--GDIEGRGCKLPLVVESDSIEAVGALNRDWEDQSEIKLIVEEIENSEGFVGVLS

Query:  VSKCSRSENRMAHNLAQAAASN
        +   SR  N++AH+LA+ A  N
Subjt:  VSKCSRSENRMAHNLAQAAASN

XP_035547259.1 uncharacterized protein LOC118348856 [Juglans regia]1.5e-2334Show/hide
Query:  ENLGLEEEERGRVVDIEDDDIDETDKDFLNSLACKILSCRTINPKGFASLMPKIWGLSENVRIEKAGRNLFLCKFRHQRFKNRVLREGPWSF--------
        ++L L E+E+G  + +E   ++         L   +L+ R  N +   ++M K+W  +  V  +  G NLFL +F   R K R+L EGPWSF        
Subjt:  ENLGLEEEERGRVVDIEDDDIDETDKDFLNSLACKILSCRTINPKGFASLMPKIWGLSENVRIEKAGRNLFLCKFRHQRFKNRVLREGPWSF--------

Query:  -ATRSWKYAIALGNSIGKFVMAESDENGKMSGETLRVKVQMDINKPLRRGTNIKTGSMADKAWIRVTYEKLPDFCYFCGKLGHVVQECEEEGSGGCSKLD
            ++K    +GN IG     + +++    GE LR++V+++I++PL RG  +K G      WIR  YE+LP+FCY CGKLGH  ++CE  G+ G    D
Subjt:  -ATRSWKYAIALGNSIGKFVMAESDENGKMSGETLRVKVQMDINKPLRRGTNIKTGSMADKAWIRVTYEKLPDFCYFCGKLGHVVQECEEEGSGGCSKLD

TrEMBL top hitse value%identityAlignment
A0A2K3PIT6 Ribonuclease H (Fragment)1.7e-2034.26Show/hide
Query:  ENLGLEEEERGRVVDIEDDDIDETDKDFLNSLACKILSCRTINPKGFASLMPKIWGLSENVRIEKAGRNLFLCKFRHQRFKNRVLREGPWSF--------
        EN   +EEE   VV   DD +   D+ F  SL  K+ +    N + F  ++ + W L   V I+   +NLFL +F  +R    VLR GPWSF        
Subjt:  ENLGLEEEERGRVVDIEDDDIDETDKDFLNSLACKILSCRTINPKGFASLMPKIWGLSENVRIEKAGRNLFLCKFRHQRFKNRVLREGPWSF--------

Query:  ---------------------------ATRSWKYAIALGNSIGKFVMAESDENGKMSGETLRVKVQMDINKPLRRGTNIKTGSMADKAWIRVTYEKLPDF
                                     RS   A+ LGN+IG+FV A+S ++G   G+ LRVKV +D+ KPL+RGT +       +  +   YE+LP+F
Subjt:  ---------------------------ATRSWKYAIALGNSIGKFVMAESDENGKMSGETLRVKVQMDINKPLRRGTNIKTGSMADKAWIRVTYEKLPDF

Query:  CYFCGKLGHVVQECEE
        CY CG++GH +++CEE
Subjt:  CYFCGKLGHVVQECEE

A0A5C7GU64 CCHC-type domain-containing protein1.3e-2030.74Show/hide
Query:  AEEVIQRQLENLGLEEEERGRVVDIEDDDIDETDKDFLNSLACKILSCRTINPKGFASLMPKIWGLSENVRIEKAGRNLFLCKFRHQRFKNRVLREGPWS
        +E  I +  ENL LE+E+   V +I +D I + D+D    L  K+L+ + +N + F  L+ +IW     V +E  G N F+  F ++ ++N+V   GPW 
Subjt:  AEEVIQRQLENLGLEEEERGRVVDIEDDDIDETDKDFLNSLACKILSCRTINPKGFASLMPKIWGLSENVRIEKAGRNLFLCKFRHQRFKNRVLREGPWS

Query:  FA----------------------------------------TRSWKYAIALGNSIGKFVMAESDENGKMSGETLRVKVQMDINKPLRRGTNIKTGSMAD
        F                                         T  W     L   IG+ V   + E+ +  G+ +RVKVQ+DI KPL+R   IK G   +
Subjt:  FA----------------------------------------TRSWKYAIALGNSIGKFVMAESDENGKMSGETLRVKVQMDINKPLRRGTNIKTGSMAD

Query:  KAWIRVTYEKLPDFCYFCGKLGHVVQECEEE
           + + YE+LPDFC+ CG++GH V+EC +E
Subjt:  KAWIRVTYEKLPDFCYFCGKLGHVVQECEEE

A0A6J1BSZ1 uncharacterized protein LOC1110054813.7e-2032.5Show/hide
Query:  ENLGLEEEERGRVVDIEDDDIDETDKDFLNSLACKILSCRTINPKGFASLMPKIWGLS-ENVRIEKAGRNLFLCKFRHQRFKNRVLREGPWSF-------
        +N  L  EE    VDI+   ++ T K    SL CK+LS R+I+     + +   W L  +   ++  G N+FL  F     +NR+LR GPW+F       
Subjt:  ENLGLEEEERGRVVDIEDDDIDETDKDFLNSLACKILSCRTINPKGFASLMPKIWGLS-ENVRIEKAGRNLFLCKFRHQRFKNRVLREGPWSF-------

Query:  ----------------------------ATRSWKYAIALGNSIGKFVMAESDENGKMSGETLRVKVQMDINKPLRRGTNIKTGSMADKAWIRVTYEKLPD
                                    A  +   A  LGN+IG F   ES+ N    G  LRV+V+ D+ KPL RG  +         WI + YE+LPD
Subjt:  ----------------------------ATRSWKYAIALGNSIGKFVMAESDENGKMSGETLRVKVQMDINKPLRRGTNIKTGSMADKAWIRVTYEKLPD

Query:  FCYFCGKLGHVVQECEEEGSGGCSK-LDYGVDLRNTQGSK
        F Y CG+L H++++C +      SK L YG  LR  QG K
Subjt:  FCYFCGKLGHVVQECEEEGSGGCSK-LDYGVDLRNTQGSK

A0A6J1DNV9 uncharacterized protein LOC1110224031.2e-2133.98Show/hide
Query:  EDTTIAILVLWNIWNARNITNLNNQQPDLDQLQRNILNNIEECKSNQDKSSRAEHGENLLSHLLWKPPDPSCWKLNADASWFEGKGVGGLGWTIRDSNGS
        ED  + ++  W IWN RN      +      + + +   + E  S Q ++S +   + L + L W+PP    W LNADASW +    GG+GW IR  +G 
Subjt:  EDTTIAILVLWNIWNARNITNLNNQQPDLDQLQRNILNNIEECKSNQDKSSRAEHGENLLSHLLWKPPDPSCWKLNADASWFEGKGVGGLGWTIRDSNGS

Query:  LIGAGCKKFKRNWSIKCLEAEAILEGMKAYSSHGDIEGRGCKLPLVVESDSIEAVGALNRDWEDQSEIKLIVEEIENSEGFVGVLSVSKCSRSENRMAHN
        ++ AG +  +   ++K LEA AILEG++      ++   G   PL +E+DS E    LNR  ED ++   +VEEI N      +L+ +K  R  N  AH+
Subjt:  LIGAGCKKFKRNWSIKCLEAEAILEGMKAYSSHGDIEGRGCKLPLVVESDSIEAVGALNRDWEDQSEIKLIVEEIENSEGFVGVLSVSKCSRSENRMAHN

Query:  LAQAAA
        LAQ A+
Subjt:  LAQAAA

A0A6P9EID8 uncharacterized protein LOC1183488567.3e-2434Show/hide
Query:  ENLGLEEEERGRVVDIEDDDIDETDKDFLNSLACKILSCRTINPKGFASLMPKIWGLSENVRIEKAGRNLFLCKFRHQRFKNRVLREGPWSF--------
        ++L L E+E+G  + +E   ++         L   +L+ R  N +   ++M K+W  +  V  +  G NLFL +F   R K R+L EGPWSF        
Subjt:  ENLGLEEEERGRVVDIEDDDIDETDKDFLNSLACKILSCRTINPKGFASLMPKIWGLSENVRIEKAGRNLFLCKFRHQRFKNRVLREGPWSF--------

Query:  -ATRSWKYAIALGNSIGKFVMAESDENGKMSGETLRVKVQMDINKPLRRGTNIKTGSMADKAWIRVTYEKLPDFCYFCGKLGHVVQECEEEGSGGCSKLD
            ++K    +GN IG     + +++    GE LR++V+++I++PL RG  +K G      WIR  YE+LP+FCY CGKLGH  ++CE  G+ G    D
Subjt:  -ATRSWKYAIALGNSIGKFVMAESDENGKMSGETLRVKVQMDINKPLRRGTNIKTGSMADKAWIRVTYEKLPDFCYFCGKLGHVVQECEEEGSGGCSKLD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein1.5e-0540.91Show/hide
Query:  PSC-WKLNADASWFEGKGVGGLGWTIRDSNGSLIGAGCKKFKRNWSIKCLEAEAILEGMKAYSSHG
        PSC  K N DAS  EG  V GLGW IR+S G+++  G  KF+   + +  E  A++  ++A S+ G
Subjt:  PSC-WKLNADASWFEGKGVGGLGWTIRDSNGSLIGAGCKKFKRNWSIKCLEAEAILEGMKAYSSHG

AT2G02650.1 Ribonuclease H-like superfamily protein2.3e-0624.88Show/hide
Query:  DTTIAILVLWNIWNARNITNLNN--QQPDLDQLQRNILNNIEECKSNQDKSSRAEH------GENLLSHLLWKPPDPSCWKLNADASWFEGKGVGGLGWT
        D  +   ++W +W +RN+       Q PD  + ++ I +  E   +N+   +   H        +      W PP     K N D+ + +G      GWT
Subjt:  DTTIAILVLWNIWNARNITNLNN--QQPDLDQLQRNILNNIEECKSNQDKSSRAEH------GENLLSHLLWKPPDPSCWKLNADASWFEGKGVGGLGWT

Query:  IRDSNGSLIGAGCKKFKRNWSIKCLEAEAILEGMKAYSSHGDIEGRGCKLPLV-VESDSIEAVGALNRDWEDQSEIKLIVEEIENSEGFVGVLSVSKCS-
        IR+ NG ++  G  K + +      EA   L  ++   +HG        L  V  ESDS   V  +N + ED S +  ++ +I +      +L +  CS 
Subjt:  IRDSNGSLIGAGCKKFKRNWSIKCLEAEAILEGMKAYSSHGDIEGRGCKLPLV-VESDSIEAVGALNRDWEDQSEIKLIVEEIENSEGFVGVLSVSKCS-

Query:  RSENRMAHNLAQAAASN
           NR  ++ A A AS+
Subjt:  RSENRMAHNLAQAAASN

AT2G33160.1 glycoside hydrolase family 28 protein / polygalacturonase (pectinase) family protein3.0e-0628.69Show/hide
Query:  DKMQQGLGKEDTT-------IAILVLWNIWNARNITNLNNQQPDLDQLQRNILNNIEECKSNQDKSSRAEHGENLLSHL-LWKPPDPSCWKLNADASWFE
        DKMQ      +TT       + + +LW +WN+RNI     +    +   +    +++E       S    HG  + SH+  W+ P     K N D S+  
Subjt:  DKMQQGLGKEDTT-------IAILVLWNIWNARNITNLNNQQPDLDQLQRNILNNIEECKSNQDKSSRAEHGENLLSHL-LWKPPDPSCWKLNADASWFE

Query:  GKGVGGLGWTIRDSNGSLIGAG
        G   G  GW +RD NG    AG
Subjt:  GKGVGGLGWTIRDSNGSLIGAG

AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)1.2e-0538.98Show/hide
Query:  WKPPDPSCWKLNADASWFEGKGVGGLGWTIRDSNGSLIGAGCKKFKRNWSIKCLEAEAI
        W PP     K N D+ + +G+      W IRDSNG +I +GC K ++++S   L+AEA+
Subjt:  WKPPDPSCWKLNADASWFEGKGVGGLGWTIRDSNGSLIGAGCKKFKRNWSIKCLEAEAI

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.7e-0726.56Show/hide
Query:  VLWNIWNARNITNLNNQQPDLDQLQRNILNNIEECKSNQDKSSRAEHGENL--LSHLLWKPPDPSCWKLNADASWFEGKGVGGLGWTIRDSNGSLIGAGC
        ++W IW + N    N+ +          LN+ +E   N   + +     N     +  W PP     K N DAS  E   V GLGW +R+S G++I  G 
Subjt:  VLWNIWNARNITNLNNQQPDLDQLQRNILNNIEECKSNQDKSSRAEHGENL--LSHLLWKPPDPSCWKLNADASWFEGKGVGGLGWTIRDSNGSLIGAGC

Query:  KKFKRNWSIKCLEAEAILEGMKAYSSHG
         KF+   + +  E   ++  ++A    G
Subjt:  KKFKRNWSIKCLEAEAILEGMKAYSSHG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGGAAAGCGCAGAAGAGGTAATACAGAGGCAGTTGGAGAATCTGGGTTTGGAGGAGGAAGAAAGGGGAAGGGTGGTTGATATCGAAGATGATGACATTGACGA
GACGGACAAAGATTTCTTGAACTCTCTGGCGTGCAAAATTTTGTCCTGTCGAACAATTAATCCAAAAGGTTTTGCCAGCTTGATGCCAAAGATCTGGGGGCTCTCCGAAA
ATGTGCGAATAGAAAAAGCAGGGAGAAACCTTTTCTTATGCAAATTTCGACACCAGCGTTTCAAAAACAGAGTTCTACGAGAGGGACCTTGGAGTTTCGCGACTCGATCC
TGGAAGTATGCAATCGCCCTAGGGAACTCGATTGGCAAATTCGTCATGGCTGAATCGGATGAGAATGGGAAGATGTCGGGCGAGACTCTACGGGTCAAAGTTCAGATGGA
TATAAACAAACCGCTGAGGAGAGGAACGAATATCAAGACTGGTTCGATGGCTGATAAAGCGTGGATCAGAGTTACATATGAAAAGTTACCAGATTTTTGTTATTTCTGCG
GGAAGTTGGGCCATGTGGTTCAAGAATGTGAAGAGGAGGGAAGCGGCGGCTGCAGCAAATTGGATTACGGAGTAGATCTCAGAAACACCCAGGGAAGCAAAGGATACTAC
CGTGGAAAAAAGCCAGATCACAGGGAACCGAATCTGAGAGGAAGGGGAAGAGGTAACATGAACAGAGGCAGATCTGCGAAATGGGAGTCAGGGGGAGGAAGTGGAAAAGA
TGAAGACAACAGGACATGGCGACATTTAGGTGAAAATACCCAAGATGGAGAGGAAGAAAATCCATTCGAAAAGGAGGTTGCGGATCGCAACGATCTGAACGAGGAAGGAA
AAAAAGAATCACCGCTGAAGCTAGGGCCAAGCCCAAGCCCATTAGGTTTATTTTTAGCCACTCACGCAGCCCTAGGTCTTCATATTTTTCCTCTTCATCTCTTTTCTCCC
TCCTTCTCCAAGACAACCACGAGCAGCCCCCCATTCCCAGTTTCCATCCTCGACGGCCAGCAACGCCGAGACCCATCTCCGGCGTCCTCGAGCAGATCTCCGACCGGCCC
TCTCCCTCACTCGCGTGTTCCCTCCGGCAAGTCCCGGCGTTGCAGCGGCGTGGGTGCGGTTTCAGCAGCAGTTCAGCGTGTCCAGCGGGTCTCCGGCGACAGAGGCCGCA
ACGGCGACGACGACTCGGCCTCAGCGGCGGCGGCGTCTCGTCTCAGCAGCAGCGGCGCACGGCATTCGCAGCAGGTTCACGTTCAACGGGTGGTGGATCTACACAGCTTT
GCGGCGGCAGTGACGAGAGCTCCGACGGGCTCGTGCTTTCCGGCGTCTCCCACAGCAGATCGACGGTGGTTGGGTTGCAGCAGCGTTTGGGGGGTCGTTTTCTCCGTTTT
TCCGACGTTTCGCGTAGTGGGACGTCGGCCGCTCCGCATCGCAGGCACGTGGCGACAAAATGAATGCAGGTATTTTACAAACCCGATTGACATATGGGACAAGATGCAGC
AGGGGTTGGGAAAAGAAGATACAACAATTGCTATATTGGTGCTCTGGAATATATGGAACGCAAGGAACATAACCAATCTCAACAATCAGCAACCAGATCTCGACCAATTA
CAGAGGAATATTCTAAACAACATTGAAGAATGCAAGAGCAATCAAGATAAATCTTCGAGAGCAGAGCACGGAGAGAACCTTTTGAGTCACCTTTTGTGGAAGCCGCCGGA
CCCAAGTTGCTGGAAATTGAACGCTGATGCCTCGTGGTTTGAAGGGAAAGGCGTGGGAGGGCTGGGGTGGACCATCCGTGACTCTAACGGATCTCTCATCGGAGCGGGCT
GCAAGAAATTTAAGAGAAATTGGTCAATCAAATGCCTTGAAGCCGAAGCGATTCTTGAGGGTATGAAAGCGTACAGTTCTCACGGCGACATCGAAGGGAGAGGATGCAAG
CTACCTCTGGTCGTTGAATCTGACTCCATAGAAGCTGTTGGAGCCTTGAATCGTGACTGGGAAGACCAATCGGAGATCAAACTCATCGTGGAAGAAATCGAGAACAGCGA
AGGCTTTGTAGGGGTGCTCAGCGTCTCCAAGTGTTCTAGATCGGAGAATAGGATGGCGCATAACCTCGCTCAAGCCGCTGCGTCGAATGGGTCTTTTGGTTTTTTTGGCT
CATCTTTTCATCAAGATGAAGAAGATGAACAGTTTTGGAGGGAAGAAGCCTTCTCCACCTGCTTTGCCTCATGTTTTGTCGAGGGAGTTAATAGTCATAGCGGGGAAAAT
GAATTGGTGGTGTCTCAATTGTTGACGCTTGGTCCCTATGAAATCAAGAAAGGCGTTCTTGATATTTCAGAGACGGCAATGAGAGGGGTGGCGGAATTGAGGGGTTCATC
ATATCATCCACTGAGCATGGATCTCACGGTCGGTTCCCAGAACATGCATGCTTTAGGAGTAAAATTGAATGAGTTAAGTGCTTTCGATCTTCAATTGCTTCCGCATGTGT
TATATGAACACCTTCAATTTGAAATTGTGTATTTGATAAAGAAATGCGACCGCATTTCTGGGAAGGCTAAAATCAAATGCGACCGCATTTCTGGAAAAAACAGAGGCCGT
TCCGAGTCCGTCGCAGGTGTTCGGGATGCAAAAAGATGCCAAAGAAACGAAGTGAATCGAAGATGGAAGCTTATTGAAGTGGGCAAGCCGCAGACCAGCGTCGAGACCCT
AGCTTCTGGGCGTCTCGACGCTGGGCTTTCCTTAATTAAATCAGGCGGCAAAAGGTCACAGCGTTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGGAAAGCGCAGAAGAGGTAATACAGAGGCAGTTGGAGAATCTGGGTTTGGAGGAGGAAGAAAGGGGAAGGGTGGTTGATATCGAAGATGATGACATTGACGA
GACGGACAAAGATTTCTTGAACTCTCTGGCGTGCAAAATTTTGTCCTGTCGAACAATTAATCCAAAAGGTTTTGCCAGCTTGATGCCAAAGATCTGGGGGCTCTCCGAAA
ATGTGCGAATAGAAAAAGCAGGGAGAAACCTTTTCTTATGCAAATTTCGACACCAGCGTTTCAAAAACAGAGTTCTACGAGAGGGACCTTGGAGTTTCGCGACTCGATCC
TGGAAGTATGCAATCGCCCTAGGGAACTCGATTGGCAAATTCGTCATGGCTGAATCGGATGAGAATGGGAAGATGTCGGGCGAGACTCTACGGGTCAAAGTTCAGATGGA
TATAAACAAACCGCTGAGGAGAGGAACGAATATCAAGACTGGTTCGATGGCTGATAAAGCGTGGATCAGAGTTACATATGAAAAGTTACCAGATTTTTGTTATTTCTGCG
GGAAGTTGGGCCATGTGGTTCAAGAATGTGAAGAGGAGGGAAGCGGCGGCTGCAGCAAATTGGATTACGGAGTAGATCTCAGAAACACCCAGGGAAGCAAAGGATACTAC
CGTGGAAAAAAGCCAGATCACAGGGAACCGAATCTGAGAGGAAGGGGAAGAGGTAACATGAACAGAGGCAGATCTGCGAAATGGGAGTCAGGGGGAGGAAGTGGAAAAGA
TGAAGACAACAGGACATGGCGACATTTAGGTGAAAATACCCAAGATGGAGAGGAAGAAAATCCATTCGAAAAGGAGGTTGCGGATCGCAACGATCTGAACGAGGAAGGAA
AAAAAGAATCACCGCTGAAGCTAGGGCCAAGCCCAAGCCCATTAGGTTTATTTTTAGCCACTCACGCAGCCCTAGGTCTTCATATTTTTCCTCTTCATCTCTTTTCTCCC
TCCTTCTCCAAGACAACCACGAGCAGCCCCCCATTCCCAGTTTCCATCCTCGACGGCCAGCAACGCCGAGACCCATCTCCGGCGTCCTCGAGCAGATCTCCGACCGGCCC
TCTCCCTCACTCGCGTGTTCCCTCCGGCAAGTCCCGGCGTTGCAGCGGCGTGGGTGCGGTTTCAGCAGCAGTTCAGCGTGTCCAGCGGGTCTCCGGCGACAGAGGCCGCA
ACGGCGACGACGACTCGGCCTCAGCGGCGGCGGCGTCTCGTCTCAGCAGCAGCGGCGCACGGCATTCGCAGCAGGTTCACGTTCAACGGGTGGTGGATCTACACAGCTTT
GCGGCGGCAGTGACGAGAGCTCCGACGGGCTCGTGCTTTCCGGCGTCTCCCACAGCAGATCGACGGTGGTTGGGTTGCAGCAGCGTTTGGGGGGTCGTTTTCTCCGTTTT
TCCGACGTTTCGCGTAGTGGGACGTCGGCCGCTCCGCATCGCAGGCACGTGGCGACAAAATGAATGCAGGTATTTTACAAACCCGATTGACATATGGGACAAGATGCAGC
AGGGGTTGGGAAAAGAAGATACAACAATTGCTATATTGGTGCTCTGGAATATATGGAACGCAAGGAACATAACCAATCTCAACAATCAGCAACCAGATCTCGACCAATTA
CAGAGGAATATTCTAAACAACATTGAAGAATGCAAGAGCAATCAAGATAAATCTTCGAGAGCAGAGCACGGAGAGAACCTTTTGAGTCACCTTTTGTGGAAGCCGCCGGA
CCCAAGTTGCTGGAAATTGAACGCTGATGCCTCGTGGTTTGAAGGGAAAGGCGTGGGAGGGCTGGGGTGGACCATCCGTGACTCTAACGGATCTCTCATCGGAGCGGGCT
GCAAGAAATTTAAGAGAAATTGGTCAATCAAATGCCTTGAAGCCGAAGCGATTCTTGAGGGTATGAAAGCGTACAGTTCTCACGGCGACATCGAAGGGAGAGGATGCAAG
CTACCTCTGGTCGTTGAATCTGACTCCATAGAAGCTGTTGGAGCCTTGAATCGTGACTGGGAAGACCAATCGGAGATCAAACTCATCGTGGAAGAAATCGAGAACAGCGA
AGGCTTTGTAGGGGTGCTCAGCGTCTCCAAGTGTTCTAGATCGGAGAATAGGATGGCGCATAACCTCGCTCAAGCCGCTGCGTCGAATGGGTCTTTTGGTTTTTTTGGCT
CATCTTTTCATCAAGATGAAGAAGATGAACAGTTTTGGAGGGAAGAAGCCTTCTCCACCTGCTTTGCCTCATGTTTTGTCGAGGGAGTTAATAGTCATAGCGGGGAAAAT
GAATTGGTGGTGTCTCAATTGTTGACGCTTGGTCCCTATGAAATCAAGAAAGGCGTTCTTGATATTTCAGAGACGGCAATGAGAGGGGTGGCGGAATTGAGGGGTTCATC
ATATCATCCACTGAGCATGGATCTCACGGTCGGTTCCCAGAACATGCATGCTTTAGGAGTAAAATTGAATGAGTTAAGTGCTTTCGATCTTCAATTGCTTCCGCATGTGT
TATATGAACACCTTCAATTTGAAATTGTGTATTTGATAAAGAAATGCGACCGCATTTCTGGGAAGGCTAAAATCAAATGCGACCGCATTTCTGGAAAAAACAGAGGCCGT
TCCGAGTCCGTCGCAGGTGTTCGGGATGCAAAAAGATGCCAAAGAAACGAAGTGAATCGAAGATGGAAGCTTATTGAAGTGGGCAAGCCGCAGACCAGCGTCGAGACCCT
AGCTTCTGGGCGTCTCGACGCTGGGCTTTCCTTAATTAAATCAGGCGGCAAAAGGTCACAGCGTTTGTGA
Protein sequenceShow/hide protein sequence
MEKESAEEVIQRQLENLGLEEEERGRVVDIEDDDIDETDKDFLNSLACKILSCRTINPKGFASLMPKIWGLSENVRIEKAGRNLFLCKFRHQRFKNRVLREGPWSFATRS
WKYAIALGNSIGKFVMAESDENGKMSGETLRVKVQMDINKPLRRGTNIKTGSMADKAWIRVTYEKLPDFCYFCGKLGHVVQECEEEGSGGCSKLDYGVDLRNTQGSKGYY
RGKKPDHREPNLRGRGRGNMNRGRSAKWESGGGSGKDEDNRTWRHLGENTQDGEEENPFEKEVADRNDLNEEGKKESPLKLGPSPSPLGLFLATHAALGLHIFPLHLFSP
SFSKTTTSSPPFPVSILDGQQRRDPSPASSSRSPTGPLPHSRVPSGKSRRCSGVGAVSAAVQRVQRVSGDRGRNGDDDSASAAAASRLSSSGARHSQQVHVQRVVDLHSF
AAAVTRAPTGSCFPASPTADRRWLGCSSVWGVVFSVFPTFRVVGRRPLRIAGTWRQNECRYFTNPIDIWDKMQQGLGKEDTTIAILVLWNIWNARNITNLNNQQPDLDQL
QRNILNNIEECKSNQDKSSRAEHGENLLSHLLWKPPDPSCWKLNADASWFEGKGVGGLGWTIRDSNGSLIGAGCKKFKRNWSIKCLEAEAILEGMKAYSSHGDIEGRGCK
LPLVVESDSIEAVGALNRDWEDQSEIKLIVEEIENSEGFVGVLSVSKCSRSENRMAHNLAQAAASNGSFGFFGSSFHQDEEDEQFWREEAFSTCFASCFVEGVNSHSGEN
ELVVSQLLTLGPYEIKKGVLDISETAMRGVAELRGSSYHPLSMDLTVGSQNMHALGVKLNELSAFDLQLLPHVLYEHLQFEIVYLIKKCDRISGKAKIKCDRISGKNRGR
SESVAGVRDAKRCQRNEVNRRWKLIEVGKPQTSVETLASGRLDAGLSLIKSGGKRSQRL