; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036201 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036201
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold5:43924943..43933560
RNA-Seq ExpressionSpg036201
SyntenySpg036201
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PQQ10307.1 uncharacterized protein Pyn_17609 [Prunus yedoensis var. nudiflora]7.2e-3731.99Show/hide
Query:  EELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYD
        EE+   +EK  +  +++   + +E  DI+D  + L  S+  K+ T+     E F   M +IW     V V+  G N+FL  F T+ D+ +++R  PW +D
Subjt:  EELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYD

Query:  DALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPIT
         AL+L E P GR   S M  +YA FWI  HN+P  C        +GN++G   DV+   +G   G  LR+R+  DV + L RG  +T+ SG   +++   
Subjt:  DALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPIT

Query:  FEKLPDFCYSCGRIGHVQQEC--IEERPEGHEEGTYGIKLRET---NSSKGFYRPKREEYWEGKWPFLRGGRGRGRSARGGRNFNEFWNKKYERNSD
        +E+LP+FC+ CGR+GHV +EC  +++  +   E  YGI L+ T   NS++     +       ++   +G   RG    GG   +  W      N D
Subjt:  FEKLPDFCYSCGRIGHVQQEC--IEERPEGHEEGTYGIKLRET---NSSKGFYRPKREEYWEGKWPFLRGGRGRGRSARGGRNFNEFWNKKYERNSD

VVA32948.1 PREDICTED: DUF4283 domain-containing [Prunus dulcis]3.8e-3834.57Show/hide
Query:  EELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYD
        +E+   +EK  +  ++++  + +   DI+D  + L  S+  K+LT+     E F   M +IW     V V+  G N+FL  F T+ D+ R++R  PW +D
Subjt:  EELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYD

Query:  DALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPIT
         AL+L E P G    S M  +YA FWI  HN+P  C        +GN+ G   DVI   +G+  G  LR+R+  DV + L+RG  +T+ SG    ++   
Subjt:  DALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPIT

Query:  FEKLPDFCYSCGRIGHVQQEC--IEERPEGHEEGTYGIKLRET
        +E+LP+FC+ CGR+GHV +EC  +++  +  +E  YG  L+ T
Subjt:  FEKLPDFCYSCGRIGHVQQEC--IEERPEGHEEGTYGIKLRET

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.2e-3638.18Show/hide
Query:  LNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRV-NVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYDD
        L E+ +   L  +EDK  V I+   +E   K L  S+ CK+L+ + I   V  + +   W ++ +  +V+  G NIFL  F    D++RI+R  PW +D 
Subjt:  LNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRV-NVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYDD

Query:  ALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPITF
        ALI+ + P        M+FR  S W+HF +L   C  +  A  LGNAIG FEDV S+ N    G  LRVR++FDV + L RG  + +       WIPI +
Subjt:  ALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPITF

Query:  EKLPDFCYSCGRIGHVQQEC
        E+LPDF Y CGR+ H+ ++C
Subjt:  EKLPDFCYSCGRIGHVQQEC

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]6.1e-3632.31Show/hide
Query:  EELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYD
        +E+ +  E       E++ V       I  AD ++   +  K+ TSK I  E    +M  +W V      E  G NI++  F++  +K R++   PW ++
Subjt:  EELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYD

Query:  DALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPIT
         +L++   P        M F + +FWI  HN+P  C   + A  LG  +G+ E++  DG    +G  +RVR+K DV + L+RG  +   S     W P+ 
Subjt:  DALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPIT

Query:  FEKLPDFCYSCGRIGHVQQECIEERPE---GHEEGTYGIKLRETNSSKGFYRPKREEYWEGKWPFLRGGR-GRGRSARGGRNFNEFWNKKYERNSDGNWD
        +EKLPDFCY CG+IGH  +EC E+R +    +    YG  LR T   K    P+ E +W       RGGR GRG    GGR     W ++ E   D +  
Subjt:  FEKLPDFCYSCGRIGHVQQECIEERPE---GHEEGTYGIKLRETNSSKGFYRPKREEYWEGKWPFLRGGR-GRGRSARGGRNFNEFWNKKYERNSDGNWD

Query:  KSPTRVPTAAEETNDSAEGEERGTA
        +S  R   A EE  D    EE  TA
Subjt:  KSPTRVPTAAEETNDSAEGEERGTA

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]6.1e-3632.08Show/hide
Query:  NEELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIY
        +E L    +K  L  +ED+  + ++   ++ A++ L+ S+  K+L  ++I  +V S ++   W VE ++ VE  G+N+FL  F  + D +R+++  PW +
Subjt:  NEELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIY

Query:  DDALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPI
        D ALI+ ++P    +IS +EF   +FWIH  +LP     +  A  LGNAIG F DV  +  G   G SLR+R+  D+ + L+RG  I +       WIPI
Subjt:  DDALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPI

Query:  TFEKLPDFCYSCGRIGHVQQEC----IEERPEGHEEGTYGIKLRETNSSKGFYRPKREEYWEGKWPFLRGGRGRGRSARGGRNFNEFWNKKYER-NSDGN
         +E+LPDFCY CG IGH   +C    +  + +      YG  LR   S  G  + ++     GK P      G        R   E   +  E+ N DG 
Subjt:  TFEKLPDFCYSCGRIGHVQQEC----IEERPEGHEEGTYGIKLRETNSSKGFYRPKREEYWEGKWPFLRGGRGRGRSARGGRNFNEFWNKKYER-NSDGN

Query:  WDKSPTRVPTAAEETNDS
          ++      AAE+T D+
Subjt:  WDKSPTRVPTAAEETNDS

TrEMBL top hitse value%identityAlignment
A0A314YVX1 CCHC-type domain-containing protein3.5e-3731.99Show/hide
Query:  EELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYD
        EE+   +EK  +  +++   + +E  DI+D  + L  S+  K+ T+     E F   M +IW     V V+  G N+FL  F T+ D+ +++R  PW +D
Subjt:  EELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYD

Query:  DALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPIT
         AL+L E P GR   S M  +YA FWI  HN+P  C        +GN++G   DV+   +G   G  LR+R+  DV + L RG  +T+ SG   +++   
Subjt:  DALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPIT

Query:  FEKLPDFCYSCGRIGHVQQEC--IEERPEGHEEGTYGIKLRET---NSSKGFYRPKREEYWEGKWPFLRGGRGRGRSARGGRNFNEFWNKKYERNSD
        +E+LP+FC+ CGR+GHV +EC  +++  +   E  YGI L+ T   NS++     +       ++   +G   RG    GG   +  W      N D
Subjt:  FEKLPDFCYSCGRIGHVQQEC--IEERPEGHEEGTYGIKLRET---NSSKGFYRPKREEYWEGKWPFLRGGRGRGRSARGGRNFNEFWNKKYERNSD

A0A5E4G034 PREDICTED: DUF4283 domain-containing1.8e-3834.57Show/hide
Query:  EELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYD
        +E+   +EK  +  ++++  + +   DI+D  + L  S+  K+LT+     E F   M +IW     V V+  G N+FL  F T+ D+ R++R  PW +D
Subjt:  EELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYD

Query:  DALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPIT
         AL+L E P G    S M  +YA FWI  HN+P  C        +GN+ G   DVI   +G+  G  LR+R+  DV + L+RG  +T+ SG    ++   
Subjt:  DALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPIT

Query:  FEKLPDFCYSCGRIGHVQQEC--IEERPEGHEEGTYGIKLRET
        +E+LP+FC+ CGR+GHV +EC  +++  +  +E  YG  L+ T
Subjt:  FEKLPDFCYSCGRIGHVQQEC--IEERPEGHEEGTYGIKLRET

A0A6J1BSZ1 uncharacterized protein LOC1110054815.9e-3738.18Show/hide
Query:  LNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRV-NVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYDD
        L E+ +   L  +EDK  V I+   +E   K L  S+ CK+L+ + I   V  + +   W ++ +  +V+  G NIFL  F    D++RI+R  PW +D 
Subjt:  LNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRV-NVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYDD

Query:  ALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPITF
        ALI+ + P        M+FR  S W+HF +L   C  +  A  LGNAIG FEDV S+ N    G  LRVR++FDV + L RG  + +       WIPI +
Subjt:  ALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPITF

Query:  EKLPDFCYSCGRIGHVQQEC
        E+LPDF Y CGR+ H+ ++C
Subjt:  EKLPDFCYSCGRIGHVQQEC

A0A6J1D765 uncharacterized protein LOC1110179022.9e-3632.31Show/hide
Query:  EELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYD
        +E+ +  E       E++ V       I  AD ++   +  K+ TSK I  E    +M  +W V      E  G NI++  F++  +K R++   PW ++
Subjt:  EELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYD

Query:  DALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPIT
         +L++   P        M F + +FWI  HN+P  C   + A  LG  +G+ E++  DG    +G  +RVR+K DV + L+RG  +   S     W P+ 
Subjt:  DALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPIT

Query:  FEKLPDFCYSCGRIGHVQQECIEERPE---GHEEGTYGIKLRETNSSKGFYRPKREEYWEGKWPFLRGGR-GRGRSARGGRNFNEFWNKKYERNSDGNWD
        +EKLPDFCY CG+IGH  +EC E+R +    +    YG  LR T   K    P+ E +W       RGGR GRG    GGR     W ++ E   D +  
Subjt:  FEKLPDFCYSCGRIGHVQQECIEERPE---GHEEGTYGIKLRETNSSKGFYRPKREEYWEGKWPFLRGGR-GRGRSARGGRNFNEFWNKKYERNSDGNWD

Query:  KSPTRVPTAAEETNDSAEGEERGTA
        +S  R   A EE  D    EE  TA
Subjt:  KSPTRVPTAAEETNDSAEGEERGTA

A0A6J1DU55 uncharacterized protein LOC1110231352.9e-3632.08Show/hide
Query:  NEELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIY
        +E L    +K  L  +ED+  + ++   ++ A++ L+ S+  K+L  ++I  +V S ++   W VE ++ VE  G+N+FL  F  + D +R+++  PW +
Subjt:  NEELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIACKILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIY

Query:  DDALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPI
        D ALI+ ++P    +IS +EF   +FWIH  +LP     +  A  LGNAIG F DV  +  G   G SLR+R+  D+ + L+RG  I +       WIPI
Subjt:  DDALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGEFEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPI

Query:  TFEKLPDFCYSCGRIGHVQQEC----IEERPEGHEEGTYGIKLRETNSSKGFYRPKREEYWEGKWPFLRGGRGRGRSARGGRNFNEFWNKKYER-NSDGN
         +E+LPDFCY CG IGH   +C    +  + +      YG  LR   S  G  + ++     GK P      G        R   E   +  E+ N DG 
Subjt:  TFEKLPDFCYSCGRIGHVQQEC----IEERPEGHEEGTYGIKLRETNSSKGFYRPKREEYWEGKWPFLRGGRGRGRSARGGRNFNEFWNKKYER-NSDGN

Query:  WDKSPTRVPTAAEETNDS
          ++      AAE+T D+
Subjt:  WDKSPTRVPTAAEETNDS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGGTCCTGGCGCTTCTCTCTCGAGCAGGGGGAACTCCATGGACAAAGGTGGGCAACTCCAAGTTTGCCGAAGGGGATCCATTAGTGATCAGCAAGGTGAAAGTGA
AGGCTCTAGAAGTCAAGACATGGAAGAGGAAGGTAGTACCAAACAACAAATGGAAGAAACAACAAGCAATGGAGAGTCATTAGGCCATGAACCCAATGAGGAGCTAAATG
AACAAATAGAAAAGCTAAGCCTGGCTGAGCAGGAGGATAAGAGAGTAGTGGCTATCGAAGACGGCGACATTGAAGATGCAGACAAGGATCTTAGTGATTCGATTGCTTGT
AAAATTCTTACATCAAAGCTCATTCTGTGGGAGGTGTTCTCAGACATCATGCCACGAATATGGGGTGTCGAAGGCAGAGTTAATGTAGAAAAATCGGGGAGGAATATCTT
CCTCTGTAAATTCAGAACCCAGAAGGATAAACACCGAATAATCAGAGGAGCCCCTTGGATTTATGATGATGCTTTGATCTTATTCGAGGAACCAAAAGGAAGATGTGATA
TCAGTGCGATGGAGTTCAGGTACGCATCTTTTTGGATCCACTTCCATAACTTACCTCGTGTTTGTTATTGCAGGAAATACGCTGAAGCGCTGGGGAATGCTATTGGTGAA
TTTGAAGACGTAATATCCGATGGAAATGGGAGAATCAGTGGTGAAAGTCTTAGAGTAAGGATCAAATTTGACGTGAGGGAGTCGTTAAAAAGGGGAACAAACATAACAGT
GGGGTCCGGTGCGGCCAAAAAATGGATTCCAATCACATTTGAAAAACTCCCTGATTTCTGTTATTCATGTGGTAGAATAGGTCACGTCCAACAAGAATGTATAGAGGAAA
GACCTGAGGGTCACGAAGAAGGCACATACGGAATAAAACTCAGAGAAACAAATAGTAGTAAAGGCTTTTACAGGCCCAAAAGAGAGGAGTACTGGGAGGGAAAATGGCCT
TTTCTGAGAGGAGGCAGAGGTAGAGGACGAAGTGCAAGAGGTGGACGAAACTTCAACGAGTTCTGGAACAAAAAGTACGAGAGAAACAGTGATGGAAACTGGGATAAAAG
TCCGACTAGAGTTCCGACGGCTGCCGAAGAAACAAACGATTCAGCAGAGGGAGAAGAGAGAGGTACGGCTGATGGGACTAATCTGTGCTGTCAGGAGGACTTGTCGGCAA
GGAATGAGGAAGACAAAAGACAAAAAAACGGTAACATGGGCAAAAAGAATACTCACAGTGTTCGAGGGGATGAGTCGGATTCAAAAGTCATTAAAAATGGGTTCAGTGTA
CCCGAACAAAAAGACAAGAATAATGGGCTGTTGATGAAGGGGCCCTTGATGAATATTGGTGCTAACAGGGCAGGCCTGACGAAGGAAATGAATCTGGATAAAGCTGACAC
CAAGGTGGGGAACGAAGAGAACTCCTTGTATACTGATCAAGCTAGCAGGAGTGTTGAAGCTTCGGAACCTAGCAAAAACACAGCTAGCAACAGACCTTTCAATGAAAACT
CTGAAGAAAGAAACAAAAGGAAAGCCAAAGAGAAGTTGCCAAACGAAAACAACAAAGAAAAGAAAAGCCCATCAAAGACAAGGAACTCAAGTTGTAAAAAATGGAAACGG
CTGGCTAGGGGAGAGGTTGCTGGGAAAATGTCTGGGAAATCTGATGTGGAAGATATGATCATTGATATGGGGAAAAGGAAGTTGAATGAAGAGAAGGCAATTGAGAGGCT
TGATAGGTTTTTGGCCAATTCCCAATTCATTAACTTGTTTAATGAGATCGAGGTCAAGCACCTTCCGAAGCACAATTTCGATCACAAACCGATATTGGCTAATGTGAACC
ACCGGAAAGCAGTTATAGACAGAAGGACTACAATGAAGCCGATTAGATTTGAAGAAGGTTGGGTGCAGTTTGAGGAGTGTAAAGATATAGTGAACACCCACTGGAAATTC
CATAGGAAGGCAACAATCGATACCTTCCAAGCTAAAGTAACAGATTGCTTGCGGAGTCTTAAAAAATGGAATTCGGTCAGACTAAAAGGATCATTAACTACGGCTATTCA
CAGGAAAGAACAGGAGATTAATCATATTAATATGTGCGAACACCACCACTATGGCTACACCGGTATGACTCTGAGACTTCTAGAGGCAGGAGACTGGTGGGAGTCCAGAG
GAAGCCTTGAGGGGAATTCTTTGAGAATCAACAAACCCTCGCCGTCGCAACTCGTCGCCCCTCGCCCGTCGTTTTCTTTTTCTTCTTCTTCGTCTGCAGCTCACCGTGAC
CACCCTCGCCGTCGCCGTCGCCGCCCCTCCGCCCGTCCTTTTCTTTTTCTTCTTCTTCGTCTGCAGCTCACCGTGACCGCACGCTGTCGAGAAAGGCAGAGGATGGAGTC
CACTTATTTTGGCAATGCAAACTGGAACCAGTTCAAGAAGTCGAATTCCAGGAAAGATACAAATCAATTTCACAAAGATATTATGAGTAAAATGAGCTACTACCAGAATC
AGAGTAAGTACCAGGAGAAAAGCTCAGCGAAGAACCTCCAGAGTCATGGGTGTTGGAATCCGTCGTCCGTTGGCCATTGGAAGATGAACTCGGACGCAACCTGGTTCGAC
TCCTCAAATTCGGGGGGAGTGGGATGGATCATTCGTGACTCATCCGGTTCTTTGATCGAAGCAGGGTGCAAGATAGTTGAGGCGAAAGATGAAATAAAAAATTTGGAAGC
CATAGCCATTTTAGATGGTCTCAAGCATTTATCAGAAAGCTACAGATCGTACTCGGAGATGGATAAGCCTCCGTTGGTGGTGGAATCGAAGTTTTCAAGCTCCTTAACCT
TGAATCTGAAGATTGCTCGGAAATCTCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGGTCCTGGCGCTTCTCTCTCGAGCAGGGGGAACTCCATGGACAAAGGTGGGCAACTCCAAGTTTGCCGAAGGGGATCCATTAGTGATCAGCAAGGTGAAAGTGA
AGGCTCTAGAAGTCAAGACATGGAAGAGGAAGGTAGTACCAAACAACAAATGGAAGAAACAACAAGCAATGGAGAGTCATTAGGCCATGAACCCAATGAGGAGCTAAATG
AACAAATAGAAAAGCTAAGCCTGGCTGAGCAGGAGGATAAGAGAGTAGTGGCTATCGAAGACGGCGACATTGAAGATGCAGACAAGGATCTTAGTGATTCGATTGCTTGT
AAAATTCTTACATCAAAGCTCATTCTGTGGGAGGTGTTCTCAGACATCATGCCACGAATATGGGGTGTCGAAGGCAGAGTTAATGTAGAAAAATCGGGGAGGAATATCTT
CCTCTGTAAATTCAGAACCCAGAAGGATAAACACCGAATAATCAGAGGAGCCCCTTGGATTTATGATGATGCTTTGATCTTATTCGAGGAACCAAAAGGAAGATGTGATA
TCAGTGCGATGGAGTTCAGGTACGCATCTTTTTGGATCCACTTCCATAACTTACCTCGTGTTTGTTATTGCAGGAAATACGCTGAAGCGCTGGGGAATGCTATTGGTGAA
TTTGAAGACGTAATATCCGATGGAAATGGGAGAATCAGTGGTGAAAGTCTTAGAGTAAGGATCAAATTTGACGTGAGGGAGTCGTTAAAAAGGGGAACAAACATAACAGT
GGGGTCCGGTGCGGCCAAAAAATGGATTCCAATCACATTTGAAAAACTCCCTGATTTCTGTTATTCATGTGGTAGAATAGGTCACGTCCAACAAGAATGTATAGAGGAAA
GACCTGAGGGTCACGAAGAAGGCACATACGGAATAAAACTCAGAGAAACAAATAGTAGTAAAGGCTTTTACAGGCCCAAAAGAGAGGAGTACTGGGAGGGAAAATGGCCT
TTTCTGAGAGGAGGCAGAGGTAGAGGACGAAGTGCAAGAGGTGGACGAAACTTCAACGAGTTCTGGAACAAAAAGTACGAGAGAAACAGTGATGGAAACTGGGATAAAAG
TCCGACTAGAGTTCCGACGGCTGCCGAAGAAACAAACGATTCAGCAGAGGGAGAAGAGAGAGGTACGGCTGATGGGACTAATCTGTGCTGTCAGGAGGACTTGTCGGCAA
GGAATGAGGAAGACAAAAGACAAAAAAACGGTAACATGGGCAAAAAGAATACTCACAGTGTTCGAGGGGATGAGTCGGATTCAAAAGTCATTAAAAATGGGTTCAGTGTA
CCCGAACAAAAAGACAAGAATAATGGGCTGTTGATGAAGGGGCCCTTGATGAATATTGGTGCTAACAGGGCAGGCCTGACGAAGGAAATGAATCTGGATAAAGCTGACAC
CAAGGTGGGGAACGAAGAGAACTCCTTGTATACTGATCAAGCTAGCAGGAGTGTTGAAGCTTCGGAACCTAGCAAAAACACAGCTAGCAACAGACCTTTCAATGAAAACT
CTGAAGAAAGAAACAAAAGGAAAGCCAAAGAGAAGTTGCCAAACGAAAACAACAAAGAAAAGAAAAGCCCATCAAAGACAAGGAACTCAAGTTGTAAAAAATGGAAACGG
CTGGCTAGGGGAGAGGTTGCTGGGAAAATGTCTGGGAAATCTGATGTGGAAGATATGATCATTGATATGGGGAAAAGGAAGTTGAATGAAGAGAAGGCAATTGAGAGGCT
TGATAGGTTTTTGGCCAATTCCCAATTCATTAACTTGTTTAATGAGATCGAGGTCAAGCACCTTCCGAAGCACAATTTCGATCACAAACCGATATTGGCTAATGTGAACC
ACCGGAAAGCAGTTATAGACAGAAGGACTACAATGAAGCCGATTAGATTTGAAGAAGGTTGGGTGCAGTTTGAGGAGTGTAAAGATATAGTGAACACCCACTGGAAATTC
CATAGGAAGGCAACAATCGATACCTTCCAAGCTAAAGTAACAGATTGCTTGCGGAGTCTTAAAAAATGGAATTCGGTCAGACTAAAAGGATCATTAACTACGGCTATTCA
CAGGAAAGAACAGGAGATTAATCATATTAATATGTGCGAACACCACCACTATGGCTACACCGGTATGACTCTGAGACTTCTAGAGGCAGGAGACTGGTGGGAGTCCAGAG
GAAGCCTTGAGGGGAATTCTTTGAGAATCAACAAACCCTCGCCGTCGCAACTCGTCGCCCCTCGCCCGTCGTTTTCTTTTTCTTCTTCTTCGTCTGCAGCTCACCGTGAC
CACCCTCGCCGTCGCCGTCGCCGCCCCTCCGCCCGTCCTTTTCTTTTTCTTCTTCTTCGTCTGCAGCTCACCGTGACCGCACGCTGTCGAGAAAGGCAGAGGATGGAGTC
CACTTATTTTGGCAATGCAAACTGGAACCAGTTCAAGAAGTCGAATTCCAGGAAAGATACAAATCAATTTCACAAAGATATTATGAGTAAAATGAGCTACTACCAGAATC
AGAGTAAGTACCAGGAGAAAAGCTCAGCGAAGAACCTCCAGAGTCATGGGTGTTGGAATCCGTCGTCCGTTGGCCATTGGAAGATGAACTCGGACGCAACCTGGTTCGAC
TCCTCAAATTCGGGGGGAGTGGGATGGATCATTCGTGACTCATCCGGTTCTTTGATCGAAGCAGGGTGCAAGATAGTTGAGGCGAAAGATGAAATAAAAAATTTGGAAGC
CATAGCCATTTTAGATGGTCTCAAGCATTTATCAGAAAGCTACAGATCGTACTCGGAGATGGATAAGCCTCCGTTGGTGGTGGAATCGAAGTTTTCAAGCTCCTTAACCT
TGAATCTGAAGATTGCTCGGAAATCTCGCTGA
Protein sequenceShow/hide protein sequence
MKGPGASLSSRGNSMDKGGQLQVCRRGSISDQQGESEGSRSQDMEEEGSTKQQMEETTSNGESLGHEPNEELNEQIEKLSLAEQEDKRVVAIEDGDIEDADKDLSDSIAC
KILTSKLILWEVFSDIMPRIWGVEGRVNVEKSGRNIFLCKFRTQKDKHRIIRGAPWIYDDALILFEEPKGRCDISAMEFRYASFWIHFHNLPRVCYCRKYAEALGNAIGE
FEDVISDGNGRISGESLRVRIKFDVRESLKRGTNITVGSGAAKKWIPITFEKLPDFCYSCGRIGHVQQECIEERPEGHEEGTYGIKLRETNSSKGFYRPKREEYWEGKWP
FLRGGRGRGRSARGGRNFNEFWNKKYERNSDGNWDKSPTRVPTAAEETNDSAEGEERGTADGTNLCCQEDLSARNEEDKRQKNGNMGKKNTHSVRGDESDSKVIKNGFSV
PEQKDKNNGLLMKGPLMNIGANRAGLTKEMNLDKADTKVGNEENSLYTDQASRSVEASEPSKNTASNRPFNENSEERNKRKAKEKLPNENNKEKKSPSKTRNSSCKKWKR
LARGEVAGKMSGKSDVEDMIIDMGKRKLNEEKAIERLDRFLANSQFINLFNEIEVKHLPKHNFDHKPILANVNHRKAVIDRRTTMKPIRFEEGWVQFEECKDIVNTHWKF
HRKATIDTFQAKVTDCLRSLKKWNSVRLKGSLTTAIHRKEQEINHINMCEHHHYGYTGMTLRLLEAGDWWESRGSLEGNSLRINKPSPSQLVAPRPSFSFSSSSSAAHRD
HPRRRRRRPSARPFLFLLLRLQLTVTARCRERQRMESTYFGNANWNQFKKSNSRKDTNQFHKDIMSKMSYYQNQSKYQEKSSAKNLQSHGCWNPSSVGHWKMNSDATWFD
SSNSGGVGWIIRDSSGSLIEAGCKIVEAKDEIKNLEAIAILDGLKHLSESYRSYSEMDKPPLVVESKFSSSLTLNLKIARKSR