; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005661 (gene) of Snake gourd v1 genome

Gene IDTan0005661
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionARM repeat superfamily protein
Genome locationLG04:15689280..15695471
RNA-Seq ExpressionTan0005661
SyntenyTan0005661
Gene Ontology termsGO:0042254 - ribosome biogenesis (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0005730 - nucleolus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011655026.1 uncharacterized protein LOC101212969 isoform X2 [Cucumis sativus]7.3e-8885.51Show/hide
Query:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT
        MRII SKLT HLCRREP RTL FR FSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSV+LL+VKDPLFKRMGASRLARFS    
Subjt:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT

Query:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD
                       IDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEA  ALHKAGAILVIKSTPDSAED KVNEYKSNLMKRFRD
Subjt:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD

Query:  LRYDVSS
        LRYDVSS
Subjt:  LRYDVSS

XP_022941868.1 uncharacterized protein LOC111447100 isoform X2 [Cucurbita moschata]6.6e-8986.47Show/hide
Query:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT
        MRIIVSKLT HLCRREPARTL FRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFS    
Subjt:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT

Query:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD
                       IDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDSAED +VNEYKSNLMKRFRD
Subjt:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD

Query:  LRYDVSS
        L YDVSS
Subjt:  LRYDVSS

XP_022996604.1 uncharacterized protein LOC111491787 isoform X2 [Cucurbita maxima]3.3e-8885.99Show/hide
Query:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT
        MRIIVSKLT HLCRREPARTL FRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFS    
Subjt:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT

Query:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD
                       IDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS+SDEAV ALHKAGAILVIKSTPDSAED +VNEYKSNLMKRFRD
Subjt:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD

Query:  LRYDVSS
        L YDVSS
Subjt:  LRYDVSS

XP_023526983.1 uncharacterized protein LOC111790336 isoform X2 [Cucurbita pepo subsp. pepo]6.6e-8986.47Show/hide
Query:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT
        MRIIVSKLT HLCRREPARTL FRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFS    
Subjt:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT

Query:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD
                       IDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDSAED +VNEYKSNLMKRFRD
Subjt:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD

Query:  LRYDVSS
        L YDVSS
Subjt:  LRYDVSS

XP_038891777.1 uncharacterized protein LOC120081166 isoform X1 [Benincasa hispida]3.3e-8885.99Show/hide
Query:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT
        MRII SKLT HLCRREP RTL FR FSAYDEREIEKEAERKVGWLLKLIFAGTATF+GYQIFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFS    
Subjt:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT

Query:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD
                       IDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDSAED KVNEYKSNLMKRFRD
Subjt:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD

Query:  LRYDVSS
        L YDVSS
Subjt:  LRYDVSS

TrEMBL top hitse value%identityAlignment
A0A0A0KM85 Uncharacterized protein3.5e-8885.51Show/hide
Query:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT
        MRII SKLT HLCRREP RTL FR FSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSV+LL+VKDPLFKRMGASRLARFS    
Subjt:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT

Query:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD
                       IDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEA  ALHKAGAILVIKSTPDSAED KVNEYKSNLMKRFRD
Subjt:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD

Query:  LRYDVSS
        LRYDVSS
Subjt:  LRYDVSS

A0A6J1FNN3 uncharacterized protein LOC111447100 isoform X11.0e-8785.65Show/hide
Query:  MRIIVSKLT--PHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID
        MRIIVSKLT   HLCRREPARTL FRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFS  
Subjt:  MRIIVSKLT--PHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID

Query:  GTSSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRF
                         IDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDSAED +VNEYKSNLMKRF
Subjt:  GTSSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRF

Query:  RDLRYDVSS
        RDL YDVSS
Subjt:  RDLRYDVSS

A0A6J1FTA5 uncharacterized protein LOC111447100 isoform X23.2e-8986.47Show/hide
Query:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT
        MRIIVSKLT HLCRREPARTL FRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFS    
Subjt:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT

Query:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD
                       IDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDSAED +VNEYKSNLMKRFRD
Subjt:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD

Query:  LRYDVSS
        L YDVSS
Subjt:  LRYDVSS

A0A6J1K583 uncharacterized protein LOC111491787 isoform X15.1e-8785.17Show/hide
Query:  MRIIVSKLT--PHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID
        MRIIVSKLT   HLCRREPARTL FRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFS  
Subjt:  MRIIVSKLT--PHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID

Query:  GTSSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRF
                         IDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS+SDEAV ALHKAGAILVIKSTPDSAED +VNEYKSNLMKRF
Subjt:  GTSSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRF

Query:  RDLRYDVSS
        RDL YDVSS
Subjt:  RDLRYDVSS

A0A6J1K968 uncharacterized protein LOC111491787 isoform X21.6e-8885.99Show/hide
Query:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT
        MRIIVSKLT HLCRREPARTL FRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFS    
Subjt:  MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGT

Query:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD
                       IDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS+SDEAV ALHKAGAILVIKSTPDSAED +VNEYKSNLMKRFRD
Subjt:  SSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRD

Query:  LRYDVSS
        L YDVSS
Subjt:  LRYDVSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G56210.1 ARM repeat superfamily protein7.8e-5653.11Show/hide
Query:  MRIIVSKLTPHLCRREPA--RTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID
        MRI+ +++  H CR   A  R+ +F   +  D+  +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+  
Subjt:  MRIIVSKLTPHLCRREPA--RTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID

Query:  GTSSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRF
                         IDDERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKAL A+S S EA   L   GA+ ++KSTP+S ED  ++ YKSN++++ 
Subjt:  GTSSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRF

Query:  RDLRYDVSS
         +    VSS
Subjt:  RDLRYDVSS

AT3G56210.2 ARM repeat superfamily protein7.8e-5653.11Show/hide
Query:  MRIIVSKLTPHLCRREPA--RTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID
        MRI+ +++  H CR   A  R+ +F   +  D+  +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+  
Subjt:  MRIIVSKLTPHLCRREPA--RTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID

Query:  GTSSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRF
                         IDDERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKAL A+S S EA   L   GA+ ++KSTP+S ED  ++ YKSN++++ 
Subjt:  GTSSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRF

Query:  RDLRYDVSS
         +    VSS
Subjt:  RDLRYDVSS

AT3G56210.4 ARM repeat superfamily protein7.8e-5653.11Show/hide
Query:  MRIIVSKLTPHLCRREPA--RTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID
        MRI+ +++  H CR   A  R+ +F   +  D+  +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+  
Subjt:  MRIIVSKLTPHLCRREPA--RTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID

Query:  GTSSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRF
                         IDDERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKAL A+S S EA   L   GA+ ++KSTP+S ED  ++ YKSN++++ 
Subjt:  GTSSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRF

Query:  RDLRYDVSS
         +    VSS
Subjt:  RDLRYDVSS

AT3G56210.5 ARM repeat superfamily protein2.5e-5452.61Show/hide
Query:  MRIIVSKLTPHLCRREPA--RTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID
        MRI+ +++  H CR   A  R+ +F   +  D+  +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+  
Subjt:  MRIIVSKLTPHLCRREPA--RTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID

Query:  GTSSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHS--DEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMK
                         IDDERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKAL A+S S   EA   L   GA+ ++KSTP+S ED  ++ YKSN+++
Subjt:  GTSSPYPPPELVEAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHS--DEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMK

Query:  RFRDLRYDVSS
        +  +    VSS
Subjt:  RFRDLRYDVSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCATAATTGTATCGAAGCTAACCCCTCATCTCTGCAGAAGGGAACCTGCGCGGACCCTGCACTTTCGCCCCTTTTCAGCTTACGACGAAAGAGAGATCGAGAAGGA
GGCTGAAAGAAAAGTAGGATGGTTATTAAAACTAATCTTTGCTGGGACTGCGACGTTTCTGGGTTACCAGATTTTTCCTTACATGGGGGATAACTTGTTGCAGCAATCTG
TGGCACTTTTGCAAGTCAAGGATCCACTGTTTAAGAGGATGGGAGCGTCTAGATTGGCTCGCTTTTCGATTGATGGTACCAGCTCACCTTATCCCCCTCCTGAATTGGTT
GAAGCTTTTGAAATTATTGATGATGAAAGAAGGATGAAAATAGTGGAGATAGGTGGGGCTCAAGAGCTCTTAAACATGCTCGGGGCTGCCAAAGATGACCGTACACGTAA
GGAAGCTTTGAAGGCTTTACATGCCATCTCACATTCAGATGAAGCTGTTGATGCCTTGCATAAAGCTGGGGCAATCTTGGTTATTAAATCTACCCCGGATTCAGCTGAAG
ATAAGAAAGTGAACGAGTACAAGTCAAACCTAATGAAGAGATTTAGGGATCTTAGATATGATGTTTCATCTTGA
mRNA sequenceShow/hide mRNA sequence
TTTAGCCTTAAACTCGAGGCCAAAAATTTTAGAGACTGAGCGGGTCGCAGTTCACAAACGAAGCCGACCCATAGTGCCGGAGCCTGCGGCCGATTCTCTGTTTTGCACCA
TGCGCATAATTGTATCGAAGCTAACCCCTCATCTCTGCAGAAGGGAACCTGCGCGGACCCTGCACTTTCGCCCCTTTTCAGCTTACGACGAAAGAGAGATCGAGAAGGAG
GCTGAAAGAAAAGTAGGATGGTTATTAAAACTAATCTTTGCTGGGACTGCGACGTTTCTGGGTTACCAGATTTTTCCTTACATGGGGGATAACTTGTTGCAGCAATCTGT
GGCACTTTTGCAAGTCAAGGATCCACTGTTTAAGAGGATGGGAGCGTCTAGATTGGCTCGCTTTTCGATTGATGGTACCAGCTCACCTTATCCCCCTCCTGAATTGGTTG
AAGCTTTTGAAATTATTGATGATGAAAGAAGGATGAAAATAGTGGAGATAGGTGGGGCTCAAGAGCTCTTAAACATGCTCGGGGCTGCCAAAGATGACCGTACACGTAAG
GAAGCTTTGAAGGCTTTACATGCCATCTCACATTCAGATGAAGCTGTTGATGCCTTGCATAAAGCTGGGGCAATCTTGGTTATTAAATCTACCCCGGATTCAGCTGAAGA
TAAGAAAGTGAACGAGTACAAGTCAAACCTAATGAAGAGATTTAGGGATCTTAGATATGATGTTTCATCTTGACAAGAATATGTGCATATGTAATGTACTGGAACACCTT
GAAGTGGTCAACTAATGGTTTATTTCCATCCCAAGCTTCTTGAGGGGAACCAAGAGCCAATAAAGTAATGAGGAACAACTTCTAAGCTTTGATTACTCATTGATTCCATG
AAAGACAAATTTTTCGCCTTGTCATCGAAGCTTCTTCCTCTTCTTGTTAGGAATATGTGCATATGCAATGTACTTGAACACCTTGAAGTGTTCAACTAATGGTTTATTTC
CATCCCAAGCTTCTTGAGGAGTTATATTTTTAACCGAAAAAGTGGGATTTCAATTCAAGATATGAACATCCCAATTTATTGCTTCAGGCCAGAATTGCTTTAGAACTCTT
CTATGTGCAATCATACTTCGAGCTCAGTTGAGAATGATATAATTTTTTTTTTTTGACAACGCCATTTAGTTGAAGTGTATAGGTTGCTGTTAGTTGGTGTCGAATTTTAT
ACTCATCATAAAAATTTGCAAATTCTTG
Protein sequenceShow/hide protein sequence
MRIIVSKLTPHLCRREPARTLHFRPFSAYDEREIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDGTSSPYPPPELV
EAFEIIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDKKVNEYKSNLMKRFRDLRYDVSS