; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020387 (gene) of Snake gourd v1 genome

Gene IDTan0020387
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTranslation initiation factor IF-2, putative isoform 1
Genome locationLG03:74447272..74451054
RNA-Seq ExpressionTan0020387
SyntenyTan0020387
Gene Ontology termsGO:0006413 - translational initiation (biological process)
GO:0003743 - translation initiation factor activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594850.1 hypothetical protein SDJN03_11403, partial [Cucurbita argyrosperma subsp. sororia]7.9e-11088.8Show/hide
Query:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL
        MARRKAKKTVK SSPS  EKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFF+EKLPNLSISG+GE+
Subjt:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL

Query:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLE-EPSDTMMLGMPDILQTPGVNNQRLS
        GEIEVQWK+T  EL  NP +G+D+HA+LL RLSIAYPN SAGM+SLNGFE SSKSVKTNPFNVE+LQIPSLVLE EPSD+MMLGM DILQTPGV+NQRLS
Subjt:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLE-EPSDTMMLGMPDILQTPGVNNQRLS

Query:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        IGMTP+TRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHES+E
Subjt:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

KAG6604007.1 hypothetical protein SDJN03_04616, partial [Cucurbita argyrosperma subsp. sororia]5.6e-10886.31Show/hide
Query:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL
        MARRKAKK+VKKSSPS   +AKD + NELKSE+QA LVSDEDVERHA AIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLP+LSIS RGE 
Subjt:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL

Query:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLEEPSDTMMLGMPDILQTPGVNNQRLSI
        GEIEVQWKDT  ELHTNP +GIDIHA+LL RLS AYPNCSAGM+S NGFE SSKSVKTNPFNVENLQIP+  LEEPSD M+LGMPD+LQTPGV+NQRLSI
Subjt:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLEEPSDTMMLGMPDILQTPGVNNQRLSI

Query:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        GMTPKTRRLPKPGE+LVSIHGSPLGVY+EDNMEAIHESEEG
Subjt:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

KAG7026813.1 hypothetical protein SDJN02_10820, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-10788.61Show/hide
Query:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL
        MARRKAKKTVK SSPS  EKAKDEAENELKSEEQAPLV DEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFF+EKLPNLSISG+GE+
Subjt:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL

Query:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLE-EPSDTMMLGMPDILQTPGVNNQRLS
        GEIEVQWK+T  EL  NP +G+D+HA+LL RLSIAYPN SAGM+SLNGFE SSKSVKTNPFNVE+LQIPSLVLE EPSD+MMLGM DILQTPGV+NQRLS
Subjt:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLE-EPSDTMMLGMPDILQTPGVNNQRLS

Query:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
        IGMTP+TRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
Subjt:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH

XP_022133198.1 uncharacterized protein LOC111005854 [Momordica charantia]4.3e-10888.38Show/hide
Query:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL
        MARRKAKKT KK SPS       EAENE K EEQAPLVS+EDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTP+LQFFEEKLPNLSIS  GE 
Subjt:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL

Query:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLEEPSDTMMLGMPDILQTPGVNNQRLSI
        GEIEVQWKD AGELHT P +GIDIHA+LL RLSIAYPNCSAGMQS+NGFE SSKSVKTN FNVENLQIPS VLEEPSD+MMLGMPDILQTPGV NQRLSI
Subjt:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLEEPSDTMMLGMPDILQTPGVNNQRLSI

Query:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

XP_022962735.1 uncharacterized protein LOC111463139 [Cucurbita moschata]9.6e-10887.97Show/hide
Query:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL
        MARRKAKKTVKKSSP   EK KDEAENELKSEEQ PLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFF+EKLPNLSISG+GE+
Subjt:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL

Query:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLE-EPSDTMMLGMPDILQTPGVNNQRLS
        GEIEVQWK+T  EL  NP +G+D+HA+LL+RLSIAYPN SAGM+SLNGFE SSKSVKTNPFNVE+LQIPSLVLE EPSD+MMLGM DILQTPGV+NQRLS
Subjt:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLE-EPSDTMMLGMPDILQTPGVNNQRLS

Query:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        IGMTP+TRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHES+E
Subjt:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

TrEMBL top hitse value%identityAlignment
A0A1S4DWB9 uncharacterized protein LOC1034894361.4e-10484.23Show/hide
Query:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL
        MARRKAKKTVKKSSPS G  AKDEA +++K+       SDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLP+LSIS RG+ 
Subjt:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL

Query:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLEEPSDTMMLGMPDILQTPGVNNQRLSI
        GEIEVQWKDT  ELHTNP +G+DIHA+LL RLS AYP CSAGM+S NGFE SSKSVKTNPFN ENLQIP+ VLEEPSD M+LGMPDILQTPG++NQRLSI
Subjt:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLEEPSDTMMLGMPDILQTPGVNNQRLSI

Query:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

A0A6J1BYE7 uncharacterized protein LOC1110058542.1e-10888.38Show/hide
Query:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL
        MARRKAKKT KK SPS       EAENE K EEQAPLVS+EDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTP+LQFFEEKLPNLSIS  GE 
Subjt:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL

Query:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLEEPSDTMMLGMPDILQTPGVNNQRLSI
        GEIEVQWKD AGELHT P +GIDIHA+LL RLSIAYPNCSAGMQS+NGFE SSKSVKTN FNVENLQIPS VLEEPSD+MMLGMPDILQTPGV NQRLSI
Subjt:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLEEPSDTMMLGMPDILQTPGVNNQRLSI

Query:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

A0A6J1HHY4 uncharacterized protein LOC1114631394.7e-10887.97Show/hide
Query:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL
        MARRKAKKTVKKSSP   EK KDEAENELKSEEQ PLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFF+EKLPNLSISG+GE+
Subjt:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL

Query:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLE-EPSDTMMLGMPDILQTPGVNNQRLS
        GEIEVQWK+T  EL  NP +G+D+HA+LL+RLSIAYPN SAGM+SLNGFE SSKSVKTNPFNVE+LQIPSLVLE EPSD+MMLGM DILQTPGV+NQRLS
Subjt:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLE-EPSDTMMLGMPDILQTPGVNNQRLS

Query:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        IGMTP+TRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHES+E
Subjt:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

A0A6J1ILD3 uncharacterized protein LOC1114784963.7e-10584.58Show/hide
Query:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL
        MARRKAKK+VKKSSPS   +AKD + NELKSE+QA LVSDEDVERHA AIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFF EKLP+LSIS RGE 
Subjt:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL

Query:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLEEPSDTMMLGMPDILQTPGVNNQRLSI
        GEIEVQWKDT  ELHTNP +GIDIHA+LL RLS AYPNCSAG++S NGFE SSKSVKTNPFNVENLQIP+ VLEEPSD M+LGMPD+LQTPG +NQRLSI
Subjt:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLEEPSDTMMLGMPDILQTPGVNNQRLSI

Query:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        GMTPKTRRLPKPGE++VSIHGSPLGVY+E NMEAIHESEE
Subjt:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

A0A6J1KR76 uncharacterized protein LOC1114969031.1e-10687.14Show/hide
Query:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL
        MARRKAKK VKKSSP   EK KDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFF+EKLPNLSISG+ E+
Subjt:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL

Query:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLE-EPSDTMMLGMPDILQTPGVNNQRLS
        GEIEVQWK+T  EL  NP +G+D+HA+LL RLSIAYPN SAGM+SLNGFE SSKSVKTNPFNVE+LQIPSLVLE EPSD+MMLGM DILQTPG +NQRLS
Subjt:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLE-EPSDTMMLGMPDILQTPGVNNQRLS

Query:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        IGMTP+TRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHES+E
Subjt:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G39630.1 unknown protein1.5e-5350.84Show/hide
Query:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL
        M +RKAK+ VK     L E+ +++ E  ++ EE+     DE+VER  AAIRAIRDVEIE+++T LRLLRSYF +EQL TP+L FF+E LP+LSIS   E 
Subjt:  MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGEL

Query:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLEEPSDTMMLGMPDILQTPGVNNQRLSI
        GEIE++W+D  G+       G+D++ ++L+RLS+ + +  +   SL G+++   +VK N    +N Q+ +LV +  S+  ML   D  QTPGVN QRLS 
Subjt:  GEIEVQWKDTAGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLEEPSDTMMLGMPDILQTPGVNNQRLSI

Query:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKED-NMEAIHE
        GMTPKT RLPK GEM++S+HGSPLGVYKED NM AI+E
Subjt:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKED-NMEAIHE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGGCGAAAGGCGAAGAAAACTGTTAAGAAGTCGAGCCCTTCACTTGGAGAGAAAGCGAAGGATGAAGCAGAAAATGAGTTAAAAAGTGAGGAGCAAGCACCTTT
GGTGTCTGACGAGGATGTTGAACGGCATGCTGCTGCAATTCGTGCCATTCGGGATGTGGAGATTGAGCGTTTGATTACTGAATTGCGGTTGCTTCGTTCGTATTTCAACA
AAGAGCAATTGCAAACTCCTCTATTGCAATTTTTCGAGGAAAAACTTCCAAACTTGTCCATTTCAGGAAGAGGCGAACTAGGAGAAATTGAAGTACAATGGAAGGATACT
GCGGGTGAATTACACACCAATCCAACTGAGGGAATAGATATACATGCTGCTCTTCTTCGTCGACTCTCCATAGCTTATCCAAACTGCTCTGCTGGGATGCAATCGTTAAA
TGGATTTGAAATTTCCAGTAAATCAGTGAAAACAAATCCTTTCAATGTTGAGAACCTGCAAATTCCAAGCTTGGTTTTGGAGGAGCCCTCGGATACTATGATGCTTGGAA
TGCCAGACATTCTCCAAACTCCTGGGGTAAATAACCAAAGATTGTCGATTGGGATGACACCGAAAACCCGAAGGCTTCCGAAACCCGGCGAGATGCTTGTGTCTATCCAT
GGATCTCCCCTTGGTGTTTACAAGGAAGACAACATGGAAGCAATACATGAATCAGAAGAGGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATTTCCAACTTCCAAGTAAAAAGGAACGAACCGAACCGAACTGACTCAAATGGTAAGTTTTGGTTTTAGGGTTTCTCCATTTCAGAATTTCCTTCTACTCTCACAGGCCA
CAGCCACCACCGTTCGCTCTCCCCGTGCTCTCGCATTTGCCGTCGACTACTCAGCCACCTCCGGCGACAACAACAAAGAACCAACCATCTTCTCGGTTTTGCTTTTTGAT
ATTGCAGTGTGCGGATTCGTTCCAGAATGCTGCAACGTGAGTAGTCCAGTAAAGTCAAGAGAGACTCGTTGCTGAACATGGCAAGGCGAAAGGCGAAGAAAACTGTTAAG
AAGTCGAGCCCTTCACTTGGAGAGAAAGCGAAGGATGAAGCAGAAAATGAGTTAAAAAGTGAGGAGCAAGCACCTTTGGTGTCTGACGAGGATGTTGAACGGCATGCTGC
TGCAATTCGTGCCATTCGGGATGTGGAGATTGAGCGTTTGATTACTGAATTGCGGTTGCTTCGTTCGTATTTCAACAAAGAGCAATTGCAAACTCCTCTATTGCAATTTT
TCGAGGAAAAACTTCCAAACTTGTCCATTTCAGGAAGAGGCGAACTAGGAGAAATTGAAGTACAATGGAAGGATACTGCGGGTGAATTACACACCAATCCAACTGAGGGA
ATAGATATACATGCTGCTCTTCTTCGTCGACTCTCCATAGCTTATCCAAACTGCTCTGCTGGGATGCAATCGTTAAATGGATTTGAAATTTCCAGTAAATCAGTGAAAAC
AAATCCTTTCAATGTTGAGAACCTGCAAATTCCAAGCTTGGTTTTGGAGGAGCCCTCGGATACTATGATGCTTGGAATGCCAGACATTCTCCAAACTCCTGGGGTAAATA
ACCAAAGATTGTCGATTGGGATGACACCGAAAACCCGAAGGCTTCCGAAACCCGGCGAGATGCTTGTGTCTATCCATGGATCTCCCCTTGGTGTTTACAAGGAAGACAAC
ATGGAAGCAATACATGAATCAGAAGAGGGTTGATTAGCTGAATCAAGGCCATTTGACACAGCTTGTCAAATGTAAAATTTCAATTGGAATCATCCAATATCGTACCTGTT
CTATCAACCTTCATGTATTGTATCTAACATCTTGATTACTACTTATTAGAGCTACTCTTACATGTGTGTAACTAAAAATTAGTGTAACTTTACTCCAAGAAACGAGTATG
TACAAACGCACAATAGTTCCACTAAATTATTGTGCATGATTATTTGTTCTCAAAATTAGATGTTGATATTGATCAACGGCGAGTCACA
Protein sequenceShow/hide protein sequence
MARRKAKKTVKKSSPSLGEKAKDEAENELKSEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPNLSISGRGELGEIEVQWKDT
AGELHTNPTEGIDIHAALLRRLSIAYPNCSAGMQSLNGFEISSKSVKTNPFNVENLQIPSLVLEEPSDTMMLGMPDILQTPGVNNQRLSIGMTPKTRRLPKPGEMLVSIH
GSPLGVYKEDNMEAIHESEEG