; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G007240 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G007240
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionTranslation initiation factor IF-2, putative isoform 1
Genome locationchr01:5740196..5742970
RNA-Seq ExpressionLsi01G007240
SyntenyLsi01G007240
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034696.1 Translation initiation factor IF-2, putative isoform 1 [Cucumis melo var. makuwa]7.0e-10273.87Show/hide
Query:  MARRKAKKTVKKSSLSPVREAKDEAA------NDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQ
        MARRKAKKTVKKSS S  R AKDEAA      +DEDVERHAAAI AIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSIS RG+QGEIEVQ
Subjt:  MARRKAKKTVKKSSLSPVREAKDEAA------NDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQ

Query:  WKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSI
        WKDTEDELHTNPADG+DIHASLLHRLSTAYP  SAGMRSFNGFEFSSKSVKTNPFN ENLQIPNFVLEEPSDN+VLGMPDILQTPG              
Subjt:  WKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSI

Query:  FGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHGMFI
                                               +SNQRLS GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHG+FI
Subjt:  FGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHGMFI

KAG6604007.1 hypothetical protein SDJN03_04616, partial [Cucurbita argyrosperma subsp. sororia]3.5e-10172.66Show/hide
Query:  MARRKAKKTVKKSSLSPVREAKDEAAN------------DEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQ
        MARRKAKK+VKKSS SP+REAKD +AN            DEDVERHA AI AIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQ
Subjt:  MARRKAKKTVKKSSLSPVREAKDEAAN------------DEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQ

Query:  GEIEVQWKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSD
        GEIEVQWKDTEDELHTNPADGIDIHASLLHRLSTAYPN SAGMRSFNGFEFSSKSVKTNPFNVENLQIPNF LEEPSDN+VLGMPD+LQTPG        
Subjt:  GEIEVQWKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSD

Query:  IAIFSIFGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
                                                     VSNQRLS GMTPKTRRLPKPGE+LVSIHGSPLGVY+EDNMEAIH
Subjt:  IAIFSIFGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH

KAG7034170.1 hypothetical protein SDJN02_03897, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-10474.74Show/hide
Query:  MARRKAKKTVKKSSLSPVREAKDEAAN------------DEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQ
        MARRKAKK+VKKSS SPVREAKD +AN            DEDVERHA AI AIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQ
Subjt:  MARRKAKKTVKKSSLSPVREAKDEAAN------------DEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQ

Query:  GEIEVQWKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSD
        GEIEVQWKDTEDELHTNPADGIDIHASLLHRLSTAYPN SAGMRSFNGFEFSSKSVKTNPFNVENLQIPNF LEEPSDN+VLGMPD+LQTPGVRIFM   
Subjt:  GEIEVQWKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSD

Query:  IAIFSIFGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
                                                     VSNQRLS GMTPKTRRLPKPGE+LVSIHGSPLGVY+EDNMEAIH
Subjt:  IAIFSIFGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH

XP_004143493.2 uncharacterized protein LOC101215210 [Cucumis sativus]4.6e-10175.81Show/hide
Query:  MARRKAKKTVKKSSLSPVREAKDEAANDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQWKDTED
        MARRKAKKTVKKSS SP+ EAKDEAANDEDVERHAAAI AIRDVEI RLITELRLLRSYFNKEQLQTPLLQFFEEKLP LSISR GEQGEIEVQWKDTED
Subjt:  MARRKAKKTVKKSSLSPVREAKDEAANDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQWKDTED

Query:  ELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSIFGGVFT
        EL TNPADGIDIHASLLHRLS AYPN SAGMRSFNGFEFSSKSVKTNPF VENLQIPNFVLEEPSDN+VLGMPDI QTPG                    
Subjt:  ELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSIFGGVFT

Query:  QNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
                                         VSNQRLS GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
Subjt:  QNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH

XP_031741813.1 uncharacterized protein LOC101204054 [Cucumis sativus]1.3e-10073.85Show/hide
Query:  MARRKAKKTVKKSSLSPVREAKDEAAN------DEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQ
        MARRKAKKTVKKSS S  R+AKDEA N      DEDVERHAAAI AIRDVEIERLIT LRLLRSYFNKEQLQTPLLQFFEEKLPSLSIS RG+QGEIEV+
Subjt:  MARRKAKKTVKKSSLSPVREAKDEAAN------DEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQ

Query:  WKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSI
        WKDTEDELHTNPADG+DIHASLLHRLSTAYP  SAGMRSFNGFEFSSKSVKTNPFN ENLQIPNFVLEEPSD++ LGMPDI+QTPGVRIFM         
Subjt:  WKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSI

Query:  FGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
                                               +SNQRLS GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
Subjt:  FGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH

TrEMBL top hitse value%identityAlignment
A0A0A0KK81 Uncharacterized protein2.2e-10175.81Show/hide
Query:  MARRKAKKTVKKSSLSPVREAKDEAANDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQWKDTED
        MARRKAKKTVKKSS SP+ EAKDEAANDEDVERHAAAI AIRDVEIERLIT LRLLRSYFNKEQLQTPLLQFFEEKLP LSISR GEQGEIEVQWKDTED
Subjt:  MARRKAKKTVKKSSLSPVREAKDEAANDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQWKDTED

Query:  ELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSIFGGVFT
        EL TNPADGIDIHASLLHRLS AYPN SAGMRSFNGFEFSSKSVKTNPF VENLQIPNFVLEEPSDN+VLGMPDI QTPG                    
Subjt:  ELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSIFGGVFT

Query:  QNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
                                         VSNQRLS GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
Subjt:  QNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH

A0A0A0KRH4 Uncharacterized protein6.4e-10173.85Show/hide
Query:  MARRKAKKTVKKSSLSPVREAKDEAAN------DEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQ
        MARRKAKKTVKKSS S  R+AKDEA N      DEDVERHAAAI AIRDVEIERLIT LRLLRSYFNKEQLQTPLLQFFEEKLPSLSIS RG+QGEIEV+
Subjt:  MARRKAKKTVKKSSLSPVREAKDEAAN------DEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQ

Query:  WKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSI
        WKDTEDELHTNPADG+DIHASLLHRLSTAYP  SAGMRSFNGFEFSSKSVKTNPFN ENLQIPNFVLEEPSD++ LGMPDI+QTPGVRIFM         
Subjt:  WKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSI

Query:  FGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
                                               +SNQRLS GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
Subjt:  FGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH

A0A1S4DWB9 uncharacterized protein LOC1034894364.2e-10073.85Show/hide
Query:  MARRKAKKTVKKSSLSPVREAKDEAA------NDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQ
        MARRKAKKTVKKSS S  R AKDEAA      +DEDVERHAAAI AIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSIS RG+QGEIEVQ
Subjt:  MARRKAKKTVKKSSLSPVREAKDEAA------NDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQ

Query:  WKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSI
        WKDTEDELHTNPADG+DIHASLLHRLSTAYP  SAGMRSFNGFEFSSKSVKTNPFN ENLQIPNFVLEEPSDN+VLGMPDILQTPG              
Subjt:  WKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSI

Query:  FGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
                                               +SNQRLS GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
Subjt:  FGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH

A0A5A7SXV4 Translation initiation factor IF-2, putative isoform 13.4e-10273.87Show/hide
Query:  MARRKAKKTVKKSSLSPVREAKDEAA------NDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQ
        MARRKAKKTVKKSS S  R AKDEAA      +DEDVERHAAAI AIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSIS RG+QGEIEVQ
Subjt:  MARRKAKKTVKKSSLSPVREAKDEAA------NDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQ

Query:  WKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSI
        WKDTEDELHTNPADG+DIHASLLHRLSTAYP  SAGMRSFNGFEFSSKSVKTNPFN ENLQIPNFVLEEPSDN+VLGMPDILQTPG              
Subjt:  WKDTEDELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSI

Query:  FGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHGMFI
                                               +SNQRLS GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHG+FI
Subjt:  FGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHGMFI

A0A5D3CLG2 Uncharacterized protein5.4e-10075.09Show/hide
Query:  MARRKAKKTVKKSSLSPVREAKDEAANDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQWKDTED
        MARRKAKKTVKKS+ SPV EAKDE ANDEDVERHAAAI AIRDVEI RLITELRLLRSYFNKEQLQTPLLQFFEEKLP LSISR GEQGEIEVQWKDTED
Subjt:  MARRKAKKTVKKSSLSPVREAKDEAANDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQWKDTED

Query:  ELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSIFGGVFT
        EL TNPADGIDIHASLLHRLS AYPN SAGMRSFNGFEFSSKSVKTNPF VENLQIPNF LEEPSDN+VLGMPDILQTPG                    
Subjt:  ELHTNPADGIDIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSIFGGVFT

Query:  QNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
                                         V NQRLS GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
Subjt:  QNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G39630.1 unknown protein7.1e-4442.11Show/hide
Query:  MARRKAKKTVK-----KSSLSPVR-EAKDEAANDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQ
        M +RKAK+ VK     ++    +R E K E   DE+VER  AAI AIRDVEIE+++T LRLLRSYF +EQL TP+L FF+E LP LSISR  E GEIE++
Subjt:  MARRKAKKTVK-----KSSLSPVR-EAKDEAANDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQ

Query:  WKDTEDELHTNPADGIDIHASLLHRLSTAYPN-YSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFS
        W+D   +      +G+D++ S+L RLS  + + YS    S  G++    +VK N    +N Q+ N V +  S+N +L   D  QTPG             
Subjt:  WKDTEDELHTNPADGIDIHASLLHRLSTAYPN-YSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFS

Query:  IFGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKED-NMEAIH
                                                V+ QRLSFGMTPKT RLPK GEM++S+HGSPLGVYKED NM AI+
Subjt:  IFGGVFTQNLWAHRRASFWILLFLLESSVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKED-NMEAIH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGGCGAAAGGCGAAGAAAACAGTTAAGAAGTCCAGCCTTTCACCTGTACGAGAAGCAAAGGATGAAGCAGCAAATGACGAGGATGTTGAACGGCATGCTGCTGC
AATCTGTGCCATTCGGGATGTGGAGATCGAGCGTTTGATTACTGAATTGCGGTTGCTTCGTTCGTATTTCAACAAAGAGCAATTGCAAACTCCTCTATTGCAATTTTTCG
AAGAAAAACTTCCAAGCTTGTCCATTTCGAGAAGAGGCGAACAAGGAGAAATTGAAGTACAATGGAAGGATACAGAGGATGAATTACACACCAATCCAGCTGATGGAATA
GATATACATGCTTCTCTCCTTCATCGCCTGTCCACAGCTTATCCTAACTACTCTGCTGGAATGCGATCTTTTAATGGATTTGAATTTTCCAGTAAATCAGTGAAAACAAA
TCCTTTCAATGTTGAGAACCTGCAAATCCCGAACTTCGTTTTGGAGGAGCCCTCGGATAATGTGGTTCTCGGCATGCCAGACATACTCCAAACTCCTGGGGTTCGCATCT
TTATGTTGTCTGATATAGCCATTTTTTCAATATTTGGTGGTGTTTTTACTCAAAATCTCTGGGCACATAGACGCGCTTCATTTTGGATTTTATTATTTCTACTCGAGTCT
TCAGTAAATGTTCCTATGGATGTCAATACCATGATAAATGTGAGTAACCAAAGACTGTCCTTTGGGATGACACCGAAAACCCGAAGATTGCCAAAGCCGGGTGAGATGCT
TGTGTCTATCCATGGATCCCCCCTTGGTGTTTACAAGGAAGACAACATGGAAGCAATCCATGGTATGTTCATTTAG
mRNA sequenceShow/hide mRNA sequence
TTCAATTTGAAACGAGTATTTTTCCTCCGCCCAATCTTCTTCCCCACGTCTTAACATTGGTCCTCCGCAGGATCTGAAAAGGAGGAAAAAGGAAATTTCGCGGCAGTTCA
CCGGCGTAGTTTAATCCATTTATTCCCATCTCTGGCGGTTGCACGTCTTCAAGTTTTGGAAACCCACTGAGAACTTTAATTAATTTCACTGTGCACTACATTAGGGTTTG
AAAGTCTTTATTGTCGTTTTCATTTATGAAAATCCAGTGACGAAGTGCGGATTCGTCTCAGAATGCTGCAACGTGAGTAGTCCATTAAAGTCAAGAAAGACTGGTTGCTG
GGAATGGCAAGGCGAAAGGCGAAGAAAACAGTTAAGAAGTCCAGCCTTTCACCTGTACGAGAAGCAAAGGATGAAGCAGCAAATGACGAGGATGTTGAACGGCATGCTGC
TGCAATCTGTGCCATTCGGGATGTGGAGATCGAGCGTTTGATTACTGAATTGCGGTTGCTTCGTTCGTATTTCAACAAAGAGCAATTGCAAACTCCTCTATTGCAATTTT
TCGAAGAAAAACTTCCAAGCTTGTCCATTTCGAGAAGAGGCGAACAAGGAGAAATTGAAGTACAATGGAAGGATACAGAGGATGAATTACACACCAATCCAGCTGATGGA
ATAGATATACATGCTTCTCTCCTTCATCGCCTGTCCACAGCTTATCCTAACTACTCTGCTGGAATGCGATCTTTTAATGGATTTGAATTTTCCAGTAAATCAGTGAAAAC
AAATCCTTTCAATGTTGAGAACCTGCAAATCCCGAACTTCGTTTTGGAGGAGCCCTCGGATAATGTGGTTCTCGGCATGCCAGACATACTCCAAACTCCTGGGGTTCGCA
TCTTTATGTTGTCTGATATAGCCATTTTTTCAATATTTGGTGGTGTTTTTACTCAAAATCTCTGGGCACATAGACGCGCTTCATTTTGGATTTTATTATTTCTACTCGAG
TCTTCAGTAAATGTTCCTATGGATGTCAATACCATGATAAATGTGAGTAACCAAAGACTGTCCTTTGGGATGACACCGAAAACCCGAAGATTGCCAAAGCCGGGTGAGAT
GCTTGTGTCTATCCATGGATCCCCCCTTGGTGTTTACAAGGAAGACAACATGGAAGCAATCCATGGTATGTTCATTTAG
Protein sequenceShow/hide protein sequence
MARRKAKKTVKKSSLSPVREAKDEAANDEDVERHAAAICAIRDVEIERLITELRLLRSYFNKEQLQTPLLQFFEEKLPSLSISRRGEQGEIEVQWKDTEDELHTNPADGI
DIHASLLHRLSTAYPNYSAGMRSFNGFEFSSKSVKTNPFNVENLQIPNFVLEEPSDNVVLGMPDILQTPGVRIFMLSDIAIFSIFGGVFTQNLWAHRRASFWILLFLLES
SVNVPMDVNTMINVSNQRLSFGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHGMFI