; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020203 (gene) of Snake gourd v1 genome

Gene IDTan0020203
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTranscription factor bHLH61 isoform 1
Genome locationLG11:2460785..2462501
RNA-Seq ExpressionTan0020203
SyntenyTan0020203
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022964712.1 uncharacterized protein LOC111464709 isoform X1 [Cucurbita moschata]7.8e-6690Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPM-VTVETLVKGFSINVFSEKSCQGLLVSIL
        MVSREHKKAALHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN    HPM VTVE LVKGFSINVFSEKSCQGLLVSIL
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPM-VTVETLVKGFSINVFSEKSCQGLLVSIL

Query:  EAFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD
        EAFEELGLNV+EARVSCTD+FQLQA  EI+EQG EA+DAQAVKEAVV+AIKSWSQ+GEQD
Subjt:  EAFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD

XP_022964721.1 uncharacterized protein LOC111464709 isoform X2 [Cucurbita moschata]3.2e-6790.57Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSILE
        MVSREHKKAALHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN    HPMVTVE LVKGFSINVFSEKSCQGLLVSILE
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSILE

Query:  AFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD
        AFEELGLNV+EARVSCTD+FQLQA  EI+EQG EA+DAQAVKEAVV+AIKSWSQ+GEQD
Subjt:  AFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD

XP_038884497.1 uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida]1.6e-6690.12Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHH--PM-VTVETLVKGFSINVFSEKSCQGLLVS
        MVSREHKKA LHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPN +SH   PM VTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHH--PM-VTVETLVKGFSINVFSEKSCQGLLVS

Query:  ILEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD
        ILE FEELGLNVIEARVSCTD+FQLQAI EI+E+G EAIDAQAVKEAVVQAIKSW QSGEQD
Subjt:  ILEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD

XP_038884498.1 uncharacterized protein LOC120075304 isoform X3 [Benincasa hispida]1.6e-6690.12Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHH--PMVTVETLVKGFSINVFSEKSCQGLLVS
        MVSREHKKA LHEKLQLLRSITNSHA LNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPN +SH   PMVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHH--PMVTVETLVKGFSINVFSEKSCQGLLVS

Query:  ILEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD
        ILE FEELGLNVIEARVSCTD+FQLQAI EI+E+G EAIDAQAVKEAVVQAIKSW QSGEQD
Subjt:  ILEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD

XP_038884499.1 uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida]6.4e-6890.68Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHH--PMVTVETLVKGFSINVFSEKSCQGLLVSI
        MVSREHKKA LHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPN +SH   PMVTVE LVKGFSINVFSEKSCQGLLVSI
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHH--PMVTVETLVKGFSINVFSEKSCQGLLVSI

Query:  LEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD
        LE FEELGLNVIEARVSCTD+FQLQAI EI+E+G EAIDAQAVKEAVVQAIKSW QSGEQD
Subjt:  LEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD

TrEMBL top hitse value%identityAlignment
A0A6J1C0X4 uncharacterized protein LOC1110064101.6e-6491.08Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSILE
        MVSREHKKA LHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD   VQNSIHPN +   PMVTVETLVKGFSINVFSEKSCQGLLVSILE
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSILE

Query:  AFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGE
         FEELGLNV+EARVSCTDSFQLQAIGEIDEQG EAIDAQAVKEAVVQAIKSWS+S E
Subjt:  AFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGE

A0A6J1HIH6 uncharacterized protein LOC111464709 isoform X13.8e-6690Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPM-VTVETLVKGFSINVFSEKSCQGLLVSIL
        MVSREHKKAALHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN    HPM VTVE LVKGFSINVFSEKSCQGLLVSIL
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPM-VTVETLVKGFSINVFSEKSCQGLLVSIL

Query:  EAFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD
        EAFEELGLNV+EARVSCTD+FQLQA  EI+EQG EA+DAQAVKEAVV+AIKSWSQ+GEQD
Subjt:  EAFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD

A0A6J1HJR0 uncharacterized protein LOC111464709 isoform X21.5e-6790.57Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSILE
        MVSREHKKAALHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN    HPMVTVE LVKGFSINVFSEKSCQGLLVSILE
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSILE

Query:  AFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD
        AFEELGLNV+EARVSCTD+FQLQA  EI+EQG EA+DAQAVKEAVV+AIKSWSQ+GEQD
Subjt:  AFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD

A0A6J1JTS8 uncharacterized protein LOC111487778 isoform X21.5e-6790.57Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSILE
        MVSREHKKAALHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN    HPMVTVE LVKGFSINVFSEKSCQGLLVSILE
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSILE

Query:  AFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD
        AFEELGLNV+EARVSCTD+FQLQA  EI+EQG EA+DAQAVKEAVV+AIKSWSQ+GEQD
Subjt:  AFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD

A0A6J1JV46 uncharacterized protein LOC111487778 isoform X13.8e-6690Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPM-VTVETLVKGFSINVFSEKSCQGLLVSIL
        MVSREHKKAALHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN    HPM VTVE LVKGFSINVFSEKSCQGLLVSIL
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPM-VTVETLVKGFSINVFSEKSCQGLLVSIL

Query:  EAFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD
        EAFEELGLNV+EARVSCTD+FQLQA  EI+EQG EA+DAQAVKEAVV+AIKSWSQ+GEQD
Subjt:  EAFEELGLNVIEARVSCTDSFQLQAIGEIDEQG-EAIDAQAVKEAVVQAIKSWSQSGEQD

SwissProt top hitse value%identityAlignment
Q9LPW3 Transcription factor SCREAM21.4e-0427.27Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMV-TVETLV--------------------
        +++   ++  L+++L +LRS+      +++ASI+ DA  Y++EL Q++     D+ T   S  P+S S HP+  T +TL                     
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMV-TVETLV--------------------

Query:  ----------KGFSINVFSEKSCQGLLVSILEAFEELGLNVIEARVSCTDSFQL
                  K  +I++F  +   GLL+S + A + LGL+V +A +SC + F L
Subjt:  ----------KGFSINVFSEKSCQGLLVSILEAFEELGLNVIEARVSCTDSFQL

Q9LSE2 Transcription factor ICE13.6e-0525.29Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDI-ATVQNSIHPNSISHHPMV-TVETLV-------------------
        +++   ++  L+++L +LRS+      +++ASI+ DA  Y++EL Q++  L+ ++ +T   S+ P S S HP+  T +TL                    
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDI-ATVQNSIHPNSISHHPMV-TVETLV-------------------

Query:  ----------KGFSINVFSEKSCQGLLVSILEAFEELGLNVIEARVSCTDSFQLQAI-GEIDEQGEAIDAQAVK
                  +  +I++F  +   GLL++ ++A + LGL+V +A +SC + F L     E  ++G+ I    +K
Subjt:  ----------KGFSINVFSEKSCQGLLVSILEAFEELGLNVIEARVSCTDSFQLQAI-GEIDEQGEAIDAQAVK

Q9LSL1 Transcription factor bHLH931.0e-0428.57Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERL---NQDIATVQNSIHPNSISH-HPMVTVETLVKG---FSINVFSEKS--
        +++   ++  L+++L +LRSI      +++ SI+ DA  Y++EL  K+ +L    Q++    NS H         +   E LV+    F I+   E +  
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERL---NQDIATVQNSIHPNSISH-HPMVTVETLVKG---FSINVFSEKS--

Query:  ---CQ---GLLVSILEAFEELGLNVIEARVSCTDSFQLQA-IGEIDEQGEAIDAQAVKEAV
           C    GLL+S +   E LGL + +  +SC   F LQA   E  EQ + I ++ +K+A+
Subjt:  ---CQ---GLLVSILEAFEELGLNVIEARVSCTDSFQLQA-IGEIDEQGEAIDAQAVKEAV

Q9LXA9 Transcription factor bHLH611.0e-0728.57Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNS-ISHHPMV--TVETLVKGFSINVFSEKSC---QG
        +++   ++  L+++L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D   + ++ H ++ I++  MV  +++  V    +N   +  C    G
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNS-ISHHPMV--TVETLVKGFSINVFSEKSC---QG

Query:  LLVSILEAFEELGLNVIEARVSCTDSFQLQA-IGEIDEQGEAIDAQAVKEAVVQ
        L+VS +   E LGL + +  +SC   F LQA   E+ EQ   + ++A K+A+++
Subjt:  LLVSILEAFEELGLNVIEARVSCTDSFQLQA-IGEIDEQGEAIDAQAVKEAVVQ

Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein5.8e-1131.33Show/hide
Query:  MVSREHKKAALHEKLQLLRSITN-SHALNKASIIV-DASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSI
        MV+ E KK A   K   L+++T+   ++++ S+++ +A  YI  LK ++E L ++   ++ +    S+     V VE + + F + + S +  +  LV+I
Subjt:  MVSREHKKAALHEKLQLLRSITN-SHALNKASIIV-DASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSI

Query:  LEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEAIDAQAVKEAVVQAI
        LEAFEE+GLNV +AR SC DSF ++AI     + +      + + +V+A+
Subjt:  LEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEAIDAQAVKEAVVQAI

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)7.3e-4664.71Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSILE
        MVSRE K+ +L EK QLLRSITNSHA N  SII+DASKYI++LKQKVER NQD    Q+S  P      PMVTVETL KGF INVFS K+  G+LVS+LE
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSILE

Query:  AFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEAIDAQAVKEAVVQAIKSWSQ
        AFE++GLNV+EAR SCTDSF L A+G  +E GE +DA+AVK+AV  AI+SW +
Subjt:  AFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEAIDAQAVKEAVVQAIKSWSQ

AT3G26744.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.5e-0625.29Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDI-ATVQNSIHPNSISHHPMV-TVETLV-------------------
        +++   ++  L+++L +LRS+      +++ASI+ DA  Y++EL Q++  L+ ++ +T   S+ P S S HP+  T +TL                    
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDI-ATVQNSIHPNSISHHPMV-TVETLV-------------------

Query:  ----------KGFSINVFSEKSCQGLLVSILEAFEELGLNVIEARVSCTDSFQLQAI-GEIDEQGEAIDAQAVK
                  +  +I++F  +   GLL++ ++A + LGL+V +A +SC + F L     E  ++G+ I    +K
Subjt:  ----------KGFSINVFSEKSCQGLLVSILEAFEELGLNVIEARVSCTDSFQLQAI-GEIDEQGEAIDAQAVK

AT3G56220.1 transcription regulators3.5e-4058.97Show/hide
Query:  MVSREHKK-AALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSIL
        MVSREHK+ ++L EK  LLRSIT+SHA ++ SIIVDASKYI++LKQKVE++N   AT        S   +PMVTVETL KGF I V S K+  G+LV +L
Subjt:  MVSREHKK-AALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSIL

Query:  EAFEELGLNVIEARVSCTDSFQLQAIGEI-DEQGEAIDAQAVKEAVVQAIKSWSQS
        E FE+LGL+V+EARVSCTD+F L AIG   ++ G+ IDA+AVK+AV +AI++WS S
Subjt:  EAFEELGLNVIEARVSCTDSFQLQAIGEI-DEQGEAIDAQAVKEAVVQAIKSWSQS

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein7.2e-0928.57Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNS-ISHHPMV--TVETLVKGFSINVFSEKSC---QG
        +++   ++  L+++L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D   + ++ H ++ I++  MV  +++  V    +N   +  C    G
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNS-ISHHPMV--TVETLVKGFSINVFSEKSC---QG

Query:  LLVSILEAFEELGLNVIEARVSCTDSFQLQA-IGEIDEQGEAIDAQAVKEAVVQ
        L+VS +   E LGL + +  +SC   F LQA   E+ EQ   + ++A K+A+++
Subjt:  LLVSILEAFEELGLNVIEARVSCTDSFQLQA-IGEIDEQGEAIDAQAVKEAVVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTAGAGAGCACAAGAAGGCAGCTCTGCATGAGAAGCTCCAATTACTTCGTTCTATTACCAACTCTCATGCTCTAAACAAGGCCTCGATTATAGTGGATGCATC
AAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGATATAGCAACCGTTCAAAATTCAATCCACCCAAATTCAATTTCTCATCATCCCATGGTTACAG
TGGAAACCCTAGTAAAGGGATTTTCTATAAATGTATTTTCAGAAAAGAGCTGCCAAGGCCTCCTTGTCTCAATATTAGAAGCCTTTGAAGAGCTGGGGCTTAATGTTATT
GAAGCTAGGGTTTCCTGTACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGACGAACAAGGAGAAGCCATTGATGCTCAAGCTGTAAAAGAAGCTGTAGTTCAAGC
TATAAAGAGCTGGAGCCAAAGCGGTGAACAAGATTAA
mRNA sequenceShow/hide mRNA sequence
GTGGGAAGCTAGCAAACAGAAGATCATATAGATTTGAGGGCTCATCGTGCCCTTATAAAAACACCATCAGAGTTTTTTTTTTTTTTGCAGAGATACTACAAGAAAATAAA
AGATAGAGAAAAAAAAAGGCAAAAAAAAAAAAAAAAAAGAAAAAGAAGAAGAAGAAGAAGAAGGCTGAAAAAAGATATAAAGAGAGAGAAAGAGAGAGAAAAAAAAAAGA
AAAAGAAAAAGAAGATAGAATCCATGGTTTCTAGAGAGCACAAGAAGGCAGCTCTGCATGAGAAGCTCCAATTACTTCGTTCTATTACCAACTCTCATGCTCTAAACAAG
GCCTCGATTATAGTGGATGCATCAAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGATATAGCAACCGTTCAAAATTCAATCCACCCAAATTCAAT
TTCTCATCATCCCATGGTTACAGTGGAAACCCTAGTAAAGGGATTTTCTATAAATGTATTTTCAGAAAAGAGCTGCCAAGGCCTCCTTGTCTCAATATTAGAAGCCTTTG
AAGAGCTGGGGCTTAATGTTATTGAAGCTAGGGTTTCCTGTACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGACGAACAAGGAGAAGCCATTGATGCTCAAGCT
GTAAAAGAAGCTGTAGTTCAAGCTATAAAGAGCTGGAGCCAAAGCGGTGAACAAGATTAAAAAGAAAAATCAACTCTTCAAGAAGGAGAATTT
Protein sequenceShow/hide protein sequence
MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNSISHHPMVTVETLVKGFSINVFSEKSCQGLLVSILEAFEELGLNVI
EARVSCTDSFQLQAIGEIDEQGEAIDAQAVKEAVVQAIKSWSQSGEQD