; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018433 (gene) of Snake gourd v1 genome

Gene IDTan0018433
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUBP1-associated proteins 1C-like isoform X3
Genome locationLG09:67572066..67574086
RNA-Seq ExpressionTan0018433
SyntenyTan0018433
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR003604 - Matrin/U1-C-like, C2H2-type zinc finger
IPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443847.1 PREDICTED: uncharacterized protein LOC103487343 [Cucumis melo]1.1e-6349.68Show/hide
Query:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEG-LAFDEQFSMRLLDQRTNH
        M+FRFRAIDNK PA A+    +SD P++D S N EL KQRI+EEI +REI  RRMLEAEIRREL++++ELA+ RA  +TEG L+FD QF +R +++  N 
Subjt:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEG-LAFDEQFSMRLLDQRTNH

Query:  IVD-QSSRGLLAVPGSSSSLNLLPVRPFPEPQKEEPKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEEADTN-------------PIPWTSKKKPAKEE
        I+D  SS  LLAVPGS+SSLNL          KEEPKP +DE +KLI L +PDP KF  KRKA G  E A                P PW   KK AKEE
Subjt:  IVD-QSSRGLLAVPGSSSSLNLLPVRPFPEPQKEEPKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEEADTN-------------PIPWTSKKKPAKEE

Query:  FICTMCNVTAKSEITFNTHLRGKKHRTKQG--------HALQIQTREEVVHKQPSPAKPEEKGEEKNNKGSFTFWCEICQIGTPNMAIMETHYKGRKHRA
        F+C+MCNV A SEI+FN H+ GKKH+ K+G        H    Q  E++  K   P K  +K           F CEIC +G P MA+M +H  GRKH+A
Subjt:  FICTMCNVTAKSEITFNTHLRGKKHRTKQG--------HALQIQTREEVVHKQPSPAKPEEKGEEKNNKGSFTFWCEICQIGTPNMAIMETHYKGRKHRA

Query:  RLFKLG-KCNLDDQKE
        RL KL  +C  +DQK+
Subjt:  RLFKLG-KCNLDDQKE

XP_021660819.1 uncharacterized protein LOC110650243 [Hevea brasiliensis]9.1e-3132.63Show/hide
Query:  MDFRFRAIDNKPP---AAASRSGYSSDHPLR------DASQNAELV----------KQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEG
        M+F+FRA+D KPP   +++S  GY S+   R      D  QN EL+          KQRI EEII +EIA RR+LEAE+RREL++++E+AM R   R  G
Subjt:  MDFRFRAIDNKPP---AAASRSGYSSDHPLR------DASQNAELV----------KQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEG

Query:  LAFDEQFSMRL--------LDQRTNHIVDQSS----RGLL-AVPGSSSSLNLLPVRPFPEPQKEEPKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEEA
        L+ +E+ +MRL        ++Q  N  ++  S     G+    P  S +L L  V+P  E          D KDKLI L KP+     AKRKA  P E+ 
Subjt:  LAFDEQFSMRL--------LDQRTNHIVDQSS----RGLL-AVPGSSSSLNLLPVRPFPEPQKEEPKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEEA

Query:  DTNPIPWTSKKKPAKEEFICTMCNVTAKSEITFNTHLRGKKHRTKQ--------------------------------GHALQIQTREEVVHKQPSPAKP
            +P    KK  KEE+ CT+CNV+A SE   N HL+GK+H+ K+                                G  L+ +   E++  + +    
Subjt:  DTNPIPWTSKKKPAKEEFICTMCNVTAKSEITFNTHLRGKKHRTKQ--------------------------------GHALQIQTREEVVHKQPSPAKP

Query:  EEKGEEKNNKGS----------------------------------FTFWCEICQIGTPNMAIMETHYKGRKHRARLFKL
        ++K E K + G+                                  F FWCE+CQIG  +  +ME H KG+KH+ +L +L
Subjt:  EEKGEEKNNKGS----------------------------------FTFWCEICQIGTPNMAIMETHYKGRKHRARLFKL

XP_022926958.1 uncharacterized protein LOC111433915 [Cucurbita moschata]5.0e-3778.33Show/hide
Query:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEGLAFDEQFSMRLLDQRTNHI
        +DFR RA+DNKP  AAS S  SSDHPLRD S NAELVKQRI+EEI  RE ASRRMLEAEIRRELII+QEL++ RAT RTEGLAFDE F+MR+LD R NHI
Subjt:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEGLAFDEQFSMRLLDQRTNHI

Query:  VDQ-SSRGLLAVPGSSSSLN
        VDQ SSRGLLAVPGS SSLN
Subjt:  VDQ-SSRGLLAVPGSSSSLN

XP_022965220.1 uncharacterized protein LOC111465139 isoform X1 [Cucurbita maxima]1.5e-3071.67Show/hide
Query:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEGLAFDEQFSMRLLDQRTNHI
        MDF  RA+DN+P  AAS S  SSDHPLRD S NAELVKQRI+ +I  REIASRRMLEAE R ELII+QEL++ RAT  TEGLAFDE F MR+LD R N I
Subjt:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEGLAFDEQFSMRLLDQRTNHI

Query:  VDQ-SSRGLLAVPGSSSSLN
        VDQ SSR LLA PGS SSLN
Subjt:  VDQ-SSRGLLAVPGSSSSLN

XP_038880353.1 zinc finger protein 385B [Benincasa hispida]3.6e-8059.33Show/hide
Query:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEGLAFDEQFSMRLLDQ-RTNH
        MDFRFRA DNK PA A+ +   S   L   S NAEL+KQR+++EI+IREIASRRMLEAEIRRELII+QELA  R   RTEGL FD+QFS+RLLDQ R NH
Subjt:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEGLAFDEQFSMRLLDQ-RTNH

Query:  IVDQSSRGLLAVPGSSSSLNLLPVRPFPEPQKEEPKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEEADTN---PIPWTSKKKPAKEEFICTMCNVTAK
         +    RGLL VPGSSSS   LPV P P PQ EEPKP DD+K+KLI LPKPDP KF+ KRKA G + E DT+   P  W S KK AKEEF+C+MCNV   
Subjt:  IVDQSSRGLLAVPGSSSSLNLLPVRPFPEPQKEEPKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEEADTN---PIPWTSKKKPAKEEFICTMCNVTAK

Query:  SEITFNTHLRGKKHRTKQGHALQIQ---TREEVVHKQPSPAKPEEKGEEKNNKGSFTFWCEICQIGTPNMAIMETHYKGRKHRARLFKLG-KCNLDDQKE
        SEI+FN HL+GKKH  K+G +LQ Q     E++  K  +  K  +K     NK  F FWC+IC+IGTP MAIM +H  G+KH+ARL KL  +  LDDQK+
Subjt:  SEITFNTHLRGKKHRTKQGHALQIQ---TREEVVHKQPSPAKPEEKGEEKNNKGSFTFWCEICQIGTPNMAIMETHYKGRKHRARLFKLG-KCNLDDQKE

TrEMBL top hitse value%identityAlignment
A0A1S3B9S7 uncharacterized protein LOC1034873435.1e-6449.68Show/hide
Query:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEG-LAFDEQFSMRLLDQRTNH
        M+FRFRAIDNK PA A+    +SD P++D S N EL KQRI+EEI +REI  RRMLEAEIRREL++++ELA+ RA  +TEG L+FD QF +R +++  N 
Subjt:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEG-LAFDEQFSMRLLDQRTNH

Query:  IVD-QSSRGLLAVPGSSSSLNLLPVRPFPEPQKEEPKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEEADTN-------------PIPWTSKKKPAKEE
        I+D  SS  LLAVPGS+SSLNL          KEEPKP +DE +KLI L +PDP KF  KRKA G  E A                P PW   KK AKEE
Subjt:  IVD-QSSRGLLAVPGSSSSLNLLPVRPFPEPQKEEPKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEEADTN-------------PIPWTSKKKPAKEE

Query:  FICTMCNVTAKSEITFNTHLRGKKHRTKQG--------HALQIQTREEVVHKQPSPAKPEEKGEEKNNKGSFTFWCEICQIGTPNMAIMETHYKGRKHRA
        F+C+MCNV A SEI+FN H+ GKKH+ K+G        H    Q  E++  K   P K  +K           F CEIC +G P MA+M +H  GRKH+A
Subjt:  FICTMCNVTAKSEITFNTHLRGKKHRTKQG--------HALQIQTREEVVHKQPSPAKPEEKGEEKNNKGSFTFWCEICQIGTPNMAIMETHYKGRKHRA

Query:  RLFKLG-KCNLDDQKE
        RL KL  +C  +DQK+
Subjt:  RLFKLG-KCNLDDQKE

A0A5D3B800 UBP1-associated proteins 1C-like isoform X35.1e-6449.68Show/hide
Query:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEG-LAFDEQFSMRLLDQRTNH
        M+FRFRAIDNK PA A+    +SD P++D S N EL KQRI+EEI +REI  RRMLEAEIRREL++++ELA+ RA  +TEG L+FD QF +R +++  N 
Subjt:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEG-LAFDEQFSMRLLDQRTNH

Query:  IVD-QSSRGLLAVPGSSSSLNLLPVRPFPEPQKEEPKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEEADTN-------------PIPWTSKKKPAKEE
        I+D  SS  LLAVPGS+SSLNL          KEEPKP +DE +KLI L +PDP KF  KRKA G  E A                P PW   KK AKEE
Subjt:  IVD-QSSRGLLAVPGSSSSLNLLPVRPFPEPQKEEPKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEEADTN-------------PIPWTSKKKPAKEE

Query:  FICTMCNVTAKSEITFNTHLRGKKHRTKQG--------HALQIQTREEVVHKQPSPAKPEEKGEEKNNKGSFTFWCEICQIGTPNMAIMETHYKGRKHRA
        F+C+MCNV A SEI+FN H+ GKKH+ K+G        H    Q  E++  K   P K  +K           F CEIC +G P MA+M +H  GRKH+A
Subjt:  FICTMCNVTAKSEITFNTHLRGKKHRTKQG--------HALQIQTREEVVHKQPSPAKPEEKGEEKNNKGSFTFWCEICQIGTPNMAIMETHYKGRKHRA

Query:  RLFKLG-KCNLDDQKE
        RL KL  +C  +DQK+
Subjt:  RLFKLG-KCNLDDQKE

A0A5N6RXK0 Uncharacterized protein1.1e-2932.45Show/hide
Query:  MDFRFRAIDNKPPAA---ASRSGYSSDHPLRDASQ-----------NAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFR--ATTRTEGLAF
        M+F+FRA+D +PP      S   Y +DH LR                 EL K RI EE+I+ +IA RR+LEAE+R  L+I++E+A+ R  AT   EGL+ 
Subjt:  MDFRFRAIDNKPPAA---ASRSGYSSDHPLRDASQ-----------NAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFR--ATTRTEGLAF

Query:  DEQFSMRLLDQRTNHIVDQSSRGLLAVPGSSSSLNLLPVRPFPEPQKEEPKPLDDE------KDKLIFLPKPDPEKFKAKRKAGGPSEEADTNPIPWTSK
        +E+ +MR  D R   ++        A PG   +++  P+     P+  E  PLD +      KDKLI L KPDP     KRKA  P   +     P+  K
Subjt:  DEQFSMRLLDQRTNHIVDQSSRGLLAVPGSSSSLNLLPVRPFPEPQKEEPKPLDDE------KDKLIFLPKPDPEKFKAKRKAGGPSEEADTNPIPWTSK

Query:  KKPAKEEFICTMCNVTAKSEITFNTHLRGKKHRTKQGHALQIQTREEVVHKQPSP---AKP--------------------EEKGEEKNNKGS-------
        K+P KEE+ C +C V+A SE  FN HL+GKKH+ K+   L+ QT  ++ +  P P    KP                    E   + K   GS       
Subjt:  KKPAKEEFICTMCNVTAKSEITFNTHLRGKKHRTKQGHALQIQTREEVVHKQPSP---AKP--------------------EEKGEEKNNKGS-------

Query:  -------------------------------------------FTFWCEICQIGTPNMAIMETHYKGRKHRARLFKLGK
                                                   F FWCE+CQIG  +  +ME H  G+KHR+RL +LG+
Subjt:  -------------------------------------------FTFWCEICQIGTPNMAIMETHYKGRKHRARLFKLGK

A0A6J1EJN4 uncharacterized protein LOC1114339152.4e-3778.33Show/hide
Query:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEGLAFDEQFSMRLLDQRTNHI
        +DFR RA+DNKP  AAS S  SSDHPLRD S NAELVKQRI+EEI  RE ASRRMLEAEIRRELII+QEL++ RAT RTEGLAFDE F+MR+LD R NHI
Subjt:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEGLAFDEQFSMRLLDQRTNHI

Query:  VDQ-SSRGLLAVPGSSSSLN
        VDQ SSRGLLAVPGS SSLN
Subjt:  VDQ-SSRGLLAVPGSSSSLN

A0A6J1HN47 uncharacterized protein LOC111465139 isoform X17.5e-3171.67Show/hide
Query:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEGLAFDEQFSMRLLDQRTNHI
        MDF  RA+DN+P  AAS S  SSDHPLRD S NAELVKQRI+ +I  REIASRRMLEAE R ELII+QEL++ RAT  TEGLAFDE F MR+LD R N I
Subjt:  MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEGLAFDEQFSMRLLDQRTNHI

Query:  VDQ-SSRGLLAVPGSSSSLN
        VDQ SSR LLA PGS SSLN
Subjt:  VDQ-SSRGLLAVPGSSSSLN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24030.1 zinc ion binding;nucleic acid binding6.3e-1429.41Show/hide
Query:  MDFRFRAID-NKPPAAA------------------SRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEG
        M+FR+RAID N+PP A                   S  G  S+  +R+A +  E+ K++I +EIII E A +R L AE+ +E+ I++E+A+ R  + TE 
Subjt:  MDFRFRAID-NKPPAAA------------------SRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEG

Query:  LAFDEQFSM-----RLLDQRTNHIVD-----QSSRGLLAVPGSSSSLNLLPVRPFPEPQKEE----PKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEE
        ++ +E+ +M     +L +Q  N   +      S    L   GS +SL   P+   P+ Q+         L+  K+ LI L + D    K K  + G  + 
Subjt:  LAFDEQFSM-----RLLDQRTNHIVD-----QSSRGLLAVPGSSSSLNLLPVRPFPEPQKEE----PKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEE

Query:  ADTNPIP---WTSKKKPAKEEFI----CTMCNVTAKSE---ITFNTHLRGKKHRTKQGH--ALQIQTREEVVHKQPSPAK-PEEKGEEKNNKGSFTFWCE
             +P    TS  +  KE+FI            K+E      N  L+ K+ + K+    A+ ++T E V  K P   K    K  E   + +  FWCE
Subjt:  ADTNPIP---WTSKKKPAKEEFI----CTMCNVTAKSE---ITFNTHLRGKKHRTKQGH--ALQIQTREEVVHKQPSPAK-PEEKGEEKNNKGSFTFWCE

Query:  ICQIGTPNMAIMETHYKGRKHRA
        IC++GT    +M  H  G+KH+A
Subjt:  ICQIGTPNMAIMETHYKGRKHRA

AT2G24030.2 zinc ion binding;nucleic acid binding9.1e-0528.57Show/hide
Query:  GSSSSLNLLPVRPFPEPQKEE----PKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEEADTNPIP---WTSKKKPAKEEFI----CTMCNVTAKSE---
        GS +SL   P+   P+ Q+         L+  K+ LI L + D    K K  + G  +      +P    TS  +  KE+FI            K+E   
Subjt:  GSSSSLNLLPVRPFPEPQKEE----PKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEEADTNPIP---WTSKKKPAKEEFI----CTMCNVTAKSE---

Query:  ITFNTHLRGKKHRTKQGH--ALQIQTREEVVHKQPSPAK-PEEKGEEKNNKGSFTFWCEICQIGTPNMAIMETHYKGRKHRA
           N  L+ K+ + K+    A+ ++T E V  K P   K    K  E   + +  FWCEIC++GT    +M  H  G+KH+A
Subjt:  ITFNTHLRGKKHRTKQGH--ALQIQTREEVVHKQPSPAK-PEEKGEEKNNKGSFTFWCEICQIGTPNMAIMETHYKGRKHRA

AT5G61190.1 putative endonuclease or glycosyl hydrolase with C2H2-type zinc finger domain6.5e-1129.12Show/hide
Query:  KAKRKAGGPSEEA-------DTNPIPWTSKKKPAKEEFICTMCNVTAKSEITFNTHLRGKKHRT--KQGHALQIQTR--EEVVHKQPSPA----------
        K+K    GP+E +       + N       +  A  EF+C MCNV  +S+I FN+HLRGKKH     Q  AL + T+  E+ V ++  P+          
Subjt:  KAKRKAGGPSEEA-------DTNPIPWTSKKKPAKEEFICTMCNVTAKSEITFNTHLRGKKHRT--KQGHALQIQTR--EEVVHKQPSPA----------

Query:  -------------KPEEKGEEKNNK---------GSFTFWCEICQIGTPNMAIMETHYKGRKHRARLFKLGKCNLDDQKEPQ
                     K  EKG+E   +          S  + C +C +G  +  + ETH +G+KH A L +     L D K+ Q
Subjt:  -------------KPEEKGEEKNNK---------GSFTFWCEICQIGTPNMAIMETHYKGRKHRARLFKLGKCNLDDQKEPQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTCAGGTTCCGAGCCATCGATAACAAGCCACCGGCCGCCGCTTCCCGTTCCGGATACTCCTCCGATCACCCGCTGCGAGATGCTTCTCAAAATGCGGAGCTCGT
GAAACAGAGGATTGAAGAAGAGATAATAATTAGAGAGATTGCGAGCCGCAGAATGCTCGAGGCGGAGATTAGGAGGGAGCTCATTATCCAGCAAGAACTAGCGATGTTTA
GGGCTACAACCCGGACGGAGGGGCTAGCATTTGACGAGCAATTTTCCATGCGATTGTTGGACCAGAGGACGAATCACATTGTCGATCAGTCTTCCAGAGGTTTATTGGCA
GTTCCAGGTTCCAGTTCTTCGCTAAACTTGTTGCCAGTTCGTCCATTTCCGGAGCCTCAAAAGGAAGAACCAAAGCCTTTGGATGATGAAAAGGACAAGTTAATCTTTCT
GCCGAAGCCAGACCCAGAAAAATTCAAAGCGAAGAGGAAAGCTGGGGGACCATCAGAGGAGGCAGATACAAACCCAATTCCTTGGACCAGTAAGAAGAAACCAGCCAAGG
AAGAGTTTATTTGTACAATGTGCAACGTTACAGCCAAAAGTGAAATTACATTCAATACACACTTAAGAGGCAAGAAGCACAGAACCAAACAGGGACATGCCCTACAAATA
CAAACTAGAGAAGAAGTAGTCCACAAGCAACCAAGCCCAGCGAAACCGGAAGAGAAAGGGGAAGAAAAAAATAATAAGGGCAGCTTTACATTCTGGTGCGAAATCTGCCA
AATTGGAACTCCAAATATGGCCATTATGGAGACACATTACAAAGGGAGGAAGCATAGGGCTCGTCTGTTCAAACTTGGTAAATGCAATTTGGACGACCAAAAGGAACCGC
AGGGGCTGGTTTCGACCACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTCAGGTTCCGAGCCATCGATAACAAGCCACCGGCCGCCGCTTCCCGTTCCGGATACTCCTCCGATCACCCGCTGCGAGATGCTTCTCAAAATGCGGAGCTCGT
GAAACAGAGGATTGAAGAAGAGATAATAATTAGAGAGATTGCGAGCCGCAGAATGCTCGAGGCGGAGATTAGGAGGGAGCTCATTATCCAGCAAGAACTAGCGATGTTTA
GGGCTACAACCCGGACGGAGGGGCTAGCATTTGACGAGCAATTTTCCATGCGATTGTTGGACCAGAGGACGAATCACATTGTCGATCAGTCTTCCAGAGGTTTATTGGCA
GTTCCAGGTTCCAGTTCTTCGCTAAACTTGTTGCCAGTTCGTCCATTTCCGGAGCCTCAAAAGGAAGAACCAAAGCCTTTGGATGATGAAAAGGACAAGTTAATCTTTCT
GCCGAAGCCAGACCCAGAAAAATTCAAAGCGAAGAGGAAAGCTGGGGGACCATCAGAGGAGGCAGATACAAACCCAATTCCTTGGACCAGTAAGAAGAAACCAGCCAAGG
AAGAGTTTATTTGTACAATGTGCAACGTTACAGCCAAAAGTGAAATTACATTCAATACACACTTAAGAGGCAAGAAGCACAGAACCAAACAGGGACATGCCCTACAAATA
CAAACTAGAGAAGAAGTAGTCCACAAGCAACCAAGCCCAGCGAAACCGGAAGAGAAAGGGGAAGAAAAAAATAATAAGGGCAGCTTTACATTCTGGTGCGAAATCTGCCA
AATTGGAACTCCAAATATGGCCATTATGGAGACACATTACAAAGGGAGGAAGCATAGGGCTCGTCTGTTCAAACTTGGTAAATGCAATTTGGACGACCAAAAGGAACCGC
AGGGGCTGGTTTCGACCACCTAA
Protein sequenceShow/hide protein sequence
MDFRFRAIDNKPPAAASRSGYSSDHPLRDASQNAELVKQRIEEEIIIREIASRRMLEAEIRRELIIQQELAMFRATTRTEGLAFDEQFSMRLLDQRTNHIVDQSSRGLLA
VPGSSSSLNLLPVRPFPEPQKEEPKPLDDEKDKLIFLPKPDPEKFKAKRKAGGPSEEADTNPIPWTSKKKPAKEEFICTMCNVTAKSEITFNTHLRGKKHRTKQGHALQI
QTREEVVHKQPSPAKPEEKGEEKNNKGSFTFWCEICQIGTPNMAIMETHYKGRKHRARLFKLGKCNLDDQKEPQGLVSTT