; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001015 (gene) of Snake gourd v1 genome

Gene IDTan0001015
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG08:17243434..17245510
RNA-Seq ExpressionTan0001015
SyntenyTan0001015
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032020.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-7042.58Show/hide
Query:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK
        ANEKA+ YI+ASL++VLAKKHELM+TA EIM+S Q+MFGQ S+Q++HD+LK+++N RM EG  VRE+VL+M+ +FN+AEMNGA IDE+SQVSFILE+L +
Subjt:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK

Query:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK
        SFLQFRSN VMNKI+YTLTTLLNE Q F+SLM+I+  + EANVA   R +HRGSTSGTK +  S    K K KKG    K + A A+  KK K  A KG 
Subjt:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK

Query:  CFHCNEDGYWKRNCPKFLAERK------------------------------------------------------------------------------
        CFHCN++G+WKRNCPK+LAE K                                                                              
Subjt:  CFHCNEDGYWKRNCPKFLAERK------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------NQGYPKETRGGLFYDPKENRVLVSTNATFLE
                                                                              +GYPK TRG  FYDPK+N+V VSTNATFLE
Subjt:  ---------------------------------------------------------------------NQGYPKETRGGLFYDPKENRVLVSTNATFLE

Query:  KDHIRDHLPRSKIVLNEM
        KDHIR+H P +KIVLNE+
Subjt:  KDHIRDHLPRSKIVLNEM

KAA0050670.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-7043.91Show/hide
Query:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK
        ANEKA+ YI+AS +EVLAKKHE M+T +EIM+S Q+MFGQ S+Q++HD+L +++N RM EG  VRE+VL+MM +FN+AEMNGA IDE+SQVSFILE+LL+
Subjt:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK

Query:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK
        SFLQFRSN VMNKI+YTLTTLLNE Q F+SLM+I+  + EANVA   R +HRG T GTK +  S    K K KKG    K +   A+  KK K  A KG 
Subjt:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK

Query:  CFHCNEDGYWKRNCPKFLAERK------------------------------------------------------------------------------
        CFHCN++G+WKRNCPK+LAE+K                                                                              
Subjt:  CFHCNEDGYWKRNCPKFLAERK------------------------------------------------------------------------------

Query:  ---------------------------------------------NQGYPKETRGGLFYDPKENRVLVSTNATFLEKDHIRDHLPRSKIVLNEM
                                                      +GY K TRGG FYDPK+N+V VSTNATFLE+DHIR+H P SKIVLN++
Subjt:  ---------------------------------------------NQGYPKETRGGLFYDPKENRVLVSTNATFLEKDHIRDHLPRSKIVLNEM

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-6662.77Show/hide
Query:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK
        ANEKA+ YI+ASL+EVLAKKHE M+TA+EIM+S Q+MFGQ S+Q++HD+LK+++N RM EG  VRE+VL+MM +FN+AEMNGA IDE+SQVSFILE+L +
Subjt:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK

Query:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK
        SFLQFRSN VMNKI+YTLTTLLNE Q F+SLM+I+  + EANVA   R +HRGSTSGTK +  S    K K KKG    K + A A+  KK K  A KG 
Subjt:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK

Query:  CFHCNEDGYWKRNCPKFLAERKNQGYPKETR
        CFHCN++G+WKRNCPK+LAE+K     K T+
Subjt:  CFHCNEDGYWKRNCPKFLAERKNQGYPKETR

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-1368.25Show/hide
Query:  NCPKFLAERKN----QGYPKETRGGLFYDPKENRVLVSTNATFLEKDHIRDHLPRSKIVLNEM
        N PK L  R       GYPK TRGG FYDPK+N+V VSTNATFLE+DHIR+H PRSKIVLNE+
Subjt:  NCPKFLAERKN----QGYPKETRGGLFYDPKENRVLVSTNATFLEKDHIRDHLPRSKIVLNEM

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]7.0e-7052.88Show/hide
Query:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK
        ANEKA+ YI+ASL+EVLAKKHE M+TA+EIM+S Q+MFGQ S+Q++HD+LK+++N RM EG  VRE+VL++M +FN+AEMNGA IDE+SQVSFILE+L +
Subjt:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK

Query:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVK-------
        SFLQFRSN VMNKI+YTLTTLLNE Q F+SLM+I+  + EANVA   R +HRGSTSGTK +  S    K K KKG    K + A A+  KK K       
Subjt:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVK-------

Query:  -----------------EVAEKGKCFHCNEDGYWK---------RNC--PKFLAE------RKNQGYPKETRGGLFYDPKENRVLVSTNATFLEKDHIRD
                         ++      +    + +W           NC   K ++E        ++GYPK TRGG FYDPK+N+V VSTNATFLE+DHIR+
Subjt:  -----------------EVAEKGKCFHCNEDGYWK---------RNC--PKFLAE------RKNQGYPKETRGGLFYDPKENRVLVSTNATFLEKDHIRD

Query:  HLPRSKIVLNEM
        H PRSKIVLNE+
Subjt:  HLPRSKIVLNEM

KAA0066490.1 gag/pol protein [Cucumis melo var. makuwa]6.3e-7142.32Show/hide
Query:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK
        ANEKA+ YI+ASL+EVLAKKHE ++TA+EIM+S Q+MFGQ S+Q++HD+LK+++N RM EG  VRE+VL+MM +F++AEMNGA IDE+SQVSFILE+L +
Subjt:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK

Query:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK
        SFLQFRSN VMNKI+YTLTTLLNE Q F+SLM+I+  + EANVA   R +HRGSTSGTK +  S    K K KKG    K + A A+  KK K  A KG 
Subjt:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK

Query:  CFHCNEDGYWKRNCPKFLAERKNQ----------------------------------------------------------------------------
        CFHCN++G+WKRNCPK+LAE+K                                                                              
Subjt:  CFHCNEDGYWKRNCPKFLAERKNQ----------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------GYPKETRGGLFYDPKENRVLVSTN
                                                                                    GYPK TRGG FYDPK+N+V VSTN
Subjt:  ----------------------------------------------------------------------------GYPKETRGGLFYDPKENRVLVSTN

Query:  ATFLEKDHIRDHLPRSKIVLNEM
        ATFLE+DHIR+H PRSKIVLNE+
Subjt:  ATFLEKDHIRDHLPRSKIVLNEM

TrEMBL top hitse value%identityAlignment
A0A5A7SMN4 Gag/pol protein1.5e-7042.58Show/hide
Query:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK
        ANEKA+ YI+ASL++VLAKKHELM+TA EIM+S Q+MFGQ S+Q++HD+LK+++N RM EG  VRE+VL+M+ +FN+AEMNGA IDE+SQVSFILE+L +
Subjt:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK

Query:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK
        SFLQFRSN VMNKI+YTLTTLLNE Q F+SLM+I+  + EANVA   R +HRGSTSGTK +  S    K K KKG    K + A A+  KK K  A KG 
Subjt:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK

Query:  CFHCNEDGYWKRNCPKFLAERK------------------------------------------------------------------------------
        CFHCN++G+WKRNCPK+LAE K                                                                              
Subjt:  CFHCNEDGYWKRNCPKFLAERK------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------NQGYPKETRGGLFYDPKENRVLVSTNATFLE
                                                                              +GYPK TRG  FYDPK+N+V VSTNATFLE
Subjt:  ---------------------------------------------------------------------NQGYPKETRGGLFYDPKENRVLVSTNATFLE

Query:  KDHIRDHLPRSKIVLNEM
        KDHIR+H P +KIVLNE+
Subjt:  KDHIRDHLPRSKIVLNEM

A0A5A7U676 Gag/pol protein8.9e-7143.91Show/hide
Query:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK
        ANEKA+ YI+AS +EVLAKKHE M+T +EIM+S Q+MFGQ S+Q++HD+L +++N RM EG  VRE+VL+MM +FN+AEMNGA IDE+SQVSFILE+LL+
Subjt:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK

Query:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK
        SFLQFRSN VMNKI+YTLTTLLNE Q F+SLM+I+  + EANVA   R +HRG T GTK +  S    K K KKG    K +   A+  KK K  A KG 
Subjt:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK

Query:  CFHCNEDGYWKRNCPKFLAERK------------------------------------------------------------------------------
        CFHCN++G+WKRNCPK+LAE+K                                                                              
Subjt:  CFHCNEDGYWKRNCPKFLAERK------------------------------------------------------------------------------

Query:  ---------------------------------------------NQGYPKETRGGLFYDPKENRVLVSTNATFLEKDHIRDHLPRSKIVLNEM
                                                      +GY K TRGG FYDPK+N+V VSTNATFLE+DHIR+H P SKIVLN++
Subjt:  ---------------------------------------------NQGYPKETRGGLFYDPKENRVLVSTNATFLEKDHIRDHLPRSKIVLNEM

A0A5A7U869 Gag/pol protein6.0e-6762.77Show/hide
Query:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK
        ANEKA+ YI+ASL+EVLAKKHE M+TA+EIM+S Q+MFGQ S+Q++HD+LK+++N RM EG  VRE+VL+MM +FN+AEMNGA IDE+SQVSFILE+L +
Subjt:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK

Query:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK
        SFLQFRSN VMNKI+YTLTTLLNE Q F+SLM+I+  + EANVA   R +HRGSTSGTK +  S    K K KKG    K + A A+  KK K  A KG 
Subjt:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK

Query:  CFHCNEDGYWKRNCPKFLAERKNQGYPKETR
        CFHCN++G+WKRNCPK+LAE+K     K T+
Subjt:  CFHCNEDGYWKRNCPKFLAERKNQGYPKETR

A0A5A7U869 Gag/pol protein1.0e-1368.25Show/hide
Query:  NCPKFLAERKN----QGYPKETRGGLFYDPKENRVLVSTNATFLEKDHIRDHLPRSKIVLNEM
        N PK L  R       GYPK TRGG FYDPK+N+V VSTNATFLE+DHIR+H PRSKIVLNE+
Subjt:  NCPKFLAERKN----QGYPKETRGGLFYDPKENRVLVSTNATFLEKDHIRDHLPRSKIVLNEM

A0A5A7V6N0 Gag/pol protein3.4e-7052.88Show/hide
Query:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK
        ANEKA+ YI+ASL+EVLAKKHE M+TA+EIM+S Q+MFGQ S+Q++HD+LK+++N RM EG  VRE+VL++M +FN+AEMNGA IDE+SQVSFILE+L +
Subjt:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK

Query:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVK-------
        SFLQFRSN VMNKI+YTLTTLLNE Q F+SLM+I+  + EANVA   R +HRGSTSGTK +  S    K K KKG    K + A A+  KK K       
Subjt:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVK-------

Query:  -----------------EVAEKGKCFHCNEDGYWK---------RNC--PKFLAE------RKNQGYPKETRGGLFYDPKENRVLVSTNATFLEKDHIRD
                         ++      +    + +W           NC   K ++E        ++GYPK TRGG FYDPK+N+V VSTNATFLE+DHIR+
Subjt:  -----------------EVAEKGKCFHCNEDGYWK---------RNC--PKFLAE------RKNQGYPKETRGGLFYDPKENRVLVSTNATFLEKDHIRD

Query:  HLPRSKIVLNEM
        H PRSKIVLNE+
Subjt:  HLPRSKIVLNEM

A0A5A7VH46 Gag/pol protein3.1e-7142.32Show/hide
Query:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK
        ANEKA+ YI+ASL+EVLAKKHE ++TA+EIM+S Q+MFGQ S+Q++HD+LK+++N RM EG  VRE+VL+MM +F++AEMNGA IDE+SQVSFILE+L +
Subjt:  ANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLK

Query:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK
        SFLQFRSN VMNKI+YTLTTLLNE Q F+SLM+I+  + EANVA   R +HRGSTSGTK +  S    K K KKG    K + A A+  KK K  A KG 
Subjt:  SFLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVA--YRSYHRGSTSGTKPIAPSRPKGKKKMKKG----KVDRATAQKGKKVKEVAEKGK

Query:  CFHCNEDGYWKRNCPKFLAERKNQ----------------------------------------------------------------------------
        CFHCN++G+WKRNCPK+LAE+K                                                                              
Subjt:  CFHCNEDGYWKRNCPKFLAERKNQ----------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------GYPKETRGGLFYDPKENRVLVSTN
                                                                                    GYPK TRGG FYDPK+N+V VSTN
Subjt:  ----------------------------------------------------------------------------GYPKETRGGLFYDPKENRVLVSTN

Query:  ATFLEKDHIRDHLPRSKIVLNEM
        ATFLE+DHIR+H PRSKIVLNE+
Subjt:  ATFLEKDHIRDHLPRSKIVLNEM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATACGATTGATGGATCGATCGGGGCCAATGAAAAGGCCAAGGTCTACATCATTGCCAGTTTAACTGAAGTATTGGCAAAGAAACACGAGTTGATGATCACCGCTAA
GGAGATCATGGAGTCGTTTCAAGACATGTTTGGACAACAGTCCTTTCAGGTCAGACACGATTCGCTCAAACACGTCTTCAACGTCCGGATGAAAGAAGGGATGTTTGTCC
GAGAATACGTTCTAGACATGATGACCTACTTTAATCTAGCGGAGATGAACGGGGCTTCGATCGACGAGTCGAGCCAGGTCAGCTTTATTCTGGAAACTCTTCTGAAGAGT
TTCCTTCAGTTTCGTAGCAATGTTGTTATGAACAAAATTAGCTACACTCTGACTACCCTCCTCAATGAGCAACAGAATTTCCAGTCCTTGATGAGGATCAGGACATCGGA
AGTTGAGGCAAATGTTGCCTACAGGTCTTATCACAGGGGTTCGACCTCTGGGACTAAACCTATTGCTCCTTCTCGCCCGAAAGGGAAGAAGAAGATGAAGAAGGGTAAAG
TTGACCGAGCTACCGCCCAAAAGGGCAAGAAGGTCAAGGAAGTTGCAGAAAAAGGAAAGTGTTTCCACTGCAATGAGGATGGTTACTGGAAGAGAAATTGTCCCAAGTTC
CTTGCCGAGAGGAAGAATCAAGGTTATCCAAAAGAGACTAGGGGTGGTCTATTTTACGATCCTAAGGAAAATAGGGTGCTTGTATCGACAAACGCCACTTTTCTTGAGAA
AGACCACATCAGGGATCATTTGCCTAGGAGCAAAATCGTGTTGAACGAAATGGACAGTACGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATACGATTGATGGATCGATCGGGGCCAATGAAAAGGCCAAGGTCTACATCATTGCCAGTTTAACTGAAGTATTGGCAAAGAAACACGAGTTGATGATCACCGCTAA
GGAGATCATGGAGTCGTTTCAAGACATGTTTGGACAACAGTCCTTTCAGGTCAGACACGATTCGCTCAAACACGTCTTCAACGTCCGGATGAAAGAAGGGATGTTTGTCC
GAGAATACGTTCTAGACATGATGACCTACTTTAATCTAGCGGAGATGAACGGGGCTTCGATCGACGAGTCGAGCCAGGTCAGCTTTATTCTGGAAACTCTTCTGAAGAGT
TTCCTTCAGTTTCGTAGCAATGTTGTTATGAACAAAATTAGCTACACTCTGACTACCCTCCTCAATGAGCAACAGAATTTCCAGTCCTTGATGAGGATCAGGACATCGGA
AGTTGAGGCAAATGTTGCCTACAGGTCTTATCACAGGGGTTCGACCTCTGGGACTAAACCTATTGCTCCTTCTCGCCCGAAAGGGAAGAAGAAGATGAAGAAGGGTAAAG
TTGACCGAGCTACCGCCCAAAAGGGCAAGAAGGTCAAGGAAGTTGCAGAAAAAGGAAAGTGTTTCCACTGCAATGAGGATGGTTACTGGAAGAGAAATTGTCCCAAGTTC
CTTGCCGAGAGGAAGAATCAAGGTTATCCAAAAGAGACTAGGGGTGGTCTATTTTACGATCCTAAGGAAAATAGGGTGCTTGTATCGACAAACGCCACTTTTCTTGAGAA
AGACCACATCAGGGATCATTTGCCTAGGAGCAAAATCGTGTTGAACGAAATGGACAGTACGTAA
Protein sequenceShow/hide protein sequence
MHTIDGSIGANEKAKVYIIASLTEVLAKKHELMITAKEIMESFQDMFGQQSFQVRHDSLKHVFNVRMKEGMFVREYVLDMMTYFNLAEMNGASIDESSQVSFILETLLKS
FLQFRSNVVMNKISYTLTTLLNEQQNFQSLMRIRTSEVEANVAYRSYHRGSTSGTKPIAPSRPKGKKKMKKGKVDRATAQKGKKVKEVAEKGKCFHCNEDGYWKRNCPKF
LAERKNQGYPKETRGGLFYDPKENRVLVSTNATFLEKDHIRDHLPRSKIVLNEMDST