; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012931 (gene) of Snake gourd v1 genome

Gene IDTan0012931
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCACTA en-spm transposon protein
Genome locationLG04:10446771..10448254
RNA-Seq ExpressionTan0012931
SyntenyTan0012931
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]1.2e-8446.78Show/hide
Query:  MVHPSQDHDVVAVLEEENALEVNRSTAQHRTRAS-------VASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDT
        + H  Q+ +   +L+ ++   +  ST +  T AS         SR   RR RGHSR +EL+R+VN HGRI IEIDE+VGK VC+ AT FS AIGTI R+T
Subjt:  MVHPSQDHDVVAVLEEENALEVNRSTAQHRTRAS-------VASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDT

Query:  IPLHYKTWSDVPKQVRDNIKDRL---------------------------STYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPP
        IPL  K WSDV K+VRD + D+L                            +YF  D+ K H+ +YV + + +TFKEYR++LY++Y  F DPKEAR CPP
Subjt:  IPLHYKTWSDVPKQVRDNIKDRL---------------------------STYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPP

Query:  ERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQV---------------------HFSIKDGWVNDHARDAYLKMQQLLEASSQE
        +RI +  DWN+LC+RWET EWK+ TE NKKSR+ +P+ HRT SKSFVQV                     HF  KDGWVN++A+DAYL+MQ+L+EAS QE
Subjt:  ERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQV---------------------HFSIKDGWVNDHARDAYLKMQQLLEASSQE

Query:  GSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-E
           P+S  EVCK VLG RSG+IKGLG +   SSSSSVTS  Q +KE EKK+E+M+ E+                LT++LS WE RWAE    +   QG +
Subjt:  GSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-E

Query:  GSSN
        G SN
Subjt:  GSSN

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]8.1e-8950.13Show/hide
Query:  MVHPSQDHDVVAVLEEENALEVNRSTAQHRTRAS-------VASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDT
        + H  Q+ +   +L+ ++   +  ST +  T AS         SR   RR RGHSR +EL+R+VN HGRI IEIDE+VGK VC+ AT FS AIGTI R+T
Subjt:  MVHPSQDHDVVAVLEEENALEVNRSTAQHRTRAS-------VASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDT

Query:  IPLHYKTWSDVPKQVRDNIKDRLSTYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPPERIDNPADWNMLCDRWETAEWKEITEK
        IPL  K WSDV K+VRD + D+L +YF  D+ K H+ +YV + + +TFKEYR++LY++Y  F DPKEAR CPP+RI +  DWN+LC+RWET EWK+ TE 
Subjt:  IPLHYKTWSDVPKQVRDNIKDRLSTYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPPERIDNPADWNMLCDRWETAEWKEITEK

Query:  NKKSRANLPHNHRTRSKSFVQV---------------------HFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGW
        NKKSR+ +P+ HRT SKSFVQV                     HF  KDGWVN++A+DAYL+MQ+L+EAS QE   P+S  EVCK VLG RSG+IKGLG 
Subjt:  NKKSRANLPHNHRTRSKSFVQV---------------------HFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGW

Query:  DLNSSSSSSVTSSSQHEKEHEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-EGSSN
        +   SSSSSVTS  Q +KE EKK+E+M+ E+                LT++LS WE RWAE    +   QG +G SN
Subjt:  DLNSSSSSSVTSSSQHEKEHEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-EGSSN

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]2.1e-6841.09Show/hide
Query:  MVHPSQDHDVVAVLEEENALEVNRSTAQHRTRAS-------VASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDT
        + H  Q+ +   +L+ ++   +  ST +  T AS         SR   RR RGHSR +EL+R+VN HGRI IEIDE+VGK VC+ AT FS AIGTI R+T
Subjt:  MVHPSQDHDVVAVLEEENALEVNRSTAQHRTRAS-------VASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDT

Query:  IPLHYKTWSDVPKQVRDNIKDRL---------------------------STYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPP
        IPL  K WSDV K+VRD + D+L                            +YF  D+ K H+ +YV + + +TFKEYR++LY++Y  F DPKEAR CPP
Subjt:  IPLHYKTWSDVPKQVRDNIKDRL---------------------------STYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPP

Query:  ERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQV---------------------HFSIKDGWVNDHARDAYLKMQQLLEASSQE
        +RI +  DWN+LC+RWET EWK+ TE NKKSR+ +P+ HRT SKSFVQV                     HF  KDGWVN++A+DAYL+MQ+L+EAS QE
Subjt:  ERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQV---------------------HFSIKDGWVNDHARDAYLKMQQLLEASSQE

Query:  GSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-E
           P+                                 SS + +KE EKK+E+M+ E+                LT++LS WE RWAE    +   QG +
Subjt:  GSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-E

Query:  GSSN
        G SN
Subjt:  GSSN

XP_038887411.1 poly [ADP-ribose] polymerase 1-like isoform X4 [Benincasa hispida]3.9e-5945.64Show/hide
Query:  MVHPSQDHDVVAVLEEENALEVNRSTAQHRTRAS-------VASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDT
        + H  Q+ +   +L+ ++   +  ST +  T AS         SR   RR RGHSR +EL+R+VN HGRI IEIDE+VGK VC+ AT FS AIGTI R+T
Subjt:  MVHPSQDHDVVAVLEEENALEVNRSTAQHRTRAS-------VASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDT

Query:  IPLHYKTWSDVPKQVRDNIKDRL---------------------------STYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPP
        IPL  K WSDV K+VRD + D+L                            +YF  D+ K H+ +YV + + +TFKEYR++LY++Y  F DPKEAR CPP
Subjt:  IPLHYKTWSDVPKQVRDNIKDRL---------------------------STYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPP

Query:  ERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQV---------------------HFSIKDGWVNDHARDAY
        +RI +  DWN+LC+RWET EWK+ TE NKKSR+ +P+ HRT SKSFVQV                     HF  KDGWVN++A+DAY
Subjt:  ERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQV---------------------HFSIKDGWVNDHARDAY

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]1.2e-8446.78Show/hide
Query:  MVHPSQDHDVVAVLEEENALEVNRSTAQHRTRAS-------VASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDT
        + H  Q+ +   +L+ ++   +  ST +  T AS         SR   RR RGHSR +EL+R+VN HGRI IEIDE+VGK VC+ AT FS AIGTI R+T
Subjt:  MVHPSQDHDVVAVLEEENALEVNRSTAQHRTRAS-------VASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDT

Query:  IPLHYKTWSDVPKQVRDNIKDRL---------------------------STYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPP
        IPL  K WSDV K+VRD + D+L                            +YF  D+ K H+ +YV + + +TFKEYR++LY++Y  F DPKEAR CPP
Subjt:  IPLHYKTWSDVPKQVRDNIKDRL---------------------------STYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPP

Query:  ERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQV---------------------HFSIKDGWVNDHARDAYLKMQQLLEASSQE
        +RI +  DWN+LC+RWET EWK+ TE NKKSR+ +P+ HRT SKSFVQV                     HF  KDGWVN++A+DAYL+MQ+L+EAS QE
Subjt:  ERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQV---------------------HFSIKDGWVNDHARDAYLKMQQLLEASSQE

Query:  GSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-E
           P+S  EVCK VLG RSG+IKGLG +   SSSSSVTS  Q +KE EKK+E+M+ E+                LT++LS WE RWAE    +   QG +
Subjt:  GSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-E

Query:  GSSN
        G SN
Subjt:  GSSN

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ3 Transposase1.4e-4635.95Show/hide
Query:  RSTAQH----RTRASVASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDNIKDRLSTY
        R + QH      R+   ++ R R  RG+ R IEL+++V  HG++ IEI+E+ GK V + A   +  IGT  R+TI L  + W  +P  V++ + DR  T+
Subjt:  RSTAQH----RTRASVASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDNIKDRLSTY

Query:  FIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPPERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQV----
        F  D T   + +Y++  + + F+E+RA L++YY +FDD  EAR  PP++I +  DWNM+CDRWET  WK+  E NK+SR+ +  NH   +KSF+QV    
Subjt:  FIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPPERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQV----

Query:  -----------------HFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHEKKVEQ
                         HF  K+GW+ND A+DAY    +++  S++ G + IS ++ CK+VLG+ S  I  L      S  S+V+S+ + EK     +++
Subjt:  -----------------HFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHEKKVEQ

Query:  MQAEIGTLTTKLSSWEERWAEFTKYMDERQG
        +  +   LT +L+ WE+RW +  K +  R G
Subjt:  MQAEIGTLTTKLSSWEERWAEFTKYMDERQG

A0A5A7TFG0 Transposon protein, putative, CACTA, En/Spm sub-class2.0e-4537.7Show/hide
Query:  LEEENALEVNRSTAQHRTRASVASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDNIK
        +E+ N   V+ ST    + + + ++ RGR  RG+ R IEL+++V  HG+I IEI+E+ GK V + A   +  IGT  R+TIPL  + W  VP  VR+ + 
Subjt:  LEEENALEVNRSTAQHRTRASVASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDNIK

Query:  DRLSTYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPPERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFV
        DRL T+F  D T   + +Y+E  + + F+E+RA+L++YY +FDD  EAR  PP RI    DWNM+CDRWET  W       KK   ++        + F 
Subjt:  DRLSTYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPPERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFV

Query:  QVHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHEK-KVEQMQAEIGTLTTKLSS
        + HF  K+GW+ND A+DAYL+MQ+++  S++ G + IS ++ C+ VLG+RS         +N  S  S+ S+    +E EK ++  ++     LT +L+ 
Subjt:  QVHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHEK-KVEQMQAEIGTLTTKLSS

Query:  WEERW
        WE+ +
Subjt:  WEERW

A0A5A7TRX4 DUF4216 domain-containing protein3.4e-4837.54Show/hide
Query:  EEENALEVNRSTAQHRTRASVASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDNIKD
        E ++   ++       T    +S S GR  RG+ R IEL+++V  HG+I IEI+E+ GK V + A   +  IGT  R+TIPL  + W  VP  VR+ + D
Subjt:  EEENALEVNRSTAQHRTRASVASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDNIKD

Query:  RLSTYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPPERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQ
         L T+F  D T   + +Y+E  + +TF+E+RA+L++YY +FDD  EAR  P  RI +  DWNM+CDRWET  WK+  E NK+S + +  NH T +KSF+Q
Subjt:  RLSTYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPPERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQ

Query:  V---------------------HFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHE
        V                     HF  K+GW ND A+DAYL+MQ+++  S++ G + IS ++ C+ VLG+RS         +N  S  S+ S+    +E E
Subjt:  V---------------------HFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHE

Query:  K-KVEQMQAEIGTLTTKLSSWEERW
        K ++  ++     LT +L+ WE+ +
Subjt:  K-KVEQMQAEIGTLTTKLSSWEERW

A0A5A7US78 Uncharacterized protein9.2e-4637.38Show/hide
Query:  LEEENALEVNRSTAQHRTRASVASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDNIK
        +E+ N   ++ ST    + + + ++ RGR  RG+ R IEL+++V  HG+I IEI+E+ GK V +     +  IGT  R+TIPL  + W  VP  VR+ + 
Subjt:  LEEENALEVNRSTAQHRTRASVASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDNIK

Query:  DRLSTYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPPERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFV
        D L T+F  D T   + +Y+E  + +TF+E+RA L++YY +FDD  EAR  PP RI +  DWNM+CDRWET  WK+     KK   ++        + F 
Subjt:  DRLSTYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPPERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFV

Query:  QVHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHEK-KVEQMQAEIGTLTTKLSS
        + HF  K+GW+ND A+DAYL+MQ+++  S++ G + IS ++ CK VLG+RS         +N  S  S+ S+    +E +K ++  ++     LT +L+ 
Subjt:  QVHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHEK-KVEQMQAEIGTLTTKLSS

Query:  WEERW
        WE+ +
Subjt:  WEERW

A0A6J1DUH3 uncharacterized protein LOC1110232121.2e-5849.4Show/hide
Query:  YFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPPERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQV---
        +F VDL+K  +N+++E+ +  +FK+YR++L+QYY EF+DP EAR  PPER+ NP DWN LCDRWET EWKEIT+KNKK+RA LP NHR  SKSF+Q+   
Subjt:  YFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPPERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQV---

Query:  ------------------HFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWD-----LNSSSSSSVTSSSQHEKEH
                          H++ KDG VND+A DAY  MQ L++A +QEG EP++Q E C+ VLG R  H+KGLG+          SSS+VTSS+ +EKE 
Subjt:  ------------------HFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWD-----LNSSSSSSVTSSSQHEKEH

Query:  EKKVEQMQAEIGTLTTK-------LSSWEERWAEFTKYMDERQGEGSSN
        EKKVE M+ E+  + T+       +S+WE+RW E +++M  RQG+G SN
Subjt:  EKKVEQMQAEIGTLTTK-------LSSWEERWAEFTKYMDERQGEGSSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCATCCAAGTCAAGATCATGATGTAGTTGCTGTGTTAGAAGAGGAGAATGCTCTGGAGGTTAATCGTTCGACAGCGCAACATCGTACACGAGCTTCAGTAGCCTC
TAGATCGCGAGGCAGGAGGGCCAGAGGGCATAGCCGAAGGATTGAGTTAGAGCGCTATGTCAATGCACATGGTAGAATACCCATCGAGATCGATGAGAAGGTCGGCAAAC
TAGTGTGTAGTAAGGCCACTACGTTCAGTGGAGCCATTGGTACCATCACCCGAGATACAATTCCACTGCATTATAAAACGTGGAGCGACGTCCCAAAGCAAGTTCGAGAC
AACATAAAAGATCGACTCTCTACATACTTTATCGTGGATTTGACCAAACATCATATAAATAGATACGTAGAGCGACTGATATCATCCACATTTAAGGAATATAGGGCAGA
ACTGTATCAATACTACCTTGAGTTTGACGACCCCAAAGAGGCTCGTGAATGTCCTCCAGAAAGAATCGATAATCCAGCTGATTGGAATATGTTATGTGATCGATGGGAGA
CCGCTGAATGGAAGGAAATAACGGAGAAAAATAAGAAAAGTCGAGCCAATCTTCCTCACAACCATCGAACTAGGTCAAAGTCATTTGTTCAAGTTCACTTCAGCATAAAG
GACGGATGGGTGAACGACCATGCGAGAGATGCATATTTGAAAATGCAACAACTTCTTGAGGCATCATCACAGGAAGGATCTGAGCCAATCTCACAGTCAGAAGTTTGTAA
AATGGTTTTGGGTACTCGATCAGGCCACATAAAAGGTCTTGGTTGGGACCTAAATTCTAGTTCGTCGTCTAGCGTCACATCTTCTTCCCAACATGAAAAAGAGCATGAAA
AGAAGGTGGAGCAGATGCAAGCTGAGATTGGTACCTTAACGACGAAGTTGTCCTCATGGGAAGAAAGATGGGCTGAATTCACAAAGTACATGGATGAAAGGCAGGGTGAA
GGTTCTTCAAACCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGCATCCAAGTCAAGATCATGATGTAGTTGCTGTGTTAGAAGAGGAGAATGCTCTGGAGGTTAATCGTTCGACAGCGCAACATCGTACACGAGCTTCAGTAGCCTC
TAGATCGCGAGGCAGGAGGGCCAGAGGGCATAGCCGAAGGATTGAGTTAGAGCGCTATGTCAATGCACATGGTAGAATACCCATCGAGATCGATGAGAAGGTCGGCAAAC
TAGTGTGTAGTAAGGCCACTACGTTCAGTGGAGCCATTGGTACCATCACCCGAGATACAATTCCACTGCATTATAAAACGTGGAGCGACGTCCCAAAGCAAGTTCGAGAC
AACATAAAAGATCGACTCTCTACATACTTTATCGTGGATTTGACCAAACATCATATAAATAGATACGTAGAGCGACTGATATCATCCACATTTAAGGAATATAGGGCAGA
ACTGTATCAATACTACCTTGAGTTTGACGACCCCAAAGAGGCTCGTGAATGTCCTCCAGAAAGAATCGATAATCCAGCTGATTGGAATATGTTATGTGATCGATGGGAGA
CCGCTGAATGGAAGGAAATAACGGAGAAAAATAAGAAAAGTCGAGCCAATCTTCCTCACAACCATCGAACTAGGTCAAAGTCATTTGTTCAAGTTCACTTCAGCATAAAG
GACGGATGGGTGAACGACCATGCGAGAGATGCATATTTGAAAATGCAACAACTTCTTGAGGCATCATCACAGGAAGGATCTGAGCCAATCTCACAGTCAGAAGTTTGTAA
AATGGTTTTGGGTACTCGATCAGGCCACATAAAAGGTCTTGGTTGGGACCTAAATTCTAGTTCGTCGTCTAGCGTCACATCTTCTTCCCAACATGAAAAAGAGCATGAAA
AGAAGGTGGAGCAGATGCAAGCTGAGATTGGTACCTTAACGACGAAGTTGTCCTCATGGGAAGAAAGATGGGCTGAATTCACAAAGTACATGGATGAAAGGCAGGGTGAA
GGTTCTTCAAACCCCTAG
Protein sequenceShow/hide protein sequence
MVHPSQDHDVVAVLEEENALEVNRSTAQHRTRASVASRSRGRRARGHSRRIELERYVNAHGRIPIEIDEKVGKLVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRD
NIKDRLSTYFIVDLTKHHINRYVERLISSTFKEYRAELYQYYLEFDDPKEARECPPERIDNPADWNMLCDRWETAEWKEITEKNKKSRANLPHNHRTRSKSFVQVHFSIK
DGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDLNSSSSSSVTSSSQHEKEHEKKVEQMQAEIGTLTTKLSSWEERWAEFTKYMDERQGE
GSSNP