; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018218 (gene) of Snake gourd v1 genome

Gene IDTan0018218
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionC2H2-type domain-containing protein
Genome locationLG09:73123725..73129466
RNA-Seq ExpressionTan0018218
SyntenyTan0018218
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023082.1 hypothetical protein SDJN02_14106, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-14487.37Show/hide
Query:  MTRATKKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVS
        +TRA KKAT G+I+LL  LL LQ+ V FALALPPSESNQD EQSAT+          RPLEQD+EH +EVHCSRERSRTAWNILEEHLLPF+EKENYQVS
Subjt:  MTRATKKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVS

Query:  TKCRLHLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLA
        TKCRLH NNDLYRDQEQHKIH DINHWQCGYCRKSFRAEKFLDKHFDNRHY+LLNVSHGKCLADLCGALHCDLKMD KSRKSKC PAAAARNKHLCESLA
Subjt:  TKCRLHLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLA

Query:  DSCFPINEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
        D+CFPINEGPSASRLHELFLHQFCGAHSCT KQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRI+K+GRK+KPL
Subjt:  DSCFPINEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL

XP_004138066.1 uncharacterized protein LOC101218367 isoform X2 [Cucumis sativus]1.3e-14388.19Show/hide
Query:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL
        KK TA TIILLS+LL LQE+VHFA +LPPS +NQD EQSAT           RPLEQ+EEHVDEVHCSRERSRTAWNI+EEHLLPFMEKENY+VST+CRL
Subjt:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL

Query:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP
        H NNDL+RDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRH NLLNVSHGKCLADLCGALHCDLKMD KSRKSKC PAAAARNKHLCESLADSCFP
Subjt:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP

Query:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
        INEGPSA+RLHELFLHQFCGAHSCTGKQKPFSRGA RQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVL+RISKAGRK KPL
Subjt:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL

XP_022921975.1 uncharacterized protein LOC111430069 [Cucurbita moschata]9.2e-14588.89Show/hide
Query:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL
        KKAT G+I+LL VLL LQ+ VHFALALPPSESNQD EQSAT+          RPLEQD+EH +EVHCSRERSRTAWNILEEHLLPF+EKENYQVSTKCRL
Subjt:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL

Query:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP
        H NNDL+RDQEQHKIH DINHWQCGYCRKSFRAEKFLDKHFDNRHY+LLNVSHGKCLADLCGALHCDLKMD KSRKSKC PAAAARNKHLCESLAD+CFP
Subjt:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP

Query:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
        INEGPSASRLHELFLHQFCGAHSCT KQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISK+GRKTKPL
Subjt:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL

XP_023515783.1 uncharacterized protein LOC111779843 [Cucurbita pepo subsp. pepo]1.3e-14388.54Show/hide
Query:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL
        KKAT G+I+LL  LL LQ+ VHFALALP SESNQD EQSAT+          RPLEQD+EH +EVHCSRERSRTAWNILEEHLLPF+EKENYQVSTKCRL
Subjt:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL

Query:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP
        H NNDL+RDQEQHKIH DINHWQCGYCRKSFRAEKFLDKHFDNRHY+LLNVSHGKCLADLCGALHCDLKMD KSRKSKC PAAAARNKHLCESLAD+CFP
Subjt:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP

Query:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
        INEGPSASRLHELFLHQFCGAHSCT KQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
Subjt:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL

XP_038879885.1 uncharacterized protein LOC120071608 [Benincasa hispida]6.0e-14489.58Show/hide
Query:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL
        KKATAGTIILLS LL LQ++VHFA ALPPS SNQDVEQSATS          RPL QD+EHVDEVHCSRERSRTAWNI+EEHLLPFMEKENYQVSTKCRL
Subjt:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL

Query:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP
        H NNDL+RDQEQHKIHLDINHWQCGYC KSFRAE FLDKHFDNRH NLLNVSHGKCLADLCGALHCDLKMD KSRKSKCNPAAAARNKHLC+SLADSCFP
Subjt:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP

Query:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
        INEGPSA+RLHELFLHQFCGAHSCTGKQKPFSRGA RQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKA RK+KPL
Subjt:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL

TrEMBL top hitse value%identityAlignment
A0A0A0LU52 C2H2-type domain-containing protein6.4e-14488.19Show/hide
Query:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL
        KK TA TIILLS+LL LQE+VHFA +LPPS +NQD EQSAT           RPLEQ+EEHVDEVHCSRERSRTAWNI+EEHLLPFMEKENY+VST+CRL
Subjt:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL

Query:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP
        H NNDL+RDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRH NLLNVSHGKCLADLCGALHCDLKMD KSRKSKC PAAAARNKHLCESLADSCFP
Subjt:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP

Query:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
        INEGPSA+RLHELFLHQFCGAHSCTGKQKPFSRGA RQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVL+RISKAGRK KPL
Subjt:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL

A0A1S3CLJ0 uncharacterized protein LOC1035023444.6e-14287.5Show/hide
Query:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL
        KK TA TIILLS LL LQE++HFA  LPPS +NQD EQSAT           RPLEQ+EEHVDEVHCSRERSRTAWNI+EEHLLPFME ENY+VST+CRL
Subjt:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL

Query:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP
        H NNDL+RDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRH NLLNVSHGKCLADLCGALHCDLKMD KSRKSKC PAAAARNKHLCESLADSCFP
Subjt:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP

Query:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
        INEGPSA+RLHELFLHQFCGAHSCTGKQKPFSRGA RQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVL+RISKAGRK+KPL
Subjt:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL

A0A6J1C036 uncharacterized protein LOC1110072497.6e-13783.5Show/hide
Query:  MTRATKKAT--AGTIILLSVLLPLQEIVHFALALPPSESNQD--VEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKEN
        M RA KKAT  AG+IIL+ +   LQ  VHFA ALPPSE+ QD  VEQSATS          RPL++ EEHVDEVHCSRERS+TAWNI+EEHLLPF+EKEN
Subjt:  MTRATKKAT--AGTIILLSVLLPLQEIVHFALALPPSESNQD--VEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKEN

Query:  YQVSTKCRLHLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLC
        YQVST+CRLH NNDL+RDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRH+NLLNVSHGKCLADLCGALHCD+KMD KSRKSKC+PAAAARNKHLC
Subjt:  YQVSTKCRLHLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLC

Query:  ESLADSCFPINEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
        ESLADSCFPINEGPSASRLH+LFLHQFCGAHSCTGK KPFS+GAERQPGIFYMASSILILMLLP+FYVIVYLHRRES+N I+VL+RISKAGRKTKPL
Subjt:  ESLADSCFPINEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL

A0A6J1E7A2 uncharacterized protein LOC1114300694.5e-14588.89Show/hide
Query:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL
        KKAT G+I+LL VLL LQ+ VHFALALPPSESNQD EQSAT+          RPLEQD+EH +EVHCSRERSRTAWNILEEHLLPF+EKENYQVSTKCRL
Subjt:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL

Query:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP
        H NNDL+RDQEQHKIH DINHWQCGYCRKSFRAEKFLDKHFDNRHY+LLNVSHGKCLADLCGALHCDLKMD KSRKSKC PAAAARNKHLCESLAD+CFP
Subjt:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP

Query:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
        INEGPSASRLHELFLHQFCGAHSCT KQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISK+GRKTKPL
Subjt:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL

A0A6J1JH74 uncharacterized protein LOC1114850863.2e-14387.85Show/hide
Query:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL
        KKAT G+IILL VL+ LQ+ V FALALPPSESNQD EQSAT+          RPLEQD+EH +EVHCSRERSRTAWNILEEH LPF+EKENYQVSTKCRL
Subjt:  KKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRL

Query:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP
        H NNDL+RDQEQHKIH DINHWQCGYCRKSFRAEK+LDKHFDNRHY+LLNVSHGKCLADLCGALHCDLKMD KSRKSKC PAAAARNKHLCESLAD+CFP
Subjt:  HLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFP

Query:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
        INEGPSASRLHELFLHQFCGAHSCT KQ+PFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
Subjt:  INEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G40710.1 zinc finger (C2H2 type) family protein9.1e-8258.52Show/hide
Query:  EEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRLHLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLA
        E+   E+HCSRERSR AW I++E+L+P++EKE YQ+ + CR+H +ND+YR+QE+HK+  DIN W+CG+C+K+F  EK+LDKHFD+RHYNLLN SHGKCL+
Subjt:  EEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRLHLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLA

Query:  DLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFPINEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFY
        DLCGALHCDL +DT   KSKCNPAAAA+N+HLCESLA+SCFP+N+G SA+RLH+ FL QFC AH+C+G  KP S+  +++  I Y+  SI++L++L ++Y
Subjt:  DLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFPINEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFY

Query:  VIVYLHRRESRNGIEVLRRISKAGRKTKP
          VYL RR  +   + L+RI   G K KP
Subjt:  VIVYLHRRESRNGIEVLRRISKAGRKTKP

AT5G63280.1 C2H2-like zinc finger protein2.5e-9262.93Show/hide
Query:  EQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRLHLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGK
        E +  +  E+HCSRERSR AW I++++L PF+E+E Y++   CRLH +NDLYRDQE HK+H+D+  W+CGYC+KSF  EKFLDKHF  RHYNLLN +  K
Subjt:  EQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRLHLNNDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGK

Query:  CLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFPINEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLP
        CLADLCGALHCD  + +K  KSKCNP A A+N+HLCES+A+SCFP+++GPSASRLHE FL QFC AH+CTG  KPF RG +++ G+FY+A SIL LMLLP
Subjt:  CLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFPINEGPSASRLHELFLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLP

Query:  IFYVIVYLHRRESRNGIEVLRRISKAGRKTKP
        +FY++V+LH+RE R+G + LRRI K+G+KTKP
Subjt:  IFYVIVYLHRRESRNGIEVLRRISKAGRKTKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATGACCAGAGCAACGAAGAAAGCAACGGCAGGCACCATAATTCTTCTTTCTGTTTTACTTCCTCTACAAGAAATTGTTCATTTTGCTTTGGCTCTACCTCCTTC
AGAGAGTAATCAGGACGTGGAGCAATCTGCAACTTCGAGGCGATATCTTTTGTGTGCTTCTCATCCAAGACCTCTCGAGCAAGATGAAGAGCATGTTGATGAAGTACATT
GTTCCAGAGAAAGAAGTAGGACAGCCTGGAATATTCTTGAGGAGCATTTGCTGCCGTTTATGGAGAAGGAAAATTATCAGGTTTCAACCAAGTGTAGGCTTCATCTAAAC
AACGATCTCTACAGAGATCAGGAGCAGCACAAGATTCATCTTGATATAAATCATTGGCAGTGTGGATACTGTCGAAAAAGCTTTCGTGCTGAAAAATTTCTTGATAAGCA
TTTTGACAACAGACACTACAATCTTCTGAATGTTAGCCACGGGAAGTGCTTAGCTGATTTATGTGGAGCATTGCATTGTGACCTAAAGATGGATACCAAGTCACGTAAAT
CTAAATGCAATCCGGCAGCTGCTGCTAGGAACAAGCATTTATGTGAGAGTCTTGCTGATAGCTGTTTTCCAATTAACGAGGGACCATCAGCTAGCCGTCTTCACGAGTTG
TTTCTTCACCAATTCTGTGGAGCTCATTCTTGCACTGGGAAACAAAAGCCATTTTCTAGAGGAGCGGAGAGGCAACCAGGCATCTTTTACATGGCATCTTCAATACTGAT
TTTGATGTTGCTACCTATTTTCTATGTCATTGTCTATTTACACCGCAGAGAATCGAGAAATGGAATCGAAGTGCTTAGACGAATATCAAAAGCTGGACGTAAAACCAAAC
CCTTGTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAATCCCAACCTACGAATAGTCCGTCCCGTATTAAACAATAAACGGTGGAATTTCCGCGCCACGCGTACGTGATTCGCGAGACAAAGTTCGTATTATTCGCGGCTAG
AATTCGAATCTCCAAAACCCCAATCGGGTCACAGTGGCTCGTAGCCCGGTTTCGCGTCTTCGAAAATGGGAATGACCAGAGCAACGAAGAAAGCAACGGCAGGCACCATA
ATTCTTCTTTCTGTTTTACTTCCTCTACAAGAAATTGTTCATTTTGCTTTGGCTCTACCTCCTTCAGAGAGTAATCAGGACGTGGAGCAATCTGCAACTTCGAGGCGATA
TCTTTTGTGTGCTTCTCATCCAAGACCTCTCGAGCAAGATGAAGAGCATGTTGATGAAGTACATTGTTCCAGAGAAAGAAGTAGGACAGCCTGGAATATTCTTGAGGAGC
ATTTGCTGCCGTTTATGGAGAAGGAAAATTATCAGGTTTCAACCAAGTGTAGGCTTCATCTAAACAACGATCTCTACAGAGATCAGGAGCAGCACAAGATTCATCTTGAT
ATAAATCATTGGCAGTGTGGATACTGTCGAAAAAGCTTTCGTGCTGAAAAATTTCTTGATAAGCATTTTGACAACAGACACTACAATCTTCTGAATGTTAGCCACGGGAA
GTGCTTAGCTGATTTATGTGGAGCATTGCATTGTGACCTAAAGATGGATACCAAGTCACGTAAATCTAAATGCAATCCGGCAGCTGCTGCTAGGAACAAGCATTTATGTG
AGAGTCTTGCTGATAGCTGTTTTCCAATTAACGAGGGACCATCAGCTAGCCGTCTTCACGAGTTGTTTCTTCACCAATTCTGTGGAGCTCATTCTTGCACTGGGAAACAA
AAGCCATTTTCTAGAGGAGCGGAGAGGCAACCAGGCATCTTTTACATGGCATCTTCAATACTGATTTTGATGTTGCTACCTATTTTCTATGTCATTGTCTATTTACACCG
CAGAGAATCGAGAAATGGAATCGAAGTGCTTAGACGAATATCAAAAGCTGGACGTAAAACCAAACCCTTGTAACTGGTAGCACCCTTTTCCCCCTTCACATCAAGAAAAC
TACATTATCAGATTTTTAAGTTTGGTAACTTCTGGGACTTTGGTAGAAATAGGATGCCATATCCAATGTTCTATAGCCATGGCGGCACCTAAATGTACATCGAACATATC
TTGCATTAAAAACTGAGTCCCACCCCCCCCCCCCCCCCCTCCTATTACAAAAGTGTCTTTCATTGGCATCATGTAGTCCTGAAGATGAGTATGGAGCTAACTGC
Protein sequenceShow/hide protein sequence
MGMTRATKKATAGTIILLSVLLPLQEIVHFALALPPSESNQDVEQSATSRRYLLCASHPRPLEQDEEHVDEVHCSRERSRTAWNILEEHLLPFMEKENYQVSTKCRLHLN
NDLYRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHYNLLNVSHGKCLADLCGALHCDLKMDTKSRKSKCNPAAAARNKHLCESLADSCFPINEGPSASRLHEL
FLHQFCGAHSCTGKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL