; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011385 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011385
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr1:23352144..23358124
RNA-Seq ExpressionLag0011385
SyntenyLag0011385
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052371.1 putative mitochondrial protein [Cucumis melo var. makuwa]7.5e-2138.31Show/hide
Query:  CRPLSTNRHQQPLII--ILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR----------
        C+ + T +H    ++  IL YLKG   H L+L KS + SL GFADADWASD DDRKSTS  CV+FG NL++WGSKKQ I+ RSS EAEYR          
Subjt:  CRPLSTNRHQQPLII--ILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR----------

Query:  --------------------------------------------------------KIGVQHLP----VADILTKPLSAINFIKLRSKLNVREPSSIGLG
                                                                K+ V+HLP    +ADILTKPLSA +F KL++ + V + + IGL 
Subjt:  --------------------------------------------------------KIGVQHLP----VADILTKPLSAINFIKLRSKLNVREPSSIGLG

Query:  G
        G
Subjt:  G

KAA0059238.1 putative mitochondrial protein [Cucumis melo var. makuwa]9.8e-2138.31Show/hide
Query:  CRPLSTNRHQQPLII--ILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR----------
        C+ + T +H    ++  IL YLKG   H L+L KS + SL GF DADWASD DDRKSTS FCV+FG NL++WGSKKQ I+ RSS +AEYR          
Subjt:  CRPLSTNRHQQPLII--ILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR----------

Query:  --------------------------------------------------------KIGVQHL----PVADILTKPLSAINFIKLRSKLNVREPSSIGLG
                                                                K+ V+HL     +ADILTKPLSA +  KL++KL V + +SIGL 
Subjt:  --------------------------------------------------------KIGVQHL----PVADILTKPLSAINFIKLRSKLNVREPSSIGLG

Query:  G
        G
Subjt:  G

KAA0063213.1 putative mitochondrial protein [Cucumis melo var. makuwa]1.7e-2037.81Show/hide
Query:  CRPLSTNRHQQPLII--ILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR----------
        C+ + T +H    ++  IL YLKG   H L+L KS + SL GF DADWASD DDRKSTS FCV+FG NL++WGSKKQ I+ RSS +AEYR          
Subjt:  CRPLSTNRHQQPLII--ILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR----------

Query:  --------------------------------------------------------KIGVQHL----PVADILTKPLSAINFIKLRSKLNVREPSSIGLG
                                                                K+ ++HL     +ADILTKPLSA +  KL++KL V + +SIGL 
Subjt:  --------------------------------------------------------KIGVQHL----PVADILTKPLSAINFIKLRSKLNVREPSSIGLG

Query:  G
        G
Subjt:  G

XP_022135935.1 uncharacterized protein LOC111007767 [Momordica charantia]1.5e-2442.55Show/hide
Query:  HQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR--------------------
        H Q +  IL YLKGT  H L L K+  FS HG+ DA WASD DDRKSTS FCVFFGGNL+TWGSKKQ I+ RSS+E EYR                    
Subjt:  HQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR--------------------

Query:  ---------------------------------------------KIGVQHLP----VADILTKPLSAINFIKLRSKLNVREPSSIGL
                                                     K  V HLP    VAD+LTKPL A+NF +L+ KLNVR+P SI L
Subjt:  ---------------------------------------------KIGVQHLP----VADILTKPLSAINFIKLRSKLNVREPSSIGL

XP_022157873.1 uncharacterized protein LOC111024485 [Momordica charantia]1.0e-3053.12Show/hide
Query:  HQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR--------------------
        H Q +  IL YLKGT  H LFL K   FSLHG+AD DWASD DDRKSTS FCVFFGGNL+TWGSKKQ I+ RSS EAEYR                    
Subjt:  HQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR--------------------

Query:  ---------------KIGVQHL----PVADILTKPLSAINFIKLRSKLNVREPSSIGLGG
                       K+ V HL     +AD+LTKPL A+NF +L+ KLNVR+PSSIGL G
Subjt:  ---------------KIGVQHL----PVADILTKPLSAINFIKLRSKLNVREPSSIGLGG

TrEMBL top hitse value%identityAlignment
A0A5A7UFS3 Putative mitochondrial protein3.6e-2138.31Show/hide
Query:  CRPLSTNRHQQPLII--ILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR----------
        C+ + T +H    ++  IL YLKG   H L+L KS + SL GFADADWASD DDRKSTS  CV+FG NL++WGSKKQ I+ RSS EAEYR          
Subjt:  CRPLSTNRHQQPLII--ILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR----------

Query:  --------------------------------------------------------KIGVQHLP----VADILTKPLSAINFIKLRSKLNVREPSSIGLG
                                                                K+ V+HLP    +ADILTKPLSA +F KL++ + V + + IGL 
Subjt:  --------------------------------------------------------KIGVQHLP----VADILTKPLSAINFIKLRSKLNVREPSSIGLG

Query:  G
        G
Subjt:  G

A0A5A7UVQ3 Putative mitochondrial protein4.8e-2138.31Show/hide
Query:  CRPLSTNRHQQPLII--ILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR----------
        C+ + T +H    ++  IL YLKG   H L+L KS + SL GF DADWASD DDRKSTS FCV+FG NL++WGSKKQ I+ RSS +AEYR          
Subjt:  CRPLSTNRHQQPLII--ILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR----------

Query:  --------------------------------------------------------KIGVQHL----PVADILTKPLSAINFIKLRSKLNVREPSSIGLG
                                                                K+ V+HL     +ADILTKPLSA +  KL++KL V + +SIGL 
Subjt:  --------------------------------------------------------KIGVQHL----PVADILTKPLSAINFIKLRSKLNVREPSSIGLG

Query:  G
        G
Subjt:  G

A0A5A7V553 Putative mitochondrial protein8.1e-2137.81Show/hide
Query:  CRPLSTNRHQQPLII--ILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR----------
        C+ + T +H    ++  IL YLKG   H L+L KS + SL GF DADWASD DDRKSTS FCV+FG NL++WGSKKQ I+ RSS +AEYR          
Subjt:  CRPLSTNRHQQPLII--ILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR----------

Query:  --------------------------------------------------------KIGVQHL----PVADILTKPLSAINFIKLRSKLNVREPSSIGLG
                                                                K+ ++HL     +ADILTKPLSA +  KL++KL V + +SIGL 
Subjt:  --------------------------------------------------------KIGVQHL----PVADILTKPLSAINFIKLRSKLNVREPSSIGLG

Query:  G
        G
Subjt:  G

A0A6J1C2F7 uncharacterized protein LOC1110077677.1e-2542.55Show/hide
Query:  HQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR--------------------
        H Q +  IL YLKGT  H L L K+  FS HG+ DA WASD DDRKSTS FCVFFGGNL+TWGSKKQ I+ RSS+E EYR                    
Subjt:  HQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR--------------------

Query:  ---------------------------------------------KIGVQHLP----VADILTKPLSAINFIKLRSKLNVREPSSIGL
                                                     K  V HLP    VAD+LTKPL A+NF +L+ KLNVR+P SI L
Subjt:  ---------------------------------------------KIGVQHLP----VADILTKPLSAINFIKLRSKLNVREPSSIGL

A0A6J1DUJ4 uncharacterized protein LOC1110244855.1e-3153.12Show/hide
Query:  HQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR--------------------
        H Q +  IL YLKGT  H LFL K   FSLHG+AD DWASD DDRKSTS FCVFFGGNL+TWGSKKQ I+ RSS EAEYR                    
Subjt:  HQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYR--------------------

Query:  ---------------KIGVQHL----PVADILTKPLSAINFIKLRSKLNVREPSSIGLGG
                       K+ V HL     +AD+LTKPL A+NF +L+ KLNVR+PSSIGL G
Subjt:  ---------------KIGVQHL----PVADILTKPLSAINFIKLRSKLNVREPSSIGLGG

SwissProt top hitse value%identityAlignment
P0CV72 Secreted RxLR effector protein 1612.0e-0840.51Show/hide
Query:  HQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQ-PILRSSIEAEY
        H Q L  +L YL+ T T+ L   ++    L G++DADWA D++ R+STS +     G  ++W SKKQ  +  SS E EY
Subjt:  HQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQ-PILRSSIEAEY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-0738.89Show/hide
Query:  ISRCRPLSTNRHQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQP-ILRSSIEAEY
        +SR        H + +  IL YL+GT T D       D  L G+ DAD A D+D+RKS++ +   F G  I+W SK Q  +  S+ EAEY
Subjt:  ISRCRPLSTNRHQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQP-ILRSSIEAEY

P92519 Uncharacterized mitochondrial protein AtMg008101.6e-1333.09Show/hide
Query:  PSTSDYRVSPTAVNHLRTTSTDRPPSIGCHLRRPTTDCHRRPPTISRCRPLSTNRHQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDR
        P  SD+R    A+ +L  T  D   ++    +R         PT++    L           +L Y+KGT  H L++HK+   ++  F D+DWA     R
Subjt:  PSTSDYRVSPTAVNHLRTTSTDRPPSIGCHLRRPTTDCHRRPPTISRCRPLSTNRHQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDR

Query:  KSTSSFCVFFGGNLITWGSKKQP-ILRSSIEAEYRKIGV
        +ST+ FC F G N+I+W +K+QP + RSS E EYR + +
Subjt:  KSTSSFCVFFGGNLITWGSKKQP-ILRSSIEAEYRKIGV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.8e-1637.09Show/hide
Query:  ADRPPLSTFDYRALPSTSDYRVSPTAVNHLRTTSTDRPPSIGCHLRRPTTDCHRRPPTISRCRPLSTNRHQQPLIIILMYLKGTFTHDLFLHKSPDFSLH
        A  P LS +    L   ++YR    ++ +L  T     P I   + R           +S+   + T  H Q L  IL YL GT  H +FL K    SLH
Subjt:  ADRPPLSTFDYRALPSTSDYRVSPTAVNHLRTTSTDRPPSIGCHLRRPTTDCHRRPPTISRCRPLSTNRHQQPLIIILMYLKGTFTHDLFLHKSPDFSLH

Query:  GFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQP-ILRSSIEAEYRKI
         ++DADWA D DD  ST+ + V+ G + I+W SKKQ  ++RSS EAEYR +
Subjt:  GFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQP-ILRSSIEAEYRKI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.8e-1534.44Show/hide
Query:  ADRPPLSTFDYRALPSTSDYRVSPTAVNHLRTTSTDRPPSIGCHLRRPTTDCHRRPPTISRCRPLSTNRHQQPLIIILMYLKGTFTHDLFLHKSPDFSLH
        A  P L+      LP  ++YR    ++ +L  T  D   ++                 +S+   + T+ H   L  +L YL GT  H +FL K    SLH
Subjt:  ADRPPLSTFDYRALPSTSDYRVSPTAVNHLRTTSTDRPPSIGCHLRRPTTDCHRRPPTISRCRPLSTNRHQQPLIIILMYLKGTFTHDLFLHKSPDFSLH

Query:  GFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQP-ILRSSIEAEYRKI
         ++DADWA D DD  ST+ + V+ G + I+W SKKQ  ++RSS EAEYR +
Subjt:  GFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQP-ILRSSIEAEYRKI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.1e-1345.12Show/hide
Query:  HQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYRKI
        HQQ ++ IL Y+KGT    LF     +  L  F+DA + S  D R+ST+ +C+F G +LI+W SKKQ ++ +SS EAEYR +
Subjt:  HQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPIL-RSSIEAEYRKI

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.1e-0538.1Show/hide
Query:  ISRCRPLSTNRHQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFC
        +S+    S     Q +  +L Y+KGT    LF   + D  L  FAD+DWAS  D R+S + FC
Subjt:  ISRCRPLSTNRHQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDRKSTSSFC

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-1433.09Show/hide
Query:  PSTSDYRVSPTAVNHLRTTSTDRPPSIGCHLRRPTTDCHRRPPTISRCRPLSTNRHQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDR
        P  SD+R    A+ +L  T  D   ++    +R         PT++    L           +L Y+KGT  H L++HK+   ++  F D+DWA     R
Subjt:  PSTSDYRVSPTAVNHLRTTSTDRPPSIGCHLRRPTTDCHRRPPTISRCRPLSTNRHQQPLIIILMYLKGTFTHDLFLHKSPDFSLHGFADADWASDLDDR

Query:  KSTSSFCVFFGGNLITWGSKKQP-ILRSSIEAEYRKIGV
        +ST+ FC F G N+I+W +K+QP + RSS E EYR + +
Subjt:  KSTSSFCVFFGGNLITWGSKKQP-ILRSSIEAEYRKIGV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGACCACCCTCGACCTCCACCGACCGACTGCTATCATCCACCTTCGACTACCAACCGCGGCCAATCGTTACTGTCGACCTCCGTTGACCGGCCTTCGTCTATTGG
TCATTGTCAACCTCTGTCGACTGGCTGCCACTGTCCATCTCCGACTACTGGTTGTCGTCGACCTCCATCGACTGTCGCCGCTGACCGACCACCACTGTCGACCTTTGACT
ACTGGGTGCCACCGTCGACCTCCAACTACTGGCCGCCACCTTCGACCTGCAACTACTGGTCGTTTCTGACCCCATCGACCAACCGCAATCGTCGACCTCTAACAACCGGC
CACGACCAACCGCTACCGTCGACTTTTGACTATCGATCGTCATCGACCTCTGCTGACCGACCACCACTATCGACCTTTGACTATCGGGCACTACCGTCGACCTCCGACTA
CCGGGTGTCGCCGACAGCCGTTAACCATCTGCGAACGACCTCTACTGACCGGCCTCCGTCGATCGGCTGCCACCTTCGACGTCCGACTACCGATTGCCACCGTCGACCTC
CAACTATCAGCCGCTGCCGACCTCTGTCGACCAACCGCCACCAGCAACCTCTAATTATAATTTTAATGTATCTTAAGGGTACGTTTACTCATGACCTTTTTCTTCATAAA
TCTCCTGATTTCTCATTGCATGGGTTTGCTGATGCGGACTGGGCTTCTGACCTCGATGACCGCAAGTCTACATCCAGTTTTTGTGTATTTTTTGGCGGTAATTTGATTAC
GTGGGGTTCTAAGAAGCAACCGATCTTGAGGTCTAGTATTGAGGCTGAGTATAGGAAAATTGGTGTTCAACATTTGCCGGTTGCTGATATACTCACTAAACCGTTGTCTG
CTATTAATTTCATTAAACTAAGGTCCAAGCTCAATGTTCGAGAGCCCTCTTCCATTGGCTTGGGGGGTTGGGGTGTTAAGGAAGCCATTAAACACTCAGAATTTGTTAAG
AGGGCAGGAGTTGGAGGTTTGAAGCCAGGTCAAAATGCAGGATTTGTGATACACGTGTCCACAAGGCATCTTCGCAATCTCCTTCCCGTTCCCGTTCTTCATTTCATCGT
AGCAAACACTGCAATCCTCCATCTCATCTTCATTTCAAACTCAAATTCCTCATTCGTCAGTCTCTCAATGGCCGACTCCGGCGCCCCTCTCCGAGTTTCTTCGACCACCA
GAATCCCCTCCTCCCGATGCTCTGTCCTCCATCCAAAACATCTTCATGTAGTCGATGTTGGCCTCTATGCTGACGCCGCCGCCCAATCTTTCGGCGAAGGAGGCGATCTT
CTGGAACAGGATGTCGGCCACGGAGGTGTGGACGTTGAACGAACTCAGGAACTCAACCACGCAAAAATGGGCCAGGAACGACGCTTCTTCCAGGTCGCTCAAACAGAGAG
GAAACAGAGCCTCCCCGATCAGAGTGGAGGGAGTTTGTTGAAGGATTTGGTGTGGGAATCCAGAAATCAGGCGGCTACAATGGCCAAATTGGAGTTTCAGAACTTGATCG
CCGGTGCCGCGGATACAGTATTGGAGGCAGTGGTGGAGAAGAAAGACATGGTGTCTGATCAAAACCAAAGCAAATTAGCAAAGCAGAGAATGGATGAAGAGAGTTTGAAT
GATGCGCGGGGAGGCGGGAGCGGGGTCTCACGGAAACAAATCCATTTCAAATTTGGAAAATTTCTCAGAATCCGACCGTTTGCGGCGCCGGTGACCGGTGACCGCCAACG
GCTCTTTTATTCGCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCGACCACCCTCGACCTCCACCGACCGACTGCTATCATCCACCTTCGACTACCAACCGCGGCCAATCGTTACTGTCGACCTCCGTTGACCGGCCTTCGTCTATTGG
TCATTGTCAACCTCTGTCGACTGGCTGCCACTGTCCATCTCCGACTACTGGTTGTCGTCGACCTCCATCGACTGTCGCCGCTGACCGACCACCACTGTCGACCTTTGACT
ACTGGGTGCCACCGTCGACCTCCAACTACTGGCCGCCACCTTCGACCTGCAACTACTGGTCGTTTCTGACCCCATCGACCAACCGCAATCGTCGACCTCTAACAACCGGC
CACGACCAACCGCTACCGTCGACTTTTGACTATCGATCGTCATCGACCTCTGCTGACCGACCACCACTATCGACCTTTGACTATCGGGCACTACCGTCGACCTCCGACTA
CCGGGTGTCGCCGACAGCCGTTAACCATCTGCGAACGACCTCTACTGACCGGCCTCCGTCGATCGGCTGCCACCTTCGACGTCCGACTACCGATTGCCACCGTCGACCTC
CAACTATCAGCCGCTGCCGACCTCTGTCGACCAACCGCCACCAGCAACCTCTAATTATAATTTTAATGTATCTTAAGGGTACGTTTACTCATGACCTTTTTCTTCATAAA
TCTCCTGATTTCTCATTGCATGGGTTTGCTGATGCGGACTGGGCTTCTGACCTCGATGACCGCAAGTCTACATCCAGTTTTTGTGTATTTTTTGGCGGTAATTTGATTAC
GTGGGGTTCTAAGAAGCAACCGATCTTGAGGTCTAGTATTGAGGCTGAGTATAGGAAAATTGGTGTTCAACATTTGCCGGTTGCTGATATACTCACTAAACCGTTGTCTG
CTATTAATTTCATTAAACTAAGGTCCAAGCTCAATGTTCGAGAGCCCTCTTCCATTGGCTTGGGGGGTTGGGGTGTTAAGGAAGCCATTAAACACTCAGAATTTGTTAAG
AGGGCAGGAGTTGGAGGTTTGAAGCCAGGTCAAAATGCAGGATTTGTGATACACGTGTCCACAAGGCATCTTCGCAATCTCCTTCCCGTTCCCGTTCTTCATTTCATCGT
AGCAAACACTGCAATCCTCCATCTCATCTTCATTTCAAACTCAAATTCCTCATTCGTCAGTCTCTCAATGGCCGACTCCGGCGCCCCTCTCCGAGTTTCTTCGACCACCA
GAATCCCCTCCTCCCGATGCTCTGTCCTCCATCCAAAACATCTTCATGTAGTCGATGTTGGCCTCTATGCTGACGCCGCCGCCCAATCTTTCGGCGAAGGAGGCGATCTT
CTGGAACAGGATGTCGGCCACGGAGGTGTGGACGTTGAACGAACTCAGGAACTCAACCACGCAAAAATGGGCCAGGAACGACGCTTCTTCCAGGTCGCTCAAACAGAGAG
GAAACAGAGCCTCCCCGATCAGAGTGGAGGGAGTTTGTTGAAGGATTTGGTGTGGGAATCCAGAAATCAGGCGGCTACAATGGCCAAATTGGAGTTTCAGAACTTGATCG
CCGGTGCCGCGGATACAGTATTGGAGGCAGTGGTGGAGAAGAAAGACATGGTGTCTGATCAAAACCAAAGCAAATTAGCAAAGCAGAGAATGGATGAAGAGAGTTTGAAT
GATGCGCGGGGAGGCGGGAGCGGGGTCTCACGGAAACAAATCCATTTCAAATTTGGAAAATTTCTCAGAATCCGACCGTTTGCGGCGCCGGTGACCGGTGACCGCCAACG
GCTCTTTTATTCGCCTTAA
Protein sequenceShow/hide protein sequence
MTDHPRPPPTDCYHPPSTTNRGQSLLSTSVDRPSSIGHCQPLSTGCHCPSPTTGCRRPPSTVAADRPPLSTFDYWVPPSTSNYWPPPSTCNYWSFLTPSTNRNRRPLTTG
HDQPLPSTFDYRSSSTSADRPPLSTFDYRALPSTSDYRVSPTAVNHLRTTSTDRPPSIGCHLRRPTTDCHRRPPTISRCRPLSTNRHQQPLIIILMYLKGTFTHDLFLHK
SPDFSLHGFADADWASDLDDRKSTSSFCVFFGGNLITWGSKKQPILRSSIEAEYRKIGVQHLPVADILTKPLSAINFIKLRSKLNVREPSSIGLGGWGVKEAIKHSEFVK
RAGVGGLKPGQNAGFVIHVSTRHLRNLLPVPVLHFIVANTAILHLIFISNSNSSFVSLSMADSGAPLRVSSTTRIPSSRCSVLHPKHLHVVDVGLYADAAAQSFGEGGDL
LEQDVGHGGVDVERTQELNHAKMGQERRFFQVAQTERKQSLPDQSGGSLLKDLVWESRNQAATMAKLEFQNLIAGAADTVLEAVVEKKDMVSDQNQSKLAKQRMDEESLN
DARGGGSGVSRKQIHFKFGKFLRIRPFAAPVTGDRQRLFYSP