; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024056 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024056
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr10:160361..162746
RNA-Seq ExpressionLag0024056
SyntenyLag0024056
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH07150.1 TatD related DNase [Prunus dulcis]2.7e-4948.63Show/hide
Query:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR
        W+ G+YGPC  ++   F +ELAD+ G C   WC+ GDFNVVR+  ++ N  R TKSMR FN FI   NL D  + N  +TWS + E AV  RLDRFL+S 
Subjt:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR

Query:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK
         W D F  +R   L R T DH P+      ++WGP PFRFENMW+ HPDFK+ ++ WW E    GW G++FM++LK LK+ +K
Subjt:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK

CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]5.7e-4445.36Show/hide
Query:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR
        W+  VYGP  +   K F  EL+D+AGL    WC+ GDFNV+R   ++L  SR T SM+ F+ FI+   LID+P+ +  +TWS M    V  RLDRFL S 
Subjt:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR

Query:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK
        +W  AF +     L R T DH+P+       +WGP PFRFENMW+ HP FK+   +WW E   +GW G +FM KL+ +KA +K
Subjt:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK

KAA0044449.1 hypothetical protein E6C27_scaffold46G001820 [Cucumis melo var. makuwa]4.8e-5126.29Show/hide
Query:  ITKRDFHDDWGRILDIMQQQL---ETALVINPFQPDKALLKCPSKDLANLLTTNKGWVSFGPVILKVEKWNKILHSKINVVPSYGG--------------
        + +R FHDDW +I+D ++ Q    ++     PF  DKALL    K+LA LL  N GW + GP  +K EKW+K  H+   V+PSYGG              
Subjt:  ITKRDFHDDWGRILDIMQQQL---ETALVINPFQPDKALLKCPSKDLANLLTTNKGWVSFGPVILKVEKWNKILHSKINVVPSYGG--------------

Query:  --------------------------------IKGNYCGFIQGEIEVVDKD-QVFKTQIVTFKEGNLLIDRIVGVHGSFSPEATHAFHK-GPFVCR----
                                        +K NY GF+   I++ D++   F  Q VT  +G  L +R   +HGSF+  A   F++  P+  +    
Subjt:  --------------------------------IKGNYCGFIQGEIEVVDKD-QVFKTQIVTFKEGNLLIDRIVGVHGSFSPEATHAFHK-GPFVCR----

Query:  ---------SKPISFSWEMRRLT--RLVGWIKPIKNAPR----------------GKAQRIRPRLEK-ESRLPKKPRSLC-----SKKAILNLKRKRK--
                   P S     +++   R +G  K IK   +                  +++I+ +  + E    KK + +C      K + +N KRK    
Subjt:  ---------SKPISFSWEMRRLT--RLVGWIKPIKNAPR----------------GKAQRIRPRLEK-ESRLPKKPRSLC-----SKKAILNLKRKRK--

Query:  ---------TQDTMKTRIMKI-----KGTGECSTQEPLE--------IREDYEEAHCSSKNDDLEKNKIIPLSLNEME--------EGYKEIDEQEEVKP
                 +  +  T+ +K+         E  ++ P +        ++    E   S  +   +K KI P   NE E            + D      P
Subjt:  ---------TQDTMKTRIMKI-----KGTGECSTQEPLE--------IREDYEEAHCSSKNDDLEKNKIIPLSLNEME--------EGYKEIDEQEEVKP

Query:  KALP-VISPTD----------------------KKKNRANDTPDGFV-ISKELED-------------DVEFTDVK----------------EEQSKQVV
          +P   SPT+                      KK+N   +T D  V   ++L D             + +F  V                 E  S+ V+
Subjt:  KALP-VISPTD----------------------KKKNRANDTPDGFV-ISKELED-------------DVEFTDVK----------------EEQSKQVV

Query:  E-------KKDRDDEIKGWVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWS
        +       + D ++    W+  +YGP   K+   F +EL ++  +C   W + GDFNV+RW E+    +  + SM++FN FI++ NLID P+ N ++TWS
Subjt:  E-------KKDRDDEIKGWVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWS

Query:  RMGEKAVASRLDRFLISRQWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGL
         +  +A  SRLDRFL S  W + F       L R T DHFP+     ++ WGP PFRF N ++  PD+KK +E WW   +  G+AG+ FM +LK L
Subjt:  RMGEKAVASRLDRFLISRQWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGL

VVA20479.1 Hypothetical predicted protein, partial [Prunus dulcis]2.7e-4948.63Show/hide
Query:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR
        W+ G+YGPC  ++   F +ELAD+ G C   WC+ GDFNVVR+  ++ N  R TKSMR FN FI   NL D  + N  +TWS + E AV  RLDRFL+S 
Subjt:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR

Query:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK
         W D F  +R   L R T DH P+      ++WGP PFRFENMW+ HPDFK+ ++ WW E    GW G++FM++LK LK+ +K
Subjt:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK

XP_021820446.1 uncharacterized protein LOC110762145 [Prunus avium]7.2e-4748.63Show/hide
Query:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR
        W+ G+YGPC  +D   F  ELA + GLC   WCI GDFNVVR+V D+ N    T SMR FN FI   NL D  + N  +TWS   E  V  RLDRFL + 
Subjt:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR

Query:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK
         W D F  FR   L R T DH P+      ++WGP PFRFENMW++HPDF +  + WWAE +  GW GF FM +LK +K  ++
Subjt:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK

TrEMBL top hitse value%identityAlignment
A0A4Y1RS61 TatD related DNase1.3e-4948.63Show/hide
Query:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR
        W+ G+YGPC  ++   F +ELAD+ G C   WC+ GDFNVVR+  ++ N  R TKSMR FN FI   NL D  + N  +TWS + E AV  RLDRFL+S 
Subjt:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR

Query:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK
         W D F  +R   L R T DH P+      ++WGP PFRFENMW+ HPDFK+ ++ WW E    GW G++FM++LK LK+ +K
Subjt:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK

A0A5A7TTA1 DUF4283 domain-containing protein2.3e-5126.29Show/hide
Query:  ITKRDFHDDWGRILDIMQQQL---ETALVINPFQPDKALLKCPSKDLANLLTTNKGWVSFGPVILKVEKWNKILHSKINVVPSYGG--------------
        + +R FHDDW +I+D ++ Q    ++     PF  DKALL    K+LA LL  N GW + GP  +K EKW+K  H+   V+PSYGG              
Subjt:  ITKRDFHDDWGRILDIMQQQL---ETALVINPFQPDKALLKCPSKDLANLLTTNKGWVSFGPVILKVEKWNKILHSKINVVPSYGG--------------

Query:  --------------------------------IKGNYCGFIQGEIEVVDKD-QVFKTQIVTFKEGNLLIDRIVGVHGSFSPEATHAFHK-GPFVCR----
                                        +K NY GF+   I++ D++   F  Q VT  +G  L +R   +HGSF+  A   F++  P+  +    
Subjt:  --------------------------------IKGNYCGFIQGEIEVVDKD-QVFKTQIVTFKEGNLLIDRIVGVHGSFSPEATHAFHK-GPFVCR----

Query:  ---------SKPISFSWEMRRLT--RLVGWIKPIKNAPR----------------GKAQRIRPRLEK-ESRLPKKPRSLC-----SKKAILNLKRKRK--
                   P S     +++   R +G  K IK   +                  +++I+ +  + E    KK + +C      K + +N KRK    
Subjt:  ---------SKPISFSWEMRRLT--RLVGWIKPIKNAPR----------------GKAQRIRPRLEK-ESRLPKKPRSLC-----SKKAILNLKRKRK--

Query:  ---------TQDTMKTRIMKI-----KGTGECSTQEPLE--------IREDYEEAHCSSKNDDLEKNKIIPLSLNEME--------EGYKEIDEQEEVKP
                 +  +  T+ +K+         E  ++ P +        ++    E   S  +   +K KI P   NE E            + D      P
Subjt:  ---------TQDTMKTRIMKI-----KGTGECSTQEPLE--------IREDYEEAHCSSKNDDLEKNKIIPLSLNEME--------EGYKEIDEQEEVKP

Query:  KALP-VISPTD----------------------KKKNRANDTPDGFV-ISKELED-------------DVEFTDVK----------------EEQSKQVV
          +P   SPT+                      KK+N   +T D  V   ++L D             + +F  V                 E  S+ V+
Subjt:  KALP-VISPTD----------------------KKKNRANDTPDGFV-ISKELED-------------DVEFTDVK----------------EEQSKQVV

Query:  E-------KKDRDDEIKGWVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWS
        +       + D ++    W+  +YGP   K+   F +EL ++  +C   W + GDFNV+RW E+    +  + SM++FN FI++ NLID P+ N ++TWS
Subjt:  E-------KKDRDDEIKGWVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWS

Query:  RMGEKAVASRLDRFLISRQWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGL
         +  +A  SRLDRFL S  W + F       L R T DHFP+     ++ WGP PFRF N ++  PD+KK +E WW   +  G+AG+ FM +LK L
Subjt:  RMGEKAVASRLDRFLISRQWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGL

A0A5E4F090 Reverse transcriptase domain-containing protein (Fragment)1.3e-4948.63Show/hide
Query:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR
        W+ G+YGPC  ++   F +ELAD+ G C   WC+ GDFNVVR+  ++ N  R TKSMR FN FI   NL D  + N  +TWS + E AV  RLDRFL+S 
Subjt:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR

Query:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK
         W D F  +R   L R T DH P+      ++WGP PFRFENMW+ HPDFK+ ++ WW E    GW G++FM++LK LK+ +K
Subjt:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK

M5VS59 Reverse transcriptase domain-containing protein (Fragment)1.7e-4948.09Show/hide
Query:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR
        W+ G+YGPC  ++   F +ELAD+ G C   WC+ GDFNVVR+  ++ N  R TKSMR FN FI   NL D  + N  +TWS + E AV  RLDRFL+S 
Subjt:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR

Query:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK
         W D F  +R   L R T DH P+      ++WGP PFRFENMW++HPDF + ++ WW E    GW G++FM +LK LK+ +K
Subjt:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK

M5WJ76 Reverse transcriptase domain-containing protein (Fragment)5.8e-5048.09Show/hide
Query:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR
        W+ G+YGPC  ++   F +ELAD+ G C  +WC+ GDFNVVR+  ++ N  R TKSMR FN FI   NL D  + N  +TWS + E AV  RLDRFL+S 
Subjt:  WVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAVASRLDRFLISR

Query:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK
         W + F  +R   L R T DH P+      ++WGP PFRFENMW++HPDFK+ ++ WW E    GW G++FM +LK LK+ +K
Subjt:  QWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTGAGGAAGATCGACTGGGATGAAGTGATAGTCATCACTAAACGTGATTTCCATGATGACTGGGGGAGAATCCTAGATATTATGCAACAACAACTTGAGACTGC
CCTAGTCATAAACCCCTTTCAGCCAGATAAGGCTCTGCTAAAATGCCCTTCCAAGGACTTGGCGAATTTATTAACTACAAACAAGGGATGGGTCAGCTTCGGGCCTGTCA
TTTTAAAAGTTGAGAAGTGGAACAAAATTTTACATAGCAAAATTAATGTGGTGCCCAGTTATGGTGGAATCAAGGGCAACTATTGTGGGTTCATCCAAGGAGAAATCGAA
GTCGTAGACAAGGATCAGGTTTTCAAAACTCAGATAGTCACATTTAAGGAGGGGAACTTGCTGATCGACCGAATTGTTGGAGTTCATGGGAGCTTCTCGCCGGAAGCGAC
GCATGCTTTCCACAAAGGACCTTTTGTCTGCAGGTCAAAACCCATCAGTTTCTCGTGGGAGATGAGACGGCTGACGAGACTAGTAGGATGGATTAAGCCCATCAAGAACG
CCCCAAGAGGAAAAGCCCAAAGAATAAGACCAAGACTCGAAAAGGAGTCTCGTTTGCCAAAGAAGCCCAGGTCACTTTGTTCAAAAAAGGCAATACTCAATTTAAAGAGG
AAACGAAAGACCCAAGACACAATGAAGACCCGGATAATGAAGATAAAGGGGACGGGGGAATGTTCGACTCAGGAACCGCTAGAAATTCGAGAGGATTATGAAGAGGCTCA
TTGCTCCAGTAAGAATGACGATCTAGAAAAAAACAAAATAATACCTCTATCTCTAAATGAGATGGAGGAAGGGTACAAGGAGATAGATGAGCAAGAAGAGGTTAAACCGA
AGGCCCTACCCGTGATATCGCCAACAGATAAGAAGAAAAATAGAGCAAATGACACTCCAGACGGCTTTGTTATTAGTAAAGAATTGGAGGACGACGTTGAGTTTACAGAT
GTTAAGGAAGAGCAGAGTAAGCAAGTAGTCGAGAAGAAGGATCGGGACGATGAGATAAAAGGGTGGGTAATAGGGGTGTATGGGCCATGTGCGGCTAAAGATGGGAAATT
CTTTTTACAAGAGTTAGCGGATGTTGCTGGTCTTTGCCAAGGTATATGGTGCATAGCAGGTGATTTCAATGTGGTTAGATGGGTGGAGGATAGGCTTAACGTTAGTAGGC
CTACCAAAAGTATGAGGAAATTTAATCGTTTCATCGCCTCTTATAATCTTATTGACATTCCGATGAATAATGGCAGGTACACTTGGTCGAGAATGGGAGAAAAGGCAGTT
GCTTCTAGGCTGGATAGATTCTTGATATCCAGACAGTGGGCTGATGCTTTTAAGGAATTTAGGCTGGATAGACTCCAGAGGCCAACATTCGACCATTTTCCCTTAGCCTT
TTCGGTTGGGGCAATGAGGTGGGGGCCTATGCCGTTTAGATTTGAAAATATGTGGATCGACCACCCGGATTTCAAAAAGATGGTGGAAAAGTGGTGGGCAGAGCTTAACC
CGAGTGGATGGGCAGGGTTTAGATTCATGGCCAAGCTGAAGGGACTGAAAGCTCACATCAAAGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGTGAGGAAGATCGACTGGGATGAAGTGATAGTCATCACTAAACGTGATTTCCATGATGACTGGGGGAGAATCCTAGATATTATGCAACAACAACTTGAGACTGC
CCTAGTCATAAACCCCTTTCAGCCAGATAAGGCTCTGCTAAAATGCCCTTCCAAGGACTTGGCGAATTTATTAACTACAAACAAGGGATGGGTCAGCTTCGGGCCTGTCA
TTTTAAAAGTTGAGAAGTGGAACAAAATTTTACATAGCAAAATTAATGTGGTGCCCAGTTATGGTGGAATCAAGGGCAACTATTGTGGGTTCATCCAAGGAGAAATCGAA
GTCGTAGACAAGGATCAGGTTTTCAAAACTCAGATAGTCACATTTAAGGAGGGGAACTTGCTGATCGACCGAATTGTTGGAGTTCATGGGAGCTTCTCGCCGGAAGCGAC
GCATGCTTTCCACAAAGGACCTTTTGTCTGCAGGTCAAAACCCATCAGTTTCTCGTGGGAGATGAGACGGCTGACGAGACTAGTAGGATGGATTAAGCCCATCAAGAACG
CCCCAAGAGGAAAAGCCCAAAGAATAAGACCAAGACTCGAAAAGGAGTCTCGTTTGCCAAAGAAGCCCAGGTCACTTTGTTCAAAAAAGGCAATACTCAATTTAAAGAGG
AAACGAAAGACCCAAGACACAATGAAGACCCGGATAATGAAGATAAAGGGGACGGGGGAATGTTCGACTCAGGAACCGCTAGAAATTCGAGAGGATTATGAAGAGGCTCA
TTGCTCCAGTAAGAATGACGATCTAGAAAAAAACAAAATAATACCTCTATCTCTAAATGAGATGGAGGAAGGGTACAAGGAGATAGATGAGCAAGAAGAGGTTAAACCGA
AGGCCCTACCCGTGATATCGCCAACAGATAAGAAGAAAAATAGAGCAAATGACACTCCAGACGGCTTTGTTATTAGTAAAGAATTGGAGGACGACGTTGAGTTTACAGAT
GTTAAGGAAGAGCAGAGTAAGCAAGTAGTCGAGAAGAAGGATCGGGACGATGAGATAAAAGGGTGGGTAATAGGGGTGTATGGGCCATGTGCGGCTAAAGATGGGAAATT
CTTTTTACAAGAGTTAGCGGATGTTGCTGGTCTTTGCCAAGGTATATGGTGCATAGCAGGTGATTTCAATGTGGTTAGATGGGTGGAGGATAGGCTTAACGTTAGTAGGC
CTACCAAAAGTATGAGGAAATTTAATCGTTTCATCGCCTCTTATAATCTTATTGACATTCCGATGAATAATGGCAGGTACACTTGGTCGAGAATGGGAGAAAAGGCAGTT
GCTTCTAGGCTGGATAGATTCTTGATATCCAGACAGTGGGCTGATGCTTTTAAGGAATTTAGGCTGGATAGACTCCAGAGGCCAACATTCGACCATTTTCCCTTAGCCTT
TTCGGTTGGGGCAATGAGGTGGGGGCCTATGCCGTTTAGATTTGAAAATATGTGGATCGACCACCCGGATTTCAAAAAGATGGTGGAAAAGTGGTGGGCAGAGCTTAACC
CGAGTGGATGGGCAGGGTTTAGATTCATGGCCAAGCTGAAGGGACTGAAAGCTCACATCAAAGATTAG
Protein sequenceShow/hide protein sequence
MDVRKIDWDEVIVITKRDFHDDWGRILDIMQQQLETALVINPFQPDKALLKCPSKDLANLLTTNKGWVSFGPVILKVEKWNKILHSKINVVPSYGGIKGNYCGFIQGEIE
VVDKDQVFKTQIVTFKEGNLLIDRIVGVHGSFSPEATHAFHKGPFVCRSKPISFSWEMRRLTRLVGWIKPIKNAPRGKAQRIRPRLEKESRLPKKPRSLCSKKAILNLKR
KRKTQDTMKTRIMKIKGTGECSTQEPLEIREDYEEAHCSSKNDDLEKNKIIPLSLNEMEEGYKEIDEQEEVKPKALPVISPTDKKKNRANDTPDGFVISKELEDDVEFTD
VKEEQSKQVVEKKDRDDEIKGWVIGVYGPCAAKDGKFFLQELADVAGLCQGIWCIAGDFNVVRWVEDRLNVSRPTKSMRKFNRFIASYNLIDIPMNNGRYTWSRMGEKAV
ASRLDRFLISRQWADAFKEFRLDRLQRPTFDHFPLAFSVGAMRWGPMPFRFENMWIDHPDFKKMVEKWWAELNPSGWAGFRFMAKLKGLKAHIKD